Fixed-effect variable selection in linear mixed models using R2 statistics

Academic Article

Abstract

  • In the linear mixed model (LMM), several R2 statistics have been proposed for assessing the goodness-of-fit of fixed effects. However, the performance of these statistics has not been fully demonstrated either analytically or through simulations. We report results of simulations to asses the ability of these statistics to select the most parsimonious model. R2 statistics from a full model were compared to other models in which fixed-effect covariates were removed. The full model was also compared to an overfitted model that included additional covariates not linked to the outcome. All models compared involved the same random effects. In this paper, we show that R2 statistics that involve the residuals are unable to adequately discriminate between the correct model and one from which important fixed-effect covariates are omitted if the computation of the predicted values for the residuals included the random effects (referred to as conditional R2 statistics). However, if the random effects are excluded from the computation of the predicted values that lead to the residuals, these R2 statistics (referred to as marginal R2 statistics) are able to select the most parsimonious model. Other R2 statistics that have been proposed by Xu [2003. Measuring explained variation in linear mixed effects models. Statist. Med. 22(22), 3527-3541] performed poorly in that there was little variation in the value of these statistics from a full model to a reduced model. © 2007 Elsevier B.V. All rights reserved.
  • Authors

    Digital Object Identifier (doi)

    Author List

  • Orelien JG; Edwards LJ
  • Start Page

  • 1896
  • End Page

  • 1907
  • Volume

  • 52
  • Issue

  • 4