Variable selection for random effects two-part models

Dongxiao Han, Lei Liu, Xiaogang Su, Bankole Johnson, Liuquan Sun

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

Random effects two-part models have been applied to longitudinal studies for zero-inflated (or semi-continuous) data, characterized by a large portion of zero values and continuous non-zero (positive) values. Examples include monthly medical costs, daily alcohol drinks, relative abundance of microbiome, etc. With the advance of information technology for data collection and storage, the number of variables available to researchers can be rather large in such studies. To avoid curse of dimensionality and facilitate decision making, it is critically important to select covariates that are truly related to the outcome. However, owing to its intricate nature, there is not yet a satisfactory variable selection method available for such sophisticated models. In this paper, we seek a feasible way of conducting variable selection for random effects two-part models on the basis of the recently proposed “minimum information criterion” (MIC) method. We demonstrate that the MIC formulation leads to a reasonable formulation of sparse estimation, which can be conveniently solved with SAS Proc NLMIXED. The performance of our approach is evaluated through simulation, and an application to a longitudinal alcohol dependence study is provided.

Original languageEnglish
Pages (from-to)2697-2709
Number of pages13
JournalStatistical Methods in Medical Research
Volume28
Issue number9
DOIs
StatePublished - Sep 1 2019

Keywords

  • High dimensional
  • mixed effects
  • pharmacogenetics
  • precision medicine
  • tuning parameter
  • variable selection

Fingerprint

Dive into the research topics of 'Variable selection for random effects two-part models'. Together they form a unique fingerprint.

Cite this