Abstract
Investigators routinely use unidimensional summaries for multidimensional data. In microarray data analysis, for example, the gene expression level is indeed a unidimensional summary of probe-level or SNP measurements. In this paper, we propose a mixture factor model for the low-level data, which enables us to examine the adequacy of a unidimensional summary while accommodating known or latent subgroups in the population. We also develop screening procedures based on the proposed model to identify potentially informative genes in biomedical studies. As shown in our empirical studies, the proposed methods are often more effective than existing methods because the new model goes beyond the conventional unidimensional summaries of gene expressions.
| Original language | English |
|---|---|
| Pages (from-to) | 60-76 |
| Number of pages | 17 |
| Journal | Test |
| Volume | 28 |
| Issue number | 1 |
| DOIs | |
| State | Published - Mar 12 2019 |
Keywords
- Gene screening
- Mean structure
- Mixture factor models
- Unidimensional test