TY - JOUR
T1 - Comparing regression-based approaches for identifying microbial functional groups
AU - Yu, Fang
AU - Tikhonov, Mikhail
N1 - Publisher Copyright:
© 2025 The Author(s). Published by IOP Publishing Ltd.
PY - 2025/7/1
Y1 - 2025/7/1
N2 - Microbial communities are composed of functionally integrated taxa, and identifying which taxa contribute to a given ecosystem function is essential for predicting community behaviors. This study compares the effectiveness of a previously proposed method for identifying ‘functional taxa,’ ensemble quotient optimization (EQO), to a potentially simpler approach based on the least absolute shrinkage and selection operator (LASSO). In contrast to LASSO, EQO uses a binary prior on coefficients, assuming uniform contribution strength across taxa. Using synthetic datasets with increasingly realistic structure, we demonstrate that EQO’s strong prior enables it to perform better in low-data regime. However, LASSO’s flexibility and efficiency can make it preferable as data complexity increases. Our results detail the favorable conditions for EQO and emphasize LASSO as a viable alternative.
AB - Microbial communities are composed of functionally integrated taxa, and identifying which taxa contribute to a given ecosystem function is essential for predicting community behaviors. This study compares the effectiveness of a previously proposed method for identifying ‘functional taxa,’ ensemble quotient optimization (EQO), to a potentially simpler approach based on the least absolute shrinkage and selection operator (LASSO). In contrast to LASSO, EQO uses a binary prior on coefficients, assuming uniform contribution strength across taxa. Using synthetic datasets with increasingly realistic structure, we demonstrate that EQO’s strong prior enables it to perform better in low-data regime. However, LASSO’s flexibility and efficiency can make it preferable as data complexity increases. Our results detail the favorable conditions for EQO and emphasize LASSO as a viable alternative.
KW - Ensemble Quotient Optimization (EQO)
KW - Least Absolute Shrinkage and Selection Operator (LASSO)
KW - data-driven inference
KW - functional taxa identification
KW - microbial functional groups
KW - phylogenetic regularization
KW - sparse regression
UR - https://www.scopus.com/pages/publications/105007065924
U2 - 10.1088/1478-3975/addc2a
DO - 10.1088/1478-3975/addc2a
M3 - Article
C2 - 40403753
AN - SCOPUS:105007065924
SN - 1478-3967
VL - 22
JO - Physical Biology
JF - Physical Biology
IS - 4
M1 - 046001
ER -