TY - JOUR
T1 - Unveiling sub-populations in critical care settings
T2 - a real-world data approach in COVID-19
AU - Anderson, Wesley
AU - Gould, Ruth
AU - Patil, Namrata
AU - Mohr, Nicholas
AU - Dodd, Kenneth
AU - Boyce, Danielle
AU - Dasher, Pam
AU - Guerin, Philippe J.
AU - Khan, Reham
AU - Cheruku, Sreekanth
AU - Kumar, Vishakha K.
AU - Mathé, Ewy
AU - Mehta, Aneesh K.
AU - Michelson, Andrew P.
AU - Williams, Andrew
AU - Heavner, Smith F.
AU - Podichetty, Jagdeep T.
N1 - Publisher Copyright:
Copyright © 2025 Anderson, Gould, Patil, Mohr, Dodd, Boyce, Dasher, Guerin, Khan, Cheruku, Kumar, Mathé, Mehta, Michelson, Williams, Heavner and Podichetty.
PY - 2025
Y1 - 2025
N2 - Background: Disease presentation and progression can vary greatly in heterogeneous diseases, such as COVID-19, with variability in patient outcomes, even within the hospital setting. This variability underscores the need for tailored treatment approaches based on distinct clinical subgroups. Objectives: This study aimed to identify COVID-19 patient subgroups with unique clinical characteristics using real-world data (RWD) from electronic health records (EHRs) to inform individualized treatment plans. Materials and methods: A Factor Analysis of Mixed Data (FAMD)-based agglomerative hierarchical clustering approach was employed to analyze the real-world data, enabling the identification of distinct patient subgroups. Statistical tests evaluated cluster differences, and machine learning models classified the identified subgroups. Results: Three clusters of COVID-19 in patients with unique clinical characteristics were identified. The analysis revealed significant differences in hospital stay durations and survival rates among the clusters, with more severe clinical features correlating with worse prognoses and machine learning classifiers achieving high accuracy in subgroup identification. Conclusion: By leveraging RWD and advanced clustering techniques, the study provides insights into the heterogeneity of COVID-19 presentations. The findings support the development of classification models that can inform more individualized and effective treatment plans, improving patient outcomes in the future.
AB - Background: Disease presentation and progression can vary greatly in heterogeneous diseases, such as COVID-19, with variability in patient outcomes, even within the hospital setting. This variability underscores the need for tailored treatment approaches based on distinct clinical subgroups. Objectives: This study aimed to identify COVID-19 patient subgroups with unique clinical characteristics using real-world data (RWD) from electronic health records (EHRs) to inform individualized treatment plans. Materials and methods: A Factor Analysis of Mixed Data (FAMD)-based agglomerative hierarchical clustering approach was employed to analyze the real-world data, enabling the identification of distinct patient subgroups. Statistical tests evaluated cluster differences, and machine learning models classified the identified subgroups. Results: Three clusters of COVID-19 in patients with unique clinical characteristics were identified. The analysis revealed significant differences in hospital stay durations and survival rates among the clusters, with more severe clinical features correlating with worse prognoses and machine learning classifiers achieving high accuracy in subgroup identification. Conclusion: By leveraging RWD and advanced clustering techniques, the study provides insights into the heterogeneity of COVID-19 presentations. The findings support the development of classification models that can inform more individualized and effective treatment plans, improving patient outcomes in the future.
KW - classification
KW - clustering analysis
KW - critical care
KW - factor analysis of mixed data
KW - real-world data
UR - http://www.scopus.com/inward/record.url?scp=105006901034&partnerID=8YFLogxK
U2 - 10.3389/fpubh.2025.1544904
DO - 10.3389/fpubh.2025.1544904
M3 - Article
C2 - 40443932
AN - SCOPUS:105006901034
SN - 2296-2565
VL - 13
JO - Frontiers in Public Health
JF - Frontiers in Public Health
M1 - 1544904
ER -