TY - JOUR
T1 - Machine Learning Approximations to Predict Epigenetic Age Acceleration in Stroke Patients
AU - Fernández-Pérez, Isabel
AU - Jiménez-Balado, Joan
AU - Lazcano, Uxue
AU - Giralt-Steinhauer, Eva
AU - Rey Álvarez, Lucía
AU - Cuadrado-Godia, Elisa
AU - Rodríguez-Campello, Ana
AU - Macias-Gómez, Adrià
AU - Suárez-Pérez, Antoni
AU - Revert-Barberá, Anna
AU - Estragués-Gázquez, Isabel
AU - Soriano-Tarraga, Carolina
AU - Roquer, Jaume
AU - Ois, Angel
AU - Jiménez-Conde, Jordi
N1 - Publisher Copyright:
© 2023 by the authors.
PY - 2023/2
Y1 - 2023/2
N2 - Age acceleration (Age-A) is a useful tool that is able to predict a broad range of health outcomes. It is necessary to determine DNA methylation levels to estimate it, and it is known that Age-A is influenced by environmental, lifestyle, and vascular risk factors (VRF). The aim of this study is to estimate the contribution of these easily measurable factors to Age-A in patients with cerebrovascular disease (CVD), using different machine learning (ML) approximations, and try to find a more accessible model able to predict Age-A. We studied a CVD cohort of 952 patients with information about VRF, lifestyle habits, and target organ damage. We estimated Age-A using Hannum’s epigenetic clock, and trained six different models to predict Age-A: a conventional linear regression model, four ML models (elastic net regression (EN), K-Nearest neighbors, random forest, and support vector machine models), and one deep learning approximation (multilayer perceptron (MLP) model). The best-performing models were EN and MLP; although, the predictive capability was modest (R2 0.358 and 0.378, respectively). In conclusion, our results support the influence of these factors on Age-A; although, they were not enough to explain most of its variability.
AB - Age acceleration (Age-A) is a useful tool that is able to predict a broad range of health outcomes. It is necessary to determine DNA methylation levels to estimate it, and it is known that Age-A is influenced by environmental, lifestyle, and vascular risk factors (VRF). The aim of this study is to estimate the contribution of these easily measurable factors to Age-A in patients with cerebrovascular disease (CVD), using different machine learning (ML) approximations, and try to find a more accessible model able to predict Age-A. We studied a CVD cohort of 952 patients with information about VRF, lifestyle habits, and target organ damage. We estimated Age-A using Hannum’s epigenetic clock, and trained six different models to predict Age-A: a conventional linear regression model, four ML models (elastic net regression (EN), K-Nearest neighbors, random forest, and support vector machine models), and one deep learning approximation (multilayer perceptron (MLP) model). The best-performing models were EN and MLP; although, the predictive capability was modest (R2 0.358 and 0.378, respectively). In conclusion, our results support the influence of these factors on Age-A; although, they were not enough to explain most of its variability.
KW - aging
KW - epigenetic clock
KW - machine learning
KW - stroke
KW - vascular risk factors
UR - https://www.scopus.com/pages/publications/85147894367
U2 - 10.3390/ijms24032759
DO - 10.3390/ijms24032759
M3 - Article
C2 - 36769083
AN - SCOPUS:85147894367
SN - 1661-6596
VL - 24
JO - International journal of molecular sciences
JF - International journal of molecular sciences
IS - 3
M1 - 2759
ER -