TY - JOUR
T1 - DeepH&M
T2 - Estimating single-CpG hydroxymethylation and methylation levels from enrichment and restriction enzyme sequencing methods
AU - He, Yu
AU - Jang, Hyo Sik
AU - Xing, Xiaoyun
AU - Li, Daofeng
AU - Vasek, Michael J.
AU - Dougherty, Joseph D.
AU - Wang, Ting
N1 - Publisher Copyright:
Copyright © 2020 The Authors, some rights reserved.
PY - 2020/7
Y1 - 2020/7
N2 - Increased appreciation of 5-hydroxymethylcytosine (5hmC) as a stable epigenetic mark, which defines cell identity and disease progress, has engendered a need for cost-effective, but high-resolution, 5hmC mapping technology. Current enrichment-based technologies provide cheap but low-resolution and relative enrichment of 5hmC levels, while single-base resolution methods can be prohibitively expensive to scale up to large experiments. To address this problem, we developed a deep learning-based method, "DeepH&M,"which integrates enrichment and restriction enzyme sequencing methods to simultaneously estimate absolute hydroxymethylation and methylation levels at single-CpG resolution. Using 7-week-old mouse cerebellum data for training the DeepH&M model, we demonstrated that the 5hmC and 5mC levels predicted by DeepH&M were in high concordance with whole-genome bisulfite-based approaches. The DeepH&M model can be applied to 7-week-old frontal cortex and 79-week-old cerebellum, revealing the robust generalizability of this method to other tissues from various biological time points.
AB - Increased appreciation of 5-hydroxymethylcytosine (5hmC) as a stable epigenetic mark, which defines cell identity and disease progress, has engendered a need for cost-effective, but high-resolution, 5hmC mapping technology. Current enrichment-based technologies provide cheap but low-resolution and relative enrichment of 5hmC levels, while single-base resolution methods can be prohibitively expensive to scale up to large experiments. To address this problem, we developed a deep learning-based method, "DeepH&M,"which integrates enrichment and restriction enzyme sequencing methods to simultaneously estimate absolute hydroxymethylation and methylation levels at single-CpG resolution. Using 7-week-old mouse cerebellum data for training the DeepH&M model, we demonstrated that the 5hmC and 5mC levels predicted by DeepH&M were in high concordance with whole-genome bisulfite-based approaches. The DeepH&M model can be applied to 7-week-old frontal cortex and 79-week-old cerebellum, revealing the robust generalizability of this method to other tissues from various biological time points.
UR - http://www.scopus.com/inward/record.url?scp=85088880083&partnerID=8YFLogxK
U2 - 10.1126/sciadv.aba0521
DO - 10.1126/sciadv.aba0521
M3 - Article
C2 - 32937429
AN - SCOPUS:85088880083
SN - 2375-2548
VL - 6
JO - Science Advances
JF - Science Advances
IS - 27
M1 - eaba0521
ER -