TY - JOUR
T1 - Leveraging sequences missing from the human genome to diagnose cancer
AU - Georgakopoulos-Soares, Ilias
AU - Yizhar-Barnea, Ofer
AU - Mouratidis, Ioannis
AU - Chan, Candace S.Y.
AU - Patsakis, Michail
AU - Nayak, Akshatha
AU - Bradley, Rachael
AU - Mahajan, Mayank
AU - Sims, Jasmine
AU - Cintron, Dianne Laboy
AU - Easterlin, Ryder
AU - Kim, Julia S.
AU - Chen, Emmalyn
AU - Pineda, Geovanni
AU - Parada, Guillermo E.
AU - Witte, John S.
AU - Maher, Christopher A.
AU - Feng, Felix
AU - Vathiotis, Ioannis
AU - Syrigos, Nikolaos
AU - Panagiotou, Emmanouil
AU - Charpidou, Andriani
AU - Syrigos, Konstantinos
AU - Chapman, Jocelyn
AU - Kvale, Mark
AU - Hemberg, Martin
AU - Ahituv, Nadav
N1 - Publisher Copyright:
© The Author(s) 2025.
PY - 2025/12
Y1 - 2025/12
N2 - Background: Cancer diagnosis using cell-free DNA (cfDNA) has the potential to improve treatment and survival but has several technical limitations. Methods: In this study, we developed a prediction model based on neomers, DNA sequences 13–17 nucleotides in length that are predominantly absent from the genomes of healthy individuals and are created by tumor-associated mutations. Results: We show that neomer-based classifiers can accurately detect cancer, including early stages, and distinguish subtypes and features. Analysis of 2577 cancer genomes from 21 cancer types shows that neomers can distinguish tumor types with higher accuracy than state-of-the-art methods. Generation and analysis of 465 cfDNA whole-genome sequences demonstrates that neomers can precisely detect lung and ovarian cancer, including early stages, with an area under the curve ranging from 0.89 to 0.94. By testing various promoters or over 9000 candidate enhancer sequences with massively parallel reporter assays, we show that neomers can identify cancer-associated mutations that alter regulatory activity. Conclusions: Combined, our results identify a sensitive, specific, and simple cancer diagnostic tool that can also identify cancer-associated mutations in gene regulatory elements.
AB - Background: Cancer diagnosis using cell-free DNA (cfDNA) has the potential to improve treatment and survival but has several technical limitations. Methods: In this study, we developed a prediction model based on neomers, DNA sequences 13–17 nucleotides in length that are predominantly absent from the genomes of healthy individuals and are created by tumor-associated mutations. Results: We show that neomer-based classifiers can accurately detect cancer, including early stages, and distinguish subtypes and features. Analysis of 2577 cancer genomes from 21 cancer types shows that neomers can distinguish tumor types with higher accuracy than state-of-the-art methods. Generation and analysis of 465 cfDNA whole-genome sequences demonstrates that neomers can precisely detect lung and ovarian cancer, including early stages, with an area under the curve ranging from 0.89 to 0.94. By testing various promoters or over 9000 candidate enhancer sequences with massively parallel reporter assays, we show that neomers can identify cancer-associated mutations that alter regulatory activity. Conclusions: Combined, our results identify a sensitive, specific, and simple cancer diagnostic tool that can also identify cancer-associated mutations in gene regulatory elements.
UR - https://www.scopus.com/pages/publications/105013812777
U2 - 10.1038/s43856-025-01067-3
DO - 10.1038/s43856-025-01067-3
M3 - Article
C2 - 40841759
AN - SCOPUS:105013812777
SN - 2730-664X
VL - 5
JO - Communications Medicine
JF - Communications Medicine
IS - 1
M1 - 363
ER -