Abstract
The National Cancer Institute's Clinical Proteomic Tumor Analysis Consortium (CPTAC) investigates tumors from a proteogenomic perspective, creating rich multi-omics datasets connecting genomic aberrations to cancer phenotypes. To facilitate pan-cancer investigations, we have generated harmonized genomic, transcriptomic, proteomic, and clinical data for >1000 tumors in 10 cohorts to create a cohesive and powerful dataset for scientific discovery. We outline efforts by the CPTAC pan-cancer working group in data harmonization, data dissemination, and computational resources for aiding biological discoveries. We also discuss challenges for multi-omics data integration and analysis, specifically the unique challenges of working with both nucleotide sequencing and mass spectrometry proteomics data.
Original language | English |
---|---|
Pages (from-to) | 1397-1406 |
Number of pages | 10 |
Journal | Cancer Cell |
Volume | 41 |
Issue number | 8 |
DOIs | |
State | Published - Aug 14 2023 |
Keywords
- CPTAC
- data harmonization
- multi-omics
- open data
- pan-cancer
- proteogenomics