Abstract

Summary: Karyotype data are the most common form of genetic data that is regularly used clinically. They are collected as part of the standard of care in many diseases, particularly in pediatric and cancer medicine contexts. Karyotypes are represented in a unique text-based format, with a syntax defined by the International System for human Cytogenetic Nomenclature (ISCN). While human-readable, ISCN is not intrinsically machine-readable. This limitation has prevented the full use of complex karyotype data in discovery science use cases. To enhance the utility and value of karyotype data, we developed a tool named CytoGPS. CytoGPS first parses ISCN karyotypes into a machine-readable format. It then converts the ISCN karyotype into a binary Loss-Gain-Fusion (LGF) model, which represents all cytogenetic abnormalities as combinations of loss, gain, or fusion events, in a format that is analyzable using modern computational methods. Such data is then made available for comprehensive 'downstream' analyses that previously were not feasible.

Original languageEnglish
Pages (from-to)5365-5366
Number of pages2
JournalBioinformatics
Volume35
Issue number24
DOIs
StatePublished - Dec 15 2019

Fingerprint

Dive into the research topics of 'CytoGPS: A web-enabled karyotype analysis tool for cytogenetics'. Together they form a unique fingerprint.

Cite this