Challenges of sequencing human genomes

Daniel C. Koboldt, Li Ding, Elaine R. Mardis, Richard K. Wilson

Research output: Contribution to journalReview articlepeer-review

109 Scopus citations

Abstract

Massively parallel sequencing technologies continue to alter the study of human genetics. As the cost of sequencing declines, next-generation sequencing (NGS) instruments and datasets will become increasingly accessible to the wider research community. Investigators are understandably eager to harness the power of these new technologies. Sequencing human genomes on these platforms, however, presents numerous production and bioinformatics challenges. Production issues like sample contamination, library chimaeras and variable run quality have become increasingly problematic in the transition from technology development lab to production floor. Analysis of NGS data, too, remains challenging, particularly given the short-read lengths (35-250 bp) and sheer volume of data. The development of streamlined, highly automated pipelines for data analysis is critical for transition from technology adoption to accelerated research and publication. This review aims to describe the state of current NGS technologies, as well as the strategies that enable NGS users to characterize the full spectrum of DNA sequence variation in humans.

Original languageEnglish
Article numberbbq016
Pages (from-to)484-498
Number of pages15
JournalBriefings in Bioinformatics
Volume11
Issue number5
DOIs
StatePublished - Sep 20 2010

Keywords

  • Human genome
  • Massively parallel sequencing
  • Next generation sequencing
  • Short read alignment
  • Variant detection
  • Whole genome sequencing

Fingerprint

Dive into the research topics of 'Challenges of sequencing human genomes'. Together they form a unique fingerprint.

Cite this