Whole-genome sequencing and variant discovery in C. elegans

La Deana W. Hillier, Gabor T. Marth, Aaron R. Quinlan, David Dooling, Ginger Fewell, Derek Barnett, Paul Fox, Jarret I. Glasscock, Matthew Hickenbotham, Weichun Huang, Vincent J. Magrini, Ryan J. Richt, Sacha N. Sander, Donald A. Stewart, Michael Stromberg, Eric F. Tsung, Todd Wylie, Tim Schedl, Richard K. Wilson, Elaine R. Mardis

Research output: Contribution to journalArticlepeer-review

350 Scopus citations

Abstract

Massively parallel sequencing instruments enable rapid and inexpensive DNA sequence data production. Because these instruments are new, their data require characterization with respect to accuracy and utility. To address this, we sequenced a Caernohabditis elegans N2 Bristol strain isolate using the Solexa Sequence Analyzer, and compared the reads to the reference genome to characterize the data and to evaluate coverage and representation. Massively parallel sequencing facilitates strain-to-reference comparison for genome-wide sequence variant discovery. Owing to the short-read-length sequences produced, we developed a revised approach to determine the regions of the genome to which short reads could be uniquely mapped. We then aligned Solexa reads from C. elegans strain CB4858 to the reference, and screened for single-nucleotide polymorphisms (SNPs) and small indels. This study demonstrates the utility of massively parallel short read sequencing for whole genome resequencing and for accurate discovery of genome-wide polymorphisms.

Original languageEnglish
Pages (from-to)183-188
Number of pages6
JournalNature Methods
Volume5
Issue number2
DOIs
StatePublished - Feb 2008

Fingerprint

Dive into the research topics of 'Whole-genome sequencing and variant discovery in C. elegans'. Together they form a unique fingerprint.

Cite this