TY - JOUR
T1 - Computational and analytical framework for small RNA profiling by high-throughput sequencing
AU - Fahlgren, Noah
AU - Sullivan, Christopher M.
AU - Kasschau, Kristin D.
AU - Chapman, Elisabeth J.
AU - Cumbie, Jason S.
AU - Montgomery, Taiowa A.
AU - Gilbert, Sunny D.
AU - Dasenko, Mark
AU - Backman, Tyler W.H.
AU - Givan, Scott A.
AU - Carrington, James C.
PY - 2009/5
Y1 - 2009/5
N2 - The advent of high-throughput sequencing (HTS) methods has enabled direct approaches to quantitatively profile small RNA populations. However, these methods have been limited by several factors, including representational artifacts and lack of established statistical methods of analysis. Furthermore, massive HTS data sets present new problems related to data processing and mapping to a reference genome. Here, we show that cluster-based sequencing-by-synthesis technology is highly reproducible as a quantitative profiling tool for several classes of small RNA from Arabidopsis thaliana. We introduce the use of synthetic RNA oligoribonucleotide standards to facilitate objective normalization between HTS data sets, and adapt icroarray-type methods for statistical analysis of multiple samples. These methods were tested successfully using mutants withsmall RNA biogenesis (mi RNA-defective dcl 1 mutant and si RNA-defective dcl 2 dcl 3 dcl 4 triple mutant) or effector protein (ago 1 mutant) deficiencies. Computational methods were also developed to rapidly and accurately parse, quantify, and map small RNA data.
AB - The advent of high-throughput sequencing (HTS) methods has enabled direct approaches to quantitatively profile small RNA populations. However, these methods have been limited by several factors, including representational artifacts and lack of established statistical methods of analysis. Furthermore, massive HTS data sets present new problems related to data processing and mapping to a reference genome. Here, we show that cluster-based sequencing-by-synthesis technology is highly reproducible as a quantitative profiling tool for several classes of small RNA from Arabidopsis thaliana. We introduce the use of synthetic RNA oligoribonucleotide standards to facilitate objective normalization between HTS data sets, and adapt icroarray-type methods for statistical analysis of multiple samples. These methods were tested successfully using mutants withsmall RNA biogenesis (mi RNA-defective dcl 1 mutant and si RNA-defective dcl 2 dcl 3 dcl 4 triple mutant) or effector protein (ago 1 mutant) deficiencies. Computational methods were also developed to rapidly and accurately parse, quantify, and map small RNA data.
KW - CASHX
KW - Oligoribonucleotide standards
KW - SAM-seq
KW - Sequencing-by-synthesis
KW - Small rNA
KW - Statistical methods
UR - http://www.scopus.com/inward/record.url?scp=65349094759&partnerID=8YFLogxK
U2 - 10.1261/rna.1473809
DO - 10.1261/rna.1473809
M3 - Article
C2 - 19307293
AN - SCOPUS:65349094759
SN - 1355-8382
VL - 15
SP - 992
EP - 1002
JO - RNA
JF - RNA
IS - 5
ER -