TY - JOUR
T1 - Pash 3.0
T2 - A versatile software package for read mapping and integrative analysis of genomic and epigenomic variation using massively parallel DNA sequencing
AU - Coarfa, Cristian
AU - Yu, Fuli
AU - Miller, Christopher A.
AU - Chen, Zuozhou
AU - Harris, R. Alan
AU - Milosavljevic, Aleksandar
N1 - Funding Information:
We thank Richard A. Gibbs, Wei Li, Gordon Robertson, and David R. Deiros for their insightful comments and suggestions. We thank Andrew Jackson for his careful review of the manuscript. This work was partially funded by the National Human Genome Research Institute, National Institutes of Health (NIH), under the grant 5R01HG004009, and the NIH Epigenomics Roadmap grant 5U01DA025956 to AM.
PY - 2010/11/23
Y1 - 2010/11/23
N2 - Background: Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-Seq and methylome mapping by whole-genome bisulfite sequencing.Results: Pash 3.0 generally matches the accuracy and speed of niche programs for fast mapping of short reads, and exceeds their performance on longer reads generated by a new generation of massively parallel sequencing technologies. By exploiting longer read lengths, Pash 3.0 maps reads onto the large fraction of genomic DNA that contains repetitive elements and polymorphic sites, including indel polymorphisms.Conclusions: We demonstrate the versatility of Pash 3.0 by analyzing the interaction between CpG methylation, CpG SNPs, and imprinting based on publicly available whole-genome shotgun bisulfite sequencing data. Pash 3.0 makes use of gapped k-mer alignment, a non-seed based comparison method, which is implemented using multi-positional hash tables. This allows Pash 3.0 to run on diverse hardware platforms, including individual computers with standard RAM capacity, multi-core hardware architectures and large clusters.
AB - Background: Massively parallel sequencing readouts of epigenomic assays are enabling integrative genome-wide analyses of genomic and epigenomic variation. Pash 3.0 performs sequence comparison and read mapping and can be employed as a module within diverse configurable analysis pipelines, including ChIP-Seq and methylome mapping by whole-genome bisulfite sequencing.Results: Pash 3.0 generally matches the accuracy and speed of niche programs for fast mapping of short reads, and exceeds their performance on longer reads generated by a new generation of massively parallel sequencing technologies. By exploiting longer read lengths, Pash 3.0 maps reads onto the large fraction of genomic DNA that contains repetitive elements and polymorphic sites, including indel polymorphisms.Conclusions: We demonstrate the versatility of Pash 3.0 by analyzing the interaction between CpG methylation, CpG SNPs, and imprinting based on publicly available whole-genome shotgun bisulfite sequencing data. Pash 3.0 makes use of gapped k-mer alignment, a non-seed based comparison method, which is implemented using multi-positional hash tables. This allows Pash 3.0 to run on diverse hardware platforms, including individual computers with standard RAM capacity, multi-core hardware architectures and large clusters.
UR - https://www.scopus.com/pages/publications/78549291439
U2 - 10.1186/1471-2105-11-572
DO - 10.1186/1471-2105-11-572
M3 - Article
C2 - 21092284
AN - SCOPUS:78549291439
SN - 1471-2105
VL - 11
JO - BMC bioinformatics
JF - BMC bioinformatics
M1 - 572
ER -