Large structural variations in the haplotype-resolved African cassava genome

Ben N. Mansfeld, Adam Boyher, Jeffrey C. Berry, Mark Wilson, Shujun Ou, Seth Polydore, Todd P. Michael, Noah Fahlgren, Rebecca S. Bart

Research output: Contribution to journalArticlepeer-review

15 Scopus citations

Abstract

Cassava (Manihot esculenta Crantz, 2n = 36) is a global food security crop. It has a highly heterozygous genome, high genetic load, and genotype-dependent asynchronous flowering. It is typically propagated by stem cuttings and any genetic variation between haplotypes, including large structural variations, is preserved by such clonal propagation. Traditional genome assembly approaches generate a collapsed haplotype representation of the genome. In highly heterozygous plants, this results in artifacts and an oversimplification of heterozygous regions. We used a combination of Pacific Biosciences (PacBio), Illumina, and Hi-C to resolve each haplotype of the genome of a farmer-preferred cassava line, TME7 (Oko-iyawo). PacBio reads were assembled using the FALCON suite. Phase switch errors were corrected using FALCON-Phase and Hi-C read data. The ultralong-range information from Hi-C sequencing was also used for scaffolding. Comparison of the two phases revealed >5000 large haplotype-specific structural variants affecting over 8 Mb, including insertions and deletions spanning thousands of base pairs. The potential of these variants to affect allele-specific expression was further explored. RNA-sequencing data from 11 different tissue types were mapped against the scaffolded haploid assembly and gene expression data are incorporated into our existing easy-to-use web-based interface to facilitate use by the broader plant science community. These two assemblies provide an excellent means to study the effects of heterozygosity, haplotype-specific structural variation, gene hemizygosity, and allele-specific gene expression contributing to important agricultural traits and further our understanding of the genetics and domestication of cassava.

Original languageEnglish
Pages (from-to)1830-1848
Number of pages19
JournalPlant Journal
Volume108
Issue number6
DOIs
StatePublished - Dec 2021

Keywords

  • cassava
  • genome assembly
  • haplotype phasing
  • high heterozygosity
  • structural variants

Fingerprint

Dive into the research topics of 'Large structural variations in the haplotype-resolved African cassava genome'. Together they form a unique fingerprint.

Cite this