TY - JOUR
T1 - Evaluating character partitioning and molecular models in plastid phylogenomics at low taxonomic levels
T2 - A case study using Amphilophium (Bignonieae, Bignoniaceae)
AU - Thode, Verônica A.
AU - Lohmann, Lúcia G.
AU - Sanmartín, Isabel
N1 - Publisher Copyright:
© 2020 Institute of Botany, Chinese Academy of Sciences
PY - 2020/11
Y1 - 2020/11
N2 - The accurate analyses of massive amounts of data obtained through next-generation sequencing depend on the selection of appropriate evolutionary models. Many plastid phylogenomic studies typically analyze plastome data as a single partition, or divided by a region, using a concatenate “supergene” approach. The effects of molecular evolutionary models and character partition strategies on plastome-based phylogenies have generally been evaluated at higher taxonomic levels in green plants. Using plastome data from 32 species of Amphilophium, a genus of Neotropical lianas, we explored potential sources of topological incongruence with different plastid genome datasets and approaches. Specifically, we evaluated the effects of compositional heterogeneity, codon usage bias, positive selection, and incomplete lineage sorting as sources of systematic error (i.e., the recovery of well-supported conflicting topologies). We compared different datasets (e.g., non-coding regions, exons, and codon-aligned and translated amino acids) using concatenated approaches under site-heterogeneous and site-homogeneous models, as well as multispecies coalescent (MSC) methods. We found incongruences in recovered phylogenetic relationships, which were mainly located in short internodes. The MSC and concatenated approaches recovered similar topologies. The analysis of GC content and codon usage bias indicated higher substitution rates and AT excess at the third codon positions, and we found evidence of positive selection in 3% of amino acid sites. There were no significant differences among species in site biochemical profiles. We argue that the selection of appropriate partition strategies and evolutionary models is important to increase accuracy in phylogenetic relationships, even when using plastome datasets, which is still the primarily used genome in plant phylogenetics.
AB - The accurate analyses of massive amounts of data obtained through next-generation sequencing depend on the selection of appropriate evolutionary models. Many plastid phylogenomic studies typically analyze plastome data as a single partition, or divided by a region, using a concatenate “supergene” approach. The effects of molecular evolutionary models and character partition strategies on plastome-based phylogenies have generally been evaluated at higher taxonomic levels in green plants. Using plastome data from 32 species of Amphilophium, a genus of Neotropical lianas, we explored potential sources of topological incongruence with different plastid genome datasets and approaches. Specifically, we evaluated the effects of compositional heterogeneity, codon usage bias, positive selection, and incomplete lineage sorting as sources of systematic error (i.e., the recovery of well-supported conflicting topologies). We compared different datasets (e.g., non-coding regions, exons, and codon-aligned and translated amino acids) using concatenated approaches under site-heterogeneous and site-homogeneous models, as well as multispecies coalescent (MSC) methods. We found incongruences in recovered phylogenetic relationships, which were mainly located in short internodes. The MSC and concatenated approaches recovered similar topologies. The analysis of GC content and codon usage bias indicated higher substitution rates and AT excess at the third codon positions, and we found evidence of positive selection in 3% of amino acid sites. There were no significant differences among species in site biochemical profiles. We argue that the selection of appropriate partition strategies and evolutionary models is important to increase accuracy in phylogenetic relationships, even when using plastome datasets, which is still the primarily used genome in plant phylogenetics.
KW - codon usage bias
KW - compositional heterogeneity
KW - gene tree incongruence
KW - Neotropical lianas
KW - NGS
KW - plastome
KW - positive selection
KW - species-level phylogenomics
UR - https://www.scopus.com/pages/publications/85082410391
U2 - 10.1111/jse.12579
DO - 10.1111/jse.12579
M3 - Article
AN - SCOPUS:85082410391
SN - 1674-4918
VL - 58
SP - 1071
EP - 1089
JO - Journal of Systematics and Evolution
JF - Journal of Systematics and Evolution
IS - 6
ER -