J Syst Evol ›› 2020, Vol. 58 ›› Issue (6): 1071-1089.DOI: 10.1111/jse.12579

• Research Articles • Previous Articles     Next Articles

Evaluating character partitioning and molecular models in plastid phylogenomics at low taxonomic levels: A case study using Amphilophium (Bignonieae, Bignoniaceae)

Verônica A. Thode1,2*, Lúcia G. Lohmann2, and Isabel Sanmartín3*   

  1. 1 Instituto de Biociências, Programa de Pós‐Graduação em Botânica, Universidade Federal do Rio Grande do Sul, Av. Bento Gonçalves 9500, Prédio 43433, Porto Alegre, RS 91501‐970, Brazil
    2 Departamento de Botânica, Instituto de Biociências, Universidade de São Paulo, Rua do Matão 277, São Paulo, SP 05508‐090, Brazil
    3 Real Jardín Botánico (RJB), CSIC, Plaza de Murillo, 2, Madrid E‐28014, Spain
  • Received:2019-05-27 Accepted:2020-02-21 Online:2020-02-25 Published:2020-11-01

Abstract:

The accurate analyses of massive amounts of data obtained through next‐generation sequencing depend on the selection of appropriate evolutionary models. Many plastid phylogenomic studies typically analyze plastome data as a single partition, or divided by a region, using a concatenate “supergene” approach. The effects of molecular evolutionary models and character partition strategies on plastome‐based phylogenies have generally been evaluated at higher taxonomic levels in green plants. Using plastome data from 32 species of Amphilophium, a genus of Neotropical lianas, we explored potential sources of topological incongruence with different plastid genome datasets and approaches. Specifically, we evaluated the effects of compositional heterogeneity, codon usage bias, positive selection, and incomplete lineage sorting as sources of systematic error (i.e., the recovery of well‐supported conflicting topologies). We compared different datasets (e.g., non‐coding regions, exons, and codon‐aligned and translated amino acids) using concatenated approaches under site‐heterogeneous and site‐homogeneous models, as well as multispecies coalescent (MSC) methods. We found incongruences in recovered phylogenetic relationships, which were mainly located in short internodes. The MSC and concatenated approaches recovered similar topologies. The analysis of GC content and codon usage bias indicated higher substitution rates and AT excess at the third codon positions, and we found evidence of positive selection in 3% of amino acid sites. There were no significant differences among species in site biochemical profiles. We argue that the selection of appropriate partition strategies and evolutionary models is important to increase accuracy in phylogenetic relationships, even when using plastome datasets, which is still the primarily used genome in plant phylogenetics.

Key words: codon usage bias, compositional heterogeneity, gene tree incongruence, Neotropical lianas, NGS, plastome, positive selection, species‐level phylogenomics