J Syst Evol ›› 2017, Vol. 55 ›› Issue (2): 85-109.doi: 10.1111/jse.12233

• Review •     Next Articles

Relative benefits of amino-acid, codon, degeneracy, DNA, and purine-pyrimidine character coding for phylogenetic analyses of exons

Mark P. Simmons*   

  1. Department of Biology, Colorado State University, Fort Collins, CO 80523-1878, USA
  • Received:2016-09-30 Online:2016-11-14 Published:2017-03-08

Abstract: Both traditional as well as 10 more recent methods of coding characters from exons of protein-coding genes are reviewed. The more recent methods collectively blur the distinction between nucleotide and amino-acid coding and enable investigators to carefully quantify the effects of different sources of phylogenetic signal as well as their potential biases. Codon models, which explicitly model silent and replacement substitutions, are a major advance and are expected to be broadly useful for simultaneously inferring recent and ancient divergences, unlike amino-acid coding. Degeneracy coding, wherein ambiguity codes are used to eliminate silent substitutions at the individual-nucleotide level, has clear advantages over scoring amino-acid characters. Nucleotide, codon, and amino-acid models are now directly comparable with easy-to-use programs, and widely used phylogenetics programs can analyze partitioned supermatrices that incorporate all three types of model. Therefore, it should become standard practice to test among these alternative model types before conducting parametric phylogenetic analyses. An earlier study of 78 protein-coding genes from 360 green-plant plastid genomes is used as an empirical example with which to quantify the relative performance of alternative character-coding methods using five quantification measures. Codon models were selected as having the best fit to the data, yet were outperformed by nucleotide models for all five quantification measures. Third-codon positions were found to be an important source of phylogenetic signal and even outperformed analyses of first and second positions for some measures. Degeneracy coding generally performed at least as well as amino-acid coding and is an arguably more effective alternative.

Key words: character-state space, codon models, composite characters, phylogenetic signal, phylogenomics, plastomics, saturation, transcriptomics

[1] Joseph A. Kleinkopf, Wade R. Roberts, Warren L. Wagner, and Eric H. Roalson. Diversification of Hawaiian Cyrtandra (Gesneriaceae) under the influence of incomplete lineage sorting and hybridization . J Syst Evol, 2019, 57(6): 561-578.
[2] Jianhua Li, Mark Stukel, Parker Bussies, Kaleb Skinner, Alan R. Lemmon, Emily Moriarty Lemmon, Kenneth Brown, Airat Bekmetjev, and Nathan G. Swenson. Maple phylogeny and biogeography inferred from phylogenomic data . J Syst Evol, 2019, 57(6): 594-606.
[3] Xiao-Yue Yang, Ze-Fu Wang, Wen-Chun Luo, Xin-Yi Guo, Cai-Hua Zhang, Jian-Quan Liu, and Guang-Peng Ren. Plastomes of Betulaceae and phylogenetic implications . J Syst Evol, 2019, 57(5): 508-518.
[4] Wu-Qin Xu, Jocelyn Losh, Chuan Chen, Pan Li, Rui-Hong Wang, Yun-Peng Zhao, Ying-Xiong Qiu, Cheng-Xin Fu. Comparative genomics of figworts (Scrophularia, Scrophulariaceae), with implications for the evolution of Scrophularia and Lamiales . J Syst Evol, 2019, 57(1): 55-65.
[5] Jun Wen, AJ Harris, Yash Kalburgi, Ning Zhang, Yuan Xu, Wei Zheng, Stefanie M. Ickert-Bond, Gabriel Johnson, Elizabeth A. Zimmer. Chloroplast phylogenomics of the New World grape species (Vitis, Vitaceae) . J Syst Evol, 2018, 56(4): 297-308.
[6] Vicki A. Funk. Collections-based science in the 21st Century . J Syst Evol, 2018, 56(3): 175-193.
[7] Ling Fang, Frederik Leliaert, Zhen-Hua Zhang, David Penny, Bo-Jian Zhong. Evolution of the Chlorophyta: Insights from chloroplast phylogenomic analyses . J Syst Evol, 2017, 55(4): 322-332.
[8] Jun Wen, AJ Harris, Stefanie M. Ickert-Bond, Rebecca Dikow, Kenneth Wurdack, Elizabeth A. Zimmer. Developing integrative systematics in the informatics and genomic era, and calling for a global Biodiversity Cyberbank . J Syst Evol, 2017, 55(4): 308-321.
[9] Jin-Mei Lu, Ning Zhang, Xin-Yu Du, Jun Wen, De-Zhu Li. Chloroplast phylogenomics resolves key relationships in ferns . J Syst Evol, 2015, 53(5): 448-457.
[10] Elizabeth A. Zimmer,Jun Wen. Using nuclear gene data for plant phylogenetics: Progress and prospects II. Next-gen approaches . J Syst Evol, 2015, 53(5): 371-379.
[11] Morgan R. Gostel, Kiera A. Coy, Andrea Weeks. Microfluidic PCR-based target enrichment: A case study in two rapid radiations of Commiphora (Burseraceae) from Madagascar . J Syst Evol, 2015, 53(5): 411-431.
[12] Zhe-Chen Qi, Yi Yu, Xiang Liu, Andrew Pais, Thomas Ranney, Ross Whetten, Qiu-Yun (Jenny) Xiang. Phylogenomics of polyploid Fothergilla (Hamamelidaceae) by RAD-tag based GBS—insights into species origin and effects of software pipelines . J Syst Evol, 2015, 53(5): 432-447.
[13] Hong QIAN, Jian ZHANG. Using an updated time-calibrated family-level phylogeny of seed plants to test for non-random patterns of life forms across the phylogeny . J Syst Evol, 2014, 52(4): 423-430.
[14] Anna-Magdalena BARNISKE, Thomas BORSCH, Kai MÜLLER, Michael KRUG, Andreas WORBERG, Christoph NEINHUIS, Dietmar QUANDT. Phylogenetics of early branching eudicots: comparing phylogenetic signal across plastid introns, spacers, and genes . J Syst Evol, 2012, 50(2): 85-108.
[15] Lei GAO, Ying-Juan SU, Ting WANG. Plastid genome sequencing, comparative genomics and phylogenomics: Current status and prospects . J Syst Evol, 2010, 48(2): 77-93.
Full text



[1] . [J]. Chin Bull Bot, 1994, 11(专辑): 19 .
[2] Xiao Xiao and Cheng Zhen-qi. Chloroplast 4.5 S ribosomol DNA. II Gene and Origin[J]. Chin Bull Bot, 1985, 3(06): 7 -9 .
[3] CAO Cui-LingLI Sheng-Xiu. Effect of Nitrogen Level on the Photosynthetic Rate, NR Activity and the Contents of Nucleic Acid of Wheat Leaf in the Stage of Reproduction[J]. Chin Bull Bot, 2003, 20(03): 319 -324 .
[4] Shi Jian ming;Gui Yao-lin and Zhu Zhi-qing. Observation on Amitosis of Sugarbeet (Beta vulgaris) Petiole during Dedifferentiation in Vitro[J]. Chin Bull Bot, 1989, 6(03): 155 .
[5] HUANG Ben-Hong. Late Paleozoic Flora in Nei Mongol Plateau[J]. Chin Bull Bot, 2000, 17(专辑): 172 -178 .
[6] XU Jing-Xian WANG Yu-Fei YANG Jian PU Guang-Rong ZHANG Cui-Fen. Advances in the Research of Tertiary Flora and Climate in Yunnan[J]. Chin Bull Bot, 2000, 17(专辑): 84 -94 .
[7] Sun Zhen-xiao Xia Guang-min Chen Hui-min. Karyotype Analysis of Psathyrostachys juncea[J]. Chin Bull Bot, 1995, 12(01): 56 .
[8] Yunpu Zheng;Jiancheng Zhao * ;Bingchang Zhang;Lin Li;Yuanming Zhang . Advances on Ecological Studies of Algae and Mosses in Biological Soil Crust[J]. Chin Bull Bot, 2009, 44(03): 371 -378 .
[9] Zili Wu, Mengyao Yu, Lu Chen, Jing Wei, Xiaoqin Wang, Yong Hu, Yan Yan, Ping Wan. Transcriptome Analysis of Physcomitrella patens Response to Cadmium Stress by Bayesian Network[J]. Chin Bull Bot, 2015, 50(2): 171 -179 .