J Syst Evol

• Research Article • Previous Articles    

Missing Data and Model Selection in Phylogenomics: A Re-Evaluation of Cicadomorpha (Hemiptera: Auchenorrhyncha) Superfamily Level Relationships

Bin Yan1,2,3, Christopher H. Dietrich4, Xiao-Fei Yu3,5, Yan Jiang1,3,6, Ren-Huai Dai1, Shi-Yu Du7, Chen-Yang Cai8, Mao-Fa Yang1,3,5,*, Feng Zhang7,*   

  1. 1Institute of Entomology, Guizhou University; Guizhou Provincial Key Laboratory for Agricultural Pest Management of the Mountainous Region, Guiyang 550025, China
    2Guizhou Provincial Key Laboratory for Mountainous Environmental Information and Ecological Protection, Guizhou Normal University, Guiyang 550001, China
    3Guizhou Natural Enemy Propagation Center, Ministry of Agriculture and Rural Affairs, Guizhou University, Guiyang 550025, China
    4Illinois Natural History Survey, Prairie Research Institute, University of Illinois, Champaign, IL, 61820, USA
    5College of Tobacco Science, Guizhou University, Guiyang 550025, PChina
    6College of Pharmacy Science, Guiyang Healthcare Vocational University, Guiyang 550081, China
    7Department of Entomology, College of Plant Protection, Nanjing Agricultural University, Nanjing 210095, China
    8State Key Laboratory of Palaeobiology and Stratigraphy, Center for Excellence in Life and Paleoenvironment, Nanjing Institute of Geology and Palaeontology, Chinese Academy of Sciences, Nanjing 210008, China

    *Authors for correspondence. Mao-Fa Yang. E-mail: gdgdly@126.com; Feng Zhang. E-mail: fzhang@njau.edu.cn
  • Received:2024-10-05 Accepted:2025-04-28
  • Supported by:
    This study was supported by the Basic Research Program (the natural science projects), Guizhou Province, China (grant number ZK[2022]125), and the National Natural Science Foundation of China (grant numbers 32300380, 32360125, 32360393).

Abstract: Cicadomorpha, the most diverse infraorder of hemipteran insects, is a group of plant sap-sucking insects consisting of three monophyletic superfamilies: Cercopoidea, Cicadoidea, and Membracoidea. This study aims to resolve the controversy surrounding the phylogenetic relationships between these superfamilies, specifically whether Cercopoidea is more closely related to Cicadoidea (topology H1) or Membracoidea (topology H2). To tackle this issue, we assembled four matrices, including three amino acid datasets and one nucleotide dataset, and performed comprehensive phylogenomic analyses using both coalescent and concatenation methods. Our results showed that the amino acid matrix with low ratio of missing data-based analyses strongly supported topology H2, indicating that Cercopoidea is sister to Membracoidea. However, the other three matrices yielded conflicting results. Interestingly, when site-heterogeneous models were used, the results flipped, and the datasets that previously supported topology H1 supported topology H2. Further analyses, including model comparisons, tree topology tests, and phylogenetic analyses without an outgroup, also supported the conclusion that Cercopoidea is more closely related to Membracoidea. The study highlights the importance of modelling among site compositional heterogeneity in phylogenomic analyses to mitigate systematic errors. Additionally, we noted that datasets with high missing data ratios may lead to conflicting phylogenetic relationships. Our study provides strong evidence that Cercopoidea is sister to Membracoidea and highlights the importance of careful methodological considerations in phylogenetic analyses.

Key words: gene property analysis, phylogenomic analysis, site-heterogeneous model, systematic error, taxa occupancy.