Cargando…

Current State of DNA Barcoding of Sciaroidea (Diptera)—Highlighting the Need to Build the Reference Library

SIMPLE SUMMARY: DNA barcoding is a method by which a specific region of the mitochondrial genome is used to quantify genetic distances within and between animal species. Most DNA barcodes of the world are assembled on the Barcode of Life online database BoldSystems (BOLD). There, machine-generated b...

Descripción completa

Detalles Bibliográficos
Autor principal: Kjærandsen, Jostein
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8879535/
https://www.ncbi.nlm.nih.gov/pubmed/35206721
http://dx.doi.org/10.3390/insects13020147
Descripción
Sumario:SIMPLE SUMMARY: DNA barcoding is a method by which a specific region of the mitochondrial genome is used to quantify genetic distances within and between animal species. Most DNA barcodes of the world are assembled on the Barcode of Life online database BoldSystems (BOLD). There, machine-generated barcode index numbers (BINs) are automatically assigned to clusters of specimens thought to represent species. I review the current state of DNA barcoding of the superfamily Sciaroidea, a diverse insect group consisting of close to 16,000 described fly species in eight families. To date, over 1.2 million specimens of Sciaroidea have been barcoded and the 56,648 assigned BINs on BOLD already represent 3.5 times the number of described species. Still, 95% of the BINs have currently no associated scientific name and very little effort has been put into building a quality-checked reference library where named species are linked to the BINs on BOLD. In the Nordic region, however, substantial progress is made towards building a complete reference library. While DNA barcoding has tremendous potential for advancing the knowledge for many diverse groups of insects, its potential will never be fully reached absent more engagement of trained taxonomists to build voucher collections, curate the reference libraries, and describe new species. ABSTRACT: DNA barcoding has tremendous potential for advancing species knowledge for many diverse groups of insects, potentially paving way for machine identification and semi-automated monitoring of whole insect faunas. Here, I review the current state of DNA barcoding of the superfamily Sciaroidea (Diptera), a diverse group consisting of eight understudied fly families where the described species in the world makes up some 10% (≈16,000 species) of all Diptera. World data of Sciaroidea were extracted from the Barcode of Life online database BoldSystems (BOLD) and contrasted with results and experiences from a Nordic project to build the reference library. Well over 1.2 million (1,224,877) Sciaroidea specimens have been submitted for barcoding, giving barcode-compliant sequences resulting in 56,648 so-called barcode index numbers (BINs, machine-generated proxies for species). Although the BINs on BOLD already represent 3.5 times the number of described species, merely some 2850 named species (described or interim names, 5% of the BINs) currently have been assigned a BIN. The other 95% remain as dark taxa figuring in many frontier publications as statistics representing proxies for species diversity within a family. In the Nordic region, however, substantial progress has been made towards building a complete reference library, currently making up 55% of all named Sciaroidea BINs on BOLD. Another major source (31%) of named Sciaroidea BINs on BOLD comes from COI sequences mined from GenBank, generated through phylogenetic and integrative studies outside of BOLD. Building a quality reference library for understudied insects such as Sciaroidea requires heavy investment, both pre sequence and post sequence, by trained taxonomists to build and curate voucher collections, to continually improve the quality of the data and describe new species. Only when the BINs are properly calibrated by a rigorously quality-checked reference library can the great potential of both classical taxonomic barcoding, metabarcoding, and eDNA ecology be realized.