Cargando…

Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data

The taxonomical identification merely based on morphology is often difficult for ancient remains. Therefore, universal or specific PCR amplification followed by sequencing and BLAST (basic local alignment search tool) search has become the most frequently used genetic-based method for the species id...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Tian Ming, Lin, Yu, Njaramba-Ngatia, Jacob, Guo, Xiao Sen, Li, Ren Gui, Li, Hai Meng, Kumar-Sahu, Sunil, Wang, Xie, Yang, Xiu Juan, Guo, Hua Bing, Xu, Wen Hao, Kristiansen, Karsten, Liu, Huan, Xu, Yan Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6679096/
https://www.ncbi.nlm.nih.gov/pubmed/31284503
http://dx.doi.org/10.3390/genes10070509
_version_ 1783441258769809408
author Lan, Tian Ming
Lin, Yu
Njaramba-Ngatia, Jacob
Guo, Xiao Sen
Li, Ren Gui
Li, Hai Meng
Kumar-Sahu, Sunil
Wang, Xie
Yang, Xiu Juan
Guo, Hua Bing
Xu, Wen Hao
Kristiansen, Karsten
Liu, Huan
Xu, Yan Chun
author_facet Lan, Tian Ming
Lin, Yu
Njaramba-Ngatia, Jacob
Guo, Xiao Sen
Li, Ren Gui
Li, Hai Meng
Kumar-Sahu, Sunil
Wang, Xie
Yang, Xiu Juan
Guo, Hua Bing
Xu, Wen Hao
Kristiansen, Karsten
Liu, Huan
Xu, Yan Chun
author_sort Lan, Tian Ming
collection PubMed
description The taxonomical identification merely based on morphology is often difficult for ancient remains. Therefore, universal or specific PCR amplification followed by sequencing and BLAST (basic local alignment search tool) search has become the most frequently used genetic-based method for the species identification of biological samples, including ancient remains. However, it is challenging for these methods to process extremely ancient samples with severe DNA fragmentation and contamination. Here, we applied whole-genome sequencing data from 12 ancient samples with ages ranging from 2.7 to 700 kya to compare different mapping algorithms, and tested different reference databases, mapping similarities and query coverage to explore the best method and mapping parameters that can improve the accuracy of ancient mammal species identification. The selected method and parameters were tested using 152 ancient samples, and 150 of the samples were successfully identified. We further screened the BLAST-based mapping results according to the deamination characteristics of ancient DNA to improve the ability of ancient species identification. Our findings demonstrate a marked improvement to the normal procedures used for ancient species identification, which was achieved through defining the mapping and filtering guidelines to identify true ancient DNA sequences. The guidelines summarized in this study could be valuable in archaeology, paleontology, evolution, and forensic science. For the convenience of the scientific community, we wrote a software script with Perl, called AncSid, which is made available on GitHub.
format Online
Article
Text
id pubmed-6679096
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-66790962019-08-19 Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data Lan, Tian Ming Lin, Yu Njaramba-Ngatia, Jacob Guo, Xiao Sen Li, Ren Gui Li, Hai Meng Kumar-Sahu, Sunil Wang, Xie Yang, Xiu Juan Guo, Hua Bing Xu, Wen Hao Kristiansen, Karsten Liu, Huan Xu, Yan Chun Genes (Basel) Article The taxonomical identification merely based on morphology is often difficult for ancient remains. Therefore, universal or specific PCR amplification followed by sequencing and BLAST (basic local alignment search tool) search has become the most frequently used genetic-based method for the species identification of biological samples, including ancient remains. However, it is challenging for these methods to process extremely ancient samples with severe DNA fragmentation and contamination. Here, we applied whole-genome sequencing data from 12 ancient samples with ages ranging from 2.7 to 700 kya to compare different mapping algorithms, and tested different reference databases, mapping similarities and query coverage to explore the best method and mapping parameters that can improve the accuracy of ancient mammal species identification. The selected method and parameters were tested using 152 ancient samples, and 150 of the samples were successfully identified. We further screened the BLAST-based mapping results according to the deamination characteristics of ancient DNA to improve the ability of ancient species identification. Our findings demonstrate a marked improvement to the normal procedures used for ancient species identification, which was achieved through defining the mapping and filtering guidelines to identify true ancient DNA sequences. The guidelines summarized in this study could be valuable in archaeology, paleontology, evolution, and forensic science. For the convenience of the scientific community, we wrote a software script with Perl, called AncSid, which is made available on GitHub. MDPI 2019-07-05 /pmc/articles/PMC6679096/ /pubmed/31284503 http://dx.doi.org/10.3390/genes10070509 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Lan, Tian Ming
Lin, Yu
Njaramba-Ngatia, Jacob
Guo, Xiao Sen
Li, Ren Gui
Li, Hai Meng
Kumar-Sahu, Sunil
Wang, Xie
Yang, Xiu Juan
Guo, Hua Bing
Xu, Wen Hao
Kristiansen, Karsten
Liu, Huan
Xu, Yan Chun
Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title_full Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title_fullStr Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title_full_unstemmed Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title_short Improving Species Identification of Ancient Mammals Based on Next-Generation Sequencing Data
title_sort improving species identification of ancient mammals based on next-generation sequencing data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6679096/
https://www.ncbi.nlm.nih.gov/pubmed/31284503
http://dx.doi.org/10.3390/genes10070509
work_keys_str_mv AT lantianming improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT linyu improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT njarambangatiajacob improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT guoxiaosen improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT lirengui improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT lihaimeng improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT kumarsahusunil improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT wangxie improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT yangxiujuan improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT guohuabing improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT xuwenhao improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT kristiansenkarsten improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT liuhuan improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata
AT xuyanchun improvingspeciesidentificationofancientmammalsbasedonnextgenerationsequencingdata