Cargando…

FACEPAI: a script for fast and consistent environmental DNA processing and identification

BACKGROUND: The use of environmental DNA (eDNA) has become an increasing important tool in environmental surveys and taxonomic research. High throughput sequencing of samples from soil, water, sediment, trap alcohol or bulk samples generate large amount of barcode sequences that can be assigned to a...

Descripción completa

Detalles Bibliográficos
Autor principal: Wahlberg, Emma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896590/
https://www.ncbi.nlm.nih.gov/pubmed/31810454
http://dx.doi.org/10.1186/s12898-019-0269-1
_version_ 1783476812492308480
author Wahlberg, Emma
author_facet Wahlberg, Emma
author_sort Wahlberg, Emma
collection PubMed
description BACKGROUND: The use of environmental DNA (eDNA) has become an increasing important tool in environmental surveys and taxonomic research. High throughput sequencing of samples from soil, water, sediment, trap alcohol or bulk samples generate large amount of barcode sequences that can be assigned to a known taxon with a reference sequence. This process can however be bioinformatic cumbersome and time consuming, especially for researchers without specialised bioinformatic training. A number of different software packages and pipelines are available, but require training in preparation of data, running of analysis and formatting results. Comparison of results produced by different packages are often difficult. RESULTS: FACEPIE is an open source script dependant on a few open source applications that provides a pipeline for rapid analysis and taxonomic assignment of environmental DNA samples. It requires an initial formatting of a reference database, using the script CaPReSe, and a configuration file and can thereafter be run to process any number of samples in succession using the same settings and references. Both configuration and executing are designed to demand as little hands on work as possible, while assuring repeatable results. CONCLUSION: The demonstration using example data from real environmental samples provides results in a time span ranging from less than 3 min to just above 15 min depending on the numbers of sequences to process. The memory usage is below 2 GB on a desktop PC. FACEPAI and CaPReSe provides a pipeline for analysing a large number of eDNA samples on common equipment, with little bioinformatic skills necessary, for subsequent ecological and taxonomical studies.
format Online
Article
Text
id pubmed-6896590
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-68965902019-12-11 FACEPAI: a script for fast and consistent environmental DNA processing and identification Wahlberg, Emma BMC Ecol Software BACKGROUND: The use of environmental DNA (eDNA) has become an increasing important tool in environmental surveys and taxonomic research. High throughput sequencing of samples from soil, water, sediment, trap alcohol or bulk samples generate large amount of barcode sequences that can be assigned to a known taxon with a reference sequence. This process can however be bioinformatic cumbersome and time consuming, especially for researchers without specialised bioinformatic training. A number of different software packages and pipelines are available, but require training in preparation of data, running of analysis and formatting results. Comparison of results produced by different packages are often difficult. RESULTS: FACEPIE is an open source script dependant on a few open source applications that provides a pipeline for rapid analysis and taxonomic assignment of environmental DNA samples. It requires an initial formatting of a reference database, using the script CaPReSe, and a configuration file and can thereafter be run to process any number of samples in succession using the same settings and references. Both configuration and executing are designed to demand as little hands on work as possible, while assuring repeatable results. CONCLUSION: The demonstration using example data from real environmental samples provides results in a time span ranging from less than 3 min to just above 15 min depending on the numbers of sequences to process. The memory usage is below 2 GB on a desktop PC. FACEPAI and CaPReSe provides a pipeline for analysing a large number of eDNA samples on common equipment, with little bioinformatic skills necessary, for subsequent ecological and taxonomical studies. BioMed Central 2019-12-06 /pmc/articles/PMC6896590/ /pubmed/31810454 http://dx.doi.org/10.1186/s12898-019-0269-1 Text en © The Author(s) 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Wahlberg, Emma
FACEPAI: a script for fast and consistent environmental DNA processing and identification
title FACEPAI: a script for fast and consistent environmental DNA processing and identification
title_full FACEPAI: a script for fast and consistent environmental DNA processing and identification
title_fullStr FACEPAI: a script for fast and consistent environmental DNA processing and identification
title_full_unstemmed FACEPAI: a script for fast and consistent environmental DNA processing and identification
title_short FACEPAI: a script for fast and consistent environmental DNA processing and identification
title_sort facepai: a script for fast and consistent environmental dna processing and identification
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6896590/
https://www.ncbi.nlm.nih.gov/pubmed/31810454
http://dx.doi.org/10.1186/s12898-019-0269-1
work_keys_str_mv AT wahlbergemma facepaiascriptforfastandconsistentenvironmentaldnaprocessingandidentification