Cargando…

RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets

For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected recombinat...

Descripción completa

Detalles Bibliográficos
Autores principales: Martin, Darren P, Varsani, Arvind, Roumagnac, Philippe, Botha, Gerrit, Maslamoney, Suresh, Schwab, Tiana, Kelz, Zena, Kumar, Venkatesh, Murrell, Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062008/
https://www.ncbi.nlm.nih.gov/pubmed/33936774
http://dx.doi.org/10.1093/ve/veaa087
_version_ 1783681680036331520
author Martin, Darren P
Varsani, Arvind
Roumagnac, Philippe
Botha, Gerrit
Maslamoney, Suresh
Schwab, Tiana
Kelz, Zena
Kumar, Venkatesh
Murrell, Ben
author_facet Martin, Darren P
Varsani, Arvind
Roumagnac, Philippe
Botha, Gerrit
Maslamoney, Suresh
Schwab, Tiana
Kelz, Zena
Kumar, Venkatesh
Murrell, Ben
author_sort Martin, Darren P
collection PubMed
description For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected recombination events, the latest iteration, RDP5, is automated enough that it can be integrated within analysis pipelines and run without any user input. The main innovation enabling this degree of automation is the implementation of statistical tests to identify recombination signals that could be attributable to evolutionary processes other than recombination. The additional analysis time required for these tests has been offset by algorithmic improvements throughout the program such that, relative to RDP4, RDP5 will still run up to five times faster and be capable of analyzing alignments containing twice as many sequences (up to 5000) that are five times longer (up to 50 million sites). For users wanting to remove signals of recombination from their datasets before using them for downstream phylogenetics-based molecular evolution analyses, RDP5 can disassemble detected recombinant sequences into their constituent parts and output a variety of different recombination-free datasets in an array of different alignment formats. For users that are interested in exploring the recombination history of their datasets, all the manual verification, data management and data visualization components of RDP5 have been extensively updated to minimize the amount of time needed by users to individually verify and refine the program’s interpretation of each of the individual recombination events that it detects.
format Online
Article
Text
id pubmed-8062008
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-80620082021-04-29 RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets Martin, Darren P Varsani, Arvind Roumagnac, Philippe Botha, Gerrit Maslamoney, Suresh Schwab, Tiana Kelz, Zena Kumar, Venkatesh Murrell, Ben Virus Evol Resources For the past 20 years, the recombination detection program (RDP) project has focused on the development of a fast, flexible, and easy to use Windows-based recombination analysis tool. Whereas previous versions of this tool have relied on considerable user-mediated verification of detected recombination events, the latest iteration, RDP5, is automated enough that it can be integrated within analysis pipelines and run without any user input. The main innovation enabling this degree of automation is the implementation of statistical tests to identify recombination signals that could be attributable to evolutionary processes other than recombination. The additional analysis time required for these tests has been offset by algorithmic improvements throughout the program such that, relative to RDP4, RDP5 will still run up to five times faster and be capable of analyzing alignments containing twice as many sequences (up to 5000) that are five times longer (up to 50 million sites). For users wanting to remove signals of recombination from their datasets before using them for downstream phylogenetics-based molecular evolution analyses, RDP5 can disassemble detected recombinant sequences into their constituent parts and output a variety of different recombination-free datasets in an array of different alignment formats. For users that are interested in exploring the recombination history of their datasets, all the manual verification, data management and data visualization components of RDP5 have been extensively updated to minimize the amount of time needed by users to individually verify and refine the program’s interpretation of each of the individual recombination events that it detects. Oxford University Press 2020-04-12 /pmc/articles/PMC8062008/ /pubmed/33936774 http://dx.doi.org/10.1093/ve/veaa087 Text en © The Author(s) 2020. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Martin, Darren P
Varsani, Arvind
Roumagnac, Philippe
Botha, Gerrit
Maslamoney, Suresh
Schwab, Tiana
Kelz, Zena
Kumar, Venkatesh
Murrell, Ben
RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title_full RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title_fullStr RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title_full_unstemmed RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title_short RDP5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
title_sort rdp5: a computer program for analyzing recombination in, and removing signals of recombination from, nucleotide sequence datasets
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8062008/
https://www.ncbi.nlm.nih.gov/pubmed/33936774
http://dx.doi.org/10.1093/ve/veaa087
work_keys_str_mv AT martindarrenp rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT varsaniarvind rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT roumagnacphilippe rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT bothagerrit rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT maslamoneysuresh rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT schwabtiana rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT kelzzena rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT kumarvenkatesh rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets
AT murrellben rdp5acomputerprogramforanalyzingrecombinationinandremovingsignalsofrecombinationfromnucleotidesequencedatasets