Cargando…

Tree pruner: An efficient tool for selecting data from a biased genetic database

BACKGROUND: Large databases of genetic data are often biased in their representation. Thus, selection of genetic data with desired properties, such as evolutionary representation or shared genotypes, is problematic. Selection on the basis of epidemiological variables may not achieve the desired prop...

Descripción completa

Detalles Bibliográficos
Autores principales: Krishnamoorthy, Mohan, Patel, Pragneshkumar, Dimitrijevic, Mira, Dietrich, Jonathan, Green, Margaret, Macken, Catherine
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045304/
https://www.ncbi.nlm.nih.gov/pubmed/21306634
http://dx.doi.org/10.1186/1471-2105-12-51
_version_ 1782198806887006208
author Krishnamoorthy, Mohan
Patel, Pragneshkumar
Dimitrijevic, Mira
Dietrich, Jonathan
Green, Margaret
Macken, Catherine
author_facet Krishnamoorthy, Mohan
Patel, Pragneshkumar
Dimitrijevic, Mira
Dietrich, Jonathan
Green, Margaret
Macken, Catherine
author_sort Krishnamoorthy, Mohan
collection PubMed
description BACKGROUND: Large databases of genetic data are often biased in their representation. Thus, selection of genetic data with desired properties, such as evolutionary representation or shared genotypes, is problematic. Selection on the basis of epidemiological variables may not achieve the desired properties. Available automated approaches to the selection of influenza genetic data make a tradeoff between speed and simplicity on the one hand and control over quality and contents of the dataset on the other hand. A poorly chosen dataset may be detrimental to subsequent analyses. RESULTS: We developed a tool, Tree Pruner, for obtaining a dataset with desired evolutionary properties from a large, biased genetic database. Tree Pruner provides the user with an interactive phylogenetic tree as a means of editing the initial dataset from which the tree was inferred. The tree visualization changes dynamically, using colors and shading, reflecting Tree Pruner actions. At the end of a Tree Pruner session, the editing actions are implemented in the dataset. Currently, Tree Pruner is implemented on the Influenza Research Database (IRD). The data management capabilities of the IRD allow the user to store a pruned dataset for additional pruning or for subsequent analysis. Tree Pruner can be easily adapted for use with other organisms. CONCLUSIONS: Tree Pruner is an efficient, manual tool for selecting a high-quality dataset with desired evolutionary properties from a biased database of genetic sequences. It offers an important alternative to automated approaches to the same goal, by providing the user with a dynamic, visual guide to the ongoing selection process and ultimate control over the contents (and therefore quality) of the dataset.
format Text
id pubmed-3045304
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-30453042011-02-26 Tree pruner: An efficient tool for selecting data from a biased genetic database Krishnamoorthy, Mohan Patel, Pragneshkumar Dimitrijevic, Mira Dietrich, Jonathan Green, Margaret Macken, Catherine BMC Bioinformatics Software BACKGROUND: Large databases of genetic data are often biased in their representation. Thus, selection of genetic data with desired properties, such as evolutionary representation or shared genotypes, is problematic. Selection on the basis of epidemiological variables may not achieve the desired properties. Available automated approaches to the selection of influenza genetic data make a tradeoff between speed and simplicity on the one hand and control over quality and contents of the dataset on the other hand. A poorly chosen dataset may be detrimental to subsequent analyses. RESULTS: We developed a tool, Tree Pruner, for obtaining a dataset with desired evolutionary properties from a large, biased genetic database. Tree Pruner provides the user with an interactive phylogenetic tree as a means of editing the initial dataset from which the tree was inferred. The tree visualization changes dynamically, using colors and shading, reflecting Tree Pruner actions. At the end of a Tree Pruner session, the editing actions are implemented in the dataset. Currently, Tree Pruner is implemented on the Influenza Research Database (IRD). The data management capabilities of the IRD allow the user to store a pruned dataset for additional pruning or for subsequent analysis. Tree Pruner can be easily adapted for use with other organisms. CONCLUSIONS: Tree Pruner is an efficient, manual tool for selecting a high-quality dataset with desired evolutionary properties from a biased database of genetic sequences. It offers an important alternative to automated approaches to the same goal, by providing the user with a dynamic, visual guide to the ongoing selection process and ultimate control over the contents (and therefore quality) of the dataset. BioMed Central 2011-02-09 /pmc/articles/PMC3045304/ /pubmed/21306634 http://dx.doi.org/10.1186/1471-2105-12-51 Text en Copyright © 2011 Krishnamoorthy et al; licensee BioMed Central Ltd. https://creativecommons.org/licenses/by/2.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0 (https://creativecommons.org/licenses/by/2.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Software
Krishnamoorthy, Mohan
Patel, Pragneshkumar
Dimitrijevic, Mira
Dietrich, Jonathan
Green, Margaret
Macken, Catherine
Tree pruner: An efficient tool for selecting data from a biased genetic database
title Tree pruner: An efficient tool for selecting data from a biased genetic database
title_full Tree pruner: An efficient tool for selecting data from a biased genetic database
title_fullStr Tree pruner: An efficient tool for selecting data from a biased genetic database
title_full_unstemmed Tree pruner: An efficient tool for selecting data from a biased genetic database
title_short Tree pruner: An efficient tool for selecting data from a biased genetic database
title_sort tree pruner: an efficient tool for selecting data from a biased genetic database
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3045304/
https://www.ncbi.nlm.nih.gov/pubmed/21306634
http://dx.doi.org/10.1186/1471-2105-12-51
work_keys_str_mv AT krishnamoorthymohan treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase
AT patelpragneshkumar treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase
AT dimitrijevicmira treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase
AT dietrichjonathan treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase
AT greenmargaret treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase
AT mackencatherine treepruneranefficienttoolforselectingdatafromabiasedgeneticdatabase