Cargando…

Discriminatory power of RNA family models

Motivation: RNA family models group nucleotide sequences that share a common biological function. These models can be used to find new sequences belonging to the same family. To succeed in this task, a model needs to exhibit high sensitivity as well as high specificity. As model construction is guid...

Descripción completa

Detalles Bibliográficos
Autores principales: zu Siederdissen, Christian Höner, Hofacker, Ivo L.
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935435/
https://www.ncbi.nlm.nih.gov/pubmed/20823307
http://dx.doi.org/10.1093/bioinformatics/btq370
_version_ 1782186402382872576
author zu Siederdissen, Christian Höner
Hofacker, Ivo L.
author_facet zu Siederdissen, Christian Höner
Hofacker, Ivo L.
author_sort zu Siederdissen, Christian Höner
collection PubMed
description Motivation: RNA family models group nucleotide sequences that share a common biological function. These models can be used to find new sequences belonging to the same family. To succeed in this task, a model needs to exhibit high sensitivity as well as high specificity. As model construction is guided by a manual process, a number of problems can occur, such as the introduction of more than one model for the same family or poorly constructed models. We explore the Rfam database to discover such problems. Results: Our main contribution is in the definition of the discriminatory power of RNA family models, together with a first algorithm for its computation. In addition, we present calculations across the whole Rfam database that show several families lacking high specificity when compared to other families. We give a list of these clusters of families and provide a tentative explanation. Our program can be used to: (i) make sure that new models are not equivalent to any model already present in the database; and (ii) new models are not simply submodels of existing families. Availability: www.tbi.univie.ac.at/software/cmcompare/. The code is licensed under the GPLv3. Results for the whole Rfam database and supporting scripts are available together with the software. Contact: choener@tbi.univie.ac.at
format Text
id pubmed-2935435
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-29354352010-09-08 Discriminatory power of RNA family models zu Siederdissen, Christian Höner Hofacker, Ivo L. Bioinformatics Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium Motivation: RNA family models group nucleotide sequences that share a common biological function. These models can be used to find new sequences belonging to the same family. To succeed in this task, a model needs to exhibit high sensitivity as well as high specificity. As model construction is guided by a manual process, a number of problems can occur, such as the introduction of more than one model for the same family or poorly constructed models. We explore the Rfam database to discover such problems. Results: Our main contribution is in the definition of the discriminatory power of RNA family models, together with a first algorithm for its computation. In addition, we present calculations across the whole Rfam database that show several families lacking high specificity when compared to other families. We give a list of these clusters of families and provide a tentative explanation. Our program can be used to: (i) make sure that new models are not equivalent to any model already present in the database; and (ii) new models are not simply submodels of existing families. Availability: www.tbi.univie.ac.at/software/cmcompare/. The code is licensed under the GPLv3. Results for the whole Rfam database and supporting scripts are available together with the software. Contact: choener@tbi.univie.ac.at Oxford University Press 2010-09-15 2010-09-04 /pmc/articles/PMC2935435/ /pubmed/20823307 http://dx.doi.org/10.1093/bioinformatics/btq370 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
zu Siederdissen, Christian Höner
Hofacker, Ivo L.
Discriminatory power of RNA family models
title Discriminatory power of RNA family models
title_full Discriminatory power of RNA family models
title_fullStr Discriminatory power of RNA family models
title_full_unstemmed Discriminatory power of RNA family models
title_short Discriminatory power of RNA family models
title_sort discriminatory power of rna family models
topic Eccb 2010 Conference Proceedings September 26 to September 29, 2010, Ghent, Belgium
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2935435/
https://www.ncbi.nlm.nih.gov/pubmed/20823307
http://dx.doi.org/10.1093/bioinformatics/btq370
work_keys_str_mv AT zusiederdissenchristianhoner discriminatorypowerofrnafamilymodels
AT hofackerivol discriminatorypowerofrnafamilymodels