Cargando…
An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference
The ITS2 gene class shows a high sequence divergence among its members that have complicated its annotation and its use for reconstructing phylogenies at a higher taxonomical level (beyond species and genus). Several alignment strategies have been implemented to improve the ITS2 annotation quality a...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3202569/ https://www.ncbi.nlm.nih.gov/pubmed/22046320 http://dx.doi.org/10.1371/journal.pone.0026638 |
_version_ | 1782215014235504640 |
---|---|
author | Agüero-Chapin, Guillermin Sánchez-Rodríguez, Aminael Hidalgo-Yanes, Pedro I. Pérez-Castillo, Yunierkis Molina-Ruiz, Reinaldo Marchal, Kathleen Vasconcelos, Vítor Antunes, Agostinho |
author_facet | Agüero-Chapin, Guillermin Sánchez-Rodríguez, Aminael Hidalgo-Yanes, Pedro I. Pérez-Castillo, Yunierkis Molina-Ruiz, Reinaldo Marchal, Kathleen Vasconcelos, Vítor Antunes, Agostinho |
author_sort | Agüero-Chapin, Guillermin |
collection | PubMed |
description | The ITS2 gene class shows a high sequence divergence among its members that have complicated its annotation and its use for reconstructing phylogenies at a higher taxonomical level (beyond species and genus). Several alignment strategies have been implemented to improve the ITS2 annotation quality and its use for phylogenetic inferences. Although, alignment based methods have been exploited to the top of its complexity to tackle both issues, no alignment-free approaches have been able to successfully address both topics. By contrast, the use of simple alignment-free classifiers, like the topological indices (TIs) containing information about the sequence and structure of ITS2, may reveal to be a useful approach for the gene prediction and for assessing the phylogenetic relationships of the ITS2 class in eukaryotes. Thus, we used the TI2BioP (Topological Indices to BioPolymers) methodology [1], [2], freely available at http://ti2biop.sourceforge.net/ to calculate two different TIs. One class was derived from the ITS2 artificial 2D structures generated from DNA strings and the other from the secondary structure inferred from RNA folding algorithms. Two alignment-free models based on Artificial Neural Networks were developed for the ITS2 class prediction using the two classes of TIs referred above. Both models showed similar performances on the training and the test sets reaching values above 95% in the overall classification. Due to the importance of the ITS2 region for fungi identification, a novel ITS2 genomic sequence was isolated from Petrakia sp. This sequence and the test set were used to comparatively evaluate the conventional classification models based on multiple sequence alignments like Hidden Markov based approaches, revealing the success of our models to identify novel ITS2 members. The isolated sequence was assessed using traditional and alignment-free based techniques applied to phylogenetic inference to complement the taxonomy of the Petrakia sp. fungal isolate. |
format | Online Article Text |
id | pubmed-3202569 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-32025692011-11-01 An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference Agüero-Chapin, Guillermin Sánchez-Rodríguez, Aminael Hidalgo-Yanes, Pedro I. Pérez-Castillo, Yunierkis Molina-Ruiz, Reinaldo Marchal, Kathleen Vasconcelos, Vítor Antunes, Agostinho PLoS One Research Article The ITS2 gene class shows a high sequence divergence among its members that have complicated its annotation and its use for reconstructing phylogenies at a higher taxonomical level (beyond species and genus). Several alignment strategies have been implemented to improve the ITS2 annotation quality and its use for phylogenetic inferences. Although, alignment based methods have been exploited to the top of its complexity to tackle both issues, no alignment-free approaches have been able to successfully address both topics. By contrast, the use of simple alignment-free classifiers, like the topological indices (TIs) containing information about the sequence and structure of ITS2, may reveal to be a useful approach for the gene prediction and for assessing the phylogenetic relationships of the ITS2 class in eukaryotes. Thus, we used the TI2BioP (Topological Indices to BioPolymers) methodology [1], [2], freely available at http://ti2biop.sourceforge.net/ to calculate two different TIs. One class was derived from the ITS2 artificial 2D structures generated from DNA strings and the other from the secondary structure inferred from RNA folding algorithms. Two alignment-free models based on Artificial Neural Networks were developed for the ITS2 class prediction using the two classes of TIs referred above. Both models showed similar performances on the training and the test sets reaching values above 95% in the overall classification. Due to the importance of the ITS2 region for fungi identification, a novel ITS2 genomic sequence was isolated from Petrakia sp. This sequence and the test set were used to comparatively evaluate the conventional classification models based on multiple sequence alignments like Hidden Markov based approaches, revealing the success of our models to identify novel ITS2 members. The isolated sequence was assessed using traditional and alignment-free based techniques applied to phylogenetic inference to complement the taxonomy of the Petrakia sp. fungal isolate. Public Library of Science 2011-10-26 /pmc/articles/PMC3202569/ /pubmed/22046320 http://dx.doi.org/10.1371/journal.pone.0026638 Text en Agüero-Chapin et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Agüero-Chapin, Guillermin Sánchez-Rodríguez, Aminael Hidalgo-Yanes, Pedro I. Pérez-Castillo, Yunierkis Molina-Ruiz, Reinaldo Marchal, Kathleen Vasconcelos, Vítor Antunes, Agostinho An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title | An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title_full | An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title_fullStr | An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title_full_unstemmed | An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title_short | An Alignment-Free Approach for Eukaryotic ITS2 Annotation and Phylogenetic Inference |
title_sort | alignment-free approach for eukaryotic its2 annotation and phylogenetic inference |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3202569/ https://www.ncbi.nlm.nih.gov/pubmed/22046320 http://dx.doi.org/10.1371/journal.pone.0026638 |
work_keys_str_mv | AT aguerochapinguillermin analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT sanchezrodriguezaminael analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT hidalgoyanespedroi analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT perezcastilloyunierkis analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT molinaruizreinaldo analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT marchalkathleen analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT vasconcelosvitor analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT antunesagostinho analignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT aguerochapinguillermin alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT sanchezrodriguezaminael alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT hidalgoyanespedroi alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT perezcastilloyunierkis alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT molinaruizreinaldo alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT marchalkathleen alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT vasconcelosvitor alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference AT antunesagostinho alignmentfreeapproachforeukaryoticits2annotationandphylogeneticinference |