Cargando…
Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features
Spliced alignment plays a central role in the precise identification of eukaryotic gene structures. Even though many spliced alignment programs have been developed, recent rapid progress in DNA sequencing technologies demands further improvements in software tools. Benchmarking algorithms under vari...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488211/ https://www.ncbi.nlm.nih.gov/pubmed/22848105 http://dx.doi.org/10.1093/nar/gks708 |
_version_ | 1782248582805454848 |
---|---|
author | Iwata, Hiroaki Gotoh, Osamu |
author_facet | Iwata, Hiroaki Gotoh, Osamu |
author_sort | Iwata, Hiroaki |
collection | PubMed |
description | Spliced alignment plays a central role in the precise identification of eukaryotic gene structures. Even though many spliced alignment programs have been developed, recent rapid progress in DNA sequencing technologies demands further improvements in software tools. Benchmarking algorithms under various conditions is an indispensable task for the development of better software; however, there is a dire lack of appropriate datasets usable for benchmarking spliced alignment programs. In this study, we have constructed two types of datasets: simulated sequence datasets and actual cross-species datasets. The datasets are designed to correspond to various real situations, i.e. divergent eukaryotic species, different types of reference sequences, and the wide divergence between query and target sequences. In addition, we have developed an extended version of our program Spaln, which incorporates two additional features to the scoring scheme of the original version, and examined this extended version, Spaln2, together with the original Spaln and other representative aligners based on our benchmark datasets. Although the effects of the modifications are not individually striking, Spaln2 is consistently most accurate and reasonably fast in most practical cases, especially for plants and fungi and for increasingly divergent pairs of target and query sequences. |
format | Online Article Text |
id | pubmed-3488211 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34882112012-11-06 Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features Iwata, Hiroaki Gotoh, Osamu Nucleic Acids Res Methods Online Spliced alignment plays a central role in the precise identification of eukaryotic gene structures. Even though many spliced alignment programs have been developed, recent rapid progress in DNA sequencing technologies demands further improvements in software tools. Benchmarking algorithms under various conditions is an indispensable task for the development of better software; however, there is a dire lack of appropriate datasets usable for benchmarking spliced alignment programs. In this study, we have constructed two types of datasets: simulated sequence datasets and actual cross-species datasets. The datasets are designed to correspond to various real situations, i.e. divergent eukaryotic species, different types of reference sequences, and the wide divergence between query and target sequences. In addition, we have developed an extended version of our program Spaln, which incorporates two additional features to the scoring scheme of the original version, and examined this extended version, Spaln2, together with the original Spaln and other representative aligners based on our benchmark datasets. Although the effects of the modifications are not individually striking, Spaln2 is consistently most accurate and reasonably fast in most practical cases, especially for plants and fungi and for increasingly divergent pairs of target and query sequences. Oxford University Press 2012-11 2012-07-30 /pmc/articles/PMC3488211/ /pubmed/22848105 http://dx.doi.org/10.1093/nar/gks708 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methods Online Iwata, Hiroaki Gotoh, Osamu Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title | Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title_full | Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title_fullStr | Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title_full_unstemmed | Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title_short | Benchmarking spliced alignment programs including Spaln2, an extended version of Spaln that incorporates additional species-specific features |
title_sort | benchmarking spliced alignment programs including spaln2, an extended version of spaln that incorporates additional species-specific features |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3488211/ https://www.ncbi.nlm.nih.gov/pubmed/22848105 http://dx.doi.org/10.1093/nar/gks708 |
work_keys_str_mv | AT iwatahiroaki benchmarkingsplicedalignmentprogramsincludingspaln2anextendedversionofspalnthatincorporatesadditionalspeciesspecificfeatures AT gotohosamu benchmarkingsplicedalignmentprogramsincludingspaln2anextendedversionofspalnthatincorporatesadditionalspeciesspecificfeatures |