Cargando…

QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction

MOTIVATION: Secondary structure prediction accuracy (SSPA) in the QuanTest benchmark can be used to measure accuracy of a multiple sequence alignment. SSPA correlates well with the sum-of-pairs score, if the results are averaged over many alignments but not on an alignment-by-alignment basis. This i...

Descripción completa

Detalles Bibliográficos
Autores principales: Sievers, Fabian, Higgins, Desmond G
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9881607/
https://www.ncbi.nlm.nih.gov/pubmed/31292629
http://dx.doi.org/10.1093/bioinformatics/btz552
_version_ 1784879141858836480
author Sievers, Fabian
Higgins, Desmond G
author_facet Sievers, Fabian
Higgins, Desmond G
author_sort Sievers, Fabian
collection PubMed
description MOTIVATION: Secondary structure prediction accuracy (SSPA) in the QuanTest benchmark can be used to measure accuracy of a multiple sequence alignment. SSPA correlates well with the sum-of-pairs score, if the results are averaged over many alignments but not on an alignment-by-alignment basis. This is due to a sub-optimal selection of reference and non-reference sequences in QuanTest. RESULTS: We develop an improved strategy for selecting reference and non-reference sequences for a new benchmark, QuanTest2. In QuanTest2, SSPA and SP correlate better on an alignment-by-alignment basis than in QuanTest. Guide-trees for QuanTest2 are more balanced with respect to reference sequences than in QuanTest. QuanTest2 scores correlate well with other well-established benchmarks. AVAILABILITY AND IMPLEMENTATION: QuanTest2 is available at http://bioinf.ucd.ie/quantest2.tar, comprises of reference and non-reference sequence sets and a scoring script. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online
format Online
Article
Text
id pubmed-9881607
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98816072023-01-31 QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction Sievers, Fabian Higgins, Desmond G Bioinformatics Original Papers MOTIVATION: Secondary structure prediction accuracy (SSPA) in the QuanTest benchmark can be used to measure accuracy of a multiple sequence alignment. SSPA correlates well with the sum-of-pairs score, if the results are averaged over many alignments but not on an alignment-by-alignment basis. This is due to a sub-optimal selection of reference and non-reference sequences in QuanTest. RESULTS: We develop an improved strategy for selecting reference and non-reference sequences for a new benchmark, QuanTest2. In QuanTest2, SSPA and SP correlate better on an alignment-by-alignment basis than in QuanTest. Guide-trees for QuanTest2 are more balanced with respect to reference sequences than in QuanTest. QuanTest2 scores correlate well with other well-established benchmarks. AVAILABILITY AND IMPLEMENTATION: QuanTest2 is available at http://bioinf.ucd.ie/quantest2.tar, comprises of reference and non-reference sequence sets and a scoring script. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online Oxford University Press 2019-07-11 /pmc/articles/PMC9881607/ /pubmed/31292629 http://dx.doi.org/10.1093/bioinformatics/btz552 Text en © The Author(s) 2019. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Sievers, Fabian
Higgins, Desmond G
QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title_full QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title_fullStr QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title_full_unstemmed QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title_short QuanTest2: benchmarking multiple sequence alignments using secondary structure prediction
title_sort quantest2: benchmarking multiple sequence alignments using secondary structure prediction
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9881607/
https://www.ncbi.nlm.nih.gov/pubmed/31292629
http://dx.doi.org/10.1093/bioinformatics/btz552
work_keys_str_mv AT sieversfabian quantest2benchmarkingmultiplesequencealignmentsusingsecondarystructureprediction
AT higginsdesmondg quantest2benchmarkingmultiplesequencealignmentsusingsecondarystructureprediction