Cargando…

Benchmarking of alignment-free sequence comparison methods

BACKGROUND: Alignment-free (AF) sequence comparison is attracting persistent interest driven by data-intensive applications. Hence, many AF procedures have been proposed in recent years, but a lack of a clearly defined benchmarking consensus hampers their performance assessment. RESULTS: Here, we pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zielezinski, Andrzej, Girgis, Hani Z., Bernard, Guillaume, Leimeister, Chris-Andre, Tang, Kujin, Dencker, Thomas, Lau, Anna Katharina, Röhling, Sophie, Choi, Jae Jin, Waterman, Michael S., Comin, Matteo, Kim, Sung-Hou, Vinga, Susana, Almeida, Jonas S., Chan, Cheong Xin, James, Benjamin T., Sun, Fengzhu, Morgenstern, Burkhard, Karlowski, Wojciech M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659240/
https://www.ncbi.nlm.nih.gov/pubmed/31345254
http://dx.doi.org/10.1186/s13059-019-1755-7
_version_ 1783439095631970304
author Zielezinski, Andrzej
Girgis, Hani Z.
Bernard, Guillaume
Leimeister, Chris-Andre
Tang, Kujin
Dencker, Thomas
Lau, Anna Katharina
Röhling, Sophie
Choi, Jae Jin
Waterman, Michael S.
Comin, Matteo
Kim, Sung-Hou
Vinga, Susana
Almeida, Jonas S.
Chan, Cheong Xin
James, Benjamin T.
Sun, Fengzhu
Morgenstern, Burkhard
Karlowski, Wojciech M.
author_facet Zielezinski, Andrzej
Girgis, Hani Z.
Bernard, Guillaume
Leimeister, Chris-Andre
Tang, Kujin
Dencker, Thomas
Lau, Anna Katharina
Röhling, Sophie
Choi, Jae Jin
Waterman, Michael S.
Comin, Matteo
Kim, Sung-Hou
Vinga, Susana
Almeida, Jonas S.
Chan, Cheong Xin
James, Benjamin T.
Sun, Fengzhu
Morgenstern, Burkhard
Karlowski, Wojciech M.
author_sort Zielezinski, Andrzej
collection PubMed
description BACKGROUND: Alignment-free (AF) sequence comparison is attracting persistent interest driven by data-intensive applications. Hence, many AF procedures have been proposed in recent years, but a lack of a clearly defined benchmarking consensus hampers their performance assessment. RESULTS: Here, we present a community resource (http://afproject.org) to establish standards for comparing alignment-free approaches across different areas of sequence-based research. We characterize 74 AF methods available in 24 software tools for five research applications, namely, protein sequence classification, gene tree inference, regulatory element detection, genome-based phylogenetic inference, and reconstruction of species trees under horizontal gene transfer and recombination events. CONCLUSION: The interactive web service allows researchers to explore the performance of alignment-free tools relevant to their data types and analytical goals. It also allows method developers to assess their own algorithms and compare them with current state-of-the-art tools, accelerating the development of new, more accurate AF solutions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1755-7) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6659240
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-66592402019-08-01 Benchmarking of alignment-free sequence comparison methods Zielezinski, Andrzej Girgis, Hani Z. Bernard, Guillaume Leimeister, Chris-Andre Tang, Kujin Dencker, Thomas Lau, Anna Katharina Röhling, Sophie Choi, Jae Jin Waterman, Michael S. Comin, Matteo Kim, Sung-Hou Vinga, Susana Almeida, Jonas S. Chan, Cheong Xin James, Benjamin T. Sun, Fengzhu Morgenstern, Burkhard Karlowski, Wojciech M. Genome Biol Research BACKGROUND: Alignment-free (AF) sequence comparison is attracting persistent interest driven by data-intensive applications. Hence, many AF procedures have been proposed in recent years, but a lack of a clearly defined benchmarking consensus hampers their performance assessment. RESULTS: Here, we present a community resource (http://afproject.org) to establish standards for comparing alignment-free approaches across different areas of sequence-based research. We characterize 74 AF methods available in 24 software tools for five research applications, namely, protein sequence classification, gene tree inference, regulatory element detection, genome-based phylogenetic inference, and reconstruction of species trees under horizontal gene transfer and recombination events. CONCLUSION: The interactive web service allows researchers to explore the performance of alignment-free tools relevant to their data types and analytical goals. It also allows method developers to assess their own algorithms and compare them with current state-of-the-art tools, accelerating the development of new, more accurate AF solutions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1755-7) contains supplementary material, which is available to authorized users. BioMed Central 2019-07-25 /pmc/articles/PMC6659240/ /pubmed/31345254 http://dx.doi.org/10.1186/s13059-019-1755-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zielezinski, Andrzej
Girgis, Hani Z.
Bernard, Guillaume
Leimeister, Chris-Andre
Tang, Kujin
Dencker, Thomas
Lau, Anna Katharina
Röhling, Sophie
Choi, Jae Jin
Waterman, Michael S.
Comin, Matteo
Kim, Sung-Hou
Vinga, Susana
Almeida, Jonas S.
Chan, Cheong Xin
James, Benjamin T.
Sun, Fengzhu
Morgenstern, Burkhard
Karlowski, Wojciech M.
Benchmarking of alignment-free sequence comparison methods
title Benchmarking of alignment-free sequence comparison methods
title_full Benchmarking of alignment-free sequence comparison methods
title_fullStr Benchmarking of alignment-free sequence comparison methods
title_full_unstemmed Benchmarking of alignment-free sequence comparison methods
title_short Benchmarking of alignment-free sequence comparison methods
title_sort benchmarking of alignment-free sequence comparison methods
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659240/
https://www.ncbi.nlm.nih.gov/pubmed/31345254
http://dx.doi.org/10.1186/s13059-019-1755-7
work_keys_str_mv AT zielezinskiandrzej benchmarkingofalignmentfreesequencecomparisonmethods
AT girgishaniz benchmarkingofalignmentfreesequencecomparisonmethods
AT bernardguillaume benchmarkingofalignmentfreesequencecomparisonmethods
AT leimeisterchrisandre benchmarkingofalignmentfreesequencecomparisonmethods
AT tangkujin benchmarkingofalignmentfreesequencecomparisonmethods
AT denckerthomas benchmarkingofalignmentfreesequencecomparisonmethods
AT lauannakatharina benchmarkingofalignmentfreesequencecomparisonmethods
AT rohlingsophie benchmarkingofalignmentfreesequencecomparisonmethods
AT choijaejin benchmarkingofalignmentfreesequencecomparisonmethods
AT watermanmichaels benchmarkingofalignmentfreesequencecomparisonmethods
AT cominmatteo benchmarkingofalignmentfreesequencecomparisonmethods
AT kimsunghou benchmarkingofalignmentfreesequencecomparisonmethods
AT vingasusana benchmarkingofalignmentfreesequencecomparisonmethods
AT almeidajonass benchmarkingofalignmentfreesequencecomparisonmethods
AT chancheongxin benchmarkingofalignmentfreesequencecomparisonmethods
AT jamesbenjamint benchmarkingofalignmentfreesequencecomparisonmethods
AT sunfengzhu benchmarkingofalignmentfreesequencecomparisonmethods
AT morgensternburkhard benchmarkingofalignmentfreesequencecomparisonmethods
AT karlowskiwojciechm benchmarkingofalignmentfreesequencecomparisonmethods