Cargando…

Benchmarking of alignment-free sequence comparison methods

BACKGROUND: Alignment-free (AF) sequence comparison is attracting persistent interest driven by data-intensive applications. Hence, many AF procedures have been proposed in recent years, but a lack of a clearly defined benchmarking consensus hampers their performance assessment. RESULTS: Here, we pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Zielezinski, Andrzej, Girgis, Hani Z., Bernard, Guillaume, Leimeister, Chris-Andre, Tang, Kujin, Dencker, Thomas, Lau, Anna Katharina, Röhling, Sophie, Choi, Jae Jin, Waterman, Michael S., Comin, Matteo, Kim, Sung-Hou, Vinga, Susana, Almeida, Jonas S., Chan, Cheong Xin, James, Benjamin T., Sun, Fengzhu, Morgenstern, Burkhard, Karlowski, Wojciech M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6659240/
https://www.ncbi.nlm.nih.gov/pubmed/31345254
http://dx.doi.org/10.1186/s13059-019-1755-7
Descripción
Sumario:BACKGROUND: Alignment-free (AF) sequence comparison is attracting persistent interest driven by data-intensive applications. Hence, many AF procedures have been proposed in recent years, but a lack of a clearly defined benchmarking consensus hampers their performance assessment. RESULTS: Here, we present a community resource (http://afproject.org) to establish standards for comparing alignment-free approaches across different areas of sequence-based research. We characterize 74 AF methods available in 24 software tools for five research applications, namely, protein sequence classification, gene tree inference, regulatory element detection, genome-based phylogenetic inference, and reconstruction of species trees under horizontal gene transfer and recombination events. CONCLUSION: The interactive web service allows researchers to explore the performance of alignment-free tools relevant to their data types and analytical goals. It also allows method developers to assess their own algorithms and compare them with current state-of-the-art tools, accelerating the development of new, more accurate AF solutions. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13059-019-1755-7) contains supplementary material, which is available to authorized users.