Cargando…

OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy

Identifying homology relationships between sequences is fundamental to biological research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy. Using...

Descripción completa

Detalles Bibliográficos
Autores principales: Emms, David M., Kelly, Steven
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4531804/
https://www.ncbi.nlm.nih.gov/pubmed/26243257
http://dx.doi.org/10.1186/s13059-015-0721-2
_version_ 1782385118557503488
author Emms, David M.
Kelly, Steven
author_facet Emms, David M.
Kelly, Steven
author_sort Emms, David M.
collection PubMed
description Identifying homology relationships between sequences is fundamental to biological research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy. Using real benchmark datasets we demonstrate that OrthoFinder is more accurate than other orthogroup inference methods by between 8 % and 33 %. Furthermore, we demonstrate the utility of OrthoFinder by providing a complete classification of transcription factor gene families in plants revealing 6.9 million previously unobserved relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0721-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4531804
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45318042015-08-12 OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy Emms, David M. Kelly, Steven Genome Biol Software Identifying homology relationships between sequences is fundamental to biological research. Here we provide a novel orthogroup inference algorithm called OrthoFinder that solves a previously undetected gene length bias in orthogroup inference, resulting in significant improvements in accuracy. Using real benchmark datasets we demonstrate that OrthoFinder is more accurate than other orthogroup inference methods by between 8 % and 33 %. Furthermore, we demonstrate the utility of OrthoFinder by providing a complete classification of transcription factor gene families in plants revealing 6.9 million previously unobserved relationships. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s13059-015-0721-2) contains supplementary material, which is available to authorized users. BioMed Central 2015-08-06 2015 /pmc/articles/PMC4531804/ /pubmed/26243257 http://dx.doi.org/10.1186/s13059-015-0721-2 Text en © Emms and Kelly. 2015 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Emms, David M.
Kelly, Steven
OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title_full OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title_fullStr OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title_full_unstemmed OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title_short OrthoFinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
title_sort orthofinder: solving fundamental biases in whole genome comparisons dramatically improves orthogroup inference accuracy
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4531804/
https://www.ncbi.nlm.nih.gov/pubmed/26243257
http://dx.doi.org/10.1186/s13059-015-0721-2
work_keys_str_mv AT emmsdavidm orthofindersolvingfundamentalbiasesinwholegenomecomparisonsdramaticallyimprovesorthogroupinferenceaccuracy
AT kellysteven orthofindersolvingfundamentalbiasesinwholegenomecomparisonsdramaticallyimprovesorthogroupinferenceaccuracy