Cargando…

The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes

BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were...

Descripción completa

Detalles Bibliográficos
Autor principal: Rödelsperger, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7992802/
https://www.ncbi.nlm.nih.gov/pubmed/33765927
http://dx.doi.org/10.1186/s12864-021-07529-x
_version_ 1783669454734884864
author Rödelsperger, Christian
author_facet Rödelsperger, Christian
author_sort Rödelsperger, Christian
collection PubMed
description BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were employed recently to improve the quality of gene annotations of P. pacificus and to more easily facilitate reverse genetic studies using candidate genes from C. elegans. RESULTS: Here, I demonstrate that the reannotation of phylogenomic data from nine related nematode species using the community-curated P. pacificus gene set as homology data substantially improves the quality of gene annotations. Benchmarking of universal single copy orthologs (BUSCO) estimates a median completeness of 84% which corresponds to a 9% increase over previous annotations. Nevertheless, the ability to infer gene models based on homology already drops beyond the genus level reflecting the rapid evolution of nematode lineages. This also indicates that the highly curated C. elegans genome is not optimally suited for annotating non-Caenorhabditis genomes based on homology. Furthermore, comparative genomic analysis of apparently missing BUSCO genes indicates a failure of ortholog detection by the BUSCO pipeline due to the insufficient sample size and phylogenetic breadth of the underlying OrthoDB data set. As a consequence, the quality of multiple divergent nematode genomes might be underestimated. CONCLUSIONS: This study highlights the need for optimizing gene annotation protocols and it demonstrates the benefit of a high quality genome for phylogenomic data of related species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07529-x.
format Online
Article
Text
id pubmed-7992802
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79928022021-03-25 The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes Rödelsperger, Christian BMC Genomics Research Article BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were employed recently to improve the quality of gene annotations of P. pacificus and to more easily facilitate reverse genetic studies using candidate genes from C. elegans. RESULTS: Here, I demonstrate that the reannotation of phylogenomic data from nine related nematode species using the community-curated P. pacificus gene set as homology data substantially improves the quality of gene annotations. Benchmarking of universal single copy orthologs (BUSCO) estimates a median completeness of 84% which corresponds to a 9% increase over previous annotations. Nevertheless, the ability to infer gene models based on homology already drops beyond the genus level reflecting the rapid evolution of nematode lineages. This also indicates that the highly curated C. elegans genome is not optimally suited for annotating non-Caenorhabditis genomes based on homology. Furthermore, comparative genomic analysis of apparently missing BUSCO genes indicates a failure of ortholog detection by the BUSCO pipeline due to the insufficient sample size and phylogenetic breadth of the underlying OrthoDB data set. As a consequence, the quality of multiple divergent nematode genomes might be underestimated. CONCLUSIONS: This study highlights the need for optimizing gene annotation protocols and it demonstrates the benefit of a high quality genome for phylogenomic data of related species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07529-x. BioMed Central 2021-03-25 /pmc/articles/PMC7992802/ /pubmed/33765927 http://dx.doi.org/10.1186/s12864-021-07529-x Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Rödelsperger, Christian
The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title_full The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title_fullStr The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title_full_unstemmed The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title_short The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
title_sort community-curated pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7992802/
https://www.ncbi.nlm.nih.gov/pubmed/33765927
http://dx.doi.org/10.1186/s12864-021-07529-x
work_keys_str_mv AT rodelspergerchristian thecommunitycuratedpristionchuspacificusgenomefacilitatesautomatedgeneannotationimprovementinrelatednematodes
AT rodelspergerchristian communitycuratedpristionchuspacificusgenomefacilitatesautomatedgeneannotationimprovementinrelatednematodes