Cargando…
The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes
BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7992802/ https://www.ncbi.nlm.nih.gov/pubmed/33765927 http://dx.doi.org/10.1186/s12864-021-07529-x |
_version_ | 1783669454734884864 |
---|---|
author | Rödelsperger, Christian |
author_facet | Rödelsperger, Christian |
author_sort | Rödelsperger, Christian |
collection | PubMed |
description | BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were employed recently to improve the quality of gene annotations of P. pacificus and to more easily facilitate reverse genetic studies using candidate genes from C. elegans. RESULTS: Here, I demonstrate that the reannotation of phylogenomic data from nine related nematode species using the community-curated P. pacificus gene set as homology data substantially improves the quality of gene annotations. Benchmarking of universal single copy orthologs (BUSCO) estimates a median completeness of 84% which corresponds to a 9% increase over previous annotations. Nevertheless, the ability to infer gene models based on homology already drops beyond the genus level reflecting the rapid evolution of nematode lineages. This also indicates that the highly curated C. elegans genome is not optimally suited for annotating non-Caenorhabditis genomes based on homology. Furthermore, comparative genomic analysis of apparently missing BUSCO genes indicates a failure of ortholog detection by the BUSCO pipeline due to the insufficient sample size and phylogenetic breadth of the underlying OrthoDB data set. As a consequence, the quality of multiple divergent nematode genomes might be underestimated. CONCLUSIONS: This study highlights the need for optimizing gene annotation protocols and it demonstrates the benefit of a high quality genome for phylogenomic data of related species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07529-x. |
format | Online Article Text |
id | pubmed-7992802 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-79928022021-03-25 The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes Rödelsperger, Christian BMC Genomics Research Article BACKGROUND: The nematode Pristionchus pacificus is an established model organism for comparative studies with Caenorhabditis elegans. Over the past years, it developed into an independent animal model organism for elucidating the genetic basis of phenotypic plasticity. Community-based curations were employed recently to improve the quality of gene annotations of P. pacificus and to more easily facilitate reverse genetic studies using candidate genes from C. elegans. RESULTS: Here, I demonstrate that the reannotation of phylogenomic data from nine related nematode species using the community-curated P. pacificus gene set as homology data substantially improves the quality of gene annotations. Benchmarking of universal single copy orthologs (BUSCO) estimates a median completeness of 84% which corresponds to a 9% increase over previous annotations. Nevertheless, the ability to infer gene models based on homology already drops beyond the genus level reflecting the rapid evolution of nematode lineages. This also indicates that the highly curated C. elegans genome is not optimally suited for annotating non-Caenorhabditis genomes based on homology. Furthermore, comparative genomic analysis of apparently missing BUSCO genes indicates a failure of ortholog detection by the BUSCO pipeline due to the insufficient sample size and phylogenetic breadth of the underlying OrthoDB data set. As a consequence, the quality of multiple divergent nematode genomes might be underestimated. CONCLUSIONS: This study highlights the need for optimizing gene annotation protocols and it demonstrates the benefit of a high quality genome for phylogenomic data of related species. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12864-021-07529-x. BioMed Central 2021-03-25 /pmc/articles/PMC7992802/ /pubmed/33765927 http://dx.doi.org/10.1186/s12864-021-07529-x Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Rödelsperger, Christian The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title | The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title_full | The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title_fullStr | The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title_full_unstemmed | The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title_short | The community-curated Pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
title_sort | community-curated pristionchus pacificus genome facilitates automated gene annotation improvement in related nematodes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7992802/ https://www.ncbi.nlm.nih.gov/pubmed/33765927 http://dx.doi.org/10.1186/s12864-021-07529-x |
work_keys_str_mv | AT rodelspergerchristian thecommunitycuratedpristionchuspacificusgenomefacilitatesautomatedgeneannotationimprovementinrelatednematodes AT rodelspergerchristian communitycuratedpristionchuspacificusgenomefacilitatesautomatedgeneannotationimprovementinrelatednematodes |