Cargando…

Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0

Omics tools provide broad datasets for biological discovery. However, the computational tools for identifying important genes or pathways in RNA-seq, proteomics, or GWAS (Genome-Wide Association Study) data depend on Gene Ontogeny annotations and are biased toward well-described pathways. This limit...

Descripción completa

Detalles Bibliográficos
Autores principales: Higgins, Daniel P, Weisman, Caroline M, Lui, Dominique S, D’Agostino, Frank A, Walker, Amy K
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9339291/
https://www.ncbi.nlm.nih.gov/pubmed/35587742
http://dx.doi.org/10.1093/genetics/iyac085
_version_ 1784760157118398464
author Higgins, Daniel P
Weisman, Caroline M
Lui, Dominique S
D’Agostino, Frank A
Walker, Amy K
author_facet Higgins, Daniel P
Weisman, Caroline M
Lui, Dominique S
D’Agostino, Frank A
Walker, Amy K
author_sort Higgins, Daniel P
collection PubMed
description Omics tools provide broad datasets for biological discovery. However, the computational tools for identifying important genes or pathways in RNA-seq, proteomics, or GWAS (Genome-Wide Association Study) data depend on Gene Ontogeny annotations and are biased toward well-described pathways. This limits their utility as poorly annotated genes, which could have novel functions, are often passed over. Recently, we developed an annotation and category enrichment tool for Caenorhabditis elegans genomic data, WormCat, which provides an intuitive visualization output. Unlike Gene Ontogeny-based enrichment tools, which exclude genes with no annotation information, WormCat 2.0 retains these genes as a special UNASSIGNED category. Here, we show that the UNASSIGNED gene category enrichment exhibits tissue-specific expression patterns and can include genes with biological functions identified in published datasets. Poorly annotated genes are often considered to be potentially species-specific and thus, of reduced interest to the biomedical community. Instead, we find that around 3% of the UNASSIGNED genes have human orthologs, including some linked to human diseases. These human orthologs themselves have little annotation information. A recently developed method that incorporates lineage relationships (abSENSE) indicates that the failure of BLAST to detect homology explains the apparent lineage specificity for many UNASSIGNED genes. This suggests that a larger subset could be related to human genes. WormCat provides an annotation strategy that allows the association of UNASSIGNED genes with specific phenotypes and known pathways. Building these associations in C. elegans, with its robust genetic tools, provides a path to further functional study and insight into these understudied genes.
format Online
Article
Text
id pubmed-9339291
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-93392912022-08-01 Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0 Higgins, Daniel P Weisman, Caroline M Lui, Dominique S D’Agostino, Frank A Walker, Amy K Genetics Investigation Omics tools provide broad datasets for biological discovery. However, the computational tools for identifying important genes or pathways in RNA-seq, proteomics, or GWAS (Genome-Wide Association Study) data depend on Gene Ontogeny annotations and are biased toward well-described pathways. This limits their utility as poorly annotated genes, which could have novel functions, are often passed over. Recently, we developed an annotation and category enrichment tool for Caenorhabditis elegans genomic data, WormCat, which provides an intuitive visualization output. Unlike Gene Ontogeny-based enrichment tools, which exclude genes with no annotation information, WormCat 2.0 retains these genes as a special UNASSIGNED category. Here, we show that the UNASSIGNED gene category enrichment exhibits tissue-specific expression patterns and can include genes with biological functions identified in published datasets. Poorly annotated genes are often considered to be potentially species-specific and thus, of reduced interest to the biomedical community. Instead, we find that around 3% of the UNASSIGNED genes have human orthologs, including some linked to human diseases. These human orthologs themselves have little annotation information. A recently developed method that incorporates lineage relationships (abSENSE) indicates that the failure of BLAST to detect homology explains the apparent lineage specificity for many UNASSIGNED genes. This suggests that a larger subset could be related to human genes. WormCat provides an annotation strategy that allows the association of UNASSIGNED genes with specific phenotypes and known pathways. Building these associations in C. elegans, with its robust genetic tools, provides a path to further functional study and insight into these understudied genes. Oxford University Press 2022-05-19 /pmc/articles/PMC9339291/ /pubmed/35587742 http://dx.doi.org/10.1093/genetics/iyac085 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Genetics Society of America. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigation
Higgins, Daniel P
Weisman, Caroline M
Lui, Dominique S
D’Agostino, Frank A
Walker, Amy K
Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title_full Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title_fullStr Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title_full_unstemmed Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title_short Defining characteristics and conservation of poorly annotated genes in Caenorhabditis elegans using WormCat 2.0
title_sort defining characteristics and conservation of poorly annotated genes in caenorhabditis elegans using wormcat 2.0
topic Investigation
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9339291/
https://www.ncbi.nlm.nih.gov/pubmed/35587742
http://dx.doi.org/10.1093/genetics/iyac085
work_keys_str_mv AT higginsdanielp definingcharacteristicsandconservationofpoorlyannotatedgenesincaenorhabditiselegansusingwormcat20
AT weismancarolinem definingcharacteristicsandconservationofpoorlyannotatedgenesincaenorhabditiselegansusingwormcat20
AT luidominiques definingcharacteristicsandconservationofpoorlyannotatedgenesincaenorhabditiselegansusingwormcat20
AT dagostinofranka definingcharacteristicsandconservationofpoorlyannotatedgenesincaenorhabditiselegansusingwormcat20
AT walkeramyk definingcharacteristicsandconservationofpoorlyannotatedgenesincaenorhabditiselegansusingwormcat20