Cargando…

NovelFam3000 – Uncharacterized human protein domains conserved across model organisms

BACKGROUND: Despite significant efforts from the research community, an extensive portion of the proteins encoded by human genes lack an assigned cellular function. Most metazoan proteins are composed of structural and/or functional domains, of which many appear in multiple proteins. Once a domain i...

Descripción completa

Detalles Bibliográficos
Autores principales: Kemmer, Danielle, Podowski, Raf M, Arenillas, David, Lim, Jonathan, Hodges, Emily, Roth, Peggy, Sonnhammer, Erik LL, Höög, Christer, Wasserman, Wyeth W
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440326/
https://www.ncbi.nlm.nih.gov/pubmed/16533400
http://dx.doi.org/10.1186/1471-2164-7-48
_version_ 1782127317672263680
author Kemmer, Danielle
Podowski, Raf M
Arenillas, David
Lim, Jonathan
Hodges, Emily
Roth, Peggy
Sonnhammer, Erik LL
Höög, Christer
Wasserman, Wyeth W
author_facet Kemmer, Danielle
Podowski, Raf M
Arenillas, David
Lim, Jonathan
Hodges, Emily
Roth, Peggy
Sonnhammer, Erik LL
Höög, Christer
Wasserman, Wyeth W
author_sort Kemmer, Danielle
collection PubMed
description BACKGROUND: Despite significant efforts from the research community, an extensive portion of the proteins encoded by human genes lack an assigned cellular function. Most metazoan proteins are composed of structural and/or functional domains, of which many appear in multiple proteins. Once a domain is characterized in one protein, the presence of a similar sequence in an uncharacterized protein serves as a basis for inference of function. Thus knowledge of a domain's function, or the protein within which it arises, can facilitate the analysis of an entire set of proteins. DESCRIPTION: From the Pfam domain database, we extracted uncharacterized protein domains represented in proteins from humans, worms, and flies. A data centre was created to facilitate the analysis of the uncharacterized domain-containing proteins. The centre both provides researchers with links to dispersed internet resources containing gene-specific experimental data and enables them to post relevant experimental results or comments. For each human gene in the system, a characterization score is posted, allowing users to track the progress of characterization over time or to identify for study uncharacterized domains in well-characterized genes. As a test of the system, a subset of 39 domains was selected for analysis and the experimental results posted to the NovelFam3000 system. For 25 human protein members of these 39 domain families, detailed sub-cellular localizations were determined. Specific observations are presented based on the analysis of the integrated information provided through the online NovelFam3000 system. CONCLUSION: Consistent experimental results between multiple members of a domain family allow for inferences of the domain's functional role. We unite bioinformatics resources and experimental data in order to accelerate the functional characterization of scarcely annotated domain families.
format Text
id pubmed-1440326
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-14403262006-04-18 NovelFam3000 – Uncharacterized human protein domains conserved across model organisms Kemmer, Danielle Podowski, Raf M Arenillas, David Lim, Jonathan Hodges, Emily Roth, Peggy Sonnhammer, Erik LL Höög, Christer Wasserman, Wyeth W BMC Genomics Database BACKGROUND: Despite significant efforts from the research community, an extensive portion of the proteins encoded by human genes lack an assigned cellular function. Most metazoan proteins are composed of structural and/or functional domains, of which many appear in multiple proteins. Once a domain is characterized in one protein, the presence of a similar sequence in an uncharacterized protein serves as a basis for inference of function. Thus knowledge of a domain's function, or the protein within which it arises, can facilitate the analysis of an entire set of proteins. DESCRIPTION: From the Pfam domain database, we extracted uncharacterized protein domains represented in proteins from humans, worms, and flies. A data centre was created to facilitate the analysis of the uncharacterized domain-containing proteins. The centre both provides researchers with links to dispersed internet resources containing gene-specific experimental data and enables them to post relevant experimental results or comments. For each human gene in the system, a characterization score is posted, allowing users to track the progress of characterization over time or to identify for study uncharacterized domains in well-characterized genes. As a test of the system, a subset of 39 domains was selected for analysis and the experimental results posted to the NovelFam3000 system. For 25 human protein members of these 39 domain families, detailed sub-cellular localizations were determined. Specific observations are presented based on the analysis of the integrated information provided through the online NovelFam3000 system. CONCLUSION: Consistent experimental results between multiple members of a domain family allow for inferences of the domain's functional role. We unite bioinformatics resources and experimental data in order to accelerate the functional characterization of scarcely annotated domain families. BioMed Central 2006-03-13 /pmc/articles/PMC1440326/ /pubmed/16533400 http://dx.doi.org/10.1186/1471-2164-7-48 Text en Copyright © 2006 Kemmer et al; licensee BioMed Central Ltd.
spellingShingle Database
Kemmer, Danielle
Podowski, Raf M
Arenillas, David
Lim, Jonathan
Hodges, Emily
Roth, Peggy
Sonnhammer, Erik LL
Höög, Christer
Wasserman, Wyeth W
NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title_full NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title_fullStr NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title_full_unstemmed NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title_short NovelFam3000 – Uncharacterized human protein domains conserved across model organisms
title_sort novelfam3000 – uncharacterized human protein domains conserved across model organisms
topic Database
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1440326/
https://www.ncbi.nlm.nih.gov/pubmed/16533400
http://dx.doi.org/10.1186/1471-2164-7-48
work_keys_str_mv AT kemmerdanielle novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT podowskirafm novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT arenillasdavid novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT limjonathan novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT hodgesemily novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT rothpeggy novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT sonnhammererikll novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT hoogchrister novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms
AT wassermanwyethw novelfam3000uncharacterizedhumanproteindomainsconservedacrossmodelorganisms