Cargando…

TIGRFAMs and Genome Properties in 2013

TIGRFAMs, available online at http://www.jcvi.org/tigrfams is a database of protein family definitions. Each entry features a seed alignment of trusted representative sequences, a hidden Markov model (HMM) built from that alignment, cutoff scores that let automated annotation pipelines decide which...

Descripción completa

Detalles Bibliográficos
Autores principales: Haft, Daniel H., Selengut, Jeremy D., Richter, Roland A., Harkins, Derek, Basu, Malay K., Beck, Erin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531188/
https://www.ncbi.nlm.nih.gov/pubmed/23197656
http://dx.doi.org/10.1093/nar/gks1234
_version_ 1782254130513838080
author Haft, Daniel H.
Selengut, Jeremy D.
Richter, Roland A.
Harkins, Derek
Basu, Malay K.
Beck, Erin
author_facet Haft, Daniel H.
Selengut, Jeremy D.
Richter, Roland A.
Harkins, Derek
Basu, Malay K.
Beck, Erin
author_sort Haft, Daniel H.
collection PubMed
description TIGRFAMs, available online at http://www.jcvi.org/tigrfams is a database of protein family definitions. Each entry features a seed alignment of trusted representative sequences, a hidden Markov model (HMM) built from that alignment, cutoff scores that let automated annotation pipelines decide which proteins are members, and annotations for transfer onto member proteins. Most TIGRFAMs models are designated equivalog, meaning they assign a specific name to proteins conserved in function from a common ancestral sequence. Models describing more functionally heterogeneous families are designated subfamily or domain, and assign less specific but more widely applicable annotations. The Genome Properties database, available at http://www.jcvi.org/genome-properties, specifies how computed evidence, including TIGRFAMs HMM results, should be used to judge whether an enzymatic pathway, a protein complex or another type of molecular subsystem is encoded in a genome. TIGRFAMs and Genome Properties content are developed in concert because subsystems reconstruction for large numbers of genomes guides selection of seed alignment sequences and cutoff values during protein family construction. Both databases specialize heavily in bacterial and archaeal subsystems. At present, 4284 models appear in TIGRFAMs, while 628 systems are described by Genome Properties. Content derives both from subsystem discovery work and from biocuration of the scientific literature.
format Online
Article
Text
id pubmed-3531188
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-35311882013-03-07 TIGRFAMs and Genome Properties in 2013 Haft, Daniel H. Selengut, Jeremy D. Richter, Roland A. Harkins, Derek Basu, Malay K. Beck, Erin Nucleic Acids Res Articles TIGRFAMs, available online at http://www.jcvi.org/tigrfams is a database of protein family definitions. Each entry features a seed alignment of trusted representative sequences, a hidden Markov model (HMM) built from that alignment, cutoff scores that let automated annotation pipelines decide which proteins are members, and annotations for transfer onto member proteins. Most TIGRFAMs models are designated equivalog, meaning they assign a specific name to proteins conserved in function from a common ancestral sequence. Models describing more functionally heterogeneous families are designated subfamily or domain, and assign less specific but more widely applicable annotations. The Genome Properties database, available at http://www.jcvi.org/genome-properties, specifies how computed evidence, including TIGRFAMs HMM results, should be used to judge whether an enzymatic pathway, a protein complex or another type of molecular subsystem is encoded in a genome. TIGRFAMs and Genome Properties content are developed in concert because subsystems reconstruction for large numbers of genomes guides selection of seed alignment sequences and cutoff values during protein family construction. Both databases specialize heavily in bacterial and archaeal subsystems. At present, 4284 models appear in TIGRFAMs, while 628 systems are described by Genome Properties. Content derives both from subsystem discovery work and from biocuration of the scientific literature. Oxford University Press 2013-01 2012-11-28 /pmc/articles/PMC3531188/ /pubmed/23197656 http://dx.doi.org/10.1093/nar/gks1234 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/3.0/), which permits non-commercial reuse, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com.
spellingShingle Articles
Haft, Daniel H.
Selengut, Jeremy D.
Richter, Roland A.
Harkins, Derek
Basu, Malay K.
Beck, Erin
TIGRFAMs and Genome Properties in 2013
title TIGRFAMs and Genome Properties in 2013
title_full TIGRFAMs and Genome Properties in 2013
title_fullStr TIGRFAMs and Genome Properties in 2013
title_full_unstemmed TIGRFAMs and Genome Properties in 2013
title_short TIGRFAMs and Genome Properties in 2013
title_sort tigrfams and genome properties in 2013
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3531188/
https://www.ncbi.nlm.nih.gov/pubmed/23197656
http://dx.doi.org/10.1093/nar/gks1234
work_keys_str_mv AT haftdanielh tigrfamsandgenomepropertiesin2013
AT selengutjeremyd tigrfamsandgenomepropertiesin2013
AT richterrolanda tigrfamsandgenomepropertiesin2013
AT harkinsderek tigrfamsandgenomepropertiesin2013
AT basumalayk tigrfamsandgenomepropertiesin2013
AT beckerin tigrfamsandgenomepropertiesin2013