Cargando…

The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam

Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that differe...

Descripción completa

Detalles Bibliográficos
Autores principales: Chiang, Zachary, Vastermark, Ake, Punta, Marco, Coggill, Penelope C., Mistry, Jaina, Finn, Robert D., Saier, Milton H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570203/
https://www.ncbi.nlm.nih.gov/pubmed/25614388
http://dx.doi.org/10.1093/bib/bbu053
_version_ 1782390163923533824
author Chiang, Zachary
Vastermark, Ake
Punta, Marco
Coggill, Penelope C.
Mistry, Jaina
Finn, Robert D.
Saier, Milton H.
author_facet Chiang, Zachary
Vastermark, Ake
Punta, Marco
Coggill, Penelope C.
Mistry, Jaina
Finn, Robert D.
Saier, Milton H.
author_sort Chiang, Zachary
collection PubMed
description Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam ‘Domains of Unknown Function’, which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB.
format Online
Article
Text
id pubmed-4570203
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-45702032015-09-21 The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam Chiang, Zachary Vastermark, Ake Punta, Marco Coggill, Penelope C. Mistry, Jaina Finn, Robert D. Saier, Milton H. Brief Bioinform Papers Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam ‘Domains of Unknown Function’, which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB. Oxford University Press 2015-09 2015-01-21 /pmc/articles/PMC4570203/ /pubmed/25614388 http://dx.doi.org/10.1093/bib/bbu053 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Papers
Chiang, Zachary
Vastermark, Ake
Punta, Marco
Coggill, Penelope C.
Mistry, Jaina
Finn, Robert D.
Saier, Milton H.
The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title_full The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title_fullStr The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title_full_unstemmed The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title_short The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
title_sort complexity, challenges and benefits of comparing two transporter classification systems in tcdb and pfam
topic Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570203/
https://www.ncbi.nlm.nih.gov/pubmed/25614388
http://dx.doi.org/10.1093/bib/bbu053
work_keys_str_mv AT chiangzachary thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT vastermarkake thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT puntamarco thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT coggillpenelopec thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT mistryjaina thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT finnrobertd thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT saiermiltonh thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT chiangzachary complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT vastermarkake complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT puntamarco complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT coggillpenelopec complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT mistryjaina complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT finnrobertd complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam
AT saiermiltonh complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam