Cargando…
The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam
Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that differe...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570203/ https://www.ncbi.nlm.nih.gov/pubmed/25614388 http://dx.doi.org/10.1093/bib/bbu053 |
_version_ | 1782390163923533824 |
---|---|
author | Chiang, Zachary Vastermark, Ake Punta, Marco Coggill, Penelope C. Mistry, Jaina Finn, Robert D. Saier, Milton H. |
author_facet | Chiang, Zachary Vastermark, Ake Punta, Marco Coggill, Penelope C. Mistry, Jaina Finn, Robert D. Saier, Milton H. |
author_sort | Chiang, Zachary |
collection | PubMed |
description | Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam ‘Domains of Unknown Function’, which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB. |
format | Online Article Text |
id | pubmed-4570203 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-45702032015-09-21 The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam Chiang, Zachary Vastermark, Ake Punta, Marco Coggill, Penelope C. Mistry, Jaina Finn, Robert D. Saier, Milton H. Brief Bioinform Papers Transport systems comprise roughly 10% of all proteins in a cell, playing critical roles in many processes. Improving and expanding their classification is an important goal that can affect studies ranging from comparative genomics to potential drug target searches. It is not surprising that different classification systems for transport proteins have arisen, be it within a specialized database, focused on this functional class of proteins, or as part of a broader classification system for all proteins. Two such databases are the Transporter Classification Database (TCDB) and the Protein family (Pfam) database. As part of a long-term endeavor to improve consistency between the two classification systems, we have compared transporter annotations in the two databases to understand the rationale for differences and to improve both systems. Differences sometimes reflect the fact that one database has a particular transporter family while the other does not. Differing family definitions and hierarchical organizations were reconciled, resulting in recognition of 69 Pfam ‘Domains of Unknown Function’, which proved to be transport protein families to be renamed using TCDB annotations. Of over 400 potential new Pfam families identified from TCDB, 10% have already been added to Pfam, and TCDB has created 60 new entries based on Pfam data. This work, for the first time, reveals the benefits of comprehensive database comparisons and explains the differences between Pfam and TCDB. Oxford University Press 2015-09 2015-01-21 /pmc/articles/PMC4570203/ /pubmed/25614388 http://dx.doi.org/10.1093/bib/bbu053 Text en © The Author 2015. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Papers Chiang, Zachary Vastermark, Ake Punta, Marco Coggill, Penelope C. Mistry, Jaina Finn, Robert D. Saier, Milton H. The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title | The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title_full | The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title_fullStr | The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title_full_unstemmed | The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title_short | The complexity, challenges and benefits of comparing two transporter classification systems in TCDB and Pfam |
title_sort | complexity, challenges and benefits of comparing two transporter classification systems in tcdb and pfam |
topic | Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4570203/ https://www.ncbi.nlm.nih.gov/pubmed/25614388 http://dx.doi.org/10.1093/bib/bbu053 |
work_keys_str_mv | AT chiangzachary thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT vastermarkake thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT puntamarco thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT coggillpenelopec thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT mistryjaina thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT finnrobertd thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT saiermiltonh thecomplexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT chiangzachary complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT vastermarkake complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT puntamarco complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT coggillpenelopec complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT mistryjaina complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT finnrobertd complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam AT saiermiltonh complexitychallengesandbenefitsofcomparingtwotransporterclassificationsystemsintcdbandpfam |