Cargando…
Protein domain recurrence and order can enhance prediction of protein functions
Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on ei...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436825/ https://www.ncbi.nlm.nih.gov/pubmed/22962465 http://dx.doi.org/10.1093/bioinformatics/bts398 |
_version_ | 1782242706415681536 |
---|---|
author | Messih, Mario Abdel Chitale, Meghana Bajic, Vladimir B. Kihara, Daisuke Gao, Xin |
author_facet | Messih, Mario Abdel Chitale, Meghana Bajic, Vladimir B. Kihara, Daisuke Gao, Xin |
author_sort | Messih, Mario Abdel |
collection | PubMed |
description | Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference. Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the naïve Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions. Availability: The new models are provided as open source programs at http://sfb.kaust.edu.sa/Pages/Software.aspx. Contact: dkihara@cs.purdue.edu, xin.gao@kaust.edu.sa Supplementary information: Supplementary data are available at Bioinformatics Online. |
format | Online Article Text |
id | pubmed-3436825 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34368252012-12-12 Protein domain recurrence and order can enhance prediction of protein functions Messih, Mario Abdel Chitale, Meghana Bajic, Vladimir B. Kihara, Daisuke Gao, Xin Bioinformatics Original Papers Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference. Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the naïve Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions. Availability: The new models are provided as open source programs at http://sfb.kaust.edu.sa/Pages/Software.aspx. Contact: dkihara@cs.purdue.edu, xin.gao@kaust.edu.sa Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-09-15 2012-09-03 /pmc/articles/PMC3436825/ /pubmed/22962465 http://dx.doi.org/10.1093/bioinformatics/bts398 Text en © The Author(s) (2012). Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Messih, Mario Abdel Chitale, Meghana Bajic, Vladimir B. Kihara, Daisuke Gao, Xin Protein domain recurrence and order can enhance prediction of protein functions |
title | Protein domain recurrence and order can enhance prediction of protein functions |
title_full | Protein domain recurrence and order can enhance prediction of protein functions |
title_fullStr | Protein domain recurrence and order can enhance prediction of protein functions |
title_full_unstemmed | Protein domain recurrence and order can enhance prediction of protein functions |
title_short | Protein domain recurrence and order can enhance prediction of protein functions |
title_sort | protein domain recurrence and order can enhance prediction of protein functions |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436825/ https://www.ncbi.nlm.nih.gov/pubmed/22962465 http://dx.doi.org/10.1093/bioinformatics/bts398 |
work_keys_str_mv | AT messihmarioabdel proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions AT chitalemeghana proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions AT bajicvladimirb proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions AT kiharadaisuke proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions AT gaoxin proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions |