Cargando…

Protein domain recurrence and order can enhance prediction of protein functions

Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on ei...

Descripción completa

Detalles Bibliográficos
Autores principales: Messih, Mario Abdel, Chitale, Meghana, Bajic, Vladimir B., Kihara, Daisuke, Gao, Xin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436825/
https://www.ncbi.nlm.nih.gov/pubmed/22962465
http://dx.doi.org/10.1093/bioinformatics/bts398
_version_ 1782242706415681536
author Messih, Mario Abdel
Chitale, Meghana
Bajic, Vladimir B.
Kihara, Daisuke
Gao, Xin
author_facet Messih, Mario Abdel
Chitale, Meghana
Bajic, Vladimir B.
Kihara, Daisuke
Gao, Xin
author_sort Messih, Mario Abdel
collection PubMed
description Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference. Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the naïve Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions. Availability: The new models are provided as open source programs at http://sfb.kaust.edu.sa/Pages/Software.aspx. Contact: dkihara@cs.purdue.edu, xin.gao@kaust.edu.sa Supplementary information: Supplementary data are available at Bioinformatics Online.
format Online
Article
Text
id pubmed-3436825
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34368252012-12-12 Protein domain recurrence and order can enhance prediction of protein functions Messih, Mario Abdel Chitale, Meghana Bajic, Vladimir B. Kihara, Daisuke Gao, Xin Bioinformatics Original Papers Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference. Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the naïve Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions. Availability: The new models are provided as open source programs at http://sfb.kaust.edu.sa/Pages/Software.aspx. Contact: dkihara@cs.purdue.edu, xin.gao@kaust.edu.sa Supplementary information: Supplementary data are available at Bioinformatics Online. Oxford University Press 2012-09-15 2012-09-03 /pmc/articles/PMC3436825/ /pubmed/22962465 http://dx.doi.org/10.1093/bioinformatics/bts398 Text en © The Author(s) (2012). Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Messih, Mario Abdel
Chitale, Meghana
Bajic, Vladimir B.
Kihara, Daisuke
Gao, Xin
Protein domain recurrence and order can enhance prediction of protein functions
title Protein domain recurrence and order can enhance prediction of protein functions
title_full Protein domain recurrence and order can enhance prediction of protein functions
title_fullStr Protein domain recurrence and order can enhance prediction of protein functions
title_full_unstemmed Protein domain recurrence and order can enhance prediction of protein functions
title_short Protein domain recurrence and order can enhance prediction of protein functions
title_sort protein domain recurrence and order can enhance prediction of protein functions
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3436825/
https://www.ncbi.nlm.nih.gov/pubmed/22962465
http://dx.doi.org/10.1093/bioinformatics/bts398
work_keys_str_mv AT messihmarioabdel proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions
AT chitalemeghana proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions
AT bajicvladimirb proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions
AT kiharadaisuke proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions
AT gaoxin proteindomainrecurrenceandordercanenhancepredictionofproteinfunctions