Cargando…
Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences
Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by expert...
Autores principales: | , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001082/ https://www.ncbi.nlm.nih.gov/pubmed/20705649 http://dx.doi.org/10.1093/nar/gkq714 |
_version_ | 1782193593935462400 |
---|---|
author | Santos, Miguel A. Turinsky, Andrei L. Ong, Serene Tsai, Jennifer Berger, Michael F. Badis, Gwenael Talukder, Shaheynoor Gehrke, Andrew R. Bulyk, Martha L. Hughes, Timothy R. Wodak, Shoshana J. |
author_facet | Santos, Miguel A. Turinsky, Andrei L. Ong, Serene Tsai, Jennifer Berger, Michael F. Badis, Gwenael Talukder, Shaheynoor Gehrke, Andrew R. Bulyk, Martha L. Hughes, Timothy R. Wodak, Shoshana J. |
author_sort | Santos, Miguel A. |
collection | PubMed |
description | Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73–83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future. |
format | Text |
id | pubmed-3001082 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-30010822010-12-13 Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences Santos, Miguel A. Turinsky, Andrei L. Ong, Serene Tsai, Jennifer Berger, Michael F. Badis, Gwenael Talukder, Shaheynoor Gehrke, Andrew R. Bulyk, Martha L. Hughes, Timothy R. Wodak, Shoshana J. Nucleic Acids Res Computational Biology Classifying proteins into subgroups with similar molecular function on the basis of sequence is an important step in deriving reliable functional annotations computationally. So far, however, available classification procedures have been evaluated against protein subgroups that are defined by experts using mainly qualitative descriptions of molecular function. Recently, in vitro DNA-binding preferences to all possible 8-nt DNA sequences have been measured for 178 mouse homeodomains using protein-binding microarrays, offering the unprecedented opportunity of evaluating the classification methods against quantitative measures of molecular function. To this end, we automatically derive homeodomain subtypes from the DNA-binding data and independently group the same domains using sequence information alone. We test five sequence-based methods, which use different sequence-similarity measures and algorithms to group sequences. Results show that methods that optimize the classification robustness reflect well the detailed functional specificity revealed by the experimental data. In some of these classifications, 73–83% of the subfamilies exactly correspond to, or are completely contained in, the function-based subtypes. Our findings demonstrate that certain sequence-based classifications are capable of yielding very specific molecular function annotations. The availability of quantitative descriptions of molecular function, such as DNA-binding data, will be a key factor in exploiting this potential in the future. Oxford University Press 2010-12 2010-08-12 /pmc/articles/PMC3001082/ /pubmed/20705649 http://dx.doi.org/10.1093/nar/gkq714 Text en © The Author(s) 2010. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/2.5 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Santos, Miguel A. Turinsky, Andrei L. Ong, Serene Tsai, Jennifer Berger, Michael F. Badis, Gwenael Talukder, Shaheynoor Gehrke, Andrew R. Bulyk, Martha L. Hughes, Timothy R. Wodak, Shoshana J. Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title | Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title_full | Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title_fullStr | Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title_full_unstemmed | Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title_short | Objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro DNA-binding preferences |
title_sort | objective sequence-based subfamily classifications of mouse homeodomains reflect their in vitro dna-binding preferences |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3001082/ https://www.ncbi.nlm.nih.gov/pubmed/20705649 http://dx.doi.org/10.1093/nar/gkq714 |
work_keys_str_mv | AT santosmiguela objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT turinskyandreil objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT ongserene objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT tsaijennifer objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT bergermichaelf objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT badisgwenael objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT talukdershaheynoor objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT gehrkeandrewr objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT bulykmarthal objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT hughestimothyr objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences AT wodakshoshanaj objectivesequencebasedsubfamilyclassificationsofmousehomeodomainsreflecttheirinvitrodnabindingpreferences |