Cargando…

Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction

BACKGROUND: The molecular function of a gene is most commonly inferred by sequence similarity. Therefore, genes that lack sufficient sequence similarity to characterized genes (such as certain classes of transcriptional regulators) are difficult to classify using most function prediction algorithms...

Descripción completa

Detalles Bibliográficos
Autores principales: Bossi, Flavia, Fan, Jue, Xiao, Jun, Chandra, Lilyana, Shen, Max, Dorone, Yanniv, Wagner, Doris, Rhee, Seung Y.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5485742/
https://www.ncbi.nlm.nih.gov/pubmed/28651538
http://dx.doi.org/10.1186/s12864-017-3853-9
_version_ 1783246131488096256
author Bossi, Flavia
Fan, Jue
Xiao, Jun
Chandra, Lilyana
Shen, Max
Dorone, Yanniv
Wagner, Doris
Rhee, Seung Y.
author_facet Bossi, Flavia
Fan, Jue
Xiao, Jun
Chandra, Lilyana
Shen, Max
Dorone, Yanniv
Wagner, Doris
Rhee, Seung Y.
author_sort Bossi, Flavia
collection PubMed
description BACKGROUND: The molecular function of a gene is most commonly inferred by sequence similarity. Therefore, genes that lack sufficient sequence similarity to characterized genes (such as certain classes of transcriptional regulators) are difficult to classify using most function prediction algorithms and have remained uncharacterized. RESULTS: To identify novel transcriptional regulators systematically, we used a feature-based pipeline to screen protein families of unknown function. This method predicted 43 transcriptional regulator families in Arabidopsis thaliana, 7 families in Drosophila melanogaster, and 9 families in Homo sapiens. Literature curation validated 12 of the predicted families to be involved in transcriptional regulation. We tested 33 out of the 195 Arabidopsis putative transcriptional regulators for their ability to activate transcription of a reporter gene in planta and found twelve coactivators, five of which had no prior literature support. To investigate mechanisms of action in which the predicted regulators might work, we looked for interactors of an Arabidopsis candidate that did not show transactivation activity in planta and found that it might work with other members of its own family and a subunit of the Polycomb Repressive Complex 2 to regulate transcription. CONCLUSIONS: Our results demonstrate the feasibility of assigning molecular function to proteins of unknown function without depending on sequence similarity. In particular, we identified novel transcriptional regulators using biological features enriched in transcription factors. The predictions reported here should accelerate the characterization of novel regulators. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3853-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5485742
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54857422017-07-03 Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction Bossi, Flavia Fan, Jue Xiao, Jun Chandra, Lilyana Shen, Max Dorone, Yanniv Wagner, Doris Rhee, Seung Y. BMC Genomics Research Article BACKGROUND: The molecular function of a gene is most commonly inferred by sequence similarity. Therefore, genes that lack sufficient sequence similarity to characterized genes (such as certain classes of transcriptional regulators) are difficult to classify using most function prediction algorithms and have remained uncharacterized. RESULTS: To identify novel transcriptional regulators systematically, we used a feature-based pipeline to screen protein families of unknown function. This method predicted 43 transcriptional regulator families in Arabidopsis thaliana, 7 families in Drosophila melanogaster, and 9 families in Homo sapiens. Literature curation validated 12 of the predicted families to be involved in transcriptional regulation. We tested 33 out of the 195 Arabidopsis putative transcriptional regulators for their ability to activate transcription of a reporter gene in planta and found twelve coactivators, five of which had no prior literature support. To investigate mechanisms of action in which the predicted regulators might work, we looked for interactors of an Arabidopsis candidate that did not show transactivation activity in planta and found that it might work with other members of its own family and a subunit of the Polycomb Repressive Complex 2 to regulate transcription. CONCLUSIONS: Our results demonstrate the feasibility of assigning molecular function to proteins of unknown function without depending on sequence similarity. In particular, we identified novel transcriptional regulators using biological features enriched in transcription factors. The predictions reported here should accelerate the characterization of novel regulators. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-017-3853-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-26 /pmc/articles/PMC5485742/ /pubmed/28651538 http://dx.doi.org/10.1186/s12864-017-3853-9 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bossi, Flavia
Fan, Jue
Xiao, Jun
Chandra, Lilyana
Shen, Max
Dorone, Yanniv
Wagner, Doris
Rhee, Seung Y.
Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title_full Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title_fullStr Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title_full_unstemmed Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title_short Systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
title_sort systematic discovery of novel eukaryotic transcriptional regulators using sequence homology independent prediction
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5485742/
https://www.ncbi.nlm.nih.gov/pubmed/28651538
http://dx.doi.org/10.1186/s12864-017-3853-9
work_keys_str_mv AT bossiflavia systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT fanjue systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT xiaojun systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT chandralilyana systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT shenmax systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT doroneyanniv systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT wagnerdoris systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction
AT rheeseungy systematicdiscoveryofnoveleukaryotictranscriptionalregulatorsusingsequencehomologyindependentprediction