Cargando…

Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics

BACKGROUND: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive. METHODOLOGY/PRINCIPAL FINDINGS: This study presents a computational tool, named “...

Descripción completa

Detalles Bibliográficos
Autores principales: Huang, Tinghua, Xiao, Hong, Tian, Qi, He, Zhen, Yuan, Cheng, Lin, Zezhao, Gao, Xuejun, Yao, Min
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205514/
https://www.ncbi.nlm.nih.gov/pubmed/35671296
http://dx.doi.org/10.1371/journal.pcbi.1009773
_version_ 1784729149742514176
author Huang, Tinghua
Xiao, Hong
Tian, Qi
He, Zhen
Yuan, Cheng
Lin, Zezhao
Gao, Xuejun
Yao, Min
author_facet Huang, Tinghua
Xiao, Hong
Tian, Qi
He, Zhen
Yuan, Cheng
Lin, Zezhao
Gao, Xuejun
Yao, Min
author_sort Huang, Tinghua
collection PubMed
description BACKGROUND: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive. METHODOLOGY/PRINCIPAL FINDINGS: This study presents a computational tool, named “Grit”, for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student’s t-test statistical method that detects high-scoring binding sites utilizing conservation information among species. The program performs sequence scanning at a rate of 3.2 Mbp/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It significantly outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets. SIGNIFICANCE: Grit is a good alternative to current available motif scanning tools.
format Online
Article
Text
id pubmed-9205514
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-92055142022-06-18 Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics Huang, Tinghua Xiao, Hong Tian, Qi He, Zhen Yuan, Cheng Lin, Zezhao Gao, Xuejun Yao, Min PLoS Comput Biol Research Article BACKGROUND: Transcription factor (TF) regulates the transcription of DNA to messenger RNA by binding to upstream sequence motifs. Identifying the locations of known motifs in whole genomes is computationally intensive. METHODOLOGY/PRINCIPAL FINDINGS: This study presents a computational tool, named “Grit”, for screening TF-binding sites (TFBS) by coordinating transcription factors to their promoter sequences in orthologous genes. This tool employs a newly developed mixed Student’s t-test statistical method that detects high-scoring binding sites utilizing conservation information among species. The program performs sequence scanning at a rate of 3.2 Mbp/s on a quad-core Amazon server and has been benchmarked by the well-established ChIP-Seq datasets, putting Grit amongst the top-ranked TFBS predictors. It significantly outperforms the well-known transcription factor motif scanning tools, Pscan (4.8%) and FIMO (17.8%), in analyzing well-documented ChIP-Atlas human genome Chip-Seq datasets. SIGNIFICANCE: Grit is a good alternative to current available motif scanning tools. Public Library of Science 2022-06-07 /pmc/articles/PMC9205514/ /pubmed/35671296 http://dx.doi.org/10.1371/journal.pcbi.1009773 Text en © 2022 Huang et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Huang, Tinghua
Xiao, Hong
Tian, Qi
He, Zhen
Yuan, Cheng
Lin, Zezhao
Gao, Xuejun
Yao, Min
Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title_full Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title_fullStr Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title_full_unstemmed Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title_short Identification of upstream transcription factor binding sites in orthologous genes using mixed Student’s t-test statistics
title_sort identification of upstream transcription factor binding sites in orthologous genes using mixed student’s t-test statistics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205514/
https://www.ncbi.nlm.nih.gov/pubmed/35671296
http://dx.doi.org/10.1371/journal.pcbi.1009773
work_keys_str_mv AT huangtinghua identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT xiaohong identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT tianqi identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT hezhen identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT yuancheng identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT linzezhao identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT gaoxuejun identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics
AT yaomin identificationofupstreamtranscriptionfactorbindingsitesinorthologousgenesusingmixedstudentstteststatistics