Cargando…

CardioTF, a database of deconstructing transcriptional circuits in the heart system

Background: Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database...

Descripción completa

Detalles Bibliográficos
Autor principal: Zhen, Yisong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012272/
https://www.ncbi.nlm.nih.gov/pubmed/27635320
http://dx.doi.org/10.7717/peerj.2339
_version_ 1782451978299768832
author Zhen, Yisong
author_facet Zhen, Yisong
author_sort Zhen, Yisong
collection PubMed
description Background: Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database was constructed. The purpose of this database is to collate information on cardiovascular transcription factors (TFs), position weight matrices (PWMs), and enhancer sequences discovered using the ChIP-seq method. Methods: The Naïve-Bayes algorithm was used to classify literature and identify all PubMed abstracts on cardiovascular development. The natural language learning tool GNAT was then used to identify corresponding gene names embedded within these abstracts. Local Perl scripts were used to integrate and dump data from public databases into the MariaDB management system (MySQL). In-house R scripts were written to analyze and visualize the results. Results: Known cardiovascular TFs from humans and human homologs from fly, Ciona, zebrafish, frog, chicken, and mouse were identified and deposited in the database. PWMs from Jaspar, hPDI, and UniPROBE databases were deposited in the database and can be retrieved using their corresponding TF names. Gene enhancer regions from various sources of ChIP-seq data were deposited into the database and were able to be visualized by graphical output. Besides biocuration, mouse homologs of the 81 core cardiac TFs were selected using a Naïve-Bayes approach and then by intersecting four independent data sources: RNA profiling, expert annotation, PubMed abstracts and phenotype. Discussion: The CardioTF database can be used as a portal to construct transcriptional network of cardiac development. Availability and Implementation: Database URL: http://www.cardiosignal.org/database/cardiotf.html.
format Online
Article
Text
id pubmed-5012272
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-50122722016-09-15 CardioTF, a database of deconstructing transcriptional circuits in the heart system Zhen, Yisong PeerJ Bioinformatics Background: Information on cardiovascular gene transcription is fragmented and far behind the present requirements of the systems biology field. To create a comprehensive source of data for cardiovascular gene regulation and to facilitate a deeper understanding of genomic data, the CardioTF database was constructed. The purpose of this database is to collate information on cardiovascular transcription factors (TFs), position weight matrices (PWMs), and enhancer sequences discovered using the ChIP-seq method. Methods: The Naïve-Bayes algorithm was used to classify literature and identify all PubMed abstracts on cardiovascular development. The natural language learning tool GNAT was then used to identify corresponding gene names embedded within these abstracts. Local Perl scripts were used to integrate and dump data from public databases into the MariaDB management system (MySQL). In-house R scripts were written to analyze and visualize the results. Results: Known cardiovascular TFs from humans and human homologs from fly, Ciona, zebrafish, frog, chicken, and mouse were identified and deposited in the database. PWMs from Jaspar, hPDI, and UniPROBE databases were deposited in the database and can be retrieved using their corresponding TF names. Gene enhancer regions from various sources of ChIP-seq data were deposited into the database and were able to be visualized by graphical output. Besides biocuration, mouse homologs of the 81 core cardiac TFs were selected using a Naïve-Bayes approach and then by intersecting four independent data sources: RNA profiling, expert annotation, PubMed abstracts and phenotype. Discussion: The CardioTF database can be used as a portal to construct transcriptional network of cardiac development. Availability and Implementation: Database URL: http://www.cardiosignal.org/database/cardiotf.html. PeerJ Inc. 2016-08-23 /pmc/articles/PMC5012272/ /pubmed/27635320 http://dx.doi.org/10.7717/peerj.2339 Text en © 2016 Zhen http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Bioinformatics
Zhen, Yisong
CardioTF, a database of deconstructing transcriptional circuits in the heart system
title CardioTF, a database of deconstructing transcriptional circuits in the heart system
title_full CardioTF, a database of deconstructing transcriptional circuits in the heart system
title_fullStr CardioTF, a database of deconstructing transcriptional circuits in the heart system
title_full_unstemmed CardioTF, a database of deconstructing transcriptional circuits in the heart system
title_short CardioTF, a database of deconstructing transcriptional circuits in the heart system
title_sort cardiotf, a database of deconstructing transcriptional circuits in the heart system
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5012272/
https://www.ncbi.nlm.nih.gov/pubmed/27635320
http://dx.doi.org/10.7717/peerj.2339
work_keys_str_mv AT zhenyisong cardiotfadatabaseofdeconstructingtranscriptionalcircuitsintheheartsystem