Cargando…

Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome

BACKGROUND: Only a handful of long noncoding RNAs have been functionally characterized. They are known to modulate regulation through interacting with other biomolecules in the cell: DNA, RNA and protein. Though there have been detailed investigations on lncRNA-miRNA and lncRNA-protein interactions,...

Descripción completa

Detalles Bibliográficos
Autores principales: Jalali, Saakshi, Singh, Amrita, Maiti, Souvik, Scaria, Vinod
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7670996/
https://www.ncbi.nlm.nih.gov/pubmed/28865451
http://dx.doi.org/10.1186/s12967-017-1282-9
_version_ 1783610849264402432
author Jalali, Saakshi
Singh, Amrita
Maiti, Souvik
Scaria, Vinod
author_facet Jalali, Saakshi
Singh, Amrita
Maiti, Souvik
Scaria, Vinod
author_sort Jalali, Saakshi
collection PubMed
description BACKGROUND: Only a handful of long noncoding RNAs have been functionally characterized. They are known to modulate regulation through interacting with other biomolecules in the cell: DNA, RNA and protein. Though there have been detailed investigations on lncRNA-miRNA and lncRNA-protein interactions, the interaction of lncRNAs with DNA have not been studied extensively. In the present study, we explore whether lncRNAs could modulate genomic regulation by interacting with DNA through the formation of highly stable DNA:DNA:RNA triplexes. METHODS: We computationally screened 23,898 lncRNA transcripts as annotated by GENCODE, across the human genome for potential triplex forming sequence stretches (PTS). The PTS frequencies were compared across 5′UTR, CDS, 3′UTR, introns, promoter and 1000 bases downstream of the transcription termination sites. These regions were annotated by mapping to experimental regulatory regions, classes of repeat regions and transcription factors. We validated few putative triplex mediated interactions where lncRNA-gene pair interaction is via pyrimidine triplex motif using biophysical methods. RESULTS: We identified 20,04,034 PTS sites to be enriched in promoter and intronic regions across human genome. Additional analysis of the association of PTS with core promoter elements revealed a systematic paucity of PTS in all regulatory regions, except TF binding sites. A total of 25 transcription factors were found to be associated with PTS. Using an interaction network, we showed that a subset of the triplex forming lncRNAs, have a positive association with gene promoters. We also demonstrated an in vitro interaction of one lncRNA candidate with its predicted gene target promoter regions. CONCLUSIONS: Our analysis shows that PTS are enriched in gene promoter and largely associated with simple repeats. The current study suggests a major role of a subset of lncRNAs in mediating chromatin organization modulation through CTCF and NSRF proteins. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12967-017-1282-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-7670996
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-76709962020-11-20 Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome Jalali, Saakshi Singh, Amrita Maiti, Souvik Scaria, Vinod J Transl Med Research BACKGROUND: Only a handful of long noncoding RNAs have been functionally characterized. They are known to modulate regulation through interacting with other biomolecules in the cell: DNA, RNA and protein. Though there have been detailed investigations on lncRNA-miRNA and lncRNA-protein interactions, the interaction of lncRNAs with DNA have not been studied extensively. In the present study, we explore whether lncRNAs could modulate genomic regulation by interacting with DNA through the formation of highly stable DNA:DNA:RNA triplexes. METHODS: We computationally screened 23,898 lncRNA transcripts as annotated by GENCODE, across the human genome for potential triplex forming sequence stretches (PTS). The PTS frequencies were compared across 5′UTR, CDS, 3′UTR, introns, promoter and 1000 bases downstream of the transcription termination sites. These regions were annotated by mapping to experimental regulatory regions, classes of repeat regions and transcription factors. We validated few putative triplex mediated interactions where lncRNA-gene pair interaction is via pyrimidine triplex motif using biophysical methods. RESULTS: We identified 20,04,034 PTS sites to be enriched in promoter and intronic regions across human genome. Additional analysis of the association of PTS with core promoter elements revealed a systematic paucity of PTS in all regulatory regions, except TF binding sites. A total of 25 transcription factors were found to be associated with PTS. Using an interaction network, we showed that a subset of the triplex forming lncRNAs, have a positive association with gene promoters. We also demonstrated an in vitro interaction of one lncRNA candidate with its predicted gene target promoter regions. CONCLUSIONS: Our analysis shows that PTS are enriched in gene promoter and largely associated with simple repeats. The current study suggests a major role of a subset of lncRNAs in mediating chromatin organization modulation through CTCF and NSRF proteins. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12967-017-1282-9) contains supplementary material, which is available to authorized users. BioMed Central 2017-09-02 /pmc/articles/PMC7670996/ /pubmed/28865451 http://dx.doi.org/10.1186/s12967-017-1282-9 Text en © The Author(s) 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Jalali, Saakshi
Singh, Amrita
Maiti, Souvik
Scaria, Vinod
Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title_full Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title_fullStr Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title_full_unstemmed Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title_short Genome-wide computational analysis of potential long noncoding RNA mediated DNA:DNA:RNA triplexes in the human genome
title_sort genome-wide computational analysis of potential long noncoding rna mediated dna:dna:rna triplexes in the human genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7670996/
https://www.ncbi.nlm.nih.gov/pubmed/28865451
http://dx.doi.org/10.1186/s12967-017-1282-9
work_keys_str_mv AT jalalisaakshi genomewidecomputationalanalysisofpotentiallongnoncodingrnamediateddnadnarnatriplexesinthehumangenome
AT singhamrita genomewidecomputationalanalysisofpotentiallongnoncodingrnamediateddnadnarnatriplexesinthehumangenome
AT maitisouvik genomewidecomputationalanalysisofpotentiallongnoncodingrnamediateddnadnarnatriplexesinthehumangenome
AT scariavinod genomewidecomputationalanalysisofpotentiallongnoncodingrnamediateddnadnarnatriplexesinthehumangenome