Cargando…

A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages

Identification of diffuse signals from the chromatin immunoprecipitation and high-throughput massively parallel sequencing (ChIP-Seq) technology poses significant computational challenges, and there are few methods currently available. We present a novel global clustering approach to enrich diffuse...

Descripción completa

Detalles Bibliográficos
Autores principales: Garmire, Lana X., Garmire, David G., Huang, Wendy, Yao, Joyee, Glass, Christopher K., Subramaniam, Shankar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184070/
https://www.ncbi.nlm.nih.gov/pubmed/21980340
http://dx.doi.org/10.1371/journal.pone.0024051
_version_ 1782213052230270976
author Garmire, Lana X.
Garmire, David G.
Huang, Wendy
Yao, Joyee
Glass, Christopher K.
Subramaniam, Shankar
author_facet Garmire, Lana X.
Garmire, David G.
Huang, Wendy
Yao, Joyee
Glass, Christopher K.
Subramaniam, Shankar
author_sort Garmire, Lana X.
collection PubMed
description Identification of diffuse signals from the chromatin immunoprecipitation and high-throughput massively parallel sequencing (ChIP-Seq) technology poses significant computational challenges, and there are few methods currently available. We present a novel global clustering approach to enrich diffuse CHIP-Seq signals of RNA polymerase II and histone 3 lysine 4 trimethylation (H3K4Me3) and apply it to identify putative long intergenic non-coding RNAs (lincRNAs) in macrophage cells. Our global clustering method compares favorably to the local clustering method SICER that was also designed to identify diffuse CHIP-Seq signals. The validity of the algorithm is confirmed at several levels. First, 8 out of a total of 11 selected putative lincRNA regions in primary macrophages respond to lipopolysaccharides (LPS) treatment as predicted by our computational method. Second, the genes nearest to lincRNAs are enriched with biological functions related to metabolic processes under resting conditions but with developmental and immune-related functions under LPS treatment. Third, the putative lincRNAs have conserved promoters, modestly conserved exons, and expected secondary structures by prediction. Last, they are enriched with motifs of transcription factors such as PU.1 and AP.1, previously shown to be important lineage determining factors in macrophages, and 83% of them overlap with distal enhancers markers. In summary, GCLS based on RNA polymerase II and H3K4Me3 CHIP-Seq method can effectively detect putative lincRNAs that exhibit expected characteristics, as exemplified by macrophages in the study.
format Online
Article
Text
id pubmed-3184070
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31840702011-10-06 A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages Garmire, Lana X. Garmire, David G. Huang, Wendy Yao, Joyee Glass, Christopher K. Subramaniam, Shankar PLoS One Research Article Identification of diffuse signals from the chromatin immunoprecipitation and high-throughput massively parallel sequencing (ChIP-Seq) technology poses significant computational challenges, and there are few methods currently available. We present a novel global clustering approach to enrich diffuse CHIP-Seq signals of RNA polymerase II and histone 3 lysine 4 trimethylation (H3K4Me3) and apply it to identify putative long intergenic non-coding RNAs (lincRNAs) in macrophage cells. Our global clustering method compares favorably to the local clustering method SICER that was also designed to identify diffuse CHIP-Seq signals. The validity of the algorithm is confirmed at several levels. First, 8 out of a total of 11 selected putative lincRNA regions in primary macrophages respond to lipopolysaccharides (LPS) treatment as predicted by our computational method. Second, the genes nearest to lincRNAs are enriched with biological functions related to metabolic processes under resting conditions but with developmental and immune-related functions under LPS treatment. Third, the putative lincRNAs have conserved promoters, modestly conserved exons, and expected secondary structures by prediction. Last, they are enriched with motifs of transcription factors such as PU.1 and AP.1, previously shown to be important lineage determining factors in macrophages, and 83% of them overlap with distal enhancers markers. In summary, GCLS based on RNA polymerase II and H3K4Me3 CHIP-Seq method can effectively detect putative lincRNAs that exhibit expected characteristics, as exemplified by macrophages in the study. Public Library of Science 2011-09-30 /pmc/articles/PMC3184070/ /pubmed/21980340 http://dx.doi.org/10.1371/journal.pone.0024051 Text en This is an open-access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 public domain dedication. https://creativecommons.org/publicdomain/zero/1.0/ This is an open-access article distributed under the terms of the Creative Commons Public Domain declaration, which stipulates that, once placed in the public domain, this work may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose.
spellingShingle Research Article
Garmire, Lana X.
Garmire, David G.
Huang, Wendy
Yao, Joyee
Glass, Christopher K.
Subramaniam, Shankar
A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title_full A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title_fullStr A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title_full_unstemmed A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title_short A Global Clustering Algorithm to Identify Long Intergenic Non-Coding RNA - with Applications in Mouse Macrophages
title_sort global clustering algorithm to identify long intergenic non-coding rna - with applications in mouse macrophages
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3184070/
https://www.ncbi.nlm.nih.gov/pubmed/21980340
http://dx.doi.org/10.1371/journal.pone.0024051
work_keys_str_mv AT garmirelanax aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT garmiredavidg aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT huangwendy aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT yaojoyee aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT glasschristopherk aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT subramaniamshankar aglobalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT garmirelanax globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT garmiredavidg globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT huangwendy globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT yaojoyee globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT glasschristopherk globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages
AT subramaniamshankar globalclusteringalgorithmtoidentifylongintergenicnoncodingrnawithapplicationsinmousemacrophages