Cargando…

GINOM: A statistical framework for assessing interval overlap of multiple genomic features

A common problem in genomics is to test for associations between two or more genomic features, typically represented as intervals interspersed across the genome. Existing methodologies can test for significant pairwise associations between two genomic intervals; however, they cannot test for associa...

Descripción completa

Detalles Bibliográficos
Autores principales: Bryner, Darshan, Criscione, Stephen, Leith, Andrew, Huynh, Quyen, Huffer, Fred, Neretti, Nicola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5491313/
https://www.ncbi.nlm.nih.gov/pubmed/28617797
http://dx.doi.org/10.1371/journal.pcbi.1005586
_version_ 1783247116934578176
author Bryner, Darshan
Criscione, Stephen
Leith, Andrew
Huynh, Quyen
Huffer, Fred
Neretti, Nicola
author_facet Bryner, Darshan
Criscione, Stephen
Leith, Andrew
Huynh, Quyen
Huffer, Fred
Neretti, Nicola
author_sort Bryner, Darshan
collection PubMed
description A common problem in genomics is to test for associations between two or more genomic features, typically represented as intervals interspersed across the genome. Existing methodologies can test for significant pairwise associations between two genomic intervals; however, they cannot test for associations involving multiple sets of intervals. This limits our ability to uncover more complex, yet biologically important associations between multiple sets of genomic features. We introduce GINOM (Genomic INterval Overlap Model), a new method that enables testing of significant associations between multiple genomic features. We demonstrate GINOM’s ability to identify higher-order associations with both simulated and real data. In particular, we used GINOM to explore L1 retrotransposable element insertion bias in lung cancer and found a significant pairwise association between L1 insertions and heterochromatic marks. Unlike other methods, GINOM also detected an association between L1 insertions and gene bodies marked by a facultative heterochromatic mark, which could explain the observed bias for L1 insertions towards cancer-associated genes.
format Online
Article
Text
id pubmed-5491313
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-54913132017-07-18 GINOM: A statistical framework for assessing interval overlap of multiple genomic features Bryner, Darshan Criscione, Stephen Leith, Andrew Huynh, Quyen Huffer, Fred Neretti, Nicola PLoS Comput Biol Research Article A common problem in genomics is to test for associations between two or more genomic features, typically represented as intervals interspersed across the genome. Existing methodologies can test for significant pairwise associations between two genomic intervals; however, they cannot test for associations involving multiple sets of intervals. This limits our ability to uncover more complex, yet biologically important associations between multiple sets of genomic features. We introduce GINOM (Genomic INterval Overlap Model), a new method that enables testing of significant associations between multiple genomic features. We demonstrate GINOM’s ability to identify higher-order associations with both simulated and real data. In particular, we used GINOM to explore L1 retrotransposable element insertion bias in lung cancer and found a significant pairwise association between L1 insertions and heterochromatic marks. Unlike other methods, GINOM also detected an association between L1 insertions and gene bodies marked by a facultative heterochromatic mark, which could explain the observed bias for L1 insertions towards cancer-associated genes. Public Library of Science 2017-06-15 /pmc/articles/PMC5491313/ /pubmed/28617797 http://dx.doi.org/10.1371/journal.pcbi.1005586 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Bryner, Darshan
Criscione, Stephen
Leith, Andrew
Huynh, Quyen
Huffer, Fred
Neretti, Nicola
GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title_full GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title_fullStr GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title_full_unstemmed GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title_short GINOM: A statistical framework for assessing interval overlap of multiple genomic features
title_sort ginom: a statistical framework for assessing interval overlap of multiple genomic features
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5491313/
https://www.ncbi.nlm.nih.gov/pubmed/28617797
http://dx.doi.org/10.1371/journal.pcbi.1005586
work_keys_str_mv AT brynerdarshan ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures
AT criscionestephen ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures
AT leithandrew ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures
AT huynhquyen ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures
AT hufferfred ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures
AT nerettinicola ginomastatisticalframeworkforassessingintervaloverlapofmultiplegenomicfeatures