Cargando…
Expression-based segmentation of the Drosophila genome
BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of c...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3909303/ https://www.ncbi.nlm.nih.gov/pubmed/24256206 http://dx.doi.org/10.1186/1471-2164-14-812 |
_version_ | 1782301824692256768 |
---|---|
author | Rubin, Alan F Green, Phil |
author_facet | Rubin, Alan F Green, Phil |
author_sort | Rubin, Alan F |
collection | PubMed |
description | BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. RESULTS: We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. CONCLUSIONS: We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding. |
format | Online Article Text |
id | pubmed-3909303 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-39093032014-02-13 Expression-based segmentation of the Drosophila genome Rubin, Alan F Green, Phil BMC Genomics Research Article BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. RESULTS: We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. CONCLUSIONS: We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding. BioMed Central 2013-11-20 /pmc/articles/PMC3909303/ /pubmed/24256206 http://dx.doi.org/10.1186/1471-2164-14-812 Text en Copyright © 2013 Rubin and Green; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Rubin, Alan F Green, Phil Expression-based segmentation of the Drosophila genome |
title | Expression-based segmentation of the Drosophila genome |
title_full | Expression-based segmentation of the Drosophila genome |
title_fullStr | Expression-based segmentation of the Drosophila genome |
title_full_unstemmed | Expression-based segmentation of the Drosophila genome |
title_short | Expression-based segmentation of the Drosophila genome |
title_sort | expression-based segmentation of the drosophila genome |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3909303/ https://www.ncbi.nlm.nih.gov/pubmed/24256206 http://dx.doi.org/10.1186/1471-2164-14-812 |
work_keys_str_mv | AT rubinalanf expressionbasedsegmentationofthedrosophilagenome AT greenphil expressionbasedsegmentationofthedrosophilagenome |