Cargando…

Expression-based segmentation of the Drosophila genome

BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of c...

Descripción completa

Detalles Bibliográficos
Autores principales: Rubin, Alan F, Green, Phil
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3909303/
https://www.ncbi.nlm.nih.gov/pubmed/24256206
http://dx.doi.org/10.1186/1471-2164-14-812
_version_ 1782301824692256768
author Rubin, Alan F
Green, Phil
author_facet Rubin, Alan F
Green, Phil
author_sort Rubin, Alan F
collection PubMed
description BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. RESULTS: We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. CONCLUSIONS: We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding.
format Online
Article
Text
id pubmed-3909303
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39093032014-02-13 Expression-based segmentation of the Drosophila genome Rubin, Alan F Green, Phil BMC Genomics Research Article BACKGROUND: It is generally accepted that gene order in eukaryotes is nonrandom, with adjacent genes often sharing expression patterns across tissues, and that this organization may be important for gene regulation. Here we describe a novel method, based on an explicit probability model instead of correlation analysis, for identifying coordinately expressed gene clusters (‘coexpression segments’), apply it to Drosophila melanogaster, and look for epigenetic associations using publicly available data. RESULTS: We find that two-thirds of Drosophila genes fall into multigenic coexpression segments, and that such segments are of two main types, housekeeping and tissue-restricted. Consistent with correlation-based studies, we find that adjacent genes within the same segment tend to be physically closer to each other than to the adjacent genes in different segments, and that tissue-restricted segments are enriched for testis-expressed genes. Our segmentation pattern correlates with Hi-C based physical interaction domains, but segments are generally much smaller than domains. Intersegment regions (including those which do not correspond to physical domain boundaries) are enriched for insulator binding sites. CONCLUSIONS: We describe a novel approach for identifying coexpression clusters that does not require arbitrary cutoff values or heuristics, and find that coexpression of adjacent genes is widespread in the Drosophila genome. Coexpression segments appear to reflect a level of regulatory organization related to, but below that of physical interaction domains, and depending in part on insulator binding. BioMed Central 2013-11-20 /pmc/articles/PMC3909303/ /pubmed/24256206 http://dx.doi.org/10.1186/1471-2164-14-812 Text en Copyright © 2013 Rubin and Green; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Rubin, Alan F
Green, Phil
Expression-based segmentation of the Drosophila genome
title Expression-based segmentation of the Drosophila genome
title_full Expression-based segmentation of the Drosophila genome
title_fullStr Expression-based segmentation of the Drosophila genome
title_full_unstemmed Expression-based segmentation of the Drosophila genome
title_short Expression-based segmentation of the Drosophila genome
title_sort expression-based segmentation of the drosophila genome
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3909303/
https://www.ncbi.nlm.nih.gov/pubmed/24256206
http://dx.doi.org/10.1186/1471-2164-14-812
work_keys_str_mv AT rubinalanf expressionbasedsegmentationofthedrosophilagenome
AT greenphil expressionbasedsegmentationofthedrosophilagenome