Cargando…

Heterochromatic sequences in a Drosophila whole-genome shotgun assembly

BACKGROUND: Most eukaryotic genomes include a substantial repeat-rich fraction termed heterochromatin, which is concentrated in centric and telomeric regions. The repetitive nature of heterochromatic sequence makes it difficult to assemble and analyze. To better understand the heterochromatic compon...

Descripción completa

Detalles Bibliográficos
Autores principales: Hoskins, Roger A, Smith, Christopher D, Carlson, Joseph W, Carvalho, A Bernardo, Halpern, Aaron, Kaminker, Joshua S, Kennedy, Cameron, Mungall, Chris J, Sullivan, Beth A, Sutton, Granger G, Yasuhara, Jiro C, Wakimoto, Barbara T, Myers, Eugene W, Celniker, Susan E, Rubin, Gerald M, Karpen, Gary H
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2002
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151187/
https://www.ncbi.nlm.nih.gov/pubmed/12537574
http://dx.doi.org/10.1186/gb-2002-3-12-research0085
_version_ 1782120664211128320
author Hoskins, Roger A
Smith, Christopher D
Carlson, Joseph W
Carvalho, A Bernardo
Halpern, Aaron
Kaminker, Joshua S
Kennedy, Cameron
Mungall, Chris J
Sullivan, Beth A
Sutton, Granger G
Yasuhara, Jiro C
Wakimoto, Barbara T
Myers, Eugene W
Celniker, Susan E
Rubin, Gerald M
Karpen, Gary H
author_facet Hoskins, Roger A
Smith, Christopher D
Carlson, Joseph W
Carvalho, A Bernardo
Halpern, Aaron
Kaminker, Joshua S
Kennedy, Cameron
Mungall, Chris J
Sullivan, Beth A
Sutton, Granger G
Yasuhara, Jiro C
Wakimoto, Barbara T
Myers, Eugene W
Celniker, Susan E
Rubin, Gerald M
Karpen, Gary H
author_sort Hoskins, Roger A
collection PubMed
description BACKGROUND: Most eukaryotic genomes include a substantial repeat-rich fraction termed heterochromatin, which is concentrated in centric and telomeric regions. The repetitive nature of heterochromatic sequence makes it difficult to assemble and analyze. To better understand the heterochromatic component of the Drosophila melanogaster genome, we characterized and annotated portions of a whole-genome shotgun sequence assembly. RESULTS: WGS3, an improved whole-genome shotgun assembly, includes 20.7 Mb of draft-quality sequence not represented in the Release 3 sequence spanning the euchromatin. We annotated this sequence using the methods employed in the re-annotation of the Release 3 euchromatic sequence. This analysis predicted 297 protein-coding genes and six non-protein-coding genes, including known heterochromatic genes, and regions of similarity to known transposable elements. Bacterial artificial chromosome (BAC)-based fluorescence in situ hybridization analysis was used to correlate the genomic sequence with the cytogenetic map in order to refine the genomic definition of the centric heterochromatin; on the basis of our cytological definition, the annotated Release 3 euchromatic sequence extends into the centric heterochromatin on each chromosome arm. CONCLUSIONS: Whole-genome shotgun assembly produced a reliable draft-quality sequence of a significant part of the Drosophila heterochromatin. Annotation of this sequence defined the intron-exon structures of 30 known protein-coding genes and 267 protein-coding gene models. The cytogenetic mapping suggests that an additional 150 predicted genes are located in heterochromatin at the base of the Release 3 euchromatic sequence. Our analysis suggests strategies for improving the sequence and annotation of the heterochromatic portions of the Drosophila and other complex genomes.
format Text
id pubmed-151187
institution National Center for Biotechnology Information
language English
publishDate 2002
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-1511872003-03-13 Heterochromatic sequences in a Drosophila whole-genome shotgun assembly Hoskins, Roger A Smith, Christopher D Carlson, Joseph W Carvalho, A Bernardo Halpern, Aaron Kaminker, Joshua S Kennedy, Cameron Mungall, Chris J Sullivan, Beth A Sutton, Granger G Yasuhara, Jiro C Wakimoto, Barbara T Myers, Eugene W Celniker, Susan E Rubin, Gerald M Karpen, Gary H Genome Biol Research BACKGROUND: Most eukaryotic genomes include a substantial repeat-rich fraction termed heterochromatin, which is concentrated in centric and telomeric regions. The repetitive nature of heterochromatic sequence makes it difficult to assemble and analyze. To better understand the heterochromatic component of the Drosophila melanogaster genome, we characterized and annotated portions of a whole-genome shotgun sequence assembly. RESULTS: WGS3, an improved whole-genome shotgun assembly, includes 20.7 Mb of draft-quality sequence not represented in the Release 3 sequence spanning the euchromatin. We annotated this sequence using the methods employed in the re-annotation of the Release 3 euchromatic sequence. This analysis predicted 297 protein-coding genes and six non-protein-coding genes, including known heterochromatic genes, and regions of similarity to known transposable elements. Bacterial artificial chromosome (BAC)-based fluorescence in situ hybridization analysis was used to correlate the genomic sequence with the cytogenetic map in order to refine the genomic definition of the centric heterochromatin; on the basis of our cytological definition, the annotated Release 3 euchromatic sequence extends into the centric heterochromatin on each chromosome arm. CONCLUSIONS: Whole-genome shotgun assembly produced a reliable draft-quality sequence of a significant part of the Drosophila heterochromatin. Annotation of this sequence defined the intron-exon structures of 30 known protein-coding genes and 267 protein-coding gene models. The cytogenetic mapping suggests that an additional 150 predicted genes are located in heterochromatin at the base of the Release 3 euchromatic sequence. Our analysis suggests strategies for improving the sequence and annotation of the heterochromatic portions of the Drosophila and other complex genomes. BioMed Central 2002 2002-12-31 /pmc/articles/PMC151187/ /pubmed/12537574 http://dx.doi.org/10.1186/gb-2002-3-12-research0085 Text en Copyright © 2002 Hoskins et al., licensee BioMed Central Ltd
spellingShingle Research
Hoskins, Roger A
Smith, Christopher D
Carlson, Joseph W
Carvalho, A Bernardo
Halpern, Aaron
Kaminker, Joshua S
Kennedy, Cameron
Mungall, Chris J
Sullivan, Beth A
Sutton, Granger G
Yasuhara, Jiro C
Wakimoto, Barbara T
Myers, Eugene W
Celniker, Susan E
Rubin, Gerald M
Karpen, Gary H
Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title_full Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title_fullStr Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title_full_unstemmed Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title_short Heterochromatic sequences in a Drosophila whole-genome shotgun assembly
title_sort heterochromatic sequences in a drosophila whole-genome shotgun assembly
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC151187/
https://www.ncbi.nlm.nih.gov/pubmed/12537574
http://dx.doi.org/10.1186/gb-2002-3-12-research0085
work_keys_str_mv AT hoskinsrogera heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT smithchristopherd heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT carlsonjosephw heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT carvalhoabernardo heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT halpernaaron heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT kaminkerjoshuas heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT kennedycameron heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT mungallchrisj heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT sullivanbetha heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT suttongrangerg heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT yasuharajiroc heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT wakimotobarbarat heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT myerseugenew heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT celnikersusane heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT rubingeraldm heterochromaticsequencesinadrosophilawholegenomeshotgunassembly
AT karpengaryh heterochromaticsequencesinadrosophilawholegenomeshotgunassembly