Cargando…

Mouse chromocenters DNA content: sequencing and in silico analysis

BACKGROUND: Chromocenters are defined as a punctate condensed blocks of chromatin in the interphase cell nuclei of certain cell types with unknown biological significance. In recent years a progress in revealing of chromocenters protein content has been made although the details of DNA content withi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ostromyshenskii, Dmitrii I., Chernyaeva, Ekaterina N., Kuznetsova, Inna S., Podgornaya, Olga I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819297/
https://www.ncbi.nlm.nih.gov/pubmed/29458329
http://dx.doi.org/10.1186/s12864-018-4534-z
_version_ 1783301187419766784
author Ostromyshenskii, Dmitrii I.
Chernyaeva, Ekaterina N.
Kuznetsova, Inna S.
Podgornaya, Olga I.
author_facet Ostromyshenskii, Dmitrii I.
Chernyaeva, Ekaterina N.
Kuznetsova, Inna S.
Podgornaya, Olga I.
author_sort Ostromyshenskii, Dmitrii I.
collection PubMed
description BACKGROUND: Chromocenters are defined as a punctate condensed blocks of chromatin in the interphase cell nuclei of certain cell types with unknown biological significance. In recent years a progress in revealing of chromocenters protein content has been made although the details of DNA content within constitutive heterochromatin still remain unclear. It is known that these regions are enriched in tandem repeats (TR) and transposable elements. Quick improvement of genome sequencing does not help to assemble the heterochromatic regions due to lack of appropriate bioinformatics techniques. RESULTS: Chromocenters DNA have been isolated by a biochemical approach from mouse liver cells nuclei and sequenced on the Illumina MiSeq resulting in ChrmC dataset. Analysis of ChrmC dataset by the bioinformatics tools available revealed that the major component of chromocenter DNA are TRs: ~ 66% MaSat and ~ 4% MiSat. Other previously classified TR families constitute ~ 1% of ChrmC dataset. About 6% of chromocenters DNA are mostly unannotated sequences. In the contigs assembled with IDBA_UD there are many fragments of heterochromatic Y-chromosome, rDNA and other pseudo-genes and non-coding DNA. A protein coding sfi1 homolog gene fragment was also found in contigs. The Sfi1 homolog gene is located on the chromosome 11 in the reference genome very close to the Golden Pass Gap (a ~ 3 Mb empty region reserved to the pericentromeric region) and proves the purity of chromocenters isolation. The second major fraction are non-LTR retroposons (SINE and LINE) with overwhelming majority of LINE - ~ 11% of ChrmC. Most of the LINE fragments are from the ~ 2 kb region at the end of the 2nd ORF and its’ flanking region. The precise LINEs’ segment of ~ 2 kb is the necessary mouse constitutive heterohromatin component together with TR. The third most abundant fraction are ERVs. The ERV distribution in chromocenters differs from the whole genome: IAP (ERV2 class) is the most numerous in ChrmC while MaLR (ERV3 class) prevails in the reference genome. IAP and its LTR also prevail in TR containing contigs extracted from the WGS dataset. In silico prediction of IAP and LINE fragments in chromocenters was confirmed by direct fluorescent in situ hybridization (FISH). CONCLUSION: Our data of chromocenters’ DNA (ChrmC) sequencing demonstrate that IAP with LTR and a precise ~ 2 kb fragment of LINE represent a substantial fraction of mouse chromocenters (constitutive heteroсhromatin) along with TRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4534-z) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5819297
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-58192972018-02-21 Mouse chromocenters DNA content: sequencing and in silico analysis Ostromyshenskii, Dmitrii I. Chernyaeva, Ekaterina N. Kuznetsova, Inna S. Podgornaya, Olga I. BMC Genomics Research Article BACKGROUND: Chromocenters are defined as a punctate condensed blocks of chromatin in the interphase cell nuclei of certain cell types with unknown biological significance. In recent years a progress in revealing of chromocenters protein content has been made although the details of DNA content within constitutive heterochromatin still remain unclear. It is known that these regions are enriched in tandem repeats (TR) and transposable elements. Quick improvement of genome sequencing does not help to assemble the heterochromatic regions due to lack of appropriate bioinformatics techniques. RESULTS: Chromocenters DNA have been isolated by a biochemical approach from mouse liver cells nuclei and sequenced on the Illumina MiSeq resulting in ChrmC dataset. Analysis of ChrmC dataset by the bioinformatics tools available revealed that the major component of chromocenter DNA are TRs: ~ 66% MaSat and ~ 4% MiSat. Other previously classified TR families constitute ~ 1% of ChrmC dataset. About 6% of chromocenters DNA are mostly unannotated sequences. In the contigs assembled with IDBA_UD there are many fragments of heterochromatic Y-chromosome, rDNA and other pseudo-genes and non-coding DNA. A protein coding sfi1 homolog gene fragment was also found in contigs. The Sfi1 homolog gene is located on the chromosome 11 in the reference genome very close to the Golden Pass Gap (a ~ 3 Mb empty region reserved to the pericentromeric region) and proves the purity of chromocenters isolation. The second major fraction are non-LTR retroposons (SINE and LINE) with overwhelming majority of LINE - ~ 11% of ChrmC. Most of the LINE fragments are from the ~ 2 kb region at the end of the 2nd ORF and its’ flanking region. The precise LINEs’ segment of ~ 2 kb is the necessary mouse constitutive heterohromatin component together with TR. The third most abundant fraction are ERVs. The ERV distribution in chromocenters differs from the whole genome: IAP (ERV2 class) is the most numerous in ChrmC while MaLR (ERV3 class) prevails in the reference genome. IAP and its LTR also prevail in TR containing contigs extracted from the WGS dataset. In silico prediction of IAP and LINE fragments in chromocenters was confirmed by direct fluorescent in situ hybridization (FISH). CONCLUSION: Our data of chromocenters’ DNA (ChrmC) sequencing demonstrate that IAP with LTR and a precise ~ 2 kb fragment of LINE represent a substantial fraction of mouse chromocenters (constitutive heteroсhromatin) along with TRs. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4534-z) contains supplementary material, which is available to authorized users. BioMed Central 2018-02-20 /pmc/articles/PMC5819297/ /pubmed/29458329 http://dx.doi.org/10.1186/s12864-018-4534-z Text en © The Author(s). 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Ostromyshenskii, Dmitrii I.
Chernyaeva, Ekaterina N.
Kuznetsova, Inna S.
Podgornaya, Olga I.
Mouse chromocenters DNA content: sequencing and in silico analysis
title Mouse chromocenters DNA content: sequencing and in silico analysis
title_full Mouse chromocenters DNA content: sequencing and in silico analysis
title_fullStr Mouse chromocenters DNA content: sequencing and in silico analysis
title_full_unstemmed Mouse chromocenters DNA content: sequencing and in silico analysis
title_short Mouse chromocenters DNA content: sequencing and in silico analysis
title_sort mouse chromocenters dna content: sequencing and in silico analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5819297/
https://www.ncbi.nlm.nih.gov/pubmed/29458329
http://dx.doi.org/10.1186/s12864-018-4534-z
work_keys_str_mv AT ostromyshenskiidmitriii mousechromocentersdnacontentsequencingandinsilicoanalysis
AT chernyaevaekaterinan mousechromocentersdnacontentsequencingandinsilicoanalysis
AT kuznetsovainnas mousechromocentersdnacontentsequencingandinsilicoanalysis
AT podgornayaolgai mousechromocentersdnacontentsequencingandinsilicoanalysis