Cargando…

In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group

Satellite DNA (satDNA) is a class of tandemly repeated non-protein coding DNA sequences which can be found in abundance in eukaryotic genomes. They can be functional, impact the genomic architecture in many ways, and their rapid evolution has consequences for species diversification. We took advanta...

Descripción completa

Detalles Bibliográficos
Autores principales: Silva, Bráulio S. M. L., Picorelli, Agnello C. R., Kuhn, Gustavo C. S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957191/
https://www.ncbi.nlm.nih.gov/pubmed/36833227
http://dx.doi.org/10.3390/genes14020300
_version_ 1784894765022576640
author Silva, Bráulio S. M. L.
Picorelli, Agnello C. R.
Kuhn, Gustavo C. S.
author_facet Silva, Bráulio S. M. L.
Picorelli, Agnello C. R.
Kuhn, Gustavo C. S.
author_sort Silva, Bráulio S. M. L.
collection PubMed
description Satellite DNA (satDNA) is a class of tandemly repeated non-protein coding DNA sequences which can be found in abundance in eukaryotic genomes. They can be functional, impact the genomic architecture in many ways, and their rapid evolution has consequences for species diversification. We took advantage of the recent availability of sequenced genomes from 23 Drosophila species from the montium group to study their satDNA landscape. For this purpose, we used publicly available whole-genome sequencing Illumina reads and the TAREAN (tandem repeat analyzer) pipeline. We provide the characterization of 101 non-homologous satDNA families in this group, 93 of which are described here for the first time. Their repeat units vary in size from 4 bp to 1897 bp, but most satDNAs show repeat units < 100 bp long and, among them, repeats ≤ 10 bp are the most frequent ones. The genomic contribution of the satDNAs ranges from ~1.4% to 21.6%. There is no significant correlation between satDNA content and genome sizes in the 23 species. We also found that at least one satDNA originated from an expansion of the central tandem repeats (CTRs) present inside a Helitron transposon. Finally, some satDNAs may be useful as taxonomic markers for the identification of species or subgroups within the group.
format Online
Article
Text
id pubmed-9957191
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-99571912023-02-25 In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group Silva, Bráulio S. M. L. Picorelli, Agnello C. R. Kuhn, Gustavo C. S. Genes (Basel) Article Satellite DNA (satDNA) is a class of tandemly repeated non-protein coding DNA sequences which can be found in abundance in eukaryotic genomes. They can be functional, impact the genomic architecture in many ways, and their rapid evolution has consequences for species diversification. We took advantage of the recent availability of sequenced genomes from 23 Drosophila species from the montium group to study their satDNA landscape. For this purpose, we used publicly available whole-genome sequencing Illumina reads and the TAREAN (tandem repeat analyzer) pipeline. We provide the characterization of 101 non-homologous satDNA families in this group, 93 of which are described here for the first time. Their repeat units vary in size from 4 bp to 1897 bp, but most satDNAs show repeat units < 100 bp long and, among them, repeats ≤ 10 bp are the most frequent ones. The genomic contribution of the satDNAs ranges from ~1.4% to 21.6%. There is no significant correlation between satDNA content and genome sizes in the 23 species. We also found that at least one satDNA originated from an expansion of the central tandem repeats (CTRs) present inside a Helitron transposon. Finally, some satDNAs may be useful as taxonomic markers for the identification of species or subgroups within the group. MDPI 2023-01-23 /pmc/articles/PMC9957191/ /pubmed/36833227 http://dx.doi.org/10.3390/genes14020300 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Silva, Bráulio S. M. L.
Picorelli, Agnello C. R.
Kuhn, Gustavo C. S.
In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title_full In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title_fullStr In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title_full_unstemmed In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title_short In Silico Identification and Characterization of Satellite DNAs in 23 Drosophila Species from the Montium Group
title_sort in silico identification and characterization of satellite dnas in 23 drosophila species from the montium group
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9957191/
https://www.ncbi.nlm.nih.gov/pubmed/36833227
http://dx.doi.org/10.3390/genes14020300
work_keys_str_mv AT silvabrauliosml insilicoidentificationandcharacterizationofsatellitednasin23drosophilaspeciesfromthemontiumgroup
AT picorelliagnellocr insilicoidentificationandcharacterizationofsatellitednasin23drosophilaspeciesfromthemontiumgroup
AT kuhngustavocs insilicoidentificationandcharacterizationofsatellitednasin23drosophilaspeciesfromthemontiumgroup