Cargando…

In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes

Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exist...

Descripción completa

Detalles Bibliográficos
Autores principales: Harrison, Thomas M. R., Rudar, Josip, Ogden, Nicholas, Steeves, Royce, Lapen, David R., Baird, Donald, Gagné, Nellie, Lung, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763401/
https://www.ncbi.nlm.nih.gov/pubmed/36536037
http://dx.doi.org/10.1038/s41598-022-26236-5
_version_ 1784853051365916672
author Harrison, Thomas M. R.
Rudar, Josip
Ogden, Nicholas
Steeves, Royce
Lapen, David R.
Baird, Donald
Gagné, Nellie
Lung, Oliver
author_facet Harrison, Thomas M. R.
Rudar, Josip
Ogden, Nicholas
Steeves, Royce
Lapen, David R.
Baird, Donald
Gagné, Nellie
Lung, Oliver
author_sort Harrison, Thomas M. R.
collection PubMed
description Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives.
format Online
Article
Text
id pubmed-9763401
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-97634012022-12-21 In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes Harrison, Thomas M. R. Rudar, Josip Ogden, Nicholas Steeves, Royce Lapen, David R. Baird, Donald Gagné, Nellie Lung, Oliver Sci Rep Article Mosquitoes are important vectors for human and animal diseases. Genetic markers, like the mitochondrial COI gene, can facilitate the taxonomic classification of disease vectors, vector-borne disease surveillance, and prevention. Within the control region (CR) of the mitochondrial genome, there exists a highly variable and poorly studied non-coding AT-rich area that contains the origin of replication. Although the CR hypervariable region has been used for species differentiation of some animals, few studies have investigated the mosquito CR. In this study, we analyze the mosquito mitogenome CR sequences from 125 species and 17 genera. We discovered four conserved motifs located 80 to 230 bp upstream of the 12S rRNA gene. Two of these motifs were found within all 392 Anopheles (An.) CR sequences while the other two motifs were identified in all 37 Culex (Cx.) CR sequences. However, only 3 of the 304 non-Culicidae Dipteran mitogenome CR sequences contained these motifs. Interestingly, the short motif found in all 37 Culex sequences had poly-A and poly-T stretch of similar length that is predicted to form a stable hairpin. We show that supervised learning using the frequency chaos game representation of the CR can be used to differentiate mosquito genera from their dipteran relatives. Nature Publishing Group UK 2022-12-19 /pmc/articles/PMC9763401/ /pubmed/36536037 http://dx.doi.org/10.1038/s41598-022-26236-5 Text en © Crown 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Harrison, Thomas M. R.
Rudar, Josip
Ogden, Nicholas
Steeves, Royce
Lapen, David R.
Baird, Donald
Gagné, Nellie
Lung, Oliver
In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_full In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_fullStr In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_full_unstemmed In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_short In silico identification of multiple conserved motifs within the control region of Culicidae mitogenomes
title_sort in silico identification of multiple conserved motifs within the control region of culicidae mitogenomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9763401/
https://www.ncbi.nlm.nih.gov/pubmed/36536037
http://dx.doi.org/10.1038/s41598-022-26236-5
work_keys_str_mv AT harrisonthomasmr insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT rudarjosip insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT ogdennicholas insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT steevesroyce insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT lapendavidr insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT bairddonald insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT gagnenellie insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes
AT lungoliver insilicoidentificationofmultipleconservedmotifswithinthecontrolregionofculicidaemitogenomes