Cargando…

New methodology for repetitive sequences identification in human X and Y chromosomes

Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserv...

Descripción completa

Detalles Bibliográficos
Autores principales: Touati, Rabeb, Tajouri, Asma, Mesaoudi, Imen, Oueslati, Afef Elloumi, Lachiri, Zied, Kharrat, Maher
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier Ltd. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572123/
https://www.ncbi.nlm.nih.gov/pubmed/33101452
http://dx.doi.org/10.1016/j.bspc.2020.102207
_version_ 1783597285137973248
author Touati, Rabeb
Tajouri, Asma
Mesaoudi, Imen
Oueslati, Afef Elloumi
Lachiri, Zied
Kharrat, Maher
author_facet Touati, Rabeb
Tajouri, Asma
Mesaoudi, Imen
Oueslati, Afef Elloumi
Lachiri, Zied
Kharrat, Maher
author_sort Touati, Rabeb
collection PubMed
description Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn’t affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score.
format Online
Article
Text
id pubmed-7572123
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Elsevier Ltd.
record_format MEDLINE/PubMed
spelling pubmed-75721232020-10-20 New methodology for repetitive sequences identification in human X and Y chromosomes Touati, Rabeb Tajouri, Asma Mesaoudi, Imen Oueslati, Afef Elloumi Lachiri, Zied Kharrat, Maher Biomed Signal Process Control Article Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn’t affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score. Elsevier Ltd. 2021-02 2020-10-19 /pmc/articles/PMC7572123/ /pubmed/33101452 http://dx.doi.org/10.1016/j.bspc.2020.102207 Text en © 2020 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active.
spellingShingle Article
Touati, Rabeb
Tajouri, Asma
Mesaoudi, Imen
Oueslati, Afef Elloumi
Lachiri, Zied
Kharrat, Maher
New methodology for repetitive sequences identification in human X and Y chromosomes
title New methodology for repetitive sequences identification in human X and Y chromosomes
title_full New methodology for repetitive sequences identification in human X and Y chromosomes
title_fullStr New methodology for repetitive sequences identification in human X and Y chromosomes
title_full_unstemmed New methodology for repetitive sequences identification in human X and Y chromosomes
title_short New methodology for repetitive sequences identification in human X and Y chromosomes
title_sort new methodology for repetitive sequences identification in human x and y chromosomes
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572123/
https://www.ncbi.nlm.nih.gov/pubmed/33101452
http://dx.doi.org/10.1016/j.bspc.2020.102207
work_keys_str_mv AT touatirabeb newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes
AT tajouriasma newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes
AT mesaoudiimen newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes
AT oueslatiafefelloumi newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes
AT lachirizied newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes
AT kharratmaher newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes