Cargando…
New methodology for repetitive sequences identification in human X and Y chromosomes
Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserv...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier Ltd.
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572123/ https://www.ncbi.nlm.nih.gov/pubmed/33101452 http://dx.doi.org/10.1016/j.bspc.2020.102207 |
_version_ | 1783597285137973248 |
---|---|
author | Touati, Rabeb Tajouri, Asma Mesaoudi, Imen Oueslati, Afef Elloumi Lachiri, Zied Kharrat, Maher |
author_facet | Touati, Rabeb Tajouri, Asma Mesaoudi, Imen Oueslati, Afef Elloumi Lachiri, Zied Kharrat, Maher |
author_sort | Touati, Rabeb |
collection | PubMed |
description | Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn’t affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score. |
format | Online Article Text |
id | pubmed-7572123 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Elsevier Ltd. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75721232020-10-20 New methodology for repetitive sequences identification in human X and Y chromosomes Touati, Rabeb Tajouri, Asma Mesaoudi, Imen Oueslati, Afef Elloumi Lachiri, Zied Kharrat, Maher Biomed Signal Process Control Article Repetitive DNA sequences occupy the major proportion of DNA in the human genome and even in the other species’ genomes. The importance of each repetitive DNA type depends on many factors: structural and functional roles, positions, lengths and numbers of these repetitions are clear examples. Conserving such DNA sequences or not in different locations in the chromosome remains a challenge for researchers in biology. Detecting their location despite their great variability and finding novel repetitive sequences remains a challenging task. To side-step this problem, we developed a new method based on signal and image processing tools. In fact, using this method we could find repetitive patterns in DNA images regardless of the repetition length. This new technique seems to be more efficient in detecting new repetitive sequences than bioinformatics tools. In fact, the classical tools present limited performances especially in case of mutations (insertion or deletion). However, modifying one or a few numbers of pixels in the image doesn’t affect the global form of the repetitive pattern. As a consequence, we generated a new repetitive patterns database which contains tandem and dispersed repeated sequences. The highly repetitive sequences, we have identified in X and Y chromosomes, are shown to be located in other human chromosomes or in other genomes. The data we have generated is then taken as input to a Convolutional neural network classifier in order to classify them. The system we have constructed is efficient and gives an average of 94.4% as recognition score. Elsevier Ltd. 2021-02 2020-10-19 /pmc/articles/PMC7572123/ /pubmed/33101452 http://dx.doi.org/10.1016/j.bspc.2020.102207 Text en © 2020 Elsevier Ltd. All rights reserved. Since January 2020 Elsevier has created a COVID-19 resource centre with free information in English and Mandarin on the novel coronavirus COVID-19. The COVID-19 resource centre is hosted on Elsevier Connect, the company's public news and information website. Elsevier hereby grants permission to make all its COVID-19-related research that is available on the COVID-19 resource centre - including this research content - immediately available in PubMed Central and other publicly funded repositories, such as the WHO COVID database with rights for unrestricted research re-use and analyses in any form or by any means with acknowledgement of the original source. These permissions are granted for free by Elsevier for as long as the COVID-19 resource centre remains active. |
spellingShingle | Article Touati, Rabeb Tajouri, Asma Mesaoudi, Imen Oueslati, Afef Elloumi Lachiri, Zied Kharrat, Maher New methodology for repetitive sequences identification in human X and Y chromosomes |
title | New methodology for repetitive sequences identification in human X and Y chromosomes |
title_full | New methodology for repetitive sequences identification in human X and Y chromosomes |
title_fullStr | New methodology for repetitive sequences identification in human X and Y chromosomes |
title_full_unstemmed | New methodology for repetitive sequences identification in human X and Y chromosomes |
title_short | New methodology for repetitive sequences identification in human X and Y chromosomes |
title_sort | new methodology for repetitive sequences identification in human x and y chromosomes |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7572123/ https://www.ncbi.nlm.nih.gov/pubmed/33101452 http://dx.doi.org/10.1016/j.bspc.2020.102207 |
work_keys_str_mv | AT touatirabeb newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes AT tajouriasma newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes AT mesaoudiimen newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes AT oueslatiafefelloumi newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes AT lachirizied newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes AT kharratmaher newmethodologyforrepetitivesequencesidentificationinhumanxandychromosomes |