Cargando…
A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences
The identification and analysis of repetitive patterns are active areas of biological and computational research. Tandem repeats in telomeres play a role in cancer and hypervariable trinucleotide tandem repeats are linked to over a dozen major neurodegenerative genetic disorders. In this paper, we p...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171338/ https://www.ncbi.nlm.nih.gov/pubmed/17713591 http://dx.doi.org/10.1155/2007/43596 |
_version_ | 1782211740002418688 |
---|---|
author | Gupta, Ravi Sarthi, Divya Mittal, Ankush Singh, Kuldip |
author_facet | Gupta, Ravi Sarthi, Divya Mittal, Ankush Singh, Kuldip |
author_sort | Gupta, Ravi |
collection | PubMed |
description | The identification and analysis of repetitive patterns are active areas of biological and computational research. Tandem repeats in telomeres play a role in cancer and hypervariable trinucleotide tandem repeats are linked to over a dozen major neurodegenerative genetic disorders. In this paper, we present an algorithm to identify the exact and inexact repeat patterns in DNA sequences based on orthogonal exactly periodic subspace decomposition technique. Using the new measure our algorithm resolves the problems like whether the repeat pattern is of period [Image: see text] or its multiple (i.e., 2[Image: see text], 3[Image: see text], etc.), and several other problems that were present in previous signal-processing-based algorithms. We present an efficient algorithm of [Image: see text], where [Image: see text] is the length of DNA sequence and [Image: see text] is the window length, for identifying repeats. The algorithm operates in two stages. In the first stage, each nucleotide is analyzed separately for periodicity, and in the second stage, the periodic information of each nucleotide is combined together to identify the tandem repeats. Datasets having exact and inexact repeats were taken up for the experimental purpose. The experimental result shows the effectiveness of the approach. |
format | Online Article Text |
id | pubmed-3171338 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Springer |
record_format | MEDLINE/PubMed |
spelling | pubmed-31713382011-09-13 A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences Gupta, Ravi Sarthi, Divya Mittal, Ankush Singh, Kuldip EURASIP J Bioinform Syst Biol Research Article The identification and analysis of repetitive patterns are active areas of biological and computational research. Tandem repeats in telomeres play a role in cancer and hypervariable trinucleotide tandem repeats are linked to over a dozen major neurodegenerative genetic disorders. In this paper, we present an algorithm to identify the exact and inexact repeat patterns in DNA sequences based on orthogonal exactly periodic subspace decomposition technique. Using the new measure our algorithm resolves the problems like whether the repeat pattern is of period [Image: see text] or its multiple (i.e., 2[Image: see text], 3[Image: see text], etc.), and several other problems that were present in previous signal-processing-based algorithms. We present an efficient algorithm of [Image: see text], where [Image: see text] is the length of DNA sequence and [Image: see text] is the window length, for identifying repeats. The algorithm operates in two stages. In the first stage, each nucleotide is analyzed separately for periodicity, and in the second stage, the periodic information of each nucleotide is combined together to identify the tandem repeats. Datasets having exact and inexact repeats were taken up for the experimental purpose. The experimental result shows the effectiveness of the approach. Springer 2007-03-13 /pmc/articles/PMC3171338/ /pubmed/17713591 http://dx.doi.org/10.1155/2007/43596 Text en Copyright © 2007 Ravi Gupta et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Gupta, Ravi Sarthi, Divya Mittal, Ankush Singh, Kuldip A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title | A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title_full | A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title_fullStr | A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title_full_unstemmed | A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title_short | A Novel Signal Processing Measure to Identify Exact and Inexact Tandem Repeat Patterns in DNA Sequences |
title_sort | novel signal processing measure to identify exact and inexact tandem repeat patterns in dna sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3171338/ https://www.ncbi.nlm.nih.gov/pubmed/17713591 http://dx.doi.org/10.1155/2007/43596 |
work_keys_str_mv | AT guptaravi anovelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT sarthidivya anovelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT mittalankush anovelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT singhkuldip anovelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT guptaravi novelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT sarthidivya novelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT mittalankush novelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences AT singhkuldip novelsignalprocessingmeasuretoidentifyexactandinexacttandemrepeatpatternsindnasequences |