Cargando…

detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation

Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable,...

Descripción completa

Detalles Bibliográficos
Autores principales: Ye, Congting, Ji, Guoli, Li, Lei, Liang, Chun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4237412/
https://www.ncbi.nlm.nih.gov/pubmed/25409465
http://dx.doi.org/10.1371/journal.pone.0113349
_version_ 1782345342605328384
author Ye, Congting
Ji, Guoli
Li, Lei
Liang, Chun
author_facet Ye, Congting
Ji, Guoli
Li, Lei
Liang, Chun
author_sort Ye, Congting
collection PubMed
description Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
format Online
Article
Text
id pubmed-4237412
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-42374122014-11-21 detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation Ye, Congting Ji, Guoli Li, Lei Liang, Chun PLoS One Research Article Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir. Public Library of Science 2014-11-19 /pmc/articles/PMC4237412/ /pubmed/25409465 http://dx.doi.org/10.1371/journal.pone.0113349 Text en © 2014 Ye et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ye, Congting
Ji, Guoli
Li, Lei
Liang, Chun
detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title_full detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title_fullStr detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title_full_unstemmed detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title_short detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
title_sort detectir: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4237412/
https://www.ncbi.nlm.nih.gov/pubmed/25409465
http://dx.doi.org/10.1371/journal.pone.0113349
work_keys_str_mv AT yecongting detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation
AT jiguoli detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation
AT lilei detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation
AT liangchun detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation