Cargando…
detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation
Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4237412/ https://www.ncbi.nlm.nih.gov/pubmed/25409465 http://dx.doi.org/10.1371/journal.pone.0113349 |
_version_ | 1782345342605328384 |
---|---|
author | Ye, Congting Ji, Guoli Li, Lei Liang, Chun |
author_facet | Ye, Congting Ji, Guoli Li, Lei Liang, Chun |
author_sort | Ye, Congting |
collection | PubMed |
description | Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir. |
format | Online Article Text |
id | pubmed-4237412 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-42374122014-11-21 detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation Ye, Congting Ji, Guoli Li, Lei Liang, Chun PLoS One Research Article Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures – hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir. Public Library of Science 2014-11-19 /pmc/articles/PMC4237412/ /pubmed/25409465 http://dx.doi.org/10.1371/journal.pone.0113349 Text en © 2014 Ye et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Ye, Congting Ji, Guoli Li, Lei Liang, Chun detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title | detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title_full | detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title_fullStr | detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title_full_unstemmed | detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title_short | detectIR: A Novel Program for Detecting Perfect and Imperfect Inverted Repeats Using Complex Numbers and Vector Calculation |
title_sort | detectir: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4237412/ https://www.ncbi.nlm.nih.gov/pubmed/25409465 http://dx.doi.org/10.1371/journal.pone.0113349 |
work_keys_str_mv | AT yecongting detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation AT jiguoli detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation AT lilei detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation AT liangchun detectiranovelprogramfordetectingperfectandimperfectinvertedrepeatsusingcomplexnumbersandvectorcalculation |