Cargando…

iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition

Meiotic recombination is an important biological process. As a main driving force of evolution, recombination provides natural new combinations of genetic variations. Rather than randomly occurring across a genome, meiotic recombination takes place in some genomic regions (the so-called ‘hotspots’)...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Wei, Feng, Peng-Mian, Lin, Hao, Chou, Kuo-Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3616736/
https://www.ncbi.nlm.nih.gov/pubmed/23303794
http://dx.doi.org/10.1093/nar/gks1450
_version_ 1782265162485465088
author Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
author_facet Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
author_sort Chen, Wei
collection PubMed
description Meiotic recombination is an important biological process. As a main driving force of evolution, recombination provides natural new combinations of genetic variations. Rather than randomly occurring across a genome, meiotic recombination takes place in some genomic regions (the so-called ‘hotspots’) with higher frequencies, and in the other regions (the so-called ‘coldspots’) with lower frequencies. Therefore, the information of the hotspots and coldspots would provide useful insights for in-depth studying of the mechanism of recombination and the genome evolution process as well. So far, the recombination regions have been mainly determined by experiments, which are both expensive and time-consuming. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the recombination regions. In this study, a predictor, called ‘iRSpot-PseDNC’, was developed for identifying the recombination hotspots and coldspots. In the new predictor, the samples of DNA sequences are formulated by a novel feature vector, the so-called ‘pseudo dinucleotide composition’ (PseDNC), into which six local DNA structural properties, i.e. three angular parameters (twist, tilt and roll) and three translational parameters (shift, slide and rise), are incorporated. It was observed by the rigorous jackknife test that the overall success rate achieved by iRSpot-PseDNC was >82% in identifying recombination spots in Saccharomyces cerevisiae, indicating the new predictor is promising or at least may become a complementary tool to the existing methods in this area. Although the benchmark data set used to train and test the current method was from S. cerevisiae, the basic approaches can also be extended to deal with all the other genomes. Particularly, it has not escaped our notice that the PseDNC approach can be also used to study many other DNA-related problems. As a user-friendly web-server, iRSpot-PseDNC is freely accessible at http://lin.uestc.edu.cn/server/iRSpot-PseDNC.
format Online
Article
Text
id pubmed-3616736
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36167362013-04-04 iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition Chen, Wei Feng, Peng-Mian Lin, Hao Chou, Kuo-Chen Nucleic Acids Res Methods Online Meiotic recombination is an important biological process. As a main driving force of evolution, recombination provides natural new combinations of genetic variations. Rather than randomly occurring across a genome, meiotic recombination takes place in some genomic regions (the so-called ‘hotspots’) with higher frequencies, and in the other regions (the so-called ‘coldspots’) with lower frequencies. Therefore, the information of the hotspots and coldspots would provide useful insights for in-depth studying of the mechanism of recombination and the genome evolution process as well. So far, the recombination regions have been mainly determined by experiments, which are both expensive and time-consuming. With the avalanche of genome sequences generated in the postgenomic age, it is highly desired to develop automated methods for rapidly and effectively identifying the recombination regions. In this study, a predictor, called ‘iRSpot-PseDNC’, was developed for identifying the recombination hotspots and coldspots. In the new predictor, the samples of DNA sequences are formulated by a novel feature vector, the so-called ‘pseudo dinucleotide composition’ (PseDNC), into which six local DNA structural properties, i.e. three angular parameters (twist, tilt and roll) and three translational parameters (shift, slide and rise), are incorporated. It was observed by the rigorous jackknife test that the overall success rate achieved by iRSpot-PseDNC was >82% in identifying recombination spots in Saccharomyces cerevisiae, indicating the new predictor is promising or at least may become a complementary tool to the existing methods in this area. Although the benchmark data set used to train and test the current method was from S. cerevisiae, the basic approaches can also be extended to deal with all the other genomes. Particularly, it has not escaped our notice that the PseDNC approach can be also used to study many other DNA-related problems. As a user-friendly web-server, iRSpot-PseDNC is freely accessible at http://lin.uestc.edu.cn/server/iRSpot-PseDNC. Oxford University Press 2013-04 2013-01-08 /pmc/articles/PMC3616736/ /pubmed/23303794 http://dx.doi.org/10.1093/nar/gks1450 Text en © The Author(s) 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods Online
Chen, Wei
Feng, Peng-Mian
Lin, Hao
Chou, Kuo-Chen
iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title_full iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title_fullStr iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title_full_unstemmed iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title_short iRSpot-PseDNC: identify recombination spots with pseudo dinucleotide composition
title_sort irspot-psednc: identify recombination spots with pseudo dinucleotide composition
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3616736/
https://www.ncbi.nlm.nih.gov/pubmed/23303794
http://dx.doi.org/10.1093/nar/gks1450
work_keys_str_mv AT chenwei irspotpsedncidentifyrecombinationspotswithpseudodinucleotidecomposition
AT fengpengmian irspotpsedncidentifyrecombinationspotswithpseudodinucleotidecomposition
AT linhao irspotpsedncidentifyrecombinationspotswithpseudodinucleotidecomposition
AT choukuochen irspotpsedncidentifyrecombinationspotswithpseudodinucleotidecomposition