Cargando…

RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features

In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to th...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Peng, Wu, Haonan, Wei, Jiawei, Sang, Fei, Sun, Xiao, Lu, Zuhong
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933199/
https://www.ncbi.nlm.nih.gov/pubmed/17478517
http://dx.doi.org/10.1093/nar/gkm217
_version_ 1782134308815765504
author Jiang, Peng
Wu, Haonan
Wei, Jiawei
Sang, Fei
Sun, Xiao
Lu, Zuhong
author_facet Jiang, Peng
Wu, Haonan
Wei, Jiawei
Sang, Fei
Sun, Xiao
Lu, Zuhong
author_sort Jiang, Peng
collection PubMed
description In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to the mechanism of recombination initiation, the prediction of hot/cold spots from DNA sequence information is a challenging task. In this article, we introduce a random forest (RF) prediction model to detect recombination hot/cold spots from yeast genome. The out-of-bag (OOB) estimation of the model indicated that the RF classifier achieved high prediction performance with 82.05% total accuracy and 0.638 Mattew's correlation coefficient (MCC) value. Compared with an alternative machine-learning algorithm, support vector machine (SVM), the RF method outperforms it in both sensitivity and specificity. The prediction model is implemented as a web server (RF-DYMHC) and it is freely available at http://www.bioinf.seu.edu.cn/Recombination/rf_dymhc.htm. Given a yeast genome and prediction parameters (RI-value and non-overlapping window scan size), the program reports the predicted hot/cold spots and marks them in color.
format Text
id pubmed-1933199
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-19331992007-07-31 RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features Jiang, Peng Wu, Haonan Wei, Jiawei Sang, Fei Sun, Xiao Lu, Zuhong Nucleic Acids Res Articles In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to the mechanism of recombination initiation, the prediction of hot/cold spots from DNA sequence information is a challenging task. In this article, we introduce a random forest (RF) prediction model to detect recombination hot/cold spots from yeast genome. The out-of-bag (OOB) estimation of the model indicated that the RF classifier achieved high prediction performance with 82.05% total accuracy and 0.638 Mattew's correlation coefficient (MCC) value. Compared with an alternative machine-learning algorithm, support vector machine (SVM), the RF method outperforms it in both sensitivity and specificity. The prediction model is implemented as a web server (RF-DYMHC) and it is freely available at http://www.bioinf.seu.edu.cn/Recombination/rf_dymhc.htm. Given a yeast genome and prediction parameters (RI-value and non-overlapping window scan size), the program reports the predicted hot/cold spots and marks them in color. Oxford University Press 2007-07 2007-05-03 /pmc/articles/PMC1933199/ /pubmed/17478517 http://dx.doi.org/10.1093/nar/gkm217 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Articles
Jiang, Peng
Wu, Haonan
Wei, Jiawei
Sang, Fei
Sun, Xiao
Lu, Zuhong
RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title_full RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title_fullStr RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title_full_unstemmed RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title_short RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
title_sort rf-dymhc: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
topic Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933199/
https://www.ncbi.nlm.nih.gov/pubmed/17478517
http://dx.doi.org/10.1093/nar/gkm217
work_keys_str_mv AT jiangpeng rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures
AT wuhaonan rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures
AT weijiawei rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures
AT sangfei rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures
AT sunxiao rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures
AT luzuhong rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures