Cargando…
RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features
In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to th...
Autores principales: | , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933199/ https://www.ncbi.nlm.nih.gov/pubmed/17478517 http://dx.doi.org/10.1093/nar/gkm217 |
_version_ | 1782134308815765504 |
---|---|
author | Jiang, Peng Wu, Haonan Wei, Jiawei Sang, Fei Sun, Xiao Lu, Zuhong |
author_facet | Jiang, Peng Wu, Haonan Wei, Jiawei Sang, Fei Sun, Xiao Lu, Zuhong |
author_sort | Jiang, Peng |
collection | PubMed |
description | In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to the mechanism of recombination initiation, the prediction of hot/cold spots from DNA sequence information is a challenging task. In this article, we introduce a random forest (RF) prediction model to detect recombination hot/cold spots from yeast genome. The out-of-bag (OOB) estimation of the model indicated that the RF classifier achieved high prediction performance with 82.05% total accuracy and 0.638 Mattew's correlation coefficient (MCC) value. Compared with an alternative machine-learning algorithm, support vector machine (SVM), the RF method outperforms it in both sensitivity and specificity. The prediction model is implemented as a web server (RF-DYMHC) and it is freely available at http://www.bioinf.seu.edu.cn/Recombination/rf_dymhc.htm. Given a yeast genome and prediction parameters (RI-value and non-overlapping window scan size), the program reports the predicted hot/cold spots and marks them in color. |
format | Text |
id | pubmed-1933199 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-19331992007-07-31 RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features Jiang, Peng Wu, Haonan Wei, Jiawei Sang, Fei Sun, Xiao Lu, Zuhong Nucleic Acids Res Articles In the yeast, meiotic recombination is initiated by double-strand DNA breaks (DSBs) which occur at relatively high frequencies in some genomic regions (hotspots) and relatively low frequencies in others (coldspots). Although observations concerning individual hot/cold spots have given clues as to the mechanism of recombination initiation, the prediction of hot/cold spots from DNA sequence information is a challenging task. In this article, we introduce a random forest (RF) prediction model to detect recombination hot/cold spots from yeast genome. The out-of-bag (OOB) estimation of the model indicated that the RF classifier achieved high prediction performance with 82.05% total accuracy and 0.638 Mattew's correlation coefficient (MCC) value. Compared with an alternative machine-learning algorithm, support vector machine (SVM), the RF method outperforms it in both sensitivity and specificity. The prediction model is implemented as a web server (RF-DYMHC) and it is freely available at http://www.bioinf.seu.edu.cn/Recombination/rf_dymhc.htm. Given a yeast genome and prediction parameters (RI-value and non-overlapping window scan size), the program reports the predicted hot/cold spots and marks them in color. Oxford University Press 2007-07 2007-05-03 /pmc/articles/PMC1933199/ /pubmed/17478517 http://dx.doi.org/10.1093/nar/gkm217 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Articles Jiang, Peng Wu, Haonan Wei, Jiawei Sang, Fei Sun, Xiao Lu, Zuhong RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title | RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title_full | RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title_fullStr | RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title_full_unstemmed | RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title_short | RF-DYMHC: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
title_sort | rf-dymhc: detecting the yeast meiotic recombination hotspots and coldspots by random forest model using gapped dinucleotide composition features |
topic | Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1933199/ https://www.ncbi.nlm.nih.gov/pubmed/17478517 http://dx.doi.org/10.1093/nar/gkm217 |
work_keys_str_mv | AT jiangpeng rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures AT wuhaonan rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures AT weijiawei rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures AT sangfei rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures AT sunxiao rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures AT luzuhong rfdymhcdetectingtheyeastmeioticrecombinationhotspotsandcoldspotsbyrandomforestmodelusinggappeddinucleotidecompositionfeatures |