Cargando…

iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition

Summary: Identification of replication origins is playing a key role in understanding the mechanism of DNA replication. This task is of great significance in DNA sequence analysis. Because of its importance, some computational approaches have been introduced. Among these predictors, the iRO-3wPseKNC...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bin, Chen, Shengyu, Yan, Ke, Weng, Fan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6759546/
https://www.ncbi.nlm.nih.gov/pubmed/31620165
http://dx.doi.org/10.3389/fgene.2019.00842
_version_ 1783453712879976448
author Liu, Bin
Chen, Shengyu
Yan, Ke
Weng, Fan
author_facet Liu, Bin
Chen, Shengyu
Yan, Ke
Weng, Fan
author_sort Liu, Bin
collection PubMed
description Summary: Identification of replication origins is playing a key role in understanding the mechanism of DNA replication. This task is of great significance in DNA sequence analysis. Because of its importance, some computational approaches have been introduced. Among these predictors, the iRO-3wPseKNC predictor is the first discriminative method that is able to correctly identify the entire replication origins. For further improving its predictive performance, we proposed the Pseudo k-tuple GC Composition (PsekGCC) approach to capture the “GC asymmetry bias” of yeast species by considering both the GC skew and the sequence order effects of k-tuple GC Composition (k-GCC) in this study. Based on PseKGCC, we proposed a new predictor called iRO-PsekGCC to identify the DNA replication origins. Rigorous jackknife test on two yeast species benchmark datasets (Saccharomyces cerevisiae, Pichia pastoris) indicated that iRO-PsekGCC outperformed iRO-3wPseKNC. It can be anticipated that iRO-PsekGCC will be a useful tool for DNA replication origin identification. Availability and implementation: The web-server for the iRO-PsekGCC predictor was established, and it can be accessed at http://bliulab.net/iRO-PsekGCC/.
format Online
Article
Text
id pubmed-6759546
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67595462019-10-16 iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition Liu, Bin Chen, Shengyu Yan, Ke Weng, Fan Front Genet Genetics Summary: Identification of replication origins is playing a key role in understanding the mechanism of DNA replication. This task is of great significance in DNA sequence analysis. Because of its importance, some computational approaches have been introduced. Among these predictors, the iRO-3wPseKNC predictor is the first discriminative method that is able to correctly identify the entire replication origins. For further improving its predictive performance, we proposed the Pseudo k-tuple GC Composition (PsekGCC) approach to capture the “GC asymmetry bias” of yeast species by considering both the GC skew and the sequence order effects of k-tuple GC Composition (k-GCC) in this study. Based on PseKGCC, we proposed a new predictor called iRO-PsekGCC to identify the DNA replication origins. Rigorous jackknife test on two yeast species benchmark datasets (Saccharomyces cerevisiae, Pichia pastoris) indicated that iRO-PsekGCC outperformed iRO-3wPseKNC. It can be anticipated that iRO-PsekGCC will be a useful tool for DNA replication origin identification. Availability and implementation: The web-server for the iRO-PsekGCC predictor was established, and it can be accessed at http://bliulab.net/iRO-PsekGCC/. Frontiers Media S.A. 2019-09-18 /pmc/articles/PMC6759546/ /pubmed/31620165 http://dx.doi.org/10.3389/fgene.2019.00842 Text en Copyright © 2019 Liu, Chen, Yan and Weng http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Liu, Bin
Chen, Shengyu
Yan, Ke
Weng, Fan
iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title_full iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title_fullStr iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title_full_unstemmed iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title_short iRO-PsekGCC: Identify DNA Replication Origins Based on Pseudo k-Tuple GC Composition
title_sort iro-psekgcc: identify dna replication origins based on pseudo k-tuple gc composition
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6759546/
https://www.ncbi.nlm.nih.gov/pubmed/31620165
http://dx.doi.org/10.3389/fgene.2019.00842
work_keys_str_mv AT liubin iropsekgccidentifydnareplicationoriginsbasedonpseudoktuplegccomposition
AT chenshengyu iropsekgccidentifydnareplicationoriginsbasedonpseudoktuplegccomposition
AT yanke iropsekgccidentifydnareplicationoriginsbasedonpseudoktuplegccomposition
AT wengfan iropsekgccidentifydnareplicationoriginsbasedonpseudoktuplegccomposition