Cargando…

Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping

BACKGROUND: Serotypes of the Foot-and-Mouth disease viruses (FMDVs) were generally determined by biological experiments. The computational genotyping is not well studied even with the availability of whole viral genomes, due to uneven evolution among genes as well as frequent genetic recombination....

Descripción completa

Detalles Bibliográficos
Autores principales: Lin, Guohui, Cai, Zhipeng, Wu, Junfeng, Wan, Xiu-Feng, Xu, Lizhe, Goebel, Randy
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2438327/
https://www.ncbi.nlm.nih.gov/pubmed/18554404
http://dx.doi.org/10.1186/1471-2105-9-279
_version_ 1782156507685584896
author Lin, Guohui
Cai, Zhipeng
Wu, Junfeng
Wan, Xiu-Feng
Xu, Lizhe
Goebel, Randy
author_facet Lin, Guohui
Cai, Zhipeng
Wu, Junfeng
Wan, Xiu-Feng
Xu, Lizhe
Goebel, Randy
author_sort Lin, Guohui
collection PubMed
description BACKGROUND: Serotypes of the Foot-and-Mouth disease viruses (FMDVs) were generally determined by biological experiments. The computational genotyping is not well studied even with the availability of whole viral genomes, due to uneven evolution among genes as well as frequent genetic recombination. Naively using sequence comparison for genotyping is only able to achieve a limited extent of success. RESULTS: We used 129 FMDV strains with known serotype as training strains to select as many as 140 most serotype-specific nucleotide strings. We then constructed a linear-kernel Support Vector Machine classifier using these 140 strings. Under the leave-one-out cross validation scheme, this classifier was able to assign correct serotype to 127 of these 129 strains, achieving 98.45% accuracy. It also assigned serotype correctly to an independent test set of 83 other FMDV strains downloaded separately from NCBI GenBank. CONCLUSION: Computational genotyping is much faster and much cheaper than the wet-lab based biological experiments, upon the availability of the detailed molecular sequences. The high accuracy of our proposed method suggests the potential of utilizing a few signature nucleotide strings instead of whole genomes to determine the serotypes of novel FMDV strains.
format Text
id pubmed-2438327
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-24383272008-06-25 Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping Lin, Guohui Cai, Zhipeng Wu, Junfeng Wan, Xiu-Feng Xu, Lizhe Goebel, Randy BMC Bioinformatics Research Article BACKGROUND: Serotypes of the Foot-and-Mouth disease viruses (FMDVs) were generally determined by biological experiments. The computational genotyping is not well studied even with the availability of whole viral genomes, due to uneven evolution among genes as well as frequent genetic recombination. Naively using sequence comparison for genotyping is only able to achieve a limited extent of success. RESULTS: We used 129 FMDV strains with known serotype as training strains to select as many as 140 most serotype-specific nucleotide strings. We then constructed a linear-kernel Support Vector Machine classifier using these 140 strings. Under the leave-one-out cross validation scheme, this classifier was able to assign correct serotype to 127 of these 129 strains, achieving 98.45% accuracy. It also assigned serotype correctly to an independent test set of 83 other FMDV strains downloaded separately from NCBI GenBank. CONCLUSION: Computational genotyping is much faster and much cheaper than the wet-lab based biological experiments, upon the availability of the detailed molecular sequences. The high accuracy of our proposed method suggests the potential of utilizing a few signature nucleotide strings instead of whole genomes to determine the serotypes of novel FMDV strains. BioMed Central 2008-06-13 /pmc/articles/PMC2438327/ /pubmed/18554404 http://dx.doi.org/10.1186/1471-2105-9-279 Text en Copyright © 2008 Lin et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Lin, Guohui
Cai, Zhipeng
Wu, Junfeng
Wan, Xiu-Feng
Xu, Lizhe
Goebel, Randy
Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title_full Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title_fullStr Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title_full_unstemmed Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title_short Identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
title_sort identifying a few foot-and-mouth disease virus signature nucleotide strings for computational genotyping
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2438327/
https://www.ncbi.nlm.nih.gov/pubmed/18554404
http://dx.doi.org/10.1186/1471-2105-9-279
work_keys_str_mv AT linguohui identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping
AT caizhipeng identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping
AT wujunfeng identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping
AT wanxiufeng identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping
AT xulizhe identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping
AT goebelrandy identifyingafewfootandmouthdiseasevirussignaturenucleotidestringsforcomputationalgenotyping