Cargando…

Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition

Many coronaviruses are capable of interspecies transmission. Some of them have caused worldwide panic as emerging human pathogens in recent years, e.g., severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV). In order to assess their thre...

Descripción completa

Detalles Bibliográficos
Autores principales: Tang, Qin, Song, Yulong, Shi, Mijuan, Cheng, Yingyin, Zhang, Wanting, Xia, Xiao-Qin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4660426/
https://www.ncbi.nlm.nih.gov/pubmed/26607834
http://dx.doi.org/10.1038/srep17155
_version_ 1782402799763456000
author Tang, Qin
Song, Yulong
Shi, Mijuan
Cheng, Yingyin
Zhang, Wanting
Xia, Xiao-Qin
author_facet Tang, Qin
Song, Yulong
Shi, Mijuan
Cheng, Yingyin
Zhang, Wanting
Xia, Xiao-Qin
author_sort Tang, Qin
collection PubMed
description Many coronaviruses are capable of interspecies transmission. Some of them have caused worldwide panic as emerging human pathogens in recent years, e.g., severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV). In order to assess their threat to humans, we explored to infer the potential hosts of coronaviruses using a dual-model approach based on nineteen parameters computed from spike genes of coronaviruses. Both the support vector machine (SVM) model and the Mahalanobis distance (MD) discriminant model achieved high accuracies in leave-one-out cross-validation of training data consisting of 730 representative coronaviruses (99.86% and 98.08% respectively). Predictions on 47 additional coronaviruses precisely conformed to conclusions or speculations by other researchers. Our approach is implemented as a web server that can be accessed at http://bioinfo.ihb.ac.cn/seq2hosts.
format Online
Article
Text
id pubmed-4660426
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-46604262015-12-02 Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition Tang, Qin Song, Yulong Shi, Mijuan Cheng, Yingyin Zhang, Wanting Xia, Xiao-Qin Sci Rep Article Many coronaviruses are capable of interspecies transmission. Some of them have caused worldwide panic as emerging human pathogens in recent years, e.g., severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV). In order to assess their threat to humans, we explored to infer the potential hosts of coronaviruses using a dual-model approach based on nineteen parameters computed from spike genes of coronaviruses. Both the support vector machine (SVM) model and the Mahalanobis distance (MD) discriminant model achieved high accuracies in leave-one-out cross-validation of training data consisting of 730 representative coronaviruses (99.86% and 98.08% respectively). Predictions on 47 additional coronaviruses precisely conformed to conclusions or speculations by other researchers. Our approach is implemented as a web server that can be accessed at http://bioinfo.ihb.ac.cn/seq2hosts. Nature Publishing Group 2015-11-26 /pmc/articles/PMC4660426/ /pubmed/26607834 http://dx.doi.org/10.1038/srep17155 Text en Copyright © 2015, Macmillan Publishers Limited http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Tang, Qin
Song, Yulong
Shi, Mijuan
Cheng, Yingyin
Zhang, Wanting
Xia, Xiao-Qin
Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title_full Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title_fullStr Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title_full_unstemmed Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title_short Inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
title_sort inferring the hosts of coronavirus using dual statistical models based on nucleotide composition
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4660426/
https://www.ncbi.nlm.nih.gov/pubmed/26607834
http://dx.doi.org/10.1038/srep17155
work_keys_str_mv AT tangqin inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition
AT songyulong inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition
AT shimijuan inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition
AT chengyingyin inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition
AT zhangwanting inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition
AT xiaxiaoqin inferringthehostsofcoronavirususingdualstatisticalmodelsbasedonnucleotidecomposition