Cargando…

Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data

Despite the various existing studies about nonsynonymous single nucleotide polymorphisms (nsSNPs), genome-wide studies based on nsSNPs are rare. NsSNPs alter amino acid sequences, affect protein structure and function, and have deleterious effects. By predicting the deleterious effect of nsSNPs, we...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Young-Sup, Won, KyeongHye, Shin, Donghyun, Oh, Jae-Don
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7781907/
https://www.ncbi.nlm.nih.gov/pubmed/33456716
http://dx.doi.org/10.1080/19768354.2020.1860125
_version_ 1783631776013352960
author Lee, Young-Sup
Won, KyeongHye
Shin, Donghyun
Oh, Jae-Don
author_facet Lee, Young-Sup
Won, KyeongHye
Shin, Donghyun
Oh, Jae-Don
author_sort Lee, Young-Sup
collection PubMed
description Despite the various existing studies about nonsynonymous single nucleotide polymorphisms (nsSNPs), genome-wide studies based on nsSNPs are rare. NsSNPs alter amino acid sequences, affect protein structure and function, and have deleterious effects. By predicting the deleterious effect of nsSNPs, we determined the total risk score per individual. Additionally, the machine learning technique was utilized to find an optimal nsSNP subset that best explains the complete nsSNP effect. A total of 16,100 nsSNPs were selected as the best representatives among 89,519 regressed nsSNPs. In the gene ontology analysis encompassing the 16,100 nsSNPs, DNA metabolic process, chemokine- and immune-related, and reproduction were the most enriched terms. We expect that our risk score prediction and nsSNP marker selection will contribute to future development of extant genome-wide association studies and breeding science more broadly.
format Online
Article
Text
id pubmed-7781907
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-77819072021-01-14 Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data Lee, Young-Sup Won, KyeongHye Shin, Donghyun Oh, Jae-Don Anim Cells Syst (Seoul) Genes & Genomics Despite the various existing studies about nonsynonymous single nucleotide polymorphisms (nsSNPs), genome-wide studies based on nsSNPs are rare. NsSNPs alter amino acid sequences, affect protein structure and function, and have deleterious effects. By predicting the deleterious effect of nsSNPs, we determined the total risk score per individual. Additionally, the machine learning technique was utilized to find an optimal nsSNP subset that best explains the complete nsSNP effect. A total of 16,100 nsSNPs were selected as the best representatives among 89,519 regressed nsSNPs. In the gene ontology analysis encompassing the 16,100 nsSNPs, DNA metabolic process, chemokine- and immune-related, and reproduction were the most enriched terms. We expect that our risk score prediction and nsSNP marker selection will contribute to future development of extant genome-wide association studies and breeding science more broadly. Taylor & Francis 2020-12-24 /pmc/articles/PMC7781907/ /pubmed/33456716 http://dx.doi.org/10.1080/19768354.2020.1860125 Text en © 2020 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genes & Genomics
Lee, Young-Sup
Won, KyeongHye
Shin, Donghyun
Oh, Jae-Don
Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title_full Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title_fullStr Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title_full_unstemmed Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title_short Risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
title_sort risk prediction and marker selection in nonsynonymous single nucleotide polymorphisms using whole genome sequencing data
topic Genes & Genomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7781907/
https://www.ncbi.nlm.nih.gov/pubmed/33456716
http://dx.doi.org/10.1080/19768354.2020.1860125
work_keys_str_mv AT leeyoungsup riskpredictionandmarkerselectioninnonsynonymoussinglenucleotidepolymorphismsusingwholegenomesequencingdata
AT wonkyeonghye riskpredictionandmarkerselectioninnonsynonymoussinglenucleotidepolymorphismsusingwholegenomesequencingdata
AT shindonghyun riskpredictionandmarkerselectioninnonsynonymoussinglenucleotidepolymorphismsusingwholegenomesequencingdata
AT ohjaedon riskpredictionandmarkerselectioninnonsynonymoussinglenucleotidepolymorphismsusingwholegenomesequencingdata