Cargando…

Detecting SNP markers discriminating horse breeds by deep learning

The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches co...

Descripción completa

Detalles Bibliográficos
Autores principales: Manzoori, Siavash, Farahani, Amir Hossein Khaltabadi, Moradi, Mohammad Hossein, Kazemi-Bonchenari, Mehdi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354035/
https://www.ncbi.nlm.nih.gov/pubmed/37464049
http://dx.doi.org/10.1038/s41598-023-38601-z
_version_ 1785074833600544768
author Manzoori, Siavash
Farahani, Amir Hossein Khaltabadi
Moradi, Mohammad Hossein
Kazemi-Bonchenari, Mehdi
author_facet Manzoori, Siavash
Farahani, Amir Hossein Khaltabadi
Moradi, Mohammad Hossein
Kazemi-Bonchenari, Mehdi
author_sort Manzoori, Siavash
collection PubMed
description The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches consisting Deep Neural Networks (DNN), Garson and Olden methods for feature selection of informative SNP markers from high-throughput genotyping data, that would be able to trace the true breed of unknown samples. The total of 795 animals from 37 breeds, genotyped by using the Illumina SNP 50k Bead chip were used in the current study and principal component analysis (PCA), log-likelihood ratios (LLR) and Neighbor-Joining (NJ) were applied to assess the performance of different assignment methods. The results revealed that the DNN, Garson, and Olden methods are able to assign individuals to true populations with 4270, 4937, and 7999 SNP markers, respectively. The PCA was used to determine how the animals allocated to the groups using all genotyped markers available on 50k Bead chip and the subset of SNP markers identified with different methods. The results indicated that all SNP panels are able to assign individuals into their true breeds. The success percentage of genetic assignment for different methods assessed by different levels of LLR showed that the success rate of 70% in the analysis was obtained by three methods with the number of markers of 110, 208, and 178 tags for DNN, Garson, and Olden methods, respectively. Also the results showed that DNN performed better than other two approaches by achieving 93% accuracy at the most stringent threshold. Finally, the identified SNPs were successfully used in independent out-group breeds consisting 120 individuals from eight breeds and the results indicated that these markers are able to correctly allocate all unknown samples to true population of origin. Furthermore, the NJ tree of allele-sharing distances on the validation dataset showed that the DNN has a high potential for feature selection. In general, the results of this study indicated that the DNN technique represents an efficient strategy for selecting a reduced pool of highly discriminant markers for assigning individuals to the true population of origin.
format Online
Article
Text
id pubmed-10354035
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-103540352023-07-20 Detecting SNP markers discriminating horse breeds by deep learning Manzoori, Siavash Farahani, Amir Hossein Khaltabadi Moradi, Mohammad Hossein Kazemi-Bonchenari, Mehdi Sci Rep Article The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches consisting Deep Neural Networks (DNN), Garson and Olden methods for feature selection of informative SNP markers from high-throughput genotyping data, that would be able to trace the true breed of unknown samples. The total of 795 animals from 37 breeds, genotyped by using the Illumina SNP 50k Bead chip were used in the current study and principal component analysis (PCA), log-likelihood ratios (LLR) and Neighbor-Joining (NJ) were applied to assess the performance of different assignment methods. The results revealed that the DNN, Garson, and Olden methods are able to assign individuals to true populations with 4270, 4937, and 7999 SNP markers, respectively. The PCA was used to determine how the animals allocated to the groups using all genotyped markers available on 50k Bead chip and the subset of SNP markers identified with different methods. The results indicated that all SNP panels are able to assign individuals into their true breeds. The success percentage of genetic assignment for different methods assessed by different levels of LLR showed that the success rate of 70% in the analysis was obtained by three methods with the number of markers of 110, 208, and 178 tags for DNN, Garson, and Olden methods, respectively. Also the results showed that DNN performed better than other two approaches by achieving 93% accuracy at the most stringent threshold. Finally, the identified SNPs were successfully used in independent out-group breeds consisting 120 individuals from eight breeds and the results indicated that these markers are able to correctly allocate all unknown samples to true population of origin. Furthermore, the NJ tree of allele-sharing distances on the validation dataset showed that the DNN has a high potential for feature selection. In general, the results of this study indicated that the DNN technique represents an efficient strategy for selecting a reduced pool of highly discriminant markers for assigning individuals to the true population of origin. Nature Publishing Group UK 2023-07-18 /pmc/articles/PMC10354035/ /pubmed/37464049 http://dx.doi.org/10.1038/s41598-023-38601-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Manzoori, Siavash
Farahani, Amir Hossein Khaltabadi
Moradi, Mohammad Hossein
Kazemi-Bonchenari, Mehdi
Detecting SNP markers discriminating horse breeds by deep learning
title Detecting SNP markers discriminating horse breeds by deep learning
title_full Detecting SNP markers discriminating horse breeds by deep learning
title_fullStr Detecting SNP markers discriminating horse breeds by deep learning
title_full_unstemmed Detecting SNP markers discriminating horse breeds by deep learning
title_short Detecting SNP markers discriminating horse breeds by deep learning
title_sort detecting snp markers discriminating horse breeds by deep learning
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354035/
https://www.ncbi.nlm.nih.gov/pubmed/37464049
http://dx.doi.org/10.1038/s41598-023-38601-z
work_keys_str_mv AT manzoorisiavash detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning
AT farahaniamirhosseinkhaltabadi detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning
AT moradimohammadhossein detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning
AT kazemibonchenarimehdi detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning