Cargando…
Detecting SNP markers discriminating horse breeds by deep learning
The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches co...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354035/ https://www.ncbi.nlm.nih.gov/pubmed/37464049 http://dx.doi.org/10.1038/s41598-023-38601-z |
_version_ | 1785074833600544768 |
---|---|
author | Manzoori, Siavash Farahani, Amir Hossein Khaltabadi Moradi, Mohammad Hossein Kazemi-Bonchenari, Mehdi |
author_facet | Manzoori, Siavash Farahani, Amir Hossein Khaltabadi Moradi, Mohammad Hossein Kazemi-Bonchenari, Mehdi |
author_sort | Manzoori, Siavash |
collection | PubMed |
description | The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches consisting Deep Neural Networks (DNN), Garson and Olden methods for feature selection of informative SNP markers from high-throughput genotyping data, that would be able to trace the true breed of unknown samples. The total of 795 animals from 37 breeds, genotyped by using the Illumina SNP 50k Bead chip were used in the current study and principal component analysis (PCA), log-likelihood ratios (LLR) and Neighbor-Joining (NJ) were applied to assess the performance of different assignment methods. The results revealed that the DNN, Garson, and Olden methods are able to assign individuals to true populations with 4270, 4937, and 7999 SNP markers, respectively. The PCA was used to determine how the animals allocated to the groups using all genotyped markers available on 50k Bead chip and the subset of SNP markers identified with different methods. The results indicated that all SNP panels are able to assign individuals into their true breeds. The success percentage of genetic assignment for different methods assessed by different levels of LLR showed that the success rate of 70% in the analysis was obtained by three methods with the number of markers of 110, 208, and 178 tags for DNN, Garson, and Olden methods, respectively. Also the results showed that DNN performed better than other two approaches by achieving 93% accuracy at the most stringent threshold. Finally, the identified SNPs were successfully used in independent out-group breeds consisting 120 individuals from eight breeds and the results indicated that these markers are able to correctly allocate all unknown samples to true population of origin. Furthermore, the NJ tree of allele-sharing distances on the validation dataset showed that the DNN has a high potential for feature selection. In general, the results of this study indicated that the DNN technique represents an efficient strategy for selecting a reduced pool of highly discriminant markers for assigning individuals to the true population of origin. |
format | Online Article Text |
id | pubmed-10354035 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-103540352023-07-20 Detecting SNP markers discriminating horse breeds by deep learning Manzoori, Siavash Farahani, Amir Hossein Khaltabadi Moradi, Mohammad Hossein Kazemi-Bonchenari, Mehdi Sci Rep Article The assignment of an individual to the true population of origin using a low-panel of discriminant SNP markers is one of the most important applications of genomic data for practical use. The aim of this study was to evaluate the potential of different Artificial Neural Networks (ANNs) approaches consisting Deep Neural Networks (DNN), Garson and Olden methods for feature selection of informative SNP markers from high-throughput genotyping data, that would be able to trace the true breed of unknown samples. The total of 795 animals from 37 breeds, genotyped by using the Illumina SNP 50k Bead chip were used in the current study and principal component analysis (PCA), log-likelihood ratios (LLR) and Neighbor-Joining (NJ) were applied to assess the performance of different assignment methods. The results revealed that the DNN, Garson, and Olden methods are able to assign individuals to true populations with 4270, 4937, and 7999 SNP markers, respectively. The PCA was used to determine how the animals allocated to the groups using all genotyped markers available on 50k Bead chip and the subset of SNP markers identified with different methods. The results indicated that all SNP panels are able to assign individuals into their true breeds. The success percentage of genetic assignment for different methods assessed by different levels of LLR showed that the success rate of 70% in the analysis was obtained by three methods with the number of markers of 110, 208, and 178 tags for DNN, Garson, and Olden methods, respectively. Also the results showed that DNN performed better than other two approaches by achieving 93% accuracy at the most stringent threshold. Finally, the identified SNPs were successfully used in independent out-group breeds consisting 120 individuals from eight breeds and the results indicated that these markers are able to correctly allocate all unknown samples to true population of origin. Furthermore, the NJ tree of allele-sharing distances on the validation dataset showed that the DNN has a high potential for feature selection. In general, the results of this study indicated that the DNN technique represents an efficient strategy for selecting a reduced pool of highly discriminant markers for assigning individuals to the true population of origin. Nature Publishing Group UK 2023-07-18 /pmc/articles/PMC10354035/ /pubmed/37464049 http://dx.doi.org/10.1038/s41598-023-38601-z Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Manzoori, Siavash Farahani, Amir Hossein Khaltabadi Moradi, Mohammad Hossein Kazemi-Bonchenari, Mehdi Detecting SNP markers discriminating horse breeds by deep learning |
title | Detecting SNP markers discriminating horse breeds by deep learning |
title_full | Detecting SNP markers discriminating horse breeds by deep learning |
title_fullStr | Detecting SNP markers discriminating horse breeds by deep learning |
title_full_unstemmed | Detecting SNP markers discriminating horse breeds by deep learning |
title_short | Detecting SNP markers discriminating horse breeds by deep learning |
title_sort | detecting snp markers discriminating horse breeds by deep learning |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10354035/ https://www.ncbi.nlm.nih.gov/pubmed/37464049 http://dx.doi.org/10.1038/s41598-023-38601-z |
work_keys_str_mv | AT manzoorisiavash detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning AT farahaniamirhosseinkhaltabadi detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning AT moradimohammadhossein detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning AT kazemibonchenarimehdi detectingsnpmarkersdiscriminatinghorsebreedsbydeeplearning |