Cargando…

Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs

The recent release of the Bovine HapMap dataset represents the most detailed survey of bovine genetic diversity to date, providing an important resource for the design and development of livestock production. We studied this dataset, comprising more than 30,000 Single Nucleotide Polymorphisms (SNPs)...

Descripción completa

Detalles Bibliográficos
Autores principales: Lewis, Jamey, Abas, Zafiris, Dadousis, Christos, Lykidis, Dimitrios, Paschou, Peristera, Drineas, Petros
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3072384/
https://www.ncbi.nlm.nih.gov/pubmed/21490966
http://dx.doi.org/10.1371/journal.pone.0018007
_version_ 1782201545866084352
author Lewis, Jamey
Abas, Zafiris
Dadousis, Christos
Lykidis, Dimitrios
Paschou, Peristera
Drineas, Petros
author_facet Lewis, Jamey
Abas, Zafiris
Dadousis, Christos
Lykidis, Dimitrios
Paschou, Peristera
Drineas, Petros
author_sort Lewis, Jamey
collection PubMed
description The recent release of the Bovine HapMap dataset represents the most detailed survey of bovine genetic diversity to date, providing an important resource for the design and development of livestock production. We studied this dataset, comprising more than 30,000 Single Nucleotide Polymorphisms (SNPs) for 19 breeds (13 taurine, three zebu, and three hybrid breeds), seeking to identify small panels of genetic markers that can be used to trace the breed of unknown cattle samples. Taking advantage of the power of Principal Components Analysis and algorithms that we have recently described for the selection of Ancestry Informative Markers from genomewide datasets, we present a decision-tree which can be used to accurately infer the origin of individual cattle. In doing so, we present a thorough examination of population genetic structure in modern bovine breeds. Performing extensive cross-validation experiments, we demonstrate that 250-500 carefully selected SNPs suffice in order to achieve close to 100% prediction accuracy of individual ancestry, when this particular set of 19 breeds is considered. Our methods, coupled with the dense genotypic data that is becoming increasingly available, have the potential to become a valuable tool and have considerable impact in worldwide livestock production. They can be used to inform the design of studies of the genetic basis of economically important traits in cattle, as well as breeding programs and efforts to conserve biodiversity. Furthermore, the SNPs that we have identified can provide a reliable solution for the traceability of breed-specific branded products.
format Text
id pubmed-3072384
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30723842011-04-13 Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs Lewis, Jamey Abas, Zafiris Dadousis, Christos Lykidis, Dimitrios Paschou, Peristera Drineas, Petros PLoS One Research Article The recent release of the Bovine HapMap dataset represents the most detailed survey of bovine genetic diversity to date, providing an important resource for the design and development of livestock production. We studied this dataset, comprising more than 30,000 Single Nucleotide Polymorphisms (SNPs) for 19 breeds (13 taurine, three zebu, and three hybrid breeds), seeking to identify small panels of genetic markers that can be used to trace the breed of unknown cattle samples. Taking advantage of the power of Principal Components Analysis and algorithms that we have recently described for the selection of Ancestry Informative Markers from genomewide datasets, we present a decision-tree which can be used to accurately infer the origin of individual cattle. In doing so, we present a thorough examination of population genetic structure in modern bovine breeds. Performing extensive cross-validation experiments, we demonstrate that 250-500 carefully selected SNPs suffice in order to achieve close to 100% prediction accuracy of individual ancestry, when this particular set of 19 breeds is considered. Our methods, coupled with the dense genotypic data that is becoming increasingly available, have the potential to become a valuable tool and have considerable impact in worldwide livestock production. They can be used to inform the design of studies of the genetic basis of economically important traits in cattle, as well as breeding programs and efforts to conserve biodiversity. Furthermore, the SNPs that we have identified can provide a reliable solution for the traceability of breed-specific branded products. Public Library of Science 2011-04-07 /pmc/articles/PMC3072384/ /pubmed/21490966 http://dx.doi.org/10.1371/journal.pone.0018007 Text en Lewis et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Lewis, Jamey
Abas, Zafiris
Dadousis, Christos
Lykidis, Dimitrios
Paschou, Peristera
Drineas, Petros
Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title_full Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title_fullStr Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title_full_unstemmed Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title_short Tracing Cattle Breeds with Principal Components Analysis Ancestry Informative SNPs
title_sort tracing cattle breeds with principal components analysis ancestry informative snps
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3072384/
https://www.ncbi.nlm.nih.gov/pubmed/21490966
http://dx.doi.org/10.1371/journal.pone.0018007
work_keys_str_mv AT lewisjamey tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps
AT abaszafiris tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps
AT dadousischristos tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps
AT lykidisdimitrios tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps
AT paschouperistera tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps
AT drineaspetros tracingcattlebreedswithprincipalcomponentsanalysisancestryinformativesnps