Cargando…

High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data

Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still...

Descripción completa

Detalles Bibliográficos
Autores principales: Butty, Adrien M., Chud, Tatiane C. S., Miglior, Filippo, Schenkel, Flavio S., Kommadath, Arun, Krivushin, Kirill, Grant, Jason R., Häfliger, Irene M., Drögemüller, Cord, Cánovas, Angela, Stothard, Paul, Baes, Christine F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7229195/
https://www.ncbi.nlm.nih.gov/pubmed/32415111
http://dx.doi.org/10.1038/s41598-020-64680-3
_version_ 1783534711876878336
author Butty, Adrien M.
Chud, Tatiane C. S.
Miglior, Filippo
Schenkel, Flavio S.
Kommadath, Arun
Krivushin, Kirill
Grant, Jason R.
Häfliger, Irene M.
Drögemüller, Cord
Cánovas, Angela
Stothard, Paul
Baes, Christine F.
author_facet Butty, Adrien M.
Chud, Tatiane C. S.
Miglior, Filippo
Schenkel, Flavio S.
Kommadath, Arun
Krivushin, Kirill
Grant, Jason R.
Häfliger, Irene M.
Drögemüller, Cord
Cánovas, Angela
Stothard, Paul
Baes, Christine F.
author_sort Butty, Adrien M.
collection PubMed
description Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n = 52) and a population-based (n = 36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR.
format Online
Article
Text
id pubmed-7229195
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72291952020-05-26 High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data Butty, Adrien M. Chud, Tatiane C. S. Miglior, Filippo Schenkel, Flavio S. Kommadath, Arun Krivushin, Kirill Grant, Jason R. Häfliger, Irene M. Drögemüller, Cord Cánovas, Angela Stothard, Paul Baes, Christine F. Sci Rep Article Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n = 52) and a population-based (n = 36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR. Nature Publishing Group UK 2020-05-15 /pmc/articles/PMC7229195/ /pubmed/32415111 http://dx.doi.org/10.1038/s41598-020-64680-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Butty, Adrien M.
Chud, Tatiane C. S.
Miglior, Filippo
Schenkel, Flavio S.
Kommadath, Arun
Krivushin, Kirill
Grant, Jason R.
Häfliger, Irene M.
Drögemüller, Cord
Cánovas, Angela
Stothard, Paul
Baes, Christine F.
High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title_full High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title_fullStr High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title_full_unstemmed High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title_short High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
title_sort high confidence copy number variants identified in holstein dairy cattle from whole genome sequence and genotype array data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7229195/
https://www.ncbi.nlm.nih.gov/pubmed/32415111
http://dx.doi.org/10.1038/s41598-020-64680-3
work_keys_str_mv AT buttyadrienm highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT chudtatianecs highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT migliorfilippo highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT schenkelflavios highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT kommadatharun highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT krivushinkirill highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT grantjasonr highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT hafligerirenem highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT drogemullercord highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT canovasangela highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT stothardpaul highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata
AT baeschristinef highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata