Cargando…
High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data
Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7229195/ https://www.ncbi.nlm.nih.gov/pubmed/32415111 http://dx.doi.org/10.1038/s41598-020-64680-3 |
_version_ | 1783534711876878336 |
---|---|
author | Butty, Adrien M. Chud, Tatiane C. S. Miglior, Filippo Schenkel, Flavio S. Kommadath, Arun Krivushin, Kirill Grant, Jason R. Häfliger, Irene M. Drögemüller, Cord Cánovas, Angela Stothard, Paul Baes, Christine F. |
author_facet | Butty, Adrien M. Chud, Tatiane C. S. Miglior, Filippo Schenkel, Flavio S. Kommadath, Arun Krivushin, Kirill Grant, Jason R. Häfliger, Irene M. Drögemüller, Cord Cánovas, Angela Stothard, Paul Baes, Christine F. |
author_sort | Butty, Adrien M. |
collection | PubMed |
description | Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n = 52) and a population-based (n = 36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR. |
format | Online Article Text |
id | pubmed-7229195 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-72291952020-05-26 High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data Butty, Adrien M. Chud, Tatiane C. S. Miglior, Filippo Schenkel, Flavio S. Kommadath, Arun Krivushin, Kirill Grant, Jason R. Häfliger, Irene M. Drögemüller, Cord Cánovas, Angela Stothard, Paul Baes, Christine F. Sci Rep Article Multiple methods to detect copy number variants (CNV) relying on different types of data have been developed and CNV have been shown to have an impact on phenotypes of numerous traits of economic importance in cattle, such as reproduction and immunity. Further improvements in CNV detection are still needed in regard to the trade-off between high-true and low-false positive variant identification rates. Instead of improving single CNV detection methods, variants can be identified in silico with high confidence when multiple methods and datasets are combined. Here, CNV were identified from whole-genome sequences (WGS) and genotype array (GEN) data on 96 Holstein animals. After CNV detection, two sets of high confidence CNV regions (CNVR) were created that contained variants found in both WGS and GEN data following an animal-based (n = 52) and a population-based (n = 36) pipeline. Furthermore, the change in false positive CNV identification rates using different GEN marker densities was evaluated. The population-based approach characterized CNVR, which were more often shared among animals (average 40% more samples per CNVR) and were more often linked to putative functions (48 vs 56% of CNVR) than CNV identified with the animal-based approach. Moreover, false positive identification rates up to 22% were estimated on GEN information. Further research using larger datasets should use a population-wide approach to identify high confidence CNVR. Nature Publishing Group UK 2020-05-15 /pmc/articles/PMC7229195/ /pubmed/32415111 http://dx.doi.org/10.1038/s41598-020-64680-3 Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Butty, Adrien M. Chud, Tatiane C. S. Miglior, Filippo Schenkel, Flavio S. Kommadath, Arun Krivushin, Kirill Grant, Jason R. Häfliger, Irene M. Drögemüller, Cord Cánovas, Angela Stothard, Paul Baes, Christine F. High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title | High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title_full | High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title_fullStr | High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title_full_unstemmed | High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title_short | High confidence copy number variants identified in Holstein dairy cattle from whole genome sequence and genotype array data |
title_sort | high confidence copy number variants identified in holstein dairy cattle from whole genome sequence and genotype array data |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7229195/ https://www.ncbi.nlm.nih.gov/pubmed/32415111 http://dx.doi.org/10.1038/s41598-020-64680-3 |
work_keys_str_mv | AT buttyadrienm highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT chudtatianecs highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT migliorfilippo highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT schenkelflavios highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT kommadatharun highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT krivushinkirill highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT grantjasonr highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT hafligerirenem highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT drogemullercord highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT canovasangela highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT stothardpaul highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata AT baeschristinef highconfidencecopynumbervariantsidentifiedinholsteindairycattlefromwholegenomesequenceandgenotypearraydata |