Cargando…
Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect ide...
Autores principales: | , , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106898/ https://www.ncbi.nlm.nih.gov/pubmed/30134820 http://dx.doi.org/10.1186/s12711-018-0414-x |
_version_ | 1783349866842292224 |
---|---|
author | Nandolo, Wilson Utsunomiya, Yuri T. Mészáros, Gábor Wurzinger, Maria Khayadzadeh, Negar Torrecilha, Rafaela B. P. Mulindwa, Henry A. Gondwe, Timothy N. Waldmann, Patrik Ferenčaković, Maja Garcia, José F. Rosen, Benjamin D. Bickhart, Derek van Tassell, Curt P. Curik, Ino Sölkner, Johann |
author_facet | Nandolo, Wilson Utsunomiya, Yuri T. Mészáros, Gábor Wurzinger, Maria Khayadzadeh, Negar Torrecilha, Rafaela B. P. Mulindwa, Henry A. Gondwe, Timothy N. Waldmann, Patrik Ferenčaković, Maja Garcia, José F. Rosen, Benjamin D. Bickhart, Derek van Tassell, Curt P. Curik, Ino Sölkner, Johann |
author_sort | Nandolo, Wilson |
collection | PubMed |
description | BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle. METHODS: ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses. RESULTS: In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR. CONCLUSIONS: Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6106898 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-61068982018-08-29 Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances Nandolo, Wilson Utsunomiya, Yuri T. Mészáros, Gábor Wurzinger, Maria Khayadzadeh, Negar Torrecilha, Rafaela B. P. Mulindwa, Henry A. Gondwe, Timothy N. Waldmann, Patrik Ferenčaković, Maja Garcia, José F. Rosen, Benjamin D. Bickhart, Derek van Tassell, Curt P. Curik, Ino Sölkner, Johann Genet Sel Evol Research Article BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle. METHODS: ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses. RESULTS: In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR. CONCLUSIONS: Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-22 /pmc/articles/PMC6106898/ /pubmed/30134820 http://dx.doi.org/10.1186/s12711-018-0414-x Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Nandolo, Wilson Utsunomiya, Yuri T. Mészáros, Gábor Wurzinger, Maria Khayadzadeh, Negar Torrecilha, Rafaela B. P. Mulindwa, Henry A. Gondwe, Timothy N. Waldmann, Patrik Ferenčaković, Maja Garcia, José F. Rosen, Benjamin D. Bickhart, Derek van Tassell, Curt P. Curik, Ino Sölkner, Johann Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title | Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title_full | Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title_fullStr | Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title_full_unstemmed | Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title_short | Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
title_sort | misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106898/ https://www.ncbi.nlm.nih.gov/pubmed/30134820 http://dx.doi.org/10.1186/s12711-018-0414-x |
work_keys_str_mv | AT nandolowilson misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT utsunomiyayurit misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT meszarosgabor misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT wurzingermaria misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT khayadzadehnegar misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT torrecilharafaelabp misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT mulindwahenrya misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT gondwetimothyn misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT waldmannpatrik misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT ferencakovicmaja misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT garciajosef misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT rosenbenjamind misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT bickhartderek misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT vantassellcurtp misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT curikino misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances AT solknerjohann misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances |