Cargando…

Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances

BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect ide...

Descripción completa

Detalles Bibliográficos
Autores principales: Nandolo, Wilson, Utsunomiya, Yuri T., Mészáros, Gábor, Wurzinger, Maria, Khayadzadeh, Negar, Torrecilha, Rafaela B. P., Mulindwa, Henry A., Gondwe, Timothy N., Waldmann, Patrik, Ferenčaković, Maja, Garcia, José F., Rosen, Benjamin D., Bickhart, Derek, van Tassell, Curt P., Curik, Ino, Sölkner, Johann
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106898/
https://www.ncbi.nlm.nih.gov/pubmed/30134820
http://dx.doi.org/10.1186/s12711-018-0414-x
_version_ 1783349866842292224
author Nandolo, Wilson
Utsunomiya, Yuri T.
Mészáros, Gábor
Wurzinger, Maria
Khayadzadeh, Negar
Torrecilha, Rafaela B. P.
Mulindwa, Henry A.
Gondwe, Timothy N.
Waldmann, Patrik
Ferenčaković, Maja
Garcia, José F.
Rosen, Benjamin D.
Bickhart, Derek
van Tassell, Curt P.
Curik, Ino
Sölkner, Johann
author_facet Nandolo, Wilson
Utsunomiya, Yuri T.
Mészáros, Gábor
Wurzinger, Maria
Khayadzadeh, Negar
Torrecilha, Rafaela B. P.
Mulindwa, Henry A.
Gondwe, Timothy N.
Waldmann, Patrik
Ferenčaković, Maja
Garcia, José F.
Rosen, Benjamin D.
Bickhart, Derek
van Tassell, Curt P.
Curik, Ino
Sölkner, Johann
author_sort Nandolo, Wilson
collection PubMed
description BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle. METHODS: ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses. RESULTS: In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR. CONCLUSIONS: Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6106898
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-61068982018-08-29 Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances Nandolo, Wilson Utsunomiya, Yuri T. Mészáros, Gábor Wurzinger, Maria Khayadzadeh, Negar Torrecilha, Rafaela B. P. Mulindwa, Henry A. Gondwe, Timothy N. Waldmann, Patrik Ferenčaković, Maja Garcia, José F. Rosen, Benjamin D. Bickhart, Derek van Tassell, Curt P. Curik, Ino Sölkner, Johann Genet Sel Evol Research Article BACKGROUND: Runs of homozygosity (ROH) islands are stretches of homozygous sequence in the genome of a large proportion of individuals in a population. Algorithms for the detection of ROH depend on the similarity of haplotypes. Coverage gaps and copy number variants (CNV) may result in incorrect identification of such similarity, leading to the detection of ROH islands where none exists. Misidentified hemizygous regions will also appear as homozygous based on sequence variation alone. Our aim was to identify ROH islands influenced by marker coverage gaps or CNV, using Illumina BovineHD BeadChip (777 K) single nucleotide polymorphism (SNP) data for Austrian Brown Swiss, Tyrol Grey and Pinzgauer cattle. METHODS: ROH were detected using clustering, and ROH islands were determined from population inbreeding levels for each marker. CNV were detected using a multivariate copy number analysis method and a hidden Markov model. SNP coverage gaps were defined as genomic regions with intermarker distances on average longer than 9.24 kb. ROH islands that overlapped CNV regions (CNVR) or SNP coverage gaps were considered as potential artefacts. Permutation tests were used to determine if overlaps between CNVR with copy losses and ROH islands were due to chance. Diversity of the haplotypes in the ROH islands was assessed by haplotype analyses. RESULTS: In Brown Swiss, Tyrol Grey and Pinzgauer, we identified 13, 22, and 24 ROH islands covering 26.6, 389.0 and 35.8 Mb, respectively, and we detected 30, 50 and 71 CNVR derived from CNV by using both algorithms, respectively. Overlaps between ROH islands, CNVR or coverage gaps occurred for 7, 14 and 16 ROH islands, respectively. About 37, 44 and 52% of the ROH islands coverage in Brown Swiss, Tyrol Grey and Pinzgauer, respectively, were affected by copy loss. Intersections between ROH islands and CNVR were small, but significantly larger compared to ROH islands at random locations across the genome, implying an association between ROH islands and CNVR. Haplotype diversity for reliable ROH islands was lower than for ROH islands that intersected with copy loss CNVR. CONCLUSIONS: Our findings show that a significant proportion of the ROH islands in the bovine genome are artefacts due to CNV or SNP coverage gaps. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12711-018-0414-x) contains supplementary material, which is available to authorized users. BioMed Central 2018-08-22 /pmc/articles/PMC6106898/ /pubmed/30134820 http://dx.doi.org/10.1186/s12711-018-0414-x Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Nandolo, Wilson
Utsunomiya, Yuri T.
Mészáros, Gábor
Wurzinger, Maria
Khayadzadeh, Negar
Torrecilha, Rafaela B. P.
Mulindwa, Henry A.
Gondwe, Timothy N.
Waldmann, Patrik
Ferenčaković, Maja
Garcia, José F.
Rosen, Benjamin D.
Bickhart, Derek
van Tassell, Curt P.
Curik, Ino
Sölkner, Johann
Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title_full Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title_fullStr Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title_full_unstemmed Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title_short Misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
title_sort misidentification of runs of homozygosity islands in cattle caused by interference with copy number variation or large intermarker distances
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6106898/
https://www.ncbi.nlm.nih.gov/pubmed/30134820
http://dx.doi.org/10.1186/s12711-018-0414-x
work_keys_str_mv AT nandolowilson misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT utsunomiyayurit misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT meszarosgabor misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT wurzingermaria misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT khayadzadehnegar misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT torrecilharafaelabp misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT mulindwahenrya misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT gondwetimothyn misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT waldmannpatrik misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT ferencakovicmaja misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT garciajosef misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT rosenbenjamind misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT bickhartderek misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT vantassellcurtp misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT curikino misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances
AT solknerjohann misidentificationofrunsofhomozygosityislandsincattlecausedbyinterferencewithcopynumbervariationorlargeintermarkerdistances