Cargando…

Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array

BACKGROUND: Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Yali, Bickhart, Derek M, Hvinden, Miranda L, Li, Congjun, Song, Jiuzhou, Boichard, Didier A, Fritz, Sébastien, Eggen, André, DeNise, Sue, Wiggans, George R, Sonstegard, Tad S, Van Tassell, Curtis P, Liu, George E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583728/
https://www.ncbi.nlm.nih.gov/pubmed/22866901
http://dx.doi.org/10.1186/1471-2164-13-376
_version_ 1782475473980227584
author Hou, Yali
Bickhart, Derek M
Hvinden, Miranda L
Li, Congjun
Song, Jiuzhou
Boichard, Didier A
Fritz, Sébastien
Eggen, André
DeNise, Sue
Wiggans, George R
Sonstegard, Tad S
Van Tassell, Curtis P
Liu, George E
author_facet Hou, Yali
Bickhart, Derek M
Hvinden, Miranda L
Li, Congjun
Song, Jiuzhou
Boichard, Didier A
Fritz, Sébastien
Eggen, André
DeNise, Sue
Wiggans, George R
Sonstegard, Tad S
Van Tassell, Curtis P
Liu, George E
author_sort Hou, Yali
collection PubMed
description BACKGROUND: Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases. RESULTS: In this study using the high density BovineHD SNP array, we performed high resolution CNV analyses on both Btau_4.0 and UMD3.1 with 674 animals of 27 cattle breeds. We first compared CNV results derived from these two different SNP array platforms on Btau_4.0. With two thirds of the animals shared between studies, on Btau_4.0 we identified 3,346 candidate CNV regions representing 142.7 megabases (~4.70%) of the genome. With a similar total length but 5 times more event counts, the average CNVR length of current Btau_4.0 dataset is significantly shorter than the previous one (42.7 kb vs. 205 kb). Although subsets of these two results overlapped, 64% (91.6 megabases) of current dataset was not present in the previous study. We also performed similar analyses on UMD3.1 using these BovineHD SNP array results. Approximately 50% more and 20% longer CNVs were called on UMD3.1 as compared to those on Btau_4.0. However, a comparable result of CNVRs (3,438 regions with a total length 146.9 megabases) was obtained. We suspect that these results are due to the UMD3.1 assembly's efforts of placing unplaced contigs and removing unmerged alleles. Selected CNVs were further experimentally validated, achieving a 73% PCR validation rate, which is considerably higher than the previous validation rate. About 20-45% of CNV regions overlapped with cattle RefSeq genes and Ensembl genes. Panther and IPA analyses indicated that these genes provide a wide spectrum of biological processes involving immune system, lipid metabolism, cell, organism and system development. CONCLUSION: We present a comprehensive result of cattle CNVs at a higher resolution and sensitivity. We identified over 3,000 candidate CNV regions on both Btau_4.0 and UMD3.1, further compared current datasets with previous results, and examined the impacts of genome assemblies on CNV calling.
format Online
Article
Text
id pubmed-3583728
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35837282013-02-28 Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array Hou, Yali Bickhart, Derek M Hvinden, Miranda L Li, Congjun Song, Jiuzhou Boichard, Didier A Fritz, Sébastien Eggen, André DeNise, Sue Wiggans, George R Sonstegard, Tad S Van Tassell, Curtis P Liu, George E BMC Genomics Research Article BACKGROUND: Btau_4.0 and UMD3.1 are two distinct cattle reference genome assemblies. In our previous study using the low density BovineSNP50 array, we reported a copy number variation (CNV) analysis on Btau_4.0 with 521 animals of 21 cattle breeds, yielding 682 CNV regions with a total length of 139.8 megabases. RESULTS: In this study using the high density BovineHD SNP array, we performed high resolution CNV analyses on both Btau_4.0 and UMD3.1 with 674 animals of 27 cattle breeds. We first compared CNV results derived from these two different SNP array platforms on Btau_4.0. With two thirds of the animals shared between studies, on Btau_4.0 we identified 3,346 candidate CNV regions representing 142.7 megabases (~4.70%) of the genome. With a similar total length but 5 times more event counts, the average CNVR length of current Btau_4.0 dataset is significantly shorter than the previous one (42.7 kb vs. 205 kb). Although subsets of these two results overlapped, 64% (91.6 megabases) of current dataset was not present in the previous study. We also performed similar analyses on UMD3.1 using these BovineHD SNP array results. Approximately 50% more and 20% longer CNVs were called on UMD3.1 as compared to those on Btau_4.0. However, a comparable result of CNVRs (3,438 regions with a total length 146.9 megabases) was obtained. We suspect that these results are due to the UMD3.1 assembly's efforts of placing unplaced contigs and removing unmerged alleles. Selected CNVs were further experimentally validated, achieving a 73% PCR validation rate, which is considerably higher than the previous validation rate. About 20-45% of CNV regions overlapped with cattle RefSeq genes and Ensembl genes. Panther and IPA analyses indicated that these genes provide a wide spectrum of biological processes involving immune system, lipid metabolism, cell, organism and system development. CONCLUSION: We present a comprehensive result of cattle CNVs at a higher resolution and sensitivity. We identified over 3,000 candidate CNV regions on both Btau_4.0 and UMD3.1, further compared current datasets with previous results, and examined the impacts of genome assemblies on CNV calling. BioMed Central 2012-08-06 /pmc/articles/PMC3583728/ /pubmed/22866901 http://dx.doi.org/10.1186/1471-2164-13-376 Text en Copyright ©2012 Hou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hou, Yali
Bickhart, Derek M
Hvinden, Miranda L
Li, Congjun
Song, Jiuzhou
Boichard, Didier A
Fritz, Sébastien
Eggen, André
DeNise, Sue
Wiggans, George R
Sonstegard, Tad S
Van Tassell, Curtis P
Liu, George E
Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title_full Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title_fullStr Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title_full_unstemmed Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title_short Fine mapping of copy number variations on two cattle genome assemblies using high density SNP array
title_sort fine mapping of copy number variations on two cattle genome assemblies using high density snp array
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3583728/
https://www.ncbi.nlm.nih.gov/pubmed/22866901
http://dx.doi.org/10.1186/1471-2164-13-376
work_keys_str_mv AT houyali finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT bickhartderekm finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT hvindenmirandal finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT licongjun finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT songjiuzhou finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT boicharddidiera finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT fritzsebastien finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT eggenandre finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT denisesue finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT wiggansgeorger finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT sonstegardtads finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT vantassellcurtisp finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray
AT liugeorgee finemappingofcopynumbervariationsontwocattlegenomeassembliesusinghighdensitysnparray