Cargando…

Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping

BACKGROUND: Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. RESULTS: We report the integration of the whole genome...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhan, Bujie, Fadista, João, Thomsen, Bo, Hedegaard, Jakob, Panitz, Frank, Bendixen, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3248099/
https://www.ncbi.nlm.nih.gov/pubmed/22082336
http://dx.doi.org/10.1186/1471-2164-12-557
_version_ 1782220194592063488
author Zhan, Bujie
Fadista, João
Thomsen, Bo
Hedegaard, Jakob
Panitz, Frank
Bendixen, Christian
author_facet Zhan, Bujie
Fadista, João
Thomsen, Bo
Hedegaard, Jakob
Panitz, Frank
Bendixen, Christian
author_sort Zhan, Bujie
collection PubMed
description BACKGROUND: Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. RESULTS: We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. CONCLUSIONS: Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants.
format Online
Article
Text
id pubmed-3248099
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32480992011-12-30 Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping Zhan, Bujie Fadista, João Thomsen, Bo Hedegaard, Jakob Panitz, Frank Bendixen, Christian BMC Genomics Research Article BACKGROUND: Integration of genomic variation with phenotypic information is an effective approach for uncovering genotype-phenotype associations. This requires an accurate identification of the different types of variation in individual genomes. RESULTS: We report the integration of the whole genome sequence of a single Holstein Friesian bull with data from single nucleotide polymorphism (SNP) and comparative genomic hybridization (CGH) array technologies to determine a comprehensive spectrum of genomic variation. The performance of resequencing SNP detection was assessed by combining SNPs that were identified to be either in identity by descent (IBD) or in copy number variation (CNV) with results from SNP array genotyping. Coding insertions and deletions (indels) were found to be enriched for size in multiples of 3 and were located near the N- and C-termini of proteins. For larger indels, a combination of split-read and read-pair approaches proved to be complementary in finding different signatures. CNVs were identified on the basis of the depth of sequenced reads, and by using SNP and CGH arrays. CONCLUSIONS: Our results provide high resolution mapping of diverse classes of genomic variation in an individual bovine genome and demonstrate that structural variation surpasses sequence variation as the main component of genomic variability. Better accuracy of SNP detection was achieved with little loss of sensitivity when algorithms that implemented mapping quality were used. IBD regions were found to be instrumental for calculating resequencing SNP accuracy, while SNP detection within CNVs tended to be less reliable. CNV discovery was affected dramatically by platform resolution and coverage biases. The combined data for this study showed that at a moderate level of sequencing coverage, an ensemble of platforms and tools can be applied together to maximize the accurate detection of sequence and structural variants. BioMed Central 2011-11-14 /pmc/articles/PMC3248099/ /pubmed/22082336 http://dx.doi.org/10.1186/1471-2164-12-557 Text en Copyright ©2011 Zhan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhan, Bujie
Fadista, João
Thomsen, Bo
Hedegaard, Jakob
Panitz, Frank
Bendixen, Christian
Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title_full Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title_fullStr Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title_full_unstemmed Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title_short Global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
title_sort global assessment of genomic variation in cattle by genome resequencing and high-throughput genotyping
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3248099/
https://www.ncbi.nlm.nih.gov/pubmed/22082336
http://dx.doi.org/10.1186/1471-2164-12-557
work_keys_str_mv AT zhanbujie globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping
AT fadistajoao globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping
AT thomsenbo globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping
AT hedegaardjakob globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping
AT panitzfrank globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping
AT bendixenchristian globalassessmentofgenomicvariationincattlebygenomeresequencingandhighthroughputgenotyping