Cargando…

Towards a comprehensive structural variation map of an individual human genome

BACKGROUND: Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less...

Descripción completa

Detalles Bibliográficos
Autores principales: Pang, Andy W, MacDonald, Jeffrey R, Pinto, Dalila, Wei, John, Rafiq, Muhammad A, Conrad, Donald F, Park, Hansoo, Hurles, Matthew E, Lee, Charles, Venter, J Craig, Kirkness, Ewen F, Levy, Samuel, Feuk, Lars, Scherer, Stephen W
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898065/
https://www.ncbi.nlm.nih.gov/pubmed/20482838
http://dx.doi.org/10.1186/gb-2010-11-5-r52
_version_ 1782183468343492608
author Pang, Andy W
MacDonald, Jeffrey R
Pinto, Dalila
Wei, John
Rafiq, Muhammad A
Conrad, Donald F
Park, Hansoo
Hurles, Matthew E
Lee, Charles
Venter, J Craig
Kirkness, Ewen F
Levy, Samuel
Feuk, Lars
Scherer, Stephen W
author_facet Pang, Andy W
MacDonald, Jeffrey R
Pinto, Dalila
Wei, John
Rafiq, Muhammad A
Conrad, Donald F
Park, Hansoo
Hurles, Matthew E
Lee, Charles
Venter, J Craig
Kirkness, Ewen F
Levy, Samuel
Feuk, Lars
Scherer, Stephen W
author_sort Pang, Andy W
collection PubMed
description BACKGROUND: Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions. RESULTS: We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association. CONCLUSIONS: Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies.
format Text
id pubmed-2898065
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28980652010-07-07 Towards a comprehensive structural variation map of an individual human genome Pang, Andy W MacDonald, Jeffrey R Pinto, Dalila Wei, John Rafiq, Muhammad A Conrad, Donald F Park, Hansoo Hurles, Matthew E Lee, Charles Venter, J Craig Kirkness, Ewen F Levy, Samuel Feuk, Lars Scherer, Stephen W Genome Biol Research BACKGROUND: Several genomes have now been sequenced, with millions of genetic variants annotated. While significant progress has been made in mapping single nucleotide polymorphisms (SNPs) and small (<10 bp) insertion/deletions (indels), the annotation of larger structural variants has been less comprehensive. It is still unclear to what extent a typical genome differs from the reference assembly, and the analysis of the genomes sequenced to date have shown varying results for copy number variation (CNV) and inversions. RESULTS: We have combined computational re-analysis of existing whole genome sequence data with novel microarray-based analysis, and detect 12,178 structural variants covering 40.6 Mb that were not reported in the initial sequencing of the first published personal genome. We estimate a total non-SNP variation content of 48.8 Mb in a single genome. Our results indicate that this genome differs from the consensus reference sequence by approximately 1.2% when considering indels/CNVs, 0.1% by SNPs and approximately 0.3% by inversions. The structural variants impact 4,867 genes, and >24% of structural variants would not be imputed by SNP-association. CONCLUSIONS: Our results indicate that a large number of structural variants have been unreported in the individual genomes published to date. This significant extent and complexity of structural variants, as well as the growing recognition of their medical relevance, necessitate they be actively studied in health-related analyses of personal genomes. The new catalogue of structural variants generated for this genome provides a crucial resource for future comparison studies. BioMed Central 2010 2010-05-19 /pmc/articles/PMC2898065/ /pubmed/20482838 http://dx.doi.org/10.1186/gb-2010-11-5-r52 Text en Copyright ©2010 Pang et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Pang, Andy W
MacDonald, Jeffrey R
Pinto, Dalila
Wei, John
Rafiq, Muhammad A
Conrad, Donald F
Park, Hansoo
Hurles, Matthew E
Lee, Charles
Venter, J Craig
Kirkness, Ewen F
Levy, Samuel
Feuk, Lars
Scherer, Stephen W
Towards a comprehensive structural variation map of an individual human genome
title Towards a comprehensive structural variation map of an individual human genome
title_full Towards a comprehensive structural variation map of an individual human genome
title_fullStr Towards a comprehensive structural variation map of an individual human genome
title_full_unstemmed Towards a comprehensive structural variation map of an individual human genome
title_short Towards a comprehensive structural variation map of an individual human genome
title_sort towards a comprehensive structural variation map of an individual human genome
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2898065/
https://www.ncbi.nlm.nih.gov/pubmed/20482838
http://dx.doi.org/10.1186/gb-2010-11-5-r52
work_keys_str_mv AT pangandyw towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT macdonaldjeffreyr towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT pintodalila towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT weijohn towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT rafiqmuhammada towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT conraddonaldf towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT parkhansoo towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT hurlesmatthewe towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT leecharles towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT venterjcraig towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT kirknessewenf towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT levysamuel towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT feuklars towardsacomprehensivestructuralvariationmapofanindividualhumangenome
AT schererstephenw towardsacomprehensivestructuralvariationmapofanindividualhumangenome