Cargando…
The Diploid Genome Sequence of an Individual Human
Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given...
Autores principales: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1964779/ https://www.ncbi.nlm.nih.gov/pubmed/17803354 http://dx.doi.org/10.1371/journal.pbio.0050254 |
_version_ | 1782134658219114496 |
---|---|
author | Levy, Samuel Sutton, Granger Ng, Pauline C Feuk, Lars Halpern, Aaron L Walenz, Brian P Axelrod, Nelson Huang, Jiaqi Kirkness, Ewen F Denisov, Gennady Lin, Yuan MacDonald, Jeffrey R Pang, Andy Wing Chun Shago, Mary Stockwell, Timothy B Tsiamouri, Alexia Bafna, Vineet Bansal, Vikas Kravitz, Saul A Busam, Dana A Beeson, Karen Y McIntosh, Tina C Remington, Karin A Abril, Josep F Gill, John Borman, Jon Rogers, Yu-Hui Frazier, Marvin E Scherer, Stephen W Strausberg, Robert L Venter, J. Craig |
author_facet | Levy, Samuel Sutton, Granger Ng, Pauline C Feuk, Lars Halpern, Aaron L Walenz, Brian P Axelrod, Nelson Huang, Jiaqi Kirkness, Ewen F Denisov, Gennady Lin, Yuan MacDonald, Jeffrey R Pang, Andy Wing Chun Shago, Mary Stockwell, Timothy B Tsiamouri, Alexia Bafna, Vineet Bansal, Vikas Kravitz, Saul A Busam, Dana A Beeson, Karen Y McIntosh, Tina C Remington, Karin A Abril, Josep F Gill, John Borman, Jon Rogers, Yu-Hui Frazier, Marvin E Scherer, Stephen W Strausberg, Robert L Venter, J. Craig |
author_sort | Levy, Samuel |
collection | PubMed |
description | Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information. |
format | Text |
id | pubmed-1964779 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-19647792007-10-27 The Diploid Genome Sequence of an Individual Human Levy, Samuel Sutton, Granger Ng, Pauline C Feuk, Lars Halpern, Aaron L Walenz, Brian P Axelrod, Nelson Huang, Jiaqi Kirkness, Ewen F Denisov, Gennady Lin, Yuan MacDonald, Jeffrey R Pang, Andy Wing Chun Shago, Mary Stockwell, Timothy B Tsiamouri, Alexia Bafna, Vineet Bansal, Vikas Kravitz, Saul A Busam, Dana A Beeson, Karen Y McIntosh, Tina C Remington, Karin A Abril, Josep F Gill, John Borman, Jon Rogers, Yu-Hui Frazier, Marvin E Scherer, Stephen W Strausberg, Robert L Venter, J. Craig PLoS Biol Research Article Presented here is a genome sequence of an individual human. It was produced from ∼32 million random DNA fragments, sequenced by Sanger dideoxy technology and assembled into 4,528 scaffolds, comprising 2,810 million bases (Mb) of contiguous sequence with approximately 7.5-fold coverage for any given region. We developed a modified version of the Celera assembler to facilitate the identification and comparison of alternate alleles within this individual diploid genome. Comparison of this genome and the National Center for Biotechnology Information human reference assembly revealed more than 4.1 million DNA variants, encompassing 12.3 Mb. These variants (of which 1,288,319 were novel) included 3,213,401 single nucleotide polymorphisms (SNPs), 53,823 block substitutions (2–206 bp), 292,102 heterozygous insertion/deletion events (indels)(1–571 bp), 559,473 homozygous indels (1–82,711 bp), 90 inversions, as well as numerous segmental duplications and copy number variation regions. Non-SNP DNA variation accounts for 22% of all events identified in the donor, however they involve 74% of all variant bases. This suggests an important role for non-SNP genetic alterations in defining the diploid genome structure. Moreover, 44% of genes were heterozygous for one or more variants. Using a novel haplotype assembly strategy, we were able to span 1.5 Gb of genome sequence in segments >200 kb, providing further precision to the diploid nature of the genome. These data depict a definitive molecular portrait of a diploid human genome that provides a starting point for future genome comparisons and enables an era of individualized genomic information. Public Library of Science 2007-10 2007-09-04 /pmc/articles/PMC1964779/ /pubmed/17803354 http://dx.doi.org/10.1371/journal.pbio.0050254 Text en © 2007 Levy et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Levy, Samuel Sutton, Granger Ng, Pauline C Feuk, Lars Halpern, Aaron L Walenz, Brian P Axelrod, Nelson Huang, Jiaqi Kirkness, Ewen F Denisov, Gennady Lin, Yuan MacDonald, Jeffrey R Pang, Andy Wing Chun Shago, Mary Stockwell, Timothy B Tsiamouri, Alexia Bafna, Vineet Bansal, Vikas Kravitz, Saul A Busam, Dana A Beeson, Karen Y McIntosh, Tina C Remington, Karin A Abril, Josep F Gill, John Borman, Jon Rogers, Yu-Hui Frazier, Marvin E Scherer, Stephen W Strausberg, Robert L Venter, J. Craig The Diploid Genome Sequence of an Individual Human |
title | The Diploid Genome Sequence of an Individual Human |
title_full | The Diploid Genome Sequence of an Individual Human |
title_fullStr | The Diploid Genome Sequence of an Individual Human |
title_full_unstemmed | The Diploid Genome Sequence of an Individual Human |
title_short | The Diploid Genome Sequence of an Individual Human |
title_sort | diploid genome sequence of an individual human |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1964779/ https://www.ncbi.nlm.nih.gov/pubmed/17803354 http://dx.doi.org/10.1371/journal.pbio.0050254 |
work_keys_str_mv | AT levysamuel thediploidgenomesequenceofanindividualhuman AT suttongranger thediploidgenomesequenceofanindividualhuman AT ngpaulinec thediploidgenomesequenceofanindividualhuman AT feuklars thediploidgenomesequenceofanindividualhuman AT halpernaaronl thediploidgenomesequenceofanindividualhuman AT walenzbrianp thediploidgenomesequenceofanindividualhuman AT axelrodnelson thediploidgenomesequenceofanindividualhuman AT huangjiaqi thediploidgenomesequenceofanindividualhuman AT kirknessewenf thediploidgenomesequenceofanindividualhuman AT denisovgennady thediploidgenomesequenceofanindividualhuman AT linyuan thediploidgenomesequenceofanindividualhuman AT macdonaldjeffreyr thediploidgenomesequenceofanindividualhuman AT pangandywingchun thediploidgenomesequenceofanindividualhuman AT shagomary thediploidgenomesequenceofanindividualhuman AT stockwelltimothyb thediploidgenomesequenceofanindividualhuman AT tsiamourialexia thediploidgenomesequenceofanindividualhuman AT bafnavineet thediploidgenomesequenceofanindividualhuman AT bansalvikas thediploidgenomesequenceofanindividualhuman AT kravitzsaula thediploidgenomesequenceofanindividualhuman AT busamdanaa thediploidgenomesequenceofanindividualhuman AT beesonkareny thediploidgenomesequenceofanindividualhuman AT mcintoshtinac thediploidgenomesequenceofanindividualhuman AT remingtonkarina thediploidgenomesequenceofanindividualhuman AT abriljosepf thediploidgenomesequenceofanindividualhuman AT gilljohn thediploidgenomesequenceofanindividualhuman AT bormanjon thediploidgenomesequenceofanindividualhuman AT rogersyuhui thediploidgenomesequenceofanindividualhuman AT fraziermarvine thediploidgenomesequenceofanindividualhuman AT schererstephenw thediploidgenomesequenceofanindividualhuman AT strausbergrobertl thediploidgenomesequenceofanindividualhuman AT venterjcraig thediploidgenomesequenceofanindividualhuman AT levysamuel diploidgenomesequenceofanindividualhuman AT suttongranger diploidgenomesequenceofanindividualhuman AT ngpaulinec diploidgenomesequenceofanindividualhuman AT feuklars diploidgenomesequenceofanindividualhuman AT halpernaaronl diploidgenomesequenceofanindividualhuman AT walenzbrianp diploidgenomesequenceofanindividualhuman AT axelrodnelson diploidgenomesequenceofanindividualhuman AT huangjiaqi diploidgenomesequenceofanindividualhuman AT kirknessewenf diploidgenomesequenceofanindividualhuman AT denisovgennady diploidgenomesequenceofanindividualhuman AT linyuan diploidgenomesequenceofanindividualhuman AT macdonaldjeffreyr diploidgenomesequenceofanindividualhuman AT pangandywingchun diploidgenomesequenceofanindividualhuman AT shagomary diploidgenomesequenceofanindividualhuman AT stockwelltimothyb diploidgenomesequenceofanindividualhuman AT tsiamourialexia diploidgenomesequenceofanindividualhuman AT bafnavineet diploidgenomesequenceofanindividualhuman AT bansalvikas diploidgenomesequenceofanindividualhuman AT kravitzsaula diploidgenomesequenceofanindividualhuman AT busamdanaa diploidgenomesequenceofanindividualhuman AT beesonkareny diploidgenomesequenceofanindividualhuman AT mcintoshtinac diploidgenomesequenceofanindividualhuman AT remingtonkarina diploidgenomesequenceofanindividualhuman AT abriljosepf diploidgenomesequenceofanindividualhuman AT gilljohn diploidgenomesequenceofanindividualhuman AT bormanjon diploidgenomesequenceofanindividualhuman AT rogersyuhui diploidgenomesequenceofanindividualhuman AT fraziermarvine diploidgenomesequenceofanindividualhuman AT schererstephenw diploidgenomesequenceofanindividualhuman AT strausbergrobertl diploidgenomesequenceofanindividualhuman AT venterjcraig diploidgenomesequenceofanindividualhuman |