Cargando…
Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one...
Autores principales: | , , , , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466960/ https://www.ncbi.nlm.nih.gov/pubmed/32631951 http://dx.doi.org/10.1534/g3.119.400995 |
_version_ | 1783577922387312640 |
---|---|
author | Soifer, llya Fong, Nicole L. Yi, Nelda Ireland, Andrea T. Lam, Irene Sooknah, Matthew Paw, Jonathan S. Peluso, Paul Concepcion, Gregory T. Rank, David Hastie, Alex R. Jojic, Vladimir Ruby, J. Graham Botstein, David Roy, Margaret A. |
author_facet | Soifer, llya Fong, Nicole L. Yi, Nelda Ireland, Andrea T. Lam, Irene Sooknah, Matthew Paw, Jonathan S. Peluso, Paul Concepcion, Gregory T. Rank, David Hastie, Alex R. Jojic, Vladimir Ruby, J. Graham Botstein, David Roy, Margaret A. |
author_sort | Soifer, llya |
collection | PubMed |
description | In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or cis-acting variants. Here we use a combination of tools and sequencing technologies to generate a de novo diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final de novo genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome de novo from the DNA of a single individual is now readily achievable. |
format | Online Article Text |
id | pubmed-7466960 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-74669602020-09-14 Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual Soifer, llya Fong, Nicole L. Yi, Nelda Ireland, Andrea T. Lam, Irene Sooknah, Matthew Paw, Jonathan S. Peluso, Paul Concepcion, Gregory T. Rank, David Hastie, Alex R. Jojic, Vladimir Ruby, J. Graham Botstein, David Roy, Margaret A. G3 (Bethesda) Genome Report In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or cis-acting variants. Here we use a combination of tools and sequencing technologies to generate a de novo diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final de novo genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome de novo from the DNA of a single individual is now readily achievable. Genetics Society of America 2020-07-06 /pmc/articles/PMC7466960/ /pubmed/32631951 http://dx.doi.org/10.1534/g3.119.400995 Text en Copyright © 2020 Soifer et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Genome Report Soifer, llya Fong, Nicole L. Yi, Nelda Ireland, Andrea T. Lam, Irene Sooknah, Matthew Paw, Jonathan S. Peluso, Paul Concepcion, Gregory T. Rank, David Hastie, Alex R. Jojic, Vladimir Ruby, J. Graham Botstein, David Roy, Margaret A. Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title | Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title_full | Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title_fullStr | Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title_full_unstemmed | Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title_short | Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual |
title_sort | fully phased sequence of a diploid human genome determined de novo from the dna of a single individual |
topic | Genome Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466960/ https://www.ncbi.nlm.nih.gov/pubmed/32631951 http://dx.doi.org/10.1534/g3.119.400995 |
work_keys_str_mv | AT soiferllya fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT fongnicolel fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT yinelda fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT irelandandreat fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT lamirene fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT sooknahmatthew fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT pawjonathans fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT pelusopaul fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT concepciongregoryt fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT rankdavid fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT hastiealexr fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT jojicvladimir fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT rubyjgraham fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT botsteindavid fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual AT roymargareta fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual |