Cargando…

Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual

In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one...

Descripción completa

Detalles Bibliográficos
Autores principales: Soifer, llya, Fong, Nicole L., Yi, Nelda, Ireland, Andrea T., Lam, Irene, Sooknah, Matthew, Paw, Jonathan S., Peluso, Paul, Concepcion, Gregory T., Rank, David, Hastie, Alex R., Jojic, Vladimir, Ruby, J. Graham, Botstein, David, Roy, Margaret A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466960/
https://www.ncbi.nlm.nih.gov/pubmed/32631951
http://dx.doi.org/10.1534/g3.119.400995
_version_ 1783577922387312640
author Soifer, llya
Fong, Nicole L.
Yi, Nelda
Ireland, Andrea T.
Lam, Irene
Sooknah, Matthew
Paw, Jonathan S.
Peluso, Paul
Concepcion, Gregory T.
Rank, David
Hastie, Alex R.
Jojic, Vladimir
Ruby, J. Graham
Botstein, David
Roy, Margaret A.
author_facet Soifer, llya
Fong, Nicole L.
Yi, Nelda
Ireland, Andrea T.
Lam, Irene
Sooknah, Matthew
Paw, Jonathan S.
Peluso, Paul
Concepcion, Gregory T.
Rank, David
Hastie, Alex R.
Jojic, Vladimir
Ruby, J. Graham
Botstein, David
Roy, Margaret A.
author_sort Soifer, llya
collection PubMed
description In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or cis-acting variants. Here we use a combination of tools and sequencing technologies to generate a de novo diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final de novo genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome de novo from the DNA of a single individual is now readily achievable.
format Online
Article
Text
id pubmed-7466960
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-74669602020-09-14 Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual Soifer, llya Fong, Nicole L. Yi, Nelda Ireland, Andrea T. Lam, Irene Sooknah, Matthew Paw, Jonathan S. Peluso, Paul Concepcion, Gregory T. Rank, David Hastie, Alex R. Jojic, Vladimir Ruby, J. Graham Botstein, David Roy, Margaret A. G3 (Bethesda) Genome Report In recent years, improved sequencing technology and computational tools have made de novo genome assembly more accessible. Many approaches, however, generate either an unphased or only partially resolved representation of a diploid genome, in which polymorphisms are detected but not assigned to one or the other of the homologous chromosomes. Yet chromosomal phase information is invaluable for the understanding of phenotypic trait inheritance in the cases of compound heterozygosity, allele-specific expression or cis-acting variants. Here we use a combination of tools and sequencing technologies to generate a de novo diploid assembly of the human primary cell line WI-38. First, data from PacBio single molecule sequencing and Bionano Genomics optical mapping were combined to generate an unphased assembly. Next, 10x Genomics linked reads were combined with the hybrid assembly to generate a partially phased assembly. Lastly, we developed and optimized methods to use short-read (Illumina) sequencing of flow cytometry-sorted metaphase chromosomes to provide phase information. The final genome assembly was almost fully (94%) phased with the addition of approximately 2.5-fold coverage of Illumina data from the sequenced metaphase chromosomes. The diploid nature of the final de novo genome assembly improved the resolution of structural variants between the WI-38 genome and the human reference genome. The phased WI-38 sequence data are available for browsing and download at wi38.research.calicolabs.com. Our work shows that assembling a completely phased diploid genome de novo from the DNA of a single individual is now readily achievable. Genetics Society of America 2020-07-06 /pmc/articles/PMC7466960/ /pubmed/32631951 http://dx.doi.org/10.1534/g3.119.400995 Text en Copyright © 2020 Soifer et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Genome Report
Soifer, llya
Fong, Nicole L.
Yi, Nelda
Ireland, Andrea T.
Lam, Irene
Sooknah, Matthew
Paw, Jonathan S.
Peluso, Paul
Concepcion, Gregory T.
Rank, David
Hastie, Alex R.
Jojic, Vladimir
Ruby, J. Graham
Botstein, David
Roy, Margaret A.
Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title_full Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title_fullStr Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title_full_unstemmed Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title_short Fully Phased Sequence of a Diploid Human Genome Determined de Novo from the DNA of a Single Individual
title_sort fully phased sequence of a diploid human genome determined de novo from the dna of a single individual
topic Genome Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7466960/
https://www.ncbi.nlm.nih.gov/pubmed/32631951
http://dx.doi.org/10.1534/g3.119.400995
work_keys_str_mv AT soiferllya fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT fongnicolel fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT yinelda fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT irelandandreat fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT lamirene fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT sooknahmatthew fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT pawjonathans fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT pelusopaul fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT concepciongregoryt fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT rankdavid fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT hastiealexr fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT jojicvladimir fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT rubyjgraham fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT botsteindavid fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual
AT roymargareta fullyphasedsequenceofadiploidhumangenomedetermineddenovofromthednaofasingleindividual