Cargando…

Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562

K562 is widely used in biomedical research. It is one of three tier-one cell lines of ENCODE and also most commonly used for large-scale CRISPR/Cas9 screens. Although its functional genomic and epigenomic characteristics have been extensively studied, its genome sequence and genomic structural featu...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Bo, Ho, Steve S., Greer, Stephanie U., Zhu, Xiaowei, Bell, John M., Arthur, Joseph G., Spies, Noah, Zhang, Xianglong, Byeon, Seunggyu, Pattni, Reenal, Ben-Efraim, Noa, Haney, Michael S., Haraksingh, Rajini R., Song, Giltae, Ji, Hanlee P., Perrin, Dimitri, Wong, Wing H., Abyzov, Alexej, Urban, Alexander E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396411/
https://www.ncbi.nlm.nih.gov/pubmed/30737237
http://dx.doi.org/10.1101/gr.234948.118
_version_ 1783399244796788736
author Zhou, Bo
Ho, Steve S.
Greer, Stephanie U.
Zhu, Xiaowei
Bell, John M.
Arthur, Joseph G.
Spies, Noah
Zhang, Xianglong
Byeon, Seunggyu
Pattni, Reenal
Ben-Efraim, Noa
Haney, Michael S.
Haraksingh, Rajini R.
Song, Giltae
Ji, Hanlee P.
Perrin, Dimitri
Wong, Wing H.
Abyzov, Alexej
Urban, Alexander E.
author_facet Zhou, Bo
Ho, Steve S.
Greer, Stephanie U.
Zhu, Xiaowei
Bell, John M.
Arthur, Joseph G.
Spies, Noah
Zhang, Xianglong
Byeon, Seunggyu
Pattni, Reenal
Ben-Efraim, Noa
Haney, Michael S.
Haraksingh, Rajini R.
Song, Giltae
Ji, Hanlee P.
Perrin, Dimitri
Wong, Wing H.
Abyzov, Alexej
Urban, Alexander E.
author_sort Zhou, Bo
collection PubMed
description K562 is widely used in biomedical research. It is one of three tier-one cell lines of ENCODE and also most commonly used for large-scale CRISPR/Cas9 screens. Although its functional genomic and epigenomic characteristics have been extensively studied, its genome sequence and genomic structural features have never been comprehensively analyzed. Such information is essential for the correct interpretation and understanding of the vast troves of existing functional genomics and epigenomics data for K562. We performed and integrated deep-coverage whole-genome (short-insert), mate-pair, and linked-read sequencing as well as karyotyping and array CGH analysis to identify a wide spectrum of genome characteristics in K562: copy numbers (CN) of aneuploid chromosome segments at high-resolution, SNVs and indels (both corrected for CN in aneuploid regions), loss of heterozygosity, megabase-scale phased haplotypes often spanning entire chromosome arms, structural variants (SVs), including small and large-scale complex SVs and nonreference retrotransposon insertions. Many SVs were phased, assembled, and experimentally validated. We identified multiple allele-specific deletions and duplications within the tumor suppressor gene FHIT. Taking aneuploidy into account, we reanalyzed K562 RNA-seq and whole-genome bisulfite sequencing data for allele-specific expression and allele-specific DNA methylation. We also show examples of how deeper insights into regulatory complexity are gained by integrating genomic variant information and structural context with functional genomics and epigenomics data. Furthermore, using K562 haplotype information, we produced an allele-specific CRISPR targeting map. This comprehensive whole-genome analysis serves as a resource for future studies that utilize K562 as well as a framework for the analysis of other cancer genomes.
format Online
Article
Text
id pubmed-6396411
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Cold Spring Harbor Laboratory Press
record_format MEDLINE/PubMed
spelling pubmed-63964112019-09-01 Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562 Zhou, Bo Ho, Steve S. Greer, Stephanie U. Zhu, Xiaowei Bell, John M. Arthur, Joseph G. Spies, Noah Zhang, Xianglong Byeon, Seunggyu Pattni, Reenal Ben-Efraim, Noa Haney, Michael S. Haraksingh, Rajini R. Song, Giltae Ji, Hanlee P. Perrin, Dimitri Wong, Wing H. Abyzov, Alexej Urban, Alexander E. Genome Res Resource K562 is widely used in biomedical research. It is one of three tier-one cell lines of ENCODE and also most commonly used for large-scale CRISPR/Cas9 screens. Although its functional genomic and epigenomic characteristics have been extensively studied, its genome sequence and genomic structural features have never been comprehensively analyzed. Such information is essential for the correct interpretation and understanding of the vast troves of existing functional genomics and epigenomics data for K562. We performed and integrated deep-coverage whole-genome (short-insert), mate-pair, and linked-read sequencing as well as karyotyping and array CGH analysis to identify a wide spectrum of genome characteristics in K562: copy numbers (CN) of aneuploid chromosome segments at high-resolution, SNVs and indels (both corrected for CN in aneuploid regions), loss of heterozygosity, megabase-scale phased haplotypes often spanning entire chromosome arms, structural variants (SVs), including small and large-scale complex SVs and nonreference retrotransposon insertions. Many SVs were phased, assembled, and experimentally validated. We identified multiple allele-specific deletions and duplications within the tumor suppressor gene FHIT. Taking aneuploidy into account, we reanalyzed K562 RNA-seq and whole-genome bisulfite sequencing data for allele-specific expression and allele-specific DNA methylation. We also show examples of how deeper insights into regulatory complexity are gained by integrating genomic variant information and structural context with functional genomics and epigenomics data. Furthermore, using K562 haplotype information, we produced an allele-specific CRISPR targeting map. This comprehensive whole-genome analysis serves as a resource for future studies that utilize K562 as well as a framework for the analysis of other cancer genomes. Cold Spring Harbor Laboratory Press 2019-03 /pmc/articles/PMC6396411/ /pubmed/30737237 http://dx.doi.org/10.1101/gr.234948.118 Text en © 2019 Zhou et al.; Published by Cold Spring Harbor Laboratory Press http://creativecommons.org/licenses/by-nc/4.0/ This article is distributed exclusively by Cold Spring Harbor Laboratory Press for the first six months after the full-issue publication date (see http://genome.cshlp.org/site/misc/terms.xhtml). After six months, it is available under a Creative Commons License (Attribution-NonCommercial 4.0 International), as described at http://creativecommons.org/licenses/by-nc/4.0/.
spellingShingle Resource
Zhou, Bo
Ho, Steve S.
Greer, Stephanie U.
Zhu, Xiaowei
Bell, John M.
Arthur, Joseph G.
Spies, Noah
Zhang, Xianglong
Byeon, Seunggyu
Pattni, Reenal
Ben-Efraim, Noa
Haney, Michael S.
Haraksingh, Rajini R.
Song, Giltae
Ji, Hanlee P.
Perrin, Dimitri
Wong, Wing H.
Abyzov, Alexej
Urban, Alexander E.
Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title_full Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title_fullStr Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title_full_unstemmed Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title_short Comprehensive, integrated, and phased whole-genome analysis of the primary ENCODE cell line K562
title_sort comprehensive, integrated, and phased whole-genome analysis of the primary encode cell line k562
topic Resource
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6396411/
https://www.ncbi.nlm.nih.gov/pubmed/30737237
http://dx.doi.org/10.1101/gr.234948.118
work_keys_str_mv AT zhoubo comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT hosteves comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT greerstephanieu comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT zhuxiaowei comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT belljohnm comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT arthurjosephg comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT spiesnoah comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT zhangxianglong comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT byeonseunggyu comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT pattnireenal comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT benefraimnoa comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT haneymichaels comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT haraksinghrajinir comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT songgiltae comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT jihanleep comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT perrindimitri comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT wongwingh comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT abyzovalexej comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562
AT urbanalexandere comprehensiveintegratedandphasedwholegenomeanalysisoftheprimaryencodecelllinek562