Cargando…

U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line

U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30× genomic s...

Descripción completa

Detalles Bibliográficos
Autores principales: Clark, Michael James, Homer, Nils, O'Connor, Brian D., Chen, Zugen, Eskin, Ascia, Lee, Hane, Merriman, Barry, Nelson, Stanley F.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813426/
https://www.ncbi.nlm.nih.gov/pubmed/20126413
http://dx.doi.org/10.1371/journal.pgen.1000832
_version_ 1782176913657167872
author Clark, Michael James
Homer, Nils
O'Connor, Brian D.
Chen, Zugen
Eskin, Ascia
Lee, Hane
Merriman, Barry
Nelson, Stanley F.
author_facet Clark, Michael James
Homer, Nils
O'Connor, Brian D.
Chen, Zugen
Eskin, Ascia
Lee, Hane
Merriman, Barry
Nelson, Stanley F.
author_sort Clark, Michael James
collection PubMed
description U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30× genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library. A total of 1,014,984,286 mate-end and 120,691,623 single-end two-base encoded reads were generated from five slides. All data were aligned using a custom designed tool called BFAST, allowing optimal color space read alignment and accurate identification of DNA variants. The aligned sequence reads and mate-pair information identified 35 interchromosomal translocation events, 1,315 structural variations (>100 bp), 191,743 small (<21 bp) insertions and deletions (indels), and 2,384,470 single nucleotide variations (SNVs). Among these observations, the known homozygous mutation in PTEN was robustly identified, and genes involved in cell adhesion were overrepresented in the mutated gene list. Data were compared to 219,187 heterozygous single nucleotide polymorphisms assayed by Illumina 1M Duo genotyping array to assess accuracy: 93.83% of all SNPs were reliably detected at filtering thresholds that yield greater than 99.99% sequence accuracy. Protein coding sequences were disrupted predominantly in this cancer cell line due to small indels, large deletions, and translocations. In total, 512 genes were homozygously mutated, including 154 by SNVs, 178 by small indels, 145 by large microdeletions, and 35 by interchromosomal translocations to reveal a highly mutated cell line genome. Of the small homozygously mutated variants, 8 SNVs and 99 indels were novel events not present in dbSNP. These data demonstrate that routine generation of broad cancer genome sequence is possible outside of genome centers. The sequence analysis of U87MG provides an unparalleled level of mutational resolution compared to any cell line to date.
format Text
id pubmed-2813426
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-28134262010-02-02 U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line Clark, Michael James Homer, Nils O'Connor, Brian D. Chen, Zugen Eskin, Ascia Lee, Hane Merriman, Barry Nelson, Stanley F. PLoS Genet Research Article U87MG is a commonly studied grade IV glioma cell line that has been analyzed in at least 1,700 publications over four decades. In order to comprehensively characterize the genome of this cell line and to serve as a model of broad cancer genome sequencing, we have generated greater than 30× genomic sequence coverage using a novel 50-base mate paired strategy with a 1.4kb mean insert library. A total of 1,014,984,286 mate-end and 120,691,623 single-end two-base encoded reads were generated from five slides. All data were aligned using a custom designed tool called BFAST, allowing optimal color space read alignment and accurate identification of DNA variants. The aligned sequence reads and mate-pair information identified 35 interchromosomal translocation events, 1,315 structural variations (>100 bp), 191,743 small (<21 bp) insertions and deletions (indels), and 2,384,470 single nucleotide variations (SNVs). Among these observations, the known homozygous mutation in PTEN was robustly identified, and genes involved in cell adhesion were overrepresented in the mutated gene list. Data were compared to 219,187 heterozygous single nucleotide polymorphisms assayed by Illumina 1M Duo genotyping array to assess accuracy: 93.83% of all SNPs were reliably detected at filtering thresholds that yield greater than 99.99% sequence accuracy. Protein coding sequences were disrupted predominantly in this cancer cell line due to small indels, large deletions, and translocations. In total, 512 genes were homozygously mutated, including 154 by SNVs, 178 by small indels, 145 by large microdeletions, and 35 by interchromosomal translocations to reveal a highly mutated cell line genome. Of the small homozygously mutated variants, 8 SNVs and 99 indels were novel events not present in dbSNP. These data demonstrate that routine generation of broad cancer genome sequence is possible outside of genome centers. The sequence analysis of U87MG provides an unparalleled level of mutational resolution compared to any cell line to date. Public Library of Science 2010-01-29 /pmc/articles/PMC2813426/ /pubmed/20126413 http://dx.doi.org/10.1371/journal.pgen.1000832 Text en Clark et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Clark, Michael James
Homer, Nils
O'Connor, Brian D.
Chen, Zugen
Eskin, Ascia
Lee, Hane
Merriman, Barry
Nelson, Stanley F.
U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title_full U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title_fullStr U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title_full_unstemmed U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title_short U87MG Decoded: The Genomic Sequence of a Cytogenetically Aberrant Human Cancer Cell Line
title_sort u87mg decoded: the genomic sequence of a cytogenetically aberrant human cancer cell line
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2813426/
https://www.ncbi.nlm.nih.gov/pubmed/20126413
http://dx.doi.org/10.1371/journal.pgen.1000832
work_keys_str_mv AT clarkmichaeljames u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT homernils u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT oconnorbriand u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT chenzugen u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT eskinascia u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT leehane u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT merrimanbarry u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline
AT nelsonstanleyf u87mgdecodedthegenomicsequenceofacytogeneticallyaberranthumancancercellline