Cargando…
Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and t...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Taylor & Francis
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6816433/ https://www.ncbi.nlm.nih.gov/pubmed/31570042 http://dx.doi.org/10.1080/19420862.2019.1667740 |
_version_ | 1783463368790638592 |
---|---|
author | Harris, Claire Xu, Weichen Grassi, Luigi Wang, Chunlei Markle, Abigail Hardman, Colin Stevens, Richard Miro-Quesada, Guillermo Hatton, Diane Wang, Jihong |
author_facet | Harris, Claire Xu, Weichen Grassi, Luigi Wang, Chunlei Markle, Abigail Hardman, Colin Stevens, Richard Miro-Quesada, Guillermo Hatton, Diane Wang, Jihong |
author_sort | Harris, Claire |
collection | PubMed |
description | Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and two-dimensional liquid chromatography coupled with online mass spectrometry. Further characterization indicated that the 11 kDa was added to the heavy chain (HC) Fc domain. Despite the relatively large mass addition, only one unknown peptide was detected by peptide mapping. To decipher the sequence, the transcriptome of the manufacturing cell line was characterized by Illumina RNA-seq. Transcriptome reconstruction detected an aberrant fusion transcript, where the light chain (LC) constant domain sequence was fused to the 3ʹ end of the HC transcript. Translation of this fusion transcript generated an extended peptide sequence at the HC C-terminus corresponding to the observed 11 kDa mass addition. Nanopore-based genome sequencing showed multiple copies of the plasmid had integrated in tandem with one copy missing the 5ʹ end of the plasmid, deleting the LC variable domain. The fusion transcript was due to read-through of the HC terminator sequence into the adjacent partial LC gene and an unexpected splicing event between a cryptic splice-donor site at the 3ʹ end of the HC and the splice acceptor site at the 5ʹ end of the LC constant domain. Our study demonstrates that combining protein physicochemical characterization with genomic and transcriptomic analysis of the manufacturing cell line greatly improves the identification of sequence variants and understanding of the underlying molecular mechanisms. |
format | Online Article Text |
id | pubmed-6816433 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Taylor & Francis |
record_format | MEDLINE/PubMed |
spelling | pubmed-68164332019-11-05 Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis Harris, Claire Xu, Weichen Grassi, Luigi Wang, Chunlei Markle, Abigail Hardman, Colin Stevens, Richard Miro-Quesada, Guillermo Hatton, Diane Wang, Jihong MAbs Report Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and two-dimensional liquid chromatography coupled with online mass spectrometry. Further characterization indicated that the 11 kDa was added to the heavy chain (HC) Fc domain. Despite the relatively large mass addition, only one unknown peptide was detected by peptide mapping. To decipher the sequence, the transcriptome of the manufacturing cell line was characterized by Illumina RNA-seq. Transcriptome reconstruction detected an aberrant fusion transcript, where the light chain (LC) constant domain sequence was fused to the 3ʹ end of the HC transcript. Translation of this fusion transcript generated an extended peptide sequence at the HC C-terminus corresponding to the observed 11 kDa mass addition. Nanopore-based genome sequencing showed multiple copies of the plasmid had integrated in tandem with one copy missing the 5ʹ end of the plasmid, deleting the LC variable domain. The fusion transcript was due to read-through of the HC terminator sequence into the adjacent partial LC gene and an unexpected splicing event between a cryptic splice-donor site at the 3ʹ end of the HC and the splice acceptor site at the 5ʹ end of the LC constant domain. Our study demonstrates that combining protein physicochemical characterization with genomic and transcriptomic analysis of the manufacturing cell line greatly improves the identification of sequence variants and understanding of the underlying molecular mechanisms. Taylor & Francis 2019-10-01 /pmc/articles/PMC6816433/ /pubmed/31570042 http://dx.doi.org/10.1080/19420862.2019.1667740 Text en © 2019 The Author(s). Published with license by Taylor & Francis Group, LLC. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way. |
spellingShingle | Report Harris, Claire Xu, Weichen Grassi, Luigi Wang, Chunlei Markle, Abigail Hardman, Colin Stevens, Richard Miro-Quesada, Guillermo Hatton, Diane Wang, Jihong Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title | Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title_full | Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title_fullStr | Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title_full_unstemmed | Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title_short | Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
title_sort | identification and characterization of an igg sequence variant with an 11 kda heavy chain c-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis |
topic | Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6816433/ https://www.ncbi.nlm.nih.gov/pubmed/31570042 http://dx.doi.org/10.1080/19420862.2019.1667740 |
work_keys_str_mv | AT harrisclaire identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT xuweichen identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT grassiluigi identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT wangchunlei identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT markleabigail identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT hardmancolin identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT stevensrichard identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT miroquesadaguillermo identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT hattondiane identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis AT wangjihong identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis |