Cargando…

Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis

Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and t...

Descripción completa

Detalles Bibliográficos
Autores principales: Harris, Claire, Xu, Weichen, Grassi, Luigi, Wang, Chunlei, Markle, Abigail, Hardman, Colin, Stevens, Richard, Miro-Quesada, Guillermo, Hatton, Diane, Wang, Jihong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Taylor & Francis 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6816433/
https://www.ncbi.nlm.nih.gov/pubmed/31570042
http://dx.doi.org/10.1080/19420862.2019.1667740
_version_ 1783463368790638592
author Harris, Claire
Xu, Weichen
Grassi, Luigi
Wang, Chunlei
Markle, Abigail
Hardman, Colin
Stevens, Richard
Miro-Quesada, Guillermo
Hatton, Diane
Wang, Jihong
author_facet Harris, Claire
Xu, Weichen
Grassi, Luigi
Wang, Chunlei
Markle, Abigail
Hardman, Colin
Stevens, Richard
Miro-Quesada, Guillermo
Hatton, Diane
Wang, Jihong
author_sort Harris, Claire
collection PubMed
description Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and two-dimensional liquid chromatography coupled with online mass spectrometry. Further characterization indicated that the 11 kDa was added to the heavy chain (HC) Fc domain. Despite the relatively large mass addition, only one unknown peptide was detected by peptide mapping. To decipher the sequence, the transcriptome of the manufacturing cell line was characterized by Illumina RNA-seq. Transcriptome reconstruction detected an aberrant fusion transcript, where the light chain (LC) constant domain sequence was fused to the 3ʹ end of the HC transcript. Translation of this fusion transcript generated an extended peptide sequence at the HC C-terminus corresponding to the observed 11 kDa mass addition. Nanopore-based genome sequencing showed multiple copies of the plasmid had integrated in tandem with one copy missing the 5ʹ end of the plasmid, deleting the LC variable domain. The fusion transcript was due to read-through of the HC terminator sequence into the adjacent partial LC gene and an unexpected splicing event between a cryptic splice-donor site at the 3ʹ end of the HC and the splice acceptor site at the 5ʹ end of the LC constant domain. Our study demonstrates that combining protein physicochemical characterization with genomic and transcriptomic analysis of the manufacturing cell line greatly improves the identification of sequence variants and understanding of the underlying molecular mechanisms.
format Online
Article
Text
id pubmed-6816433
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Taylor & Francis
record_format MEDLINE/PubMed
spelling pubmed-68164332019-11-05 Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis Harris, Claire Xu, Weichen Grassi, Luigi Wang, Chunlei Markle, Abigail Hardman, Colin Stevens, Richard Miro-Quesada, Guillermo Hatton, Diane Wang, Jihong MAbs Report Protein primary structure is a potential critical quality attribute for biotherapeutics. Identifying and characterizing any sequence variants present is essential for product development. A sequence variant ~11 kDa larger than the expected IgG mass was observed by size-exclusion chromatography and two-dimensional liquid chromatography coupled with online mass spectrometry. Further characterization indicated that the 11 kDa was added to the heavy chain (HC) Fc domain. Despite the relatively large mass addition, only one unknown peptide was detected by peptide mapping. To decipher the sequence, the transcriptome of the manufacturing cell line was characterized by Illumina RNA-seq. Transcriptome reconstruction detected an aberrant fusion transcript, where the light chain (LC) constant domain sequence was fused to the 3ʹ end of the HC transcript. Translation of this fusion transcript generated an extended peptide sequence at the HC C-terminus corresponding to the observed 11 kDa mass addition. Nanopore-based genome sequencing showed multiple copies of the plasmid had integrated in tandem with one copy missing the 5ʹ end of the plasmid, deleting the LC variable domain. The fusion transcript was due to read-through of the HC terminator sequence into the adjacent partial LC gene and an unexpected splicing event between a cryptic splice-donor site at the 3ʹ end of the HC and the splice acceptor site at the 5ʹ end of the LC constant domain. Our study demonstrates that combining protein physicochemical characterization with genomic and transcriptomic analysis of the manufacturing cell line greatly improves the identification of sequence variants and understanding of the underlying molecular mechanisms. Taylor & Francis 2019-10-01 /pmc/articles/PMC6816433/ /pubmed/31570042 http://dx.doi.org/10.1080/19420862.2019.1667740 Text en © 2019 The Author(s). Published with license by Taylor & Francis Group, LLC. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivatives License (http://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited, and is not altered, transformed, or built upon in any way.
spellingShingle Report
Harris, Claire
Xu, Weichen
Grassi, Luigi
Wang, Chunlei
Markle, Abigail
Hardman, Colin
Stevens, Richard
Miro-Quesada, Guillermo
Hatton, Diane
Wang, Jihong
Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title_full Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title_fullStr Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title_full_unstemmed Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title_short Identification and characterization of an IgG sequence variant with an 11 kDa heavy chain C-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
title_sort identification and characterization of an igg sequence variant with an 11 kda heavy chain c-terminal extension using a combination of mass spectrometry and high-throughput sequencing analysis
topic Report
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6816433/
https://www.ncbi.nlm.nih.gov/pubmed/31570042
http://dx.doi.org/10.1080/19420862.2019.1667740
work_keys_str_mv AT harrisclaire identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT xuweichen identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT grassiluigi identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT wangchunlei identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT markleabigail identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT hardmancolin identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT stevensrichard identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT miroquesadaguillermo identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT hattondiane identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis
AT wangjihong identificationandcharacterizationofaniggsequencevariantwithan11kdaheavychaincterminalextensionusingacombinationofmassspectrometryandhighthroughputsequencinganalysis