Cargando…

Low-cost and clinically applicable copy number profiling using repeat DNA

BACKGROUND: Somatic copy number alterations (SCNAs) are an important class of genomic alteration in cancer. They are frequently observed in cancer samples, with studies showing that, on average, SCNAs affect 34% of a cancer cell’s genome. Furthermore, SCNAs have been shown to be major drivers of tum...

Descripción completa

Detalles Bibliográficos
Autores principales: Abujudeh, Sam, Zeki, Sebastian S., van Lanschot, Meta C.J., Pusung, Mark, Weaver, Jamie M.J., Li, Xiaodun, Noorani, Ayesha, Metz, Andrew J., Bornschein, Jan, Bower, Lawrence, Miremadi, Ahmad, Fitzgerald, Rebecca C., Morrissey, Edward R., Lynch, Andy G.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9386984/
https://www.ncbi.nlm.nih.gov/pubmed/35978291
http://dx.doi.org/10.1186/s12864-022-08681-8
_version_ 1784769929321381888
author Abujudeh, Sam
Zeki, Sebastian S.
van Lanschot, Meta C.J.
Pusung, Mark
Weaver, Jamie M.J.
Li, Xiaodun
Noorani, Ayesha
Metz, Andrew J.
Bornschein, Jan
Bower, Lawrence
Miremadi, Ahmad
Fitzgerald, Rebecca C.
Morrissey, Edward R.
Lynch, Andy G.
author_facet Abujudeh, Sam
Zeki, Sebastian S.
van Lanschot, Meta C.J.
Pusung, Mark
Weaver, Jamie M.J.
Li, Xiaodun
Noorani, Ayesha
Metz, Andrew J.
Bornschein, Jan
Bower, Lawrence
Miremadi, Ahmad
Fitzgerald, Rebecca C.
Morrissey, Edward R.
Lynch, Andy G.
author_sort Abujudeh, Sam
collection PubMed
description BACKGROUND: Somatic copy number alterations (SCNAs) are an important class of genomic alteration in cancer. They are frequently observed in cancer samples, with studies showing that, on average, SCNAs affect 34% of a cancer cell’s genome. Furthermore, SCNAs have been shown to be major drivers of tumour development and have been associated with response to therapy and prognosis. Large-scale cancer genome studies suggest that tumours are driven by somatic copy number alterations (SCNAs) or single-nucleotide variants (SNVs). Despite the frequency of SCNAs and their clinical relevance, the use of genomics assays in the clinic is biased towards targeted gene panels, which identify SNVs but provide limited scope to detect SCNAs throughout the genome. There is a need for a comparably low-cost and simple method for high-resolution SCNA profiling. RESULTS: We present conliga, a fully probabilistic method that infers SCNA profiles from a low-cost, simple, and clinically-relevant assay (FAST-SeqS). When applied to 11 high-purity oesophageal adenocarcinoma samples, we obtain good agreement (Spearman’s rank correlation coefficient, r(s)=0.94) between conliga’s inferred SCNA profiles using FAST-SeqS data (approximately £14 per sample) and those inferred by ASCAT using high-coverage WGS (gold-standard). We find that conliga outperforms CNVkit (r(s)=0.89), also applied to FAST-SeqS data, and is comparable to QDNAseq (r(s)=0.96) applied to low-coverage WGS, which is approximately four-fold more expensive, more laborious and less clinically-relevant. By performing an in silico dilution series experiment, we find that conliga is particularly suited to detecting SCNAs in low tumour purity samples. At two million reads per sample, conliga is able to detect SCNAs in all nine samples at 3% tumour purity and as low as 0.5% purity in one sample. Crucially, we show that conliga’s hidden state information can be used to decide when a sample is abnormal or normal, whereas CNVkit and QDNAseq cannot provide this critical information. CONCLUSIONS: We show that conliga provides high-resolution SCNA profiles using a convenient, low-cost assay. We believe conliga makes FAST-SeqS a more clinically valuable assay as well as a useful research tool, enabling inexpensive and fast copy number profiling of pre-malignant and cancer samples. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08681-8).
format Online
Article
Text
id pubmed-9386984
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-93869842022-08-19 Low-cost and clinically applicable copy number profiling using repeat DNA Abujudeh, Sam Zeki, Sebastian S. van Lanschot, Meta C.J. Pusung, Mark Weaver, Jamie M.J. Li, Xiaodun Noorani, Ayesha Metz, Andrew J. Bornschein, Jan Bower, Lawrence Miremadi, Ahmad Fitzgerald, Rebecca C. Morrissey, Edward R. Lynch, Andy G. BMC Genomics Research Article BACKGROUND: Somatic copy number alterations (SCNAs) are an important class of genomic alteration in cancer. They are frequently observed in cancer samples, with studies showing that, on average, SCNAs affect 34% of a cancer cell’s genome. Furthermore, SCNAs have been shown to be major drivers of tumour development and have been associated with response to therapy and prognosis. Large-scale cancer genome studies suggest that tumours are driven by somatic copy number alterations (SCNAs) or single-nucleotide variants (SNVs). Despite the frequency of SCNAs and their clinical relevance, the use of genomics assays in the clinic is biased towards targeted gene panels, which identify SNVs but provide limited scope to detect SCNAs throughout the genome. There is a need for a comparably low-cost and simple method for high-resolution SCNA profiling. RESULTS: We present conliga, a fully probabilistic method that infers SCNA profiles from a low-cost, simple, and clinically-relevant assay (FAST-SeqS). When applied to 11 high-purity oesophageal adenocarcinoma samples, we obtain good agreement (Spearman’s rank correlation coefficient, r(s)=0.94) between conliga’s inferred SCNA profiles using FAST-SeqS data (approximately £14 per sample) and those inferred by ASCAT using high-coverage WGS (gold-standard). We find that conliga outperforms CNVkit (r(s)=0.89), also applied to FAST-SeqS data, and is comparable to QDNAseq (r(s)=0.96) applied to low-coverage WGS, which is approximately four-fold more expensive, more laborious and less clinically-relevant. By performing an in silico dilution series experiment, we find that conliga is particularly suited to detecting SCNAs in low tumour purity samples. At two million reads per sample, conliga is able to detect SCNAs in all nine samples at 3% tumour purity and as low as 0.5% purity in one sample. Crucially, we show that conliga’s hidden state information can be used to decide when a sample is abnormal or normal, whereas CNVkit and QDNAseq cannot provide this critical information. CONCLUSIONS: We show that conliga provides high-resolution SCNA profiles using a convenient, low-cost assay. We believe conliga makes FAST-SeqS a more clinically valuable assay as well as a useful research tool, enabling inexpensive and fast copy number profiling of pre-malignant and cancer samples. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at (10.1186/s12864-022-08681-8). BioMed Central 2022-08-17 /pmc/articles/PMC9386984/ /pubmed/35978291 http://dx.doi.org/10.1186/s12864-022-08681-8 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Abujudeh, Sam
Zeki, Sebastian S.
van Lanschot, Meta C.J.
Pusung, Mark
Weaver, Jamie M.J.
Li, Xiaodun
Noorani, Ayesha
Metz, Andrew J.
Bornschein, Jan
Bower, Lawrence
Miremadi, Ahmad
Fitzgerald, Rebecca C.
Morrissey, Edward R.
Lynch, Andy G.
Low-cost and clinically applicable copy number profiling using repeat DNA
title Low-cost and clinically applicable copy number profiling using repeat DNA
title_full Low-cost and clinically applicable copy number profiling using repeat DNA
title_fullStr Low-cost and clinically applicable copy number profiling using repeat DNA
title_full_unstemmed Low-cost and clinically applicable copy number profiling using repeat DNA
title_short Low-cost and clinically applicable copy number profiling using repeat DNA
title_sort low-cost and clinically applicable copy number profiling using repeat dna
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9386984/
https://www.ncbi.nlm.nih.gov/pubmed/35978291
http://dx.doi.org/10.1186/s12864-022-08681-8
work_keys_str_mv AT abujudehsam lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT zekisebastians lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT vanlanschotmetacj lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT pusungmark lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT weaverjamiemj lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT lixiaodun lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT nooraniayesha lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT metzandrewj lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT bornscheinjan lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT bowerlawrence lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT miremadiahmad lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT fitzgeraldrebeccac lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT morrisseyedwardr lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna
AT lynchandyg lowcostandclinicallyapplicablecopynumberprofilingusingrepeatdna