Cargando…

Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms

Genetic ancestry–oriented cancer research requires the ability to perform accurate and robust genetic ancestry inference from existing cancer-derived data, including whole-exome sequencing, transcriptome sequencing, and targeted gene panels, very often in the absence of matching cancer-free genomic...

Descripción completa

Detalles Bibliográficos
Autores principales: Belleau, Pascal, Deschênes, Astrid, Chambwe, Nyasha, Tuveson, David A., Krasnitz, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Association for Cancer Research 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811156/
https://www.ncbi.nlm.nih.gov/pubmed/36351074
http://dx.doi.org/10.1158/0008-5472.CAN-22-0682
_version_ 1784863470596915200
author Belleau, Pascal
Deschênes, Astrid
Chambwe, Nyasha
Tuveson, David A.
Krasnitz, Alexander
author_facet Belleau, Pascal
Deschênes, Astrid
Chambwe, Nyasha
Tuveson, David A.
Krasnitz, Alexander
author_sort Belleau, Pascal
collection PubMed
description Genetic ancestry–oriented cancer research requires the ability to perform accurate and robust genetic ancestry inference from existing cancer-derived data, including whole-exome sequencing, transcriptome sequencing, and targeted gene panels, very often in the absence of matching cancer-free genomic data. Here we examined the feasibility and accuracy of computational inference of genetic ancestry relying exclusively on cancer-derived data. A data synthesis framework was developed to optimize and assess the performance of the ancestry inference for any given input cancer-derived molecular profile. In its core procedure, the ancestral background of the profiled patient is replaced with one of any number of individuals with known ancestry. The data synthesis framework is applicable to multiple profiling platforms, making it possible to assess the performance of inference specifically for a given molecular profile and separately for each continental-level ancestry; this ability extends to all ancestries, including those without statistically sufficient representation in the existing cancer data. The inference procedure was demonstrated to be accurate and robust in a wide range of sequencing depths. Testing of the approach in four representative cancer types and across three molecular profiling modalities showed that continental-level ancestry of patients can be inferred with high accuracy, as quantified by its agreement with the gold standard of deriving ancestry from matching cancer-free molecular data. This study demonstrates that vast amounts of existing cancer-derived molecular data are potentially amenable to ancestry-oriented studies of the disease without requiring matching cancer-free genomes or patient self-reported ancestry. SIGNIFICANCE: The development of a computational approach that enables accurate and robust ancestry inference from cancer-derived molecular profiles without matching cancer-free data provides a valuable methodology for genetic ancestry–oriented cancer research.
format Online
Article
Text
id pubmed-9811156
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher American Association for Cancer Research
record_format MEDLINE/PubMed
spelling pubmed-98111562023-01-05 Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms Belleau, Pascal Deschênes, Astrid Chambwe, Nyasha Tuveson, David A. Krasnitz, Alexander Cancer Res Genome and Epigenome Genetic ancestry–oriented cancer research requires the ability to perform accurate and robust genetic ancestry inference from existing cancer-derived data, including whole-exome sequencing, transcriptome sequencing, and targeted gene panels, very often in the absence of matching cancer-free genomic data. Here we examined the feasibility and accuracy of computational inference of genetic ancestry relying exclusively on cancer-derived data. A data synthesis framework was developed to optimize and assess the performance of the ancestry inference for any given input cancer-derived molecular profile. In its core procedure, the ancestral background of the profiled patient is replaced with one of any number of individuals with known ancestry. The data synthesis framework is applicable to multiple profiling platforms, making it possible to assess the performance of inference specifically for a given molecular profile and separately for each continental-level ancestry; this ability extends to all ancestries, including those without statistically sufficient representation in the existing cancer data. The inference procedure was demonstrated to be accurate and robust in a wide range of sequencing depths. Testing of the approach in four representative cancer types and across three molecular profiling modalities showed that continental-level ancestry of patients can be inferred with high accuracy, as quantified by its agreement with the gold standard of deriving ancestry from matching cancer-free molecular data. This study demonstrates that vast amounts of existing cancer-derived molecular data are potentially amenable to ancestry-oriented studies of the disease without requiring matching cancer-free genomes or patient self-reported ancestry. SIGNIFICANCE: The development of a computational approach that enables accurate and robust ancestry inference from cancer-derived molecular profiles without matching cancer-free data provides a valuable methodology for genetic ancestry–oriented cancer research. American Association for Cancer Research 2023-01-04 2022-11-09 /pmc/articles/PMC9811156/ /pubmed/36351074 http://dx.doi.org/10.1158/0008-5472.CAN-22-0682 Text en ©2022 The Authors; Published by the American Association for Cancer Research https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0) license.
spellingShingle Genome and Epigenome
Belleau, Pascal
Deschênes, Astrid
Chambwe, Nyasha
Tuveson, David A.
Krasnitz, Alexander
Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title_full Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title_fullStr Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title_full_unstemmed Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title_short Genetic Ancestry Inference from Cancer-Derived Molecular Data across Genomic and Transcriptomic Platforms
title_sort genetic ancestry inference from cancer-derived molecular data across genomic and transcriptomic platforms
topic Genome and Epigenome
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9811156/
https://www.ncbi.nlm.nih.gov/pubmed/36351074
http://dx.doi.org/10.1158/0008-5472.CAN-22-0682
work_keys_str_mv AT belleaupascal geneticancestryinferencefromcancerderivedmoleculardataacrossgenomicandtranscriptomicplatforms
AT deschenesastrid geneticancestryinferencefromcancerderivedmoleculardataacrossgenomicandtranscriptomicplatforms
AT chambwenyasha geneticancestryinferencefromcancerderivedmoleculardataacrossgenomicandtranscriptomicplatforms
AT tuvesondavida geneticancestryinferencefromcancerderivedmoleculardataacrossgenomicandtranscriptomicplatforms
AT krasnitzalexander geneticancestryinferencefromcancerderivedmoleculardataacrossgenomicandtranscriptomicplatforms