Cargando…

Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC

BACKGROUND: Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations have led to...

Descripción completa

Detalles Bibliográficos
Autores principales: Goswami, Chitrita, Chawla, Smriti, Thakral, Deepshi, Pant, Himanshu, Verma, Pramod, Malik, Prabhat Singh, ▮, Jayadeva, Gupta, Ritu, Ahuja, Gaurav, Sengupta, Debarka
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7590669/
https://www.ncbi.nlm.nih.gov/pubmed/33287695
http://dx.doi.org/10.1186/s12864-020-07147-z
_version_ 1783600849437589504
author Goswami, Chitrita
Chawla, Smriti
Thakral, Deepshi
Pant, Himanshu
Verma, Pramod
Malik, Prabhat Singh
▮, Jayadeva
Gupta, Ritu
Ahuja, Gaurav
Sengupta, Debarka
author_facet Goswami, Chitrita
Chawla, Smriti
Thakral, Deepshi
Pant, Himanshu
Verma, Pramod
Malik, Prabhat Singh
▮, Jayadeva
Gupta, Ritu
Ahuja, Gaurav
Sengupta, Debarka
author_sort Goswami, Chitrita
collection PubMed
description BACKGROUND: Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations have led to the discovery of various blood-based biomarkers. Tumor Educated Platelets (TEPs) have, of late, generated considerable interest due to their ability to infer tumor existence and subtype accurately. So far, a majority of the studies involving TEPs have offered marker-panels consisting of several hundreds of genes. Profiling large numbers of genes incur a significant cost, impeding its diagnostic adoption. As such, it is important to construct minimalistic molecular signatures comprising a small number of genes. RESULTS: To address the aforesaid challenges, we analyzed publicly available TEP expression profiles and identified a panel of 11 platelet-genes that reliably discriminates between cancer and healthy samples. To validate its efficacy, we chose non-small cell lung cancer (NSCLC), the most prevalent type of lung malignancy. When applied to platelet-gene expression data from a published study, our machine learning model could accurately discriminate between non-metastatic NSCLC cases and healthy samples. We further experimentally validated the panel on an in-house cohort of metastatic NSCLC patients and healthy controls via real-time quantitative Polymerase Chain Reaction (RT-qPCR) (AUC = 0.97). Model performance was boosted significantly after artificial data-augmentation using the EigenSample method (AUC = 0.99). Lastly, we demonstrated the cancer-specificity of the proposed gene-panel by benchmarking it on platelet transcriptomes from patients with Myocardial Infarction (MI). CONCLUSION: We demonstrated an end-to-end bioinformatic plus experimental workflow for identifying a minimal set of TEP associated marker-genes that are predictive of the existence of cancers. We also discussed a strategy for boosting the predictive model performance by artificial augmentation of gene expression data.
format Online
Article
Text
id pubmed-7590669
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-75906692020-10-27 Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC Goswami, Chitrita Chawla, Smriti Thakral, Deepshi Pant, Himanshu Verma, Pramod Malik, Prabhat Singh ▮, Jayadeva Gupta, Ritu Ahuja, Gaurav Sengupta, Debarka BMC Genomics Research Article BACKGROUND: Early diagnosis is crucial for effective medical management of cancer patients. Tissue biopsy has been widely used for cancer diagnosis, but its invasive nature limits its application, especially when repeated biopsies are needed. Over the past few years, genomic explorations have led to the discovery of various blood-based biomarkers. Tumor Educated Platelets (TEPs) have, of late, generated considerable interest due to their ability to infer tumor existence and subtype accurately. So far, a majority of the studies involving TEPs have offered marker-panels consisting of several hundreds of genes. Profiling large numbers of genes incur a significant cost, impeding its diagnostic adoption. As such, it is important to construct minimalistic molecular signatures comprising a small number of genes. RESULTS: To address the aforesaid challenges, we analyzed publicly available TEP expression profiles and identified a panel of 11 platelet-genes that reliably discriminates between cancer and healthy samples. To validate its efficacy, we chose non-small cell lung cancer (NSCLC), the most prevalent type of lung malignancy. When applied to platelet-gene expression data from a published study, our machine learning model could accurately discriminate between non-metastatic NSCLC cases and healthy samples. We further experimentally validated the panel on an in-house cohort of metastatic NSCLC patients and healthy controls via real-time quantitative Polymerase Chain Reaction (RT-qPCR) (AUC = 0.97). Model performance was boosted significantly after artificial data-augmentation using the EigenSample method (AUC = 0.99). Lastly, we demonstrated the cancer-specificity of the proposed gene-panel by benchmarking it on platelet transcriptomes from patients with Myocardial Infarction (MI). CONCLUSION: We demonstrated an end-to-end bioinformatic plus experimental workflow for identifying a minimal set of TEP associated marker-genes that are predictive of the existence of cancers. We also discussed a strategy for boosting the predictive model performance by artificial augmentation of gene expression data. BioMed Central 2020-10-27 /pmc/articles/PMC7590669/ /pubmed/33287695 http://dx.doi.org/10.1186/s12864-020-07147-z Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Goswami, Chitrita
Chawla, Smriti
Thakral, Deepshi
Pant, Himanshu
Verma, Pramod
Malik, Prabhat Singh
▮, Jayadeva
Gupta, Ritu
Ahuja, Gaurav
Sengupta, Debarka
Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_full Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_fullStr Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_full_unstemmed Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_short Molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of NSCLC
title_sort molecular signature comprising 11 platelet-genes enables accurate blood-based diagnosis of nsclc
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7590669/
https://www.ncbi.nlm.nih.gov/pubmed/33287695
http://dx.doi.org/10.1186/s12864-020-07147-z
work_keys_str_mv AT goswamichitrita molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT chawlasmriti molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT thakraldeepshi molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT panthimanshu molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT vermapramod molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT malikprabhatsingh molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT jayadeva molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT guptaritu molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT ahujagaurav molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc
AT senguptadebarka molecularsignaturecomprising11plateletgenesenablesaccuratebloodbaseddiagnosisofnsclc