Cargando…

Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses

This study was designed to identify lncRNA biomarker candidates using lung cancer data from RNA-Seq and microarray platforms separately. Lung cancer datasets were obtained from the Gene Expression Omnibus (GEO, n = 287) and The Cancer Genome Atlas (TCGA, n = 216) repositories, only common lncRNAs we...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Tianying, Khadka, Vedbar Singh, Deng, Youping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7425463/
https://www.ncbi.nlm.nih.gov/pubmed/32675385
http://dx.doi.org/10.18632/aging.103496
_version_ 1783570499007152128
author Zhao, Tianying
Khadka, Vedbar Singh
Deng, Youping
author_facet Zhao, Tianying
Khadka, Vedbar Singh
Deng, Youping
author_sort Zhao, Tianying
collection PubMed
description This study was designed to identify lncRNA biomarker candidates using lung cancer data from RNA-Seq and microarray platforms separately. Lung cancer datasets were obtained from the Gene Expression Omnibus (GEO, n = 287) and The Cancer Genome Atlas (TCGA, n = 216) repositories, only common lncRNAs were used. Differentially expressed (DE) lncRNAs in tumors with respect to normal were selected from the Affymetrix and TCGA datasets. A training model consisting of the top 20 DE Affymetrix lncRNAs was used for validation in the TCGA and Agilent datasets. A second similar training model was generated using the TCGA dataset. First, a model using the top 20 DE lncRNAs from Affymetrix for training and validated using TCGA and Agilent, achieved high prediction accuracy for both training (98.5% AUC for Affymetrix) and validation (99.2% AUC for TCGA and 92.8% AUC for Agilent). A similar model using the top 20 DE lncRNAs from TCGA for training and validated using Affymetrix and Agilent, also achieved high prediction accuracy for both training (97.7% AUC for TCGA) and validation (96.5% AUC for Affymetrix and 80.9% AUC for Agilent). Eight lncRNAs were found to be overlapped from these two lists.
format Online
Article
Text
id pubmed-7425463
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Impact Journals
record_format MEDLINE/PubMed
spelling pubmed-74254632020-08-25 Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses Zhao, Tianying Khadka, Vedbar Singh Deng, Youping Aging (Albany NY) Research Paper This study was designed to identify lncRNA biomarker candidates using lung cancer data from RNA-Seq and microarray platforms separately. Lung cancer datasets were obtained from the Gene Expression Omnibus (GEO, n = 287) and The Cancer Genome Atlas (TCGA, n = 216) repositories, only common lncRNAs were used. Differentially expressed (DE) lncRNAs in tumors with respect to normal were selected from the Affymetrix and TCGA datasets. A training model consisting of the top 20 DE Affymetrix lncRNAs was used for validation in the TCGA and Agilent datasets. A second similar training model was generated using the TCGA dataset. First, a model using the top 20 DE lncRNAs from Affymetrix for training and validated using TCGA and Agilent, achieved high prediction accuracy for both training (98.5% AUC for Affymetrix) and validation (99.2% AUC for TCGA and 92.8% AUC for Agilent). A similar model using the top 20 DE lncRNAs from TCGA for training and validated using Affymetrix and Agilent, also achieved high prediction accuracy for both training (97.7% AUC for TCGA) and validation (96.5% AUC for Affymetrix and 80.9% AUC for Agilent). Eight lncRNAs were found to be overlapped from these two lists. Impact Journals 2020-07-16 /pmc/articles/PMC7425463/ /pubmed/32675385 http://dx.doi.org/10.18632/aging.103496 Text en Copyright © 2020 Zhao et al. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY 3.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Zhao, Tianying
Khadka, Vedbar Singh
Deng, Youping
Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title_full Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title_fullStr Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title_full_unstemmed Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title_short Identification of lncRNA biomarkers for lung cancer through integrative cross-platform data analyses
title_sort identification of lncrna biomarkers for lung cancer through integrative cross-platform data analyses
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7425463/
https://www.ncbi.nlm.nih.gov/pubmed/32675385
http://dx.doi.org/10.18632/aging.103496
work_keys_str_mv AT zhaotianying identificationoflncrnabiomarkersforlungcancerthroughintegrativecrossplatformdataanalyses
AT khadkavedbarsingh identificationoflncrnabiomarkersforlungcancerthroughintegrativecrossplatformdataanalyses
AT dengyouping identificationoflncrnabiomarkersforlungcancerthroughintegrativecrossplatformdataanalyses