Cargando…

A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset

Breast cancer research has traditionally centred on genomic alterations, hormone receptor status and changes in cancer-related proteins to provide new avenues for targeted therapies. Due to advances in next generation sequencing technologies, there has been the emergence of long, non-coding RNAs (ln...

Descripción completa

Detalles Bibliográficos
Autores principales: Zaheed, Oza, Samson, Julia, Dean, Kellie
Formato: Online Artículo Texto
Lenguaje:English
Publicado: KeAi Publishing 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7078458/
https://www.ncbi.nlm.nih.gov/pubmed/32206740
http://dx.doi.org/10.1016/j.ncrna.2020.02.004
_version_ 1783507624071790592
author Zaheed, Oza
Samson, Julia
Dean, Kellie
author_facet Zaheed, Oza
Samson, Julia
Dean, Kellie
author_sort Zaheed, Oza
collection PubMed
description Breast cancer research has traditionally centred on genomic alterations, hormone receptor status and changes in cancer-related proteins to provide new avenues for targeted therapies. Due to advances in next generation sequencing technologies, there has been the emergence of long, non-coding RNAs (lncRNAs) as regulators of normal cellular events, with links to various disease states, including breast cancer. Here we describe our bioinformatic analyses of a previously published RNA sequencing (RNA-seq) dataset to identify lncRNAs with altered expression levels in a subset of breast cancer cell lines. Using a previously published RNA-seq dataset of 675 cancer cell lines, a subset of 18 cell lines was selected for our analyses that included 16 breast cancer lines, one ductal carcinoma in situ line and one normal-like breast epithelial cell line. Principal component analysis demonstrated correlation with well-established categorisation methods of breast cancer (i.e. luminal A/B, HER2 enriched and basal-like A/B). Through detailed comparison of differentially expressed lncRNAs in each breast cancer sub-type with normal-like breast epithelial cells, we identified 15 lncRNAs with consistently altered expression, including three uncharacterised lncRNAs. Utilising data from The Cancer Genome Atlas (TCGA) and The Genotype Tissue Expression (GETx) project via Gene Expression Profiling Interactive Analysis (GEPIA2), we assessed clinical relevance of several identified lncRNAs with invasive breast cancer. Lastly, we determined the relative expression level of six lncRNAs across a spectrum of breast cancer cell lines to experimentally confirm the findings of our bioinformatic analyses. Overall, we show that the use of existing RNA-seq datasets, if re-analysed with modern bioinformatic tools, can provide a valuable resource to identify lncRNAs that could have important biological roles in oncogenesis and tumour progression.
format Online
Article
Text
id pubmed-7078458
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher KeAi Publishing
record_format MEDLINE/PubMed
spelling pubmed-70784582020-03-23 A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset Zaheed, Oza Samson, Julia Dean, Kellie Noncoding RNA Res Article Breast cancer research has traditionally centred on genomic alterations, hormone receptor status and changes in cancer-related proteins to provide new avenues for targeted therapies. Due to advances in next generation sequencing technologies, there has been the emergence of long, non-coding RNAs (lncRNAs) as regulators of normal cellular events, with links to various disease states, including breast cancer. Here we describe our bioinformatic analyses of a previously published RNA sequencing (RNA-seq) dataset to identify lncRNAs with altered expression levels in a subset of breast cancer cell lines. Using a previously published RNA-seq dataset of 675 cancer cell lines, a subset of 18 cell lines was selected for our analyses that included 16 breast cancer lines, one ductal carcinoma in situ line and one normal-like breast epithelial cell line. Principal component analysis demonstrated correlation with well-established categorisation methods of breast cancer (i.e. luminal A/B, HER2 enriched and basal-like A/B). Through detailed comparison of differentially expressed lncRNAs in each breast cancer sub-type with normal-like breast epithelial cells, we identified 15 lncRNAs with consistently altered expression, including three uncharacterised lncRNAs. Utilising data from The Cancer Genome Atlas (TCGA) and The Genotype Tissue Expression (GETx) project via Gene Expression Profiling Interactive Analysis (GEPIA2), we assessed clinical relevance of several identified lncRNAs with invasive breast cancer. Lastly, we determined the relative expression level of six lncRNAs across a spectrum of breast cancer cell lines to experimentally confirm the findings of our bioinformatic analyses. Overall, we show that the use of existing RNA-seq datasets, if re-analysed with modern bioinformatic tools, can provide a valuable resource to identify lncRNAs that could have important biological roles in oncogenesis and tumour progression. KeAi Publishing 2020-02-24 /pmc/articles/PMC7078458/ /pubmed/32206740 http://dx.doi.org/10.1016/j.ncrna.2020.02.004 Text en © 2020 Production and hosting by Elsevier B.V. on behalf of KeAi Communications Co., Ltd. http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Zaheed, Oza
Samson, Julia
Dean, Kellie
A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title_full A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title_fullStr A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title_full_unstemmed A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title_short A bioinformatics approach to identify novel long, non-coding RNAs in breast cancer cell lines from an existing RNA-sequencing dataset
title_sort bioinformatics approach to identify novel long, non-coding rnas in breast cancer cell lines from an existing rna-sequencing dataset
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7078458/
https://www.ncbi.nlm.nih.gov/pubmed/32206740
http://dx.doi.org/10.1016/j.ncrna.2020.02.004
work_keys_str_mv AT zaheedoza abioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset
AT samsonjulia abioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset
AT deankellie abioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset
AT zaheedoza bioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset
AT samsonjulia bioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset
AT deankellie bioinformaticsapproachtoidentifynovellongnoncodingrnasinbreastcancercelllinesfromanexistingrnasequencingdataset