Cargando…

A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression

Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an...

Descripción completa

Detalles Bibliográficos
Autores principales: Rohr, Michael, Beardsley, Jordan, Nakkina, Sai Preethi, Zhu, Xiang, Aljabban, Jihad, Hadley, Dexter, Altomare, Deborah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8358057/
https://www.ncbi.nlm.nih.gov/pubmed/34381057
http://dx.doi.org/10.1038/s41597-021-00998-5
_version_ 1783737256221081600
author Rohr, Michael
Beardsley, Jordan
Nakkina, Sai Preethi
Zhu, Xiang
Aljabban, Jihad
Hadley, Dexter
Altomare, Deborah
author_facet Rohr, Michael
Beardsley, Jordan
Nakkina, Sai Preethi
Zhu, Xiang
Aljabban, Jihad
Hadley, Dexter
Altomare, Deborah
author_sort Rohr, Michael
collection PubMed
description Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an assortment of disparate microarray studies and hinders consensus building. To address this, we developed a microarray meta-dataset comprising 231 healthy, 132 adenoma, and 342 CRC tissue samples from twelve independent studies. Utilizing a stringent analytic framework, select datasets were downloaded from the Gene Expression Omnibus, normalized by frozen robust multiarray averaging and subsequently merged. Batch effects were then identified and removed by empirical Bayes estimation (ComBat). Finally, the meta-dataset was filtered for low variant probes, enabling downstream differential expression as well as quantitative and functional validation through cross-platform correlation and enrichment analyses, respectively. Overall, our meta-dataset provides a robust tool for investigating colorectal adenoma formation and malignant transformation at the transcriptional level with a pipeline that is modular and readily adaptable for similar analyses in other cancer types.
format Online
Article
Text
id pubmed-8358057
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-83580572021-08-19 A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression Rohr, Michael Beardsley, Jordan Nakkina, Sai Preethi Zhu, Xiang Aljabban, Jihad Hadley, Dexter Altomare, Deborah Sci Data Data Descriptor Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an assortment of disparate microarray studies and hinders consensus building. To address this, we developed a microarray meta-dataset comprising 231 healthy, 132 adenoma, and 342 CRC tissue samples from twelve independent studies. Utilizing a stringent analytic framework, select datasets were downloaded from the Gene Expression Omnibus, normalized by frozen robust multiarray averaging and subsequently merged. Batch effects were then identified and removed by empirical Bayes estimation (ComBat). Finally, the meta-dataset was filtered for low variant probes, enabling downstream differential expression as well as quantitative and functional validation through cross-platform correlation and enrichment analyses, respectively. Overall, our meta-dataset provides a robust tool for investigating colorectal adenoma formation and malignant transformation at the transcriptional level with a pipeline that is modular and readily adaptable for similar analyses in other cancer types. Nature Publishing Group UK 2021-08-11 /pmc/articles/PMC8358057/ /pubmed/34381057 http://dx.doi.org/10.1038/s41597-021-00998-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) applies to the metadata files associated with this article.
spellingShingle Data Descriptor
Rohr, Michael
Beardsley, Jordan
Nakkina, Sai Preethi
Zhu, Xiang
Aljabban, Jihad
Hadley, Dexter
Altomare, Deborah
A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title_full A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title_fullStr A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title_full_unstemmed A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title_short A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
title_sort merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
topic Data Descriptor
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8358057/
https://www.ncbi.nlm.nih.gov/pubmed/34381057
http://dx.doi.org/10.1038/s41597-021-00998-5
work_keys_str_mv AT rohrmichael amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT beardsleyjordan amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT nakkinasaipreethi amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT zhuxiang amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT aljabbanjihad amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT hadleydexter amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT altomaredeborah amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT rohrmichael mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT beardsleyjordan mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT nakkinasaipreethi mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT zhuxiang mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT aljabbanjihad mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT hadleydexter mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression
AT altomaredeborah mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression