Cargando…
A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression
Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8358057/ https://www.ncbi.nlm.nih.gov/pubmed/34381057 http://dx.doi.org/10.1038/s41597-021-00998-5 |
_version_ | 1783737256221081600 |
---|---|
author | Rohr, Michael Beardsley, Jordan Nakkina, Sai Preethi Zhu, Xiang Aljabban, Jihad Hadley, Dexter Altomare, Deborah |
author_facet | Rohr, Michael Beardsley, Jordan Nakkina, Sai Preethi Zhu, Xiang Aljabban, Jihad Hadley, Dexter Altomare, Deborah |
author_sort | Rohr, Michael |
collection | PubMed |
description | Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an assortment of disparate microarray studies and hinders consensus building. To address this, we developed a microarray meta-dataset comprising 231 healthy, 132 adenoma, and 342 CRC tissue samples from twelve independent studies. Utilizing a stringent analytic framework, select datasets were downloaded from the Gene Expression Omnibus, normalized by frozen robust multiarray averaging and subsequently merged. Batch effects were then identified and removed by empirical Bayes estimation (ComBat). Finally, the meta-dataset was filtered for low variant probes, enabling downstream differential expression as well as quantitative and functional validation through cross-platform correlation and enrichment analyses, respectively. Overall, our meta-dataset provides a robust tool for investigating colorectal adenoma formation and malignant transformation at the transcriptional level with a pipeline that is modular and readily adaptable for similar analyses in other cancer types. |
format | Online Article Text |
id | pubmed-8358057 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-83580572021-08-19 A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression Rohr, Michael Beardsley, Jordan Nakkina, Sai Preethi Zhu, Xiang Aljabban, Jihad Hadley, Dexter Altomare, Deborah Sci Data Data Descriptor Transcriptional profiling of pre- and post-malignant colorectal cancer (CRC) lesions enable temporal monitoring of molecular events underlying neoplastic progression. However, the most widely used transcriptomic dataset for CRC, TCGA-COAD, is devoid of adenoma samples, which increases reliance on an assortment of disparate microarray studies and hinders consensus building. To address this, we developed a microarray meta-dataset comprising 231 healthy, 132 adenoma, and 342 CRC tissue samples from twelve independent studies. Utilizing a stringent analytic framework, select datasets were downloaded from the Gene Expression Omnibus, normalized by frozen robust multiarray averaging and subsequently merged. Batch effects were then identified and removed by empirical Bayes estimation (ComBat). Finally, the meta-dataset was filtered for low variant probes, enabling downstream differential expression as well as quantitative and functional validation through cross-platform correlation and enrichment analyses, respectively. Overall, our meta-dataset provides a robust tool for investigating colorectal adenoma formation and malignant transformation at the transcriptional level with a pipeline that is modular and readily adaptable for similar analyses in other cancer types. Nature Publishing Group UK 2021-08-11 /pmc/articles/PMC8358057/ /pubmed/34381057 http://dx.doi.org/10.1038/s41597-021-00998-5 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) applies to the metadata files associated with this article. |
spellingShingle | Data Descriptor Rohr, Michael Beardsley, Jordan Nakkina, Sai Preethi Zhu, Xiang Aljabban, Jihad Hadley, Dexter Altomare, Deborah A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title | A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title_full | A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title_fullStr | A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title_full_unstemmed | A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title_short | A merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
title_sort | merged microarray meta-dataset for transcriptionally profiling colorectal neoplasm formation and progression |
topic | Data Descriptor |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8358057/ https://www.ncbi.nlm.nih.gov/pubmed/34381057 http://dx.doi.org/10.1038/s41597-021-00998-5 |
work_keys_str_mv | AT rohrmichael amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT beardsleyjordan amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT nakkinasaipreethi amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT zhuxiang amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT aljabbanjihad amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT hadleydexter amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT altomaredeborah amergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT rohrmichael mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT beardsleyjordan mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT nakkinasaipreethi mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT zhuxiang mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT aljabbanjihad mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT hadleydexter mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression AT altomaredeborah mergedmicroarraymetadatasetfortranscriptionallyprofilingcolorectalneoplasmformationandprogression |