Cargando…

Teaching genomics to life science undergraduates using cloud computing platforms with open datasets

The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with...

Descripción completa

Detalles Bibliográficos
Autores principales: Poolman, Toryn M., Townsend‐Nicholson, Andrea, Cain, Amanda
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9804627/
https://www.ncbi.nlm.nih.gov/pubmed/35972192
http://dx.doi.org/10.1002/bmb.21646
_version_ 1784862153742745600
author Poolman, Toryn M.
Townsend‐Nicholson, Andrea
Cain, Amanda
author_facet Poolman, Toryn M.
Townsend‐Nicholson, Andrea
Cain, Amanda
author_sort Poolman, Toryn M.
collection PubMed
description The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with limited computing experience). We aimed to give the students a chance to explore any dataset, rather than use a small number of artificial datasets (~60 published datasets were used). To achieve this, we utilized Google Colaboratory (Colab), a virtual computing environment. Colab was used as a framework to retrieve raw sequencing data (analyzed with QIIME2) and generate visualizations. Setting up the environment requires no prior experience; all students have the same drive structure and notebooks can be shared (for synchronous sessions). We also used the platform to combine multiple datasets, perform a meta‐analysis, and allowed the students to analyze large datasets with 1000s of subjects and factors. Projects that required increased computational resources were integrated with Google Cloud Compute. In future, all research projects can include some aspects of reanalyzing public data, providing students with data science experience. Colab is also an excellent environment in which to develop data skills in multiple languages (e.g., Perl, Python, Julia).
format Online
Article
Text
id pubmed-9804627
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-98046272023-01-06 Teaching genomics to life science undergraduates using cloud computing platforms with open datasets Poolman, Toryn M. Townsend‐Nicholson, Andrea Cain, Amanda Biochem Mol Biol Educ Covid‐19 The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with limited computing experience). We aimed to give the students a chance to explore any dataset, rather than use a small number of artificial datasets (~60 published datasets were used). To achieve this, we utilized Google Colaboratory (Colab), a virtual computing environment. Colab was used as a framework to retrieve raw sequencing data (analyzed with QIIME2) and generate visualizations. Setting up the environment requires no prior experience; all students have the same drive structure and notebooks can be shared (for synchronous sessions). We also used the platform to combine multiple datasets, perform a meta‐analysis, and allowed the students to analyze large datasets with 1000s of subjects and factors. Projects that required increased computational resources were integrated with Google Cloud Compute. In future, all research projects can include some aspects of reanalyzing public data, providing students with data science experience. Colab is also an excellent environment in which to develop data skills in multiple languages (e.g., Perl, Python, Julia). John Wiley & Sons, Inc. 2022-08-16 2022 /pmc/articles/PMC9804627/ /pubmed/35972192 http://dx.doi.org/10.1002/bmb.21646 Text en © 2022 The Authors. Biochemistry and Molecular Biology Education published by Wiley Periodicals LLC on behalf of International Union of Biochemistry and Molecular Biology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Covid‐19
Poolman, Toryn M.
Townsend‐Nicholson, Andrea
Cain, Amanda
Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title_full Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title_fullStr Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title_full_unstemmed Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title_short Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
title_sort teaching genomics to life science undergraduates using cloud computing platforms with open datasets
topic Covid‐19
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9804627/
https://www.ncbi.nlm.nih.gov/pubmed/35972192
http://dx.doi.org/10.1002/bmb.21646
work_keys_str_mv AT poolmantorynm teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets
AT townsendnicholsonandrea teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets
AT cainamanda teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets