Cargando…
Teaching genomics to life science undergraduates using cloud computing platforms with open datasets
The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley & Sons, Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9804627/ https://www.ncbi.nlm.nih.gov/pubmed/35972192 http://dx.doi.org/10.1002/bmb.21646 |
_version_ | 1784862153742745600 |
---|---|
author | Poolman, Toryn M. Townsend‐Nicholson, Andrea Cain, Amanda |
author_facet | Poolman, Toryn M. Townsend‐Nicholson, Andrea Cain, Amanda |
author_sort | Poolman, Toryn M. |
collection | PubMed |
description | The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with limited computing experience). We aimed to give the students a chance to explore any dataset, rather than use a small number of artificial datasets (~60 published datasets were used). To achieve this, we utilized Google Colaboratory (Colab), a virtual computing environment. Colab was used as a framework to retrieve raw sequencing data (analyzed with QIIME2) and generate visualizations. Setting up the environment requires no prior experience; all students have the same drive structure and notebooks can be shared (for synchronous sessions). We also used the platform to combine multiple datasets, perform a meta‐analysis, and allowed the students to analyze large datasets with 1000s of subjects and factors. Projects that required increased computational resources were integrated with Google Cloud Compute. In future, all research projects can include some aspects of reanalyzing public data, providing students with data science experience. Colab is also an excellent environment in which to develop data skills in multiple languages (e.g., Perl, Python, Julia). |
format | Online Article Text |
id | pubmed-9804627 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | John Wiley & Sons, Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98046272023-01-06 Teaching genomics to life science undergraduates using cloud computing platforms with open datasets Poolman, Toryn M. Townsend‐Nicholson, Andrea Cain, Amanda Biochem Mol Biol Educ Covid‐19 The final year of a biochemistry degree is usually a time to experience research. However, laboratory‐based research projects were not possible during COVID‐19. Instead, we used open datasets to provide computational research projects in metagenomics to biochemistry undergraduates (80 students with limited computing experience). We aimed to give the students a chance to explore any dataset, rather than use a small number of artificial datasets (~60 published datasets were used). To achieve this, we utilized Google Colaboratory (Colab), a virtual computing environment. Colab was used as a framework to retrieve raw sequencing data (analyzed with QIIME2) and generate visualizations. Setting up the environment requires no prior experience; all students have the same drive structure and notebooks can be shared (for synchronous sessions). We also used the platform to combine multiple datasets, perform a meta‐analysis, and allowed the students to analyze large datasets with 1000s of subjects and factors. Projects that required increased computational resources were integrated with Google Cloud Compute. In future, all research projects can include some aspects of reanalyzing public data, providing students with data science experience. Colab is also an excellent environment in which to develop data skills in multiple languages (e.g., Perl, Python, Julia). John Wiley & Sons, Inc. 2022-08-16 2022 /pmc/articles/PMC9804627/ /pubmed/35972192 http://dx.doi.org/10.1002/bmb.21646 Text en © 2022 The Authors. Biochemistry and Molecular Biology Education published by Wiley Periodicals LLC on behalf of International Union of Biochemistry and Molecular Biology. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Covid‐19 Poolman, Toryn M. Townsend‐Nicholson, Andrea Cain, Amanda Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title | Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title_full | Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title_fullStr | Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title_full_unstemmed | Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title_short | Teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
title_sort | teaching genomics to life science undergraduates using cloud computing platforms with open datasets |
topic | Covid‐19 |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9804627/ https://www.ncbi.nlm.nih.gov/pubmed/35972192 http://dx.doi.org/10.1002/bmb.21646 |
work_keys_str_mv | AT poolmantorynm teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets AT townsendnicholsonandrea teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets AT cainamanda teachinggenomicstolifescienceundergraduatesusingcloudcomputingplatformswithopendatasets |