Cargando…

Data science technology course: The design, assessment and computing environment perspectives

This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data scienc...

Descripción completa

Detalles Bibliográficos
Autores principales: Ismail, Azlan, Mutalib, Sofianita, Haron, Haryani
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9871418/
https://www.ncbi.nlm.nih.gov/pubmed/36714440
http://dx.doi.org/10.1007/s10639-022-11558-8
_version_ 1784877167316828160
author Ismail, Azlan
Mutalib, Sofianita
Haron, Haryani
author_facet Ismail, Azlan
Mutalib, Sofianita
Haron, Haryani
author_sort Ismail, Azlan
collection PubMed
description This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data science activities. We tackle the discussion about this course based on three main requirements, which are related to the need to exploit the key skills from two dimensions, namely, Data Science and Big Data, and the need for a cluster-based computing platform and its accessibility. We address these requirements by presenting the course design and its assessments, the configuration of the computing platform, and the strategy to enable flexible accessibility. In terms of course design, the offered course contributes to several innovative elements and has covered multiple key areas of the data science body of knowledge and multiple quadrants of the job and skills matrix. In the case of the computing platform, a stable deployment of a Hadoop cluster with flexible accessibility, triggered by the pandemic situation, has been established. Furthermore, through our experience with the implementation of the cluster, it has shown the ability of the cluster to handle computing problems with a larger dataset than the one used for the semesters within the scope of the study. We also provide some reflections and highlight future improvements.
format Online
Article
Text
id pubmed-9871418
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-98714182023-01-25 Data science technology course: The design, assessment and computing environment perspectives Ismail, Azlan Mutalib, Sofianita Haron, Haryani Educ Inf Technol (Dordr) Article This article discusses the key elements of the Data Science Technology course offered to postgraduate students enrolled in the Master of Data Science program. This course complements the existing curriculum by providing the skills to handle the Big Data platform and tools, in addition to data science activities. We tackle the discussion about this course based on three main requirements, which are related to the need to exploit the key skills from two dimensions, namely, Data Science and Big Data, and the need for a cluster-based computing platform and its accessibility. We address these requirements by presenting the course design and its assessments, the configuration of the computing platform, and the strategy to enable flexible accessibility. In terms of course design, the offered course contributes to several innovative elements and has covered multiple key areas of the data science body of knowledge and multiple quadrants of the job and skills matrix. In the case of the computing platform, a stable deployment of a Hadoop cluster with flexible accessibility, triggered by the pandemic situation, has been established. Furthermore, through our experience with the implementation of the cluster, it has shown the ability of the cluster to handle computing problems with a larger dataset than the one used for the semesters within the scope of the study. We also provide some reflections and highlight future improvements. Springer US 2023-01-24 /pmc/articles/PMC9871418/ /pubmed/36714440 http://dx.doi.org/10.1007/s10639-022-11558-8 Text en © The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Article
Ismail, Azlan
Mutalib, Sofianita
Haron, Haryani
Data science technology course: The design, assessment and computing environment perspectives
title Data science technology course: The design, assessment and computing environment perspectives
title_full Data science technology course: The design, assessment and computing environment perspectives
title_fullStr Data science technology course: The design, assessment and computing environment perspectives
title_full_unstemmed Data science technology course: The design, assessment and computing environment perspectives
title_short Data science technology course: The design, assessment and computing environment perspectives
title_sort data science technology course: the design, assessment and computing environment perspectives
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9871418/
https://www.ncbi.nlm.nih.gov/pubmed/36714440
http://dx.doi.org/10.1007/s10639-022-11558-8
work_keys_str_mv AT ismailazlan datasciencetechnologycoursethedesignassessmentandcomputingenvironmentperspectives
AT mutalibsofianita datasciencetechnologycoursethedesignassessmentandcomputingenvironmentperspectives
AT haronharyani datasciencetechnologycoursethedesignassessmentandcomputingenvironmentperspectives