Cargando…

A university map of course knowledge

Knowledge representation has gained in relevance as data from the ubiquitous digitization of behaviors amass and academia and industry seek methods to understand and reason about the information they encode. Success in this pursuit has emerged with data from natural language, where skip-grams and ot...

Descripción completa

Detalles Bibliográficos
Autores principales: Pardos, Zachary A., Nam, Andrew Joo Hun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7526902/
https://www.ncbi.nlm.nih.gov/pubmed/32997664
http://dx.doi.org/10.1371/journal.pone.0233207
_version_ 1783588943160147968
author Pardos, Zachary A.
Nam, Andrew Joo Hun
author_facet Pardos, Zachary A.
Nam, Andrew Joo Hun
author_sort Pardos, Zachary A.
collection PubMed
description Knowledge representation has gained in relevance as data from the ubiquitous digitization of behaviors amass and academia and industry seek methods to understand and reason about the information they encode. Success in this pursuit has emerged with data from natural language, where skip-grams and other linear connectionist models of distributed representation have surfaced scrutable relational structures which have also served as artifacts of anthropological interest. Natural language is, however, only a fraction of the big data deluge. Here we show that latent semantic structure can be informed by behavioral data and that domain knowledge can be extracted from this structure through visualization and a novel mapping of the text descriptions of elements onto this behaviorally informed representation. In this study, we use the course enrollment histories of 124,000 students at a public university to learn vector representations of its courses. From these course selection informed representations, a notable 88% of course attribute information was recovered, as well as 40% of course relationships constructed from prior domain knowledge and evaluated by analogy (e.g., Math 1B is to Honors Math 1B as Physics 7B is to Honors Physics 7B). To aid in interpretation of the learned structure, we create a semantic interpolation, translating course vectors to a bag-of-words of their respective catalog descriptions via regression. We find that representations learned from enrollment histories resolved courses to a level of semantic fidelity exceeding that of their catalog descriptions, revealing nuanced content differences between similar courses, as well as accurately describing departments the dataset had no course descriptions for. We end with a discussion of the possible mechanisms by which this semantic structure may be informed and implications for the nascent research and practice of data science.
format Online
Article
Text
id pubmed-7526902
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-75269022020-10-06 A university map of course knowledge Pardos, Zachary A. Nam, Andrew Joo Hun PLoS One Research Article Knowledge representation has gained in relevance as data from the ubiquitous digitization of behaviors amass and academia and industry seek methods to understand and reason about the information they encode. Success in this pursuit has emerged with data from natural language, where skip-grams and other linear connectionist models of distributed representation have surfaced scrutable relational structures which have also served as artifacts of anthropological interest. Natural language is, however, only a fraction of the big data deluge. Here we show that latent semantic structure can be informed by behavioral data and that domain knowledge can be extracted from this structure through visualization and a novel mapping of the text descriptions of elements onto this behaviorally informed representation. In this study, we use the course enrollment histories of 124,000 students at a public university to learn vector representations of its courses. From these course selection informed representations, a notable 88% of course attribute information was recovered, as well as 40% of course relationships constructed from prior domain knowledge and evaluated by analogy (e.g., Math 1B is to Honors Math 1B as Physics 7B is to Honors Physics 7B). To aid in interpretation of the learned structure, we create a semantic interpolation, translating course vectors to a bag-of-words of their respective catalog descriptions via regression. We find that representations learned from enrollment histories resolved courses to a level of semantic fidelity exceeding that of their catalog descriptions, revealing nuanced content differences between similar courses, as well as accurately describing departments the dataset had no course descriptions for. We end with a discussion of the possible mechanisms by which this semantic structure may be informed and implications for the nascent research and practice of data science. Public Library of Science 2020-09-30 /pmc/articles/PMC7526902/ /pubmed/32997664 http://dx.doi.org/10.1371/journal.pone.0233207 Text en © 2020 Pardos, Nam http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Pardos, Zachary A.
Nam, Andrew Joo Hun
A university map of course knowledge
title A university map of course knowledge
title_full A university map of course knowledge
title_fullStr A university map of course knowledge
title_full_unstemmed A university map of course knowledge
title_short A university map of course knowledge
title_sort university map of course knowledge
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7526902/
https://www.ncbi.nlm.nih.gov/pubmed/32997664
http://dx.doi.org/10.1371/journal.pone.0233207
work_keys_str_mv AT pardoszacharya auniversitymapofcourseknowledge
AT namandrewjoohun auniversitymapofcourseknowledge
AT pardoszacharya universitymapofcourseknowledge
AT namandrewjoohun universitymapofcourseknowledge