Cargando…

A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments

When selecting relevant inputs in modeling problems with low quality data, the ranking of the most informative inputs is also uncertain. In this paper, this issue is addressed through a new procedure that allows the extending of different crisp feature selection algorithms to vague data. The partial...

Descripción completa

Detalles Bibliográficos
Autores principales: Otero, José, Palacios, Ana, Suárez, Rosario, Junco, Luis, Couso, Inés, Sánchez, Luciano
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4119680/
https://www.ncbi.nlm.nih.gov/pubmed/25114967
http://dx.doi.org/10.1155/2014/468405
_version_ 1782328994346041344
author Otero, José
Palacios, Ana
Suárez, Rosario
Junco, Luis
Couso, Inés
Sánchez, Luciano
author_facet Otero, José
Palacios, Ana
Suárez, Rosario
Junco, Luis
Couso, Inés
Sánchez, Luciano
author_sort Otero, José
collection PubMed
description When selecting relevant inputs in modeling problems with low quality data, the ranking of the most informative inputs is also uncertain. In this paper, this issue is addressed through a new procedure that allows the extending of different crisp feature selection algorithms to vague data. The partial knowledge about the ordinal of each feature is modelled by means of a possibility distribution, and a ranking is hereby applied to sort these distributions. It will be shown that this technique makes the most use of the available information in some vague datasets. The approach is demonstrated in a real-world application. In the context of massive online computer science courses, methods are sought for automatically providing the student with a qualification through code metrics. Feature selection methods are used to find the metrics involved in the most meaningful predictions. In this study, 800 source code files, collected and revised by the authors in classroom Computer Science lectures taught between 2013 and 2014, are analyzed with the proposed technique, and the most relevant metrics for the automatic grading task are discussed.
format Online
Article
Text
id pubmed-4119680
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-41196802014-08-11 A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments Otero, José Palacios, Ana Suárez, Rosario Junco, Luis Couso, Inés Sánchez, Luciano ScientificWorldJournal Research Article When selecting relevant inputs in modeling problems with low quality data, the ranking of the most informative inputs is also uncertain. In this paper, this issue is addressed through a new procedure that allows the extending of different crisp feature selection algorithms to vague data. The partial knowledge about the ordinal of each feature is modelled by means of a possibility distribution, and a ranking is hereby applied to sort these distributions. It will be shown that this technique makes the most use of the available information in some vague datasets. The approach is demonstrated in a real-world application. In the context of massive online computer science courses, methods are sought for automatically providing the student with a qualification through code metrics. Feature selection methods are used to find the metrics involved in the most meaningful predictions. In this study, 800 source code files, collected and revised by the authors in classroom Computer Science lectures taught between 2013 and 2014, are analyzed with the proposed technique, and the most relevant metrics for the automatic grading task are discussed. Hindawi Publishing Corporation 2014 2014-07-07 /pmc/articles/PMC4119680/ /pubmed/25114967 http://dx.doi.org/10.1155/2014/468405 Text en Copyright © 2014 José Otero et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Otero, José
Palacios, Ana
Suárez, Rosario
Junco, Luis
Couso, Inés
Sánchez, Luciano
A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title_full A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title_fullStr A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title_full_unstemmed A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title_short A Procedure for Extending Input Selection Algorithms to Low Quality Data in Modelling Problems with Application to the Automatic Grading of Uploaded Assignments
title_sort procedure for extending input selection algorithms to low quality data in modelling problems with application to the automatic grading of uploaded assignments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4119680/
https://www.ncbi.nlm.nih.gov/pubmed/25114967
http://dx.doi.org/10.1155/2014/468405
work_keys_str_mv AT oterojose aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT palaciosana aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT suarezrosario aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT juncoluis aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT cousoines aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT sanchezluciano aprocedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT oterojose procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT palaciosana procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT suarezrosario procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT juncoluis procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT cousoines procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments
AT sanchezluciano procedureforextendinginputselectionalgorithmstolowqualitydatainmodellingproblemswithapplicationtotheautomaticgradingofuploadedassignments