Cargando…

Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models

DRC (Coltheart et al., 2001) and CDP++ (Perry et al., 2010) are two of the most successful models of reading aloud. These models differ primarily in how their sublexical systems convert letter strings into phonological codes. DRC adopts a set of grapheme-to-phoneme conversion rules (GPCs) while CDP+...

Descripción completa

Detalles Bibliográficos
Autores principales: Robidoux, Serje, Pritchard, Stephen C.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978355/
https://www.ncbi.nlm.nih.gov/pubmed/24744745
http://dx.doi.org/10.3389/fpsyg.2014.00267
_version_ 1782310551948361728
author Robidoux, Serje
Pritchard, Stephen C.
author_facet Robidoux, Serje
Pritchard, Stephen C.
author_sort Robidoux, Serje
collection PubMed
description DRC (Coltheart et al., 2001) and CDP++ (Perry et al., 2010) are two of the most successful models of reading aloud. These models differ primarily in how their sublexical systems convert letter strings into phonological codes. DRC adopts a set of grapheme-to-phoneme conversion rules (GPCs) while CDP++ uses a simple trained network that has been exposed to a combination of rules and the spellings and pronunciations of known words. Thus far the debate between fixed rules and learned associations has largely emphasized reaction time experiments, error rates in dyslexias, and item-level variance from large-scale databases. Recently, Pritchard et al. (2012) examined the models' non-word reading in a new way. They compared responses produced by the models to those produced by 45 skilled readers. Their item-by-item analysis is informative, but leaves open some questions that can be addressed with a different technique. Using hierarchical clustering techniques, we first examined the subject data to identify if there are classes of subjects that are similar to each other in their overall response profiles. We found that there are indeed two groups of subject that differ in their pronunciations for certain consonant clusters. We also tested the possibility that CDP++ is modeling one set of subjects well, while DRC is modeling a different set of subjects. We found that CDP++ does not fit any human reader's response pattern very well, while DRC fits the human readers as well as or better than any other reader.
format Online
Article
Text
id pubmed-3978355
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-39783552014-04-17 Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models Robidoux, Serje Pritchard, Stephen C. Front Psychol Psychology DRC (Coltheart et al., 2001) and CDP++ (Perry et al., 2010) are two of the most successful models of reading aloud. These models differ primarily in how their sublexical systems convert letter strings into phonological codes. DRC adopts a set of grapheme-to-phoneme conversion rules (GPCs) while CDP++ uses a simple trained network that has been exposed to a combination of rules and the spellings and pronunciations of known words. Thus far the debate between fixed rules and learned associations has largely emphasized reaction time experiments, error rates in dyslexias, and item-level variance from large-scale databases. Recently, Pritchard et al. (2012) examined the models' non-word reading in a new way. They compared responses produced by the models to those produced by 45 skilled readers. Their item-by-item analysis is informative, but leaves open some questions that can be addressed with a different technique. Using hierarchical clustering techniques, we first examined the subject data to identify if there are classes of subjects that are similar to each other in their overall response profiles. We found that there are indeed two groups of subject that differ in their pronunciations for certain consonant clusters. We also tested the possibility that CDP++ is modeling one set of subjects well, while DRC is modeling a different set of subjects. We found that CDP++ does not fit any human reader's response pattern very well, while DRC fits the human readers as well as or better than any other reader. Frontiers Media S.A. 2014-03-31 /pmc/articles/PMC3978355/ /pubmed/24744745 http://dx.doi.org/10.3389/fpsyg.2014.00267 Text en Copyright © 2014 Robidoux and Pritchard. http://creativecommons.org/licenses/by/3.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Robidoux, Serje
Pritchard, Stephen C.
Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title_full Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title_fullStr Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title_full_unstemmed Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title_short Hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
title_sort hierarchical clustering analysis of reading aloud data: a new technique for evaluating the performance of computational models
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3978355/
https://www.ncbi.nlm.nih.gov/pubmed/24744745
http://dx.doi.org/10.3389/fpsyg.2014.00267
work_keys_str_mv AT robidouxserje hierarchicalclusteringanalysisofreadingalouddataanewtechniqueforevaluatingtheperformanceofcomputationalmodels
AT pritchardstephenc hierarchicalclusteringanalysisofreadingalouddataanewtechniqueforevaluatingtheperformanceofcomputationalmodels