Cargando…
Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence
This paper represents our research results in the pursuit of the following objectives: (i) to introduce a novel multi-sources data set to tackle the shortcomings of the previous data sets, (ii) to propose a robust artificial intelligence-based solution to identify dyslexia in primary school pupils,...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10664902/ https://www.ncbi.nlm.nih.gov/pubmed/37992041 http://dx.doi.org/10.1371/journal.pone.0292047 |
_version_ | 1785148807764246528 |
---|---|
author | Shalileh, Soroosh Ignatov, Dmitry Lopukhina, Anastasiya Dragoy, Olga |
author_facet | Shalileh, Soroosh Ignatov, Dmitry Lopukhina, Anastasiya Dragoy, Olga |
author_sort | Shalileh, Soroosh |
collection | PubMed |
description | This paper represents our research results in the pursuit of the following objectives: (i) to introduce a novel multi-sources data set to tackle the shortcomings of the previous data sets, (ii) to propose a robust artificial intelligence-based solution to identify dyslexia in primary school pupils, (iii) to investigate our psycholinguistic knowledge by studying the importance of the features in identifying dyslexia by our best AI model. In order to achieve the first objective, we collected and annotated a new set of eye-movement-during-reading data. Furthermore, we collected demographic data, including the measure of non-verbal intelligence, to form our three data sources. Our data set is the largest eye-movement data set globally. Unlike the previously introduced binary-class data sets, it contains (A) three class labels and (B) reading speed. Concerning the second objective, we formulated the task of dyslexia prediction as regression and classification problems and scrutinized the performance of 12 classifications and eight regressions approaches. We exploited the Bayesian optimization method to fine-tune the hyperparameters of the models: and reported the average and the standard deviation of our evaluation metrics in a stratified ten-fold cross-validation. Our studies showed that multi-layer perceptron, random forest, gradient boosting, and k-nearest neighbor form the group having the most acceptable results. Moreover, we showed that although separately using each data source did not lead to accurate results, their combination led to a reliable solution. We also determined the importance of the features of our best classifier: our findings showed that the IQ, gender, and age are the top three important features; we also showed that fixation along the y-axis is more important than other fixation data. Dyslexia detection, eye fixation, eye movement, demographic, classification, regression, artificial intelligence. |
format | Online Article Text |
id | pubmed-10664902 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-106649022023-11-22 Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence Shalileh, Soroosh Ignatov, Dmitry Lopukhina, Anastasiya Dragoy, Olga PLoS One Research Article This paper represents our research results in the pursuit of the following objectives: (i) to introduce a novel multi-sources data set to tackle the shortcomings of the previous data sets, (ii) to propose a robust artificial intelligence-based solution to identify dyslexia in primary school pupils, (iii) to investigate our psycholinguistic knowledge by studying the importance of the features in identifying dyslexia by our best AI model. In order to achieve the first objective, we collected and annotated a new set of eye-movement-during-reading data. Furthermore, we collected demographic data, including the measure of non-verbal intelligence, to form our three data sources. Our data set is the largest eye-movement data set globally. Unlike the previously introduced binary-class data sets, it contains (A) three class labels and (B) reading speed. Concerning the second objective, we formulated the task of dyslexia prediction as regression and classification problems and scrutinized the performance of 12 classifications and eight regressions approaches. We exploited the Bayesian optimization method to fine-tune the hyperparameters of the models: and reported the average and the standard deviation of our evaluation metrics in a stratified ten-fold cross-validation. Our studies showed that multi-layer perceptron, random forest, gradient boosting, and k-nearest neighbor form the group having the most acceptable results. Moreover, we showed that although separately using each data source did not lead to accurate results, their combination led to a reliable solution. We also determined the importance of the features of our best classifier: our findings showed that the IQ, gender, and age are the top three important features; we also showed that fixation along the y-axis is more important than other fixation data. Dyslexia detection, eye fixation, eye movement, demographic, classification, regression, artificial intelligence. Public Library of Science 2023-11-22 /pmc/articles/PMC10664902/ /pubmed/37992041 http://dx.doi.org/10.1371/journal.pone.0292047 Text en © 2023 Shalileh et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Shalileh, Soroosh Ignatov, Dmitry Lopukhina, Anastasiya Dragoy, Olga Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title | Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title_full | Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title_fullStr | Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title_full_unstemmed | Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title_short | Identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
title_sort | identifying dyslexia in school pupils from eye movement and demographic data using artificial intelligence |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10664902/ https://www.ncbi.nlm.nih.gov/pubmed/37992041 http://dx.doi.org/10.1371/journal.pone.0292047 |
work_keys_str_mv | AT shalilehsoroosh identifyingdyslexiainschoolpupilsfromeyemovementanddemographicdatausingartificialintelligence AT ignatovdmitry identifyingdyslexiainschoolpupilsfromeyemovementanddemographicdatausingartificialintelligence AT lopukhinaanastasiya identifyingdyslexiainschoolpupilsfromeyemovementanddemographicdatausingartificialintelligence AT dragoyolga identifyingdyslexiainschoolpupilsfromeyemovementanddemographicdatausingartificialintelligence |