Cargando…

Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context

Grammatical complexity has received extensive attention in second language acquisition. Although computational tools have been developed to analyze grammatical complexity, most relevant studies investigated this construct in the context of English as a second language. In response to an increasing n...

Descripción completa

Detalles Bibliográficos
Autores principales: Lan, Ge, Pan, Xiaofei, Sun, Yachao, Lu, Yuan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976805/
https://www.ncbi.nlm.nih.gov/pubmed/36874797
http://dx.doi.org/10.3389/fpsyg.2023.1139703
_version_ 1784899160229543936
author Lan, Ge
Pan, Xiaofei
Sun, Yachao
Lu, Yuan
author_facet Lan, Ge
Pan, Xiaofei
Sun, Yachao
Lu, Yuan
author_sort Lan, Ge
collection PubMed
description Grammatical complexity has received extensive attention in second language acquisition. Although computational tools have been developed to analyze grammatical complexity, most relevant studies investigated this construct in the context of English as a second language. In response to an increasing number of L2 Chinese learners, it is important to extend the investigation of grammatical complexity in L2 Chinese. To promote relevant research, we evaluated the new computational tool, Stanza, on its accuracy of part-of-speech tagging for L2 Chinese writing. We particularly focused on eight grammatical features closely related to L2 Chinese development. Then, we reported the precisions, recalls, and F-scores for the individual grammatical features and offered a qualitative analysis of systematic tagging errors. In terms of the precision, three features have high rates, over 90% (i.e., ba and bei markers, classifiers, -de as noun modifier marker). For recall, four features have high rates, over 90% (i.e., aspect markers, ba and bei markers, classifiers, -de as noun modifier marker). Overall, based on the F-scores, Stanza has a good tagging performance on ba and bei markers, classifiers, and -de as a noun modifier marker. This evaluation provides research implications for scholars who plan to use this computational tool to study L2 Chinese development in second language acquisition or applied linguistics in general.
format Online
Article
Text
id pubmed-9976805
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-99768052023-03-02 Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context Lan, Ge Pan, Xiaofei Sun, Yachao Lu, Yuan Front Psychol Psychology Grammatical complexity has received extensive attention in second language acquisition. Although computational tools have been developed to analyze grammatical complexity, most relevant studies investigated this construct in the context of English as a second language. In response to an increasing number of L2 Chinese learners, it is important to extend the investigation of grammatical complexity in L2 Chinese. To promote relevant research, we evaluated the new computational tool, Stanza, on its accuracy of part-of-speech tagging for L2 Chinese writing. We particularly focused on eight grammatical features closely related to L2 Chinese development. Then, we reported the precisions, recalls, and F-scores for the individual grammatical features and offered a qualitative analysis of systematic tagging errors. In terms of the precision, three features have high rates, over 90% (i.e., ba and bei markers, classifiers, -de as noun modifier marker). For recall, four features have high rates, over 90% (i.e., aspect markers, ba and bei markers, classifiers, -de as noun modifier marker). Overall, based on the F-scores, Stanza has a good tagging performance on ba and bei markers, classifiers, and -de as a noun modifier marker. This evaluation provides research implications for scholars who plan to use this computational tool to study L2 Chinese development in second language acquisition or applied linguistics in general. Frontiers Media S.A. 2023-02-15 /pmc/articles/PMC9976805/ /pubmed/36874797 http://dx.doi.org/10.3389/fpsyg.2023.1139703 Text en Copyright © 2023 Lan, Pan, Sun and Lu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Psychology
Lan, Ge
Pan, Xiaofei
Sun, Yachao
Lu, Yuan
Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title_full Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title_fullStr Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title_full_unstemmed Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title_short Part of speech tagging of grammatical features related to L2 Chinese development: A case analysis of Stanza in the L2 writing context
title_sort part of speech tagging of grammatical features related to l2 chinese development: a case analysis of stanza in the l2 writing context
topic Psychology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9976805/
https://www.ncbi.nlm.nih.gov/pubmed/36874797
http://dx.doi.org/10.3389/fpsyg.2023.1139703
work_keys_str_mv AT lange partofspeechtaggingofgrammaticalfeaturesrelatedtol2chinesedevelopmentacaseanalysisofstanzainthel2writingcontext
AT panxiaofei partofspeechtaggingofgrammaticalfeaturesrelatedtol2chinesedevelopmentacaseanalysisofstanzainthel2writingcontext
AT sunyachao partofspeechtaggingofgrammaticalfeaturesrelatedtol2chinesedevelopmentacaseanalysisofstanzainthel2writingcontext
AT luyuan partofspeechtaggingofgrammaticalfeaturesrelatedtol2chinesedevelopmentacaseanalysisofstanzainthel2writingcontext