Cargando…

A transformer fine-tuning strategy for text dialect identification

Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their qu...

Descripción completa

Detalles Bibliográficos
Autores principales: Humayun, Mohammad Ali, Yassin, Hayati, Shuja, Junaid, Alourani, Abdullah, Abas, Pg Emeroylariffion
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer London 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665018/
https://www.ncbi.nlm.nih.gov/pubmed/36408287
http://dx.doi.org/10.1007/s00521-022-07944-5
_version_ 1784831206052855808
author Humayun, Mohammad Ali
Yassin, Hayati
Shuja, Junaid
Alourani, Abdullah
Abas, Pg Emeroylariffion
author_facet Humayun, Mohammad Ali
Yassin, Hayati
Shuja, Junaid
Alourani, Abdullah
Abas, Pg Emeroylariffion
author_sort Humayun, Mohammad Ali
collection PubMed
description Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their question, social background identification of patients can improve the referral system by selecting a medical consultant with a similar social origin for efficient communication. This paper has proposed a novel fine-tuning strategy for the pre-trained transformers to identify the social origin of text authors. When fused with the existing adapter model, the proposed methods achieve an overall accuracy of 53.96% for the Arabic dialect identification task on the Nuanced Arabic Dialect Identification (NADI) dataset. The overall accuracy is 0.54% higher than the previous best for the same dataset, which establishes the utility of custom fine-tuning strategies for pre-trained transformer models.
format Online
Article
Text
id pubmed-9665018
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Springer London
record_format MEDLINE/PubMed
spelling pubmed-96650182022-11-16 A transformer fine-tuning strategy for text dialect identification Humayun, Mohammad Ali Yassin, Hayati Shuja, Junaid Alourani, Abdullah Abas, Pg Emeroylariffion Neural Comput Appl Original Article Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their question, social background identification of patients can improve the referral system by selecting a medical consultant with a similar social origin for efficient communication. This paper has proposed a novel fine-tuning strategy for the pre-trained transformers to identify the social origin of text authors. When fused with the existing adapter model, the proposed methods achieve an overall accuracy of 53.96% for the Arabic dialect identification task on the Nuanced Arabic Dialect Identification (NADI) dataset. The overall accuracy is 0.54% higher than the previous best for the same dataset, which establishes the utility of custom fine-tuning strategies for pre-trained transformer models. Springer London 2022-11-15 2023 /pmc/articles/PMC9665018/ /pubmed/36408287 http://dx.doi.org/10.1007/s00521-022-07944-5 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic.
spellingShingle Original Article
Humayun, Mohammad Ali
Yassin, Hayati
Shuja, Junaid
Alourani, Abdullah
Abas, Pg Emeroylariffion
A transformer fine-tuning strategy for text dialect identification
title A transformer fine-tuning strategy for text dialect identification
title_full A transformer fine-tuning strategy for text dialect identification
title_fullStr A transformer fine-tuning strategy for text dialect identification
title_full_unstemmed A transformer fine-tuning strategy for text dialect identification
title_short A transformer fine-tuning strategy for text dialect identification
title_sort transformer fine-tuning strategy for text dialect identification
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665018/
https://www.ncbi.nlm.nih.gov/pubmed/36408287
http://dx.doi.org/10.1007/s00521-022-07944-5
work_keys_str_mv AT humayunmohammadali atransformerfinetuningstrategyfortextdialectidentification
AT yassinhayati atransformerfinetuningstrategyfortextdialectidentification
AT shujajunaid atransformerfinetuningstrategyfortextdialectidentification
AT alouraniabdullah atransformerfinetuningstrategyfortextdialectidentification
AT abaspgemeroylariffion atransformerfinetuningstrategyfortextdialectidentification
AT humayunmohammadali transformerfinetuningstrategyfortextdialectidentification
AT yassinhayati transformerfinetuningstrategyfortextdialectidentification
AT shujajunaid transformerfinetuningstrategyfortextdialectidentification
AT alouraniabdullah transformerfinetuningstrategyfortextdialectidentification
AT abaspgemeroylariffion transformerfinetuningstrategyfortextdialectidentification