Cargando…
A transformer fine-tuning strategy for text dialect identification
Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their qu...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer London
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665018/ https://www.ncbi.nlm.nih.gov/pubmed/36408287 http://dx.doi.org/10.1007/s00521-022-07944-5 |
_version_ | 1784831206052855808 |
---|---|
author | Humayun, Mohammad Ali Yassin, Hayati Shuja, Junaid Alourani, Abdullah Abas, Pg Emeroylariffion |
author_facet | Humayun, Mohammad Ali Yassin, Hayati Shuja, Junaid Alourani, Abdullah Abas, Pg Emeroylariffion |
author_sort | Humayun, Mohammad Ali |
collection | PubMed |
description | Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their question, social background identification of patients can improve the referral system by selecting a medical consultant with a similar social origin for efficient communication. This paper has proposed a novel fine-tuning strategy for the pre-trained transformers to identify the social origin of text authors. When fused with the existing adapter model, the proposed methods achieve an overall accuracy of 53.96% for the Arabic dialect identification task on the Nuanced Arabic Dialect Identification (NADI) dataset. The overall accuracy is 0.54% higher than the previous best for the same dataset, which establishes the utility of custom fine-tuning strategies for pre-trained transformer models. |
format | Online Article Text |
id | pubmed-9665018 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Springer London |
record_format | MEDLINE/PubMed |
spelling | pubmed-96650182022-11-16 A transformer fine-tuning strategy for text dialect identification Humayun, Mohammad Ali Yassin, Hayati Shuja, Junaid Alourani, Abdullah Abas, Pg Emeroylariffion Neural Comput Appl Original Article Online medical consultation can significantly improve the efficiency of primary health care. Recently, many online medical question–answer services have been developed that connect the patients with relevant medical consultants based on their questions. Considering the linguistic variety in their question, social background identification of patients can improve the referral system by selecting a medical consultant with a similar social origin for efficient communication. This paper has proposed a novel fine-tuning strategy for the pre-trained transformers to identify the social origin of text authors. When fused with the existing adapter model, the proposed methods achieve an overall accuracy of 53.96% for the Arabic dialect identification task on the Nuanced Arabic Dialect Identification (NADI) dataset. The overall accuracy is 0.54% higher than the previous best for the same dataset, which establishes the utility of custom fine-tuning strategies for pre-trained transformer models. Springer London 2022-11-15 2023 /pmc/articles/PMC9665018/ /pubmed/36408287 http://dx.doi.org/10.1007/s00521-022-07944-5 Text en © The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2022, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law. This article is made available via the PMC Open Access Subset for unrestricted research re-use and secondary analysis in any form or by any means with acknowledgement of the original source. These permissions are granted for the duration of the World Health Organization (WHO) declaration of COVID-19 as a global pandemic. |
spellingShingle | Original Article Humayun, Mohammad Ali Yassin, Hayati Shuja, Junaid Alourani, Abdullah Abas, Pg Emeroylariffion A transformer fine-tuning strategy for text dialect identification |
title | A transformer fine-tuning strategy for text dialect identification |
title_full | A transformer fine-tuning strategy for text dialect identification |
title_fullStr | A transformer fine-tuning strategy for text dialect identification |
title_full_unstemmed | A transformer fine-tuning strategy for text dialect identification |
title_short | A transformer fine-tuning strategy for text dialect identification |
title_sort | transformer fine-tuning strategy for text dialect identification |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9665018/ https://www.ncbi.nlm.nih.gov/pubmed/36408287 http://dx.doi.org/10.1007/s00521-022-07944-5 |
work_keys_str_mv | AT humayunmohammadali atransformerfinetuningstrategyfortextdialectidentification AT yassinhayati atransformerfinetuningstrategyfortextdialectidentification AT shujajunaid atransformerfinetuningstrategyfortextdialectidentification AT alouraniabdullah atransformerfinetuningstrategyfortextdialectidentification AT abaspgemeroylariffion atransformerfinetuningstrategyfortextdialectidentification AT humayunmohammadali transformerfinetuningstrategyfortextdialectidentification AT yassinhayati transformerfinetuningstrategyfortextdialectidentification AT shujajunaid transformerfinetuningstrategyfortextdialectidentification AT alouraniabdullah transformerfinetuningstrategyfortextdialectidentification AT abaspgemeroylariffion transformerfinetuningstrategyfortextdialectidentification |