Cargando…

Development of Hausa dataset a baseline for speech recognition

The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audi...

Descripción completa

Detalles Bibliográficos
Autores principales: Ibrahim, Umar Adam, Boukar, Moussa Mahamat, Suleiman, Muhammed Aliyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/
https://www.ncbi.nlm.nih.gov/pubmed/35242895
http://dx.doi.org/10.1016/j.dib.2022.107820
_version_ 1784640354811641856
author Ibrahim, Umar Adam
Boukar, Moussa Mahamat
Suleiman, Muhammed Aliyu
author_facet Ibrahim, Umar Adam
Boukar, Moussa Mahamat
Suleiman, Muhammed Aliyu
author_sort Ibrahim, Umar Adam
collection PubMed
description The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application.
format Online
Article
Text
id pubmed-8792404
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-87924042022-03-02 Development of Hausa dataset a baseline for speech recognition Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu Data Brief Data Article The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application. Elsevier 2022-01-10 /pmc/articles/PMC8792404/ /pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820 Text en © 2022 The Author(s). Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Data Article
Ibrahim, Umar Adam
Boukar, Moussa Mahamat
Suleiman, Muhammed Aliyu
Development of Hausa dataset a baseline for speech recognition
title Development of Hausa dataset a baseline for speech recognition
title_full Development of Hausa dataset a baseline for speech recognition
title_fullStr Development of Hausa dataset a baseline for speech recognition
title_full_unstemmed Development of Hausa dataset a baseline for speech recognition
title_short Development of Hausa dataset a baseline for speech recognition
title_sort development of hausa dataset a baseline for speech recognition
topic Data Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/
https://www.ncbi.nlm.nih.gov/pubmed/35242895
http://dx.doi.org/10.1016/j.dib.2022.107820
work_keys_str_mv AT ibrahimumaradam developmentofhausadatasetabaselineforspeechrecognition
AT boukarmoussamahamat developmentofhausadatasetabaselineforspeechrecognition
AT suleimanmuhammedaliyu developmentofhausadatasetabaselineforspeechrecognition