Cargando…
Development of Hausa dataset a baseline for speech recognition
The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Elsevier
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/ https://www.ncbi.nlm.nih.gov/pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820 |
_version_ | 1784640354811641856 |
---|---|
author | Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu |
author_facet | Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu |
author_sort | Ibrahim, Umar Adam |
collection | PubMed |
description | The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application. |
format | Online Article Text |
id | pubmed-8792404 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Elsevier |
record_format | MEDLINE/PubMed |
spelling | pubmed-87924042022-03-02 Development of Hausa dataset a baseline for speech recognition Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu Data Brief Data Article The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application. Elsevier 2022-01-10 /pmc/articles/PMC8792404/ /pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820 Text en © 2022 The Author(s). Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Data Article Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu Development of Hausa dataset a baseline for speech recognition |
title | Development of Hausa dataset a baseline for speech recognition |
title_full | Development of Hausa dataset a baseline for speech recognition |
title_fullStr | Development of Hausa dataset a baseline for speech recognition |
title_full_unstemmed | Development of Hausa dataset a baseline for speech recognition |
title_short | Development of Hausa dataset a baseline for speech recognition |
title_sort | development of hausa dataset a baseline for speech recognition |
topic | Data Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/ https://www.ncbi.nlm.nih.gov/pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820 |
work_keys_str_mv | AT ibrahimumaradam developmentofhausadatasetabaselineforspeechrecognition AT boukarmoussamahamat developmentofhausadatasetabaselineforspeechrecognition AT suleimanmuhammedaliyu developmentofhausadatasetabaselineforspeechrecognition |