Cargando…

Development of Hausa dataset a baseline for speech recognition

The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audi...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ibrahim, Umar Adam, Boukar, Moussa Mahamat, Suleiman, Muhammed Aliyu
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Data Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/ https://www.ncbi.nlm.nih.gov/pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820

_version_	1784640354811641856
author	Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu
author_facet	Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu
author_sort	Ibrahim, Umar Adam
collection	PubMed
description	The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application.
format	Online Article Text
id	pubmed-8792404
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-87924042022-03-02 Development of Hausa dataset a baseline for speech recognition Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu Data Brief Data Article The Hausa language read-speech dataset was created by recording native Hausa speakers. The recording took place at Nile university of Nigeria audio studio and radio broadcasting studio. The recorded dataset was segmented into unigram and bigram. The Hausa speech dataset contain 47hr of recorded audio speech. The dataset can be used for automatic speech recognition, speech synthesis, Text-to-Speech and speech-to-text application. Elsevier 2022-01-10 /pmc/articles/PMC8792404/ /pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820 Text en © 2022 The Author(s). Published by Elsevier Inc. https://creativecommons.org/licenses/by/4.0/This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Data Article Ibrahim, Umar Adam Boukar, Moussa Mahamat Suleiman, Muhammed Aliyu Development of Hausa dataset a baseline for speech recognition
title	Development of Hausa dataset a baseline for speech recognition
title_full	Development of Hausa dataset a baseline for speech recognition
title_fullStr	Development of Hausa dataset a baseline for speech recognition
title_full_unstemmed	Development of Hausa dataset a baseline for speech recognition
title_short	Development of Hausa dataset a baseline for speech recognition
title_sort	development of hausa dataset a baseline for speech recognition
topic	Data Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8792404/ https://www.ncbi.nlm.nih.gov/pubmed/35242895 http://dx.doi.org/10.1016/j.dib.2022.107820
work_keys_str_mv	AT ibrahimumaradam developmentofhausadatasetabaselineforspeechrecognition AT boukarmoussamahamat developmentofhausadatasetabaselineforspeechrecognition AT suleimanmuhammedaliyu developmentofhausadatasetabaselineforspeechrecognition

Development of Hausa dataset a baseline for speech recognition

Ejemplares similares