Cargando…

A dataset of histograms of original and fake voice recordings (H-Voice)

This paper presents H-Voice, a dataset of 6672 histograms of original and fake voice recordings obtained by the Imitation [1,2] and the Deep Voice [3] methods. The dataset is organized into six directories: Training_fake, Training_original, Validation_fake, Validation_original, External_test1, and E...

Descripción completa

Detalles Bibliográficos
Autores principales:	Ballesteros, Dora M., Rodriguez, Yohanna, Renza, Diego
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2020
Materias:	Computer Science
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7058910/ https://www.ncbi.nlm.nih.gov/pubmed/32154354 http://dx.doi.org/10.1016/j.dib.2020.105331

_version_	1783503944312422400
author	Ballesteros, Dora M. Rodriguez, Yohanna Renza, Diego
author_facet	Ballesteros, Dora M. Rodriguez, Yohanna Renza, Diego
author_sort	Ballesteros, Dora M.
collection	PubMed
description	This paper presents H-Voice, a dataset of 6672 histograms of original and fake voice recordings obtained by the Imitation [1,2] and the Deep Voice [3] methods. The dataset is organized into six directories: Training_fake, Training_original, Validation_fake, Validation_original, External_test1, and External_test2. The training directories include 2088 histograms of fake voice recordings and 2020 histograms of original voice recordings. Each validation directory has 864 histograms obtained from fake voice recordings and original voice recordings. Finally, External_test1 has 760 histograms (380 from fake voice recordings obtained by the Imitation method and 380 from original voice recordings), and External_test2 has 76 histograms (72 from fake voice recordings obtained by the Deep Voice method and 4 from original voice recordings). With this dataset, the researchers can train, cross-validate and test classification models using machine learning techniques to identify fake voice recordings.
format	Online Article Text
id	pubmed-7058910
institution	National Center for Biotechnology Information
language	English
publishDate	2020
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-70589102020-03-09 A dataset of histograms of original and fake voice recordings (H-Voice) Ballesteros, Dora M. Rodriguez, Yohanna Renza, Diego Data Brief Computer Science This paper presents H-Voice, a dataset of 6672 histograms of original and fake voice recordings obtained by the Imitation [1,2] and the Deep Voice [3] methods. The dataset is organized into six directories: Training_fake, Training_original, Validation_fake, Validation_original, External_test1, and External_test2. The training directories include 2088 histograms of fake voice recordings and 2020 histograms of original voice recordings. Each validation directory has 864 histograms obtained from fake voice recordings and original voice recordings. Finally, External_test1 has 760 histograms (380 from fake voice recordings obtained by the Imitation method and 380 from original voice recordings), and External_test2 has 76 histograms (72 from fake voice recordings obtained by the Deep Voice method and 4 from original voice recordings). With this dataset, the researchers can train, cross-validate and test classification models using machine learning techniques to identify fake voice recordings. Elsevier 2020-02-26 /pmc/articles/PMC7058910/ /pubmed/32154354 http://dx.doi.org/10.1016/j.dib.2020.105331 Text en © 2020 The Author(s) http://creativecommons.org/licenses/by/4.0/ This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Computer Science Ballesteros, Dora M. Rodriguez, Yohanna Renza, Diego A dataset of histograms of original and fake voice recordings (H-Voice)
title	A dataset of histograms of original and fake voice recordings (H-Voice)
title_full	A dataset of histograms of original and fake voice recordings (H-Voice)
title_fullStr	A dataset of histograms of original and fake voice recordings (H-Voice)
title_full_unstemmed	A dataset of histograms of original and fake voice recordings (H-Voice)
title_short	A dataset of histograms of original and fake voice recordings (H-Voice)
title_sort	dataset of histograms of original and fake voice recordings (h-voice)
topic	Computer Science
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7058910/ https://www.ncbi.nlm.nih.gov/pubmed/32154354 http://dx.doi.org/10.1016/j.dib.2020.105331
work_keys_str_mv	AT ballesterosdoram adatasetofhistogramsoforiginalandfakevoicerecordingshvoice AT rodriguezyohanna adatasetofhistogramsoforiginalandfakevoicerecordingshvoice AT renzadiego adatasetofhistogramsoforiginalandfakevoicerecordingshvoice AT ballesterosdoram datasetofhistogramsoforiginalandfakevoicerecordingshvoice AT rodriguezyohanna datasetofhistogramsoforiginalandfakevoicerecordingshvoice AT renzadiego datasetofhistogramsoforiginalandfakevoicerecordingshvoice

A dataset of histograms of original and fake voice recordings (H-Voice)

Ejemplares similares