Cargando…

pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis

Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retri...

Descripción completa

Detalles Bibliográficos
Autor principal: Giannakopoulos, Theodoros
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676707/
https://www.ncbi.nlm.nih.gov/pubmed/26656189
http://dx.doi.org/10.1371/journal.pone.0144610
_version_ 1782405222644056064
author Giannakopoulos, Theodoros
author_facet Giannakopoulos, Theodoros
author_sort Giannakopoulos, Theodoros
collection PubMed
description Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library.
format Online
Article
Text
id pubmed-4676707
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-46767072015-12-31 pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis Giannakopoulos, Theodoros PLoS One Research Article Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library. Public Library of Science 2015-12-11 /pmc/articles/PMC4676707/ /pubmed/26656189 http://dx.doi.org/10.1371/journal.pone.0144610 Text en © 2015 Theodoros Giannakopoulos http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Giannakopoulos, Theodoros
pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title_full pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title_fullStr pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title_full_unstemmed pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title_short pyAudioAnalysis: An Open-Source Python Library for Audio Signal Analysis
title_sort pyaudioanalysis: an open-source python library for audio signal analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4676707/
https://www.ncbi.nlm.nih.gov/pubmed/26656189
http://dx.doi.org/10.1371/journal.pone.0144610
work_keys_str_mv AT giannakopoulostheodoros pyaudioanalysisanopensourcepythonlibraryforaudiosignalanalysis