Cargando…

Smartphone Application for the Analysis of Prosodic Features in Running Speech with a Focus on Bipolar Disorders: System Performance Evaluation and Case Study

Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited t...

Descripción completa

Detalles Bibliográficos
Autores principales: Guidi, Andrea, Salvi, Sergio, Ottaviano, Manuel, Gentili, Claudio, Bertschy, Gilles, de Rossi, Danilo, Scilingo, Enzo Pasquale, Vanello, Nicola
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4701269/
https://www.ncbi.nlm.nih.gov/pubmed/26561811
http://dx.doi.org/10.3390/s151128070
Descripción
Sumario:Bipolar disorder is one of the most common mood disorders characterized by large and invalidating mood swings. Several projects focus on the development of decision support systems that monitor and advise patients, as well as clinicians. Voice monitoring and speech signal analysis can be exploited to reach this goal. In this study, an Android application was designed for analyzing running speech using a smartphone device. The application can record audio samples and estimate speech fundamental frequency, [Formula: see text] , and its changes. [Formula: see text]-related features are estimated locally on the smartphone, with some advantages with respect to remote processing approaches in terms of privacy protection and reduced upload costs. The raw features can be sent to a central server and further processed. The quality of the audio recordings, algorithm reliability and performance of the overall system were evaluated in terms of voiced segment detection and features estimation. The results demonstrate that mean [Formula: see text] from each voiced segment can be reliably estimated, thus describing prosodic features across the speech sample. Instead, features related to [Formula: see text] variability within each voiced segment performed poorly. A case study performed on a bipolar patient is presented.