Speech as an Emotional Load Biomarker in Clinical Applications

Detalhes bibliográficos
Autor(a) principal: Coelho,L. F.
Data de Publicação: 2024
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004
Resumo: Abstract Introduction: Healthcare professionals often contend with significant emotional burdens in their work, including the impact of negative emotions, such as stress and anxiety, which can have profound consequences on immediate and long-term healthcare delivery. In this paper a stress estimation algorithm is proposed based on the classification of negative valence emotions in speech recordings. Methods: An end-to-end machine learning pipeline is proposed. Two distinct decision models are considered, VGG-16 and SqueezeNet, while sharing a common constant Q power spectrogram input for acoustic representation. The system is trained and evaluated using the RAVDESS and TESS emotional speech datasets. Results: The system was evaluated for individual emotion classification (multiclass problem) and also for negative and neutral or positive emotion classification (binary problem). The results achieved are comparable to previously reported systems, with the SqueezeNet model offering a significantly smaller footprint, enabling versatile applications. Further exploration of the model's parameter space holds promise for enhanced performance. Conclusion: The proposed system can constitute a feasible approach for the estimation of a low-cost non-invasive biomarker for negative emotions. This allows to raise alerts and develop mitigating actions to the burden of negative emotions, being an additional management tool for healthcare services that allows to maintain quality and maximize availability.
id RCAP_424cd3a921957a02e2e128d124e506e5
oai_identifier_str oai:scielo:S0872-671X2024000300004
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Speech as an Emotional Load Biomarker in Clinical ApplicationsBiomarkersEmotionsMachine LearningSpeech.Abstract Introduction: Healthcare professionals often contend with significant emotional burdens in their work, including the impact of negative emotions, such as stress and anxiety, which can have profound consequences on immediate and long-term healthcare delivery. In this paper a stress estimation algorithm is proposed based on the classification of negative valence emotions in speech recordings. Methods: An end-to-end machine learning pipeline is proposed. Two distinct decision models are considered, VGG-16 and SqueezeNet, while sharing a common constant Q power spectrogram input for acoustic representation. The system is trained and evaluated using the RAVDESS and TESS emotional speech datasets. Results: The system was evaluated for individual emotion classification (multiclass problem) and also for negative and neutral or positive emotion classification (binary problem). The results achieved are comparable to previously reported systems, with the SqueezeNet model offering a significantly smaller footprint, enabling versatile applications. Further exploration of the model's parameter space holds promise for enhanced performance. Conclusion: The proposed system can constitute a feasible approach for the estimation of a low-cost non-invasive biomarker for negative emotions. This allows to raise alerts and develop mitigating actions to the burden of negative emotions, being an additional management tool for healthcare services that allows to maintain quality and maximize availability.Sociedade Portuguesa de Medicina Interna2024-05-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articletext/htmlhttp://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004Medicina Interna v.31 suppl.spe1 2024reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAPenghttp://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004Coelho,L. F.info:eu-repo/semantics/openAccess2024-10-24T23:01:55Zoai:scielo:S0872-671X2024000300004Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-10-24T23:01:55Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Speech as an Emotional Load Biomarker in Clinical Applications
title Speech as an Emotional Load Biomarker in Clinical Applications
spellingShingle Speech as an Emotional Load Biomarker in Clinical Applications
Coelho,L. F.
Biomarkers
Emotions
Machine Learning
Speech.
title_short Speech as an Emotional Load Biomarker in Clinical Applications
title_full Speech as an Emotional Load Biomarker in Clinical Applications
title_fullStr Speech as an Emotional Load Biomarker in Clinical Applications
title_full_unstemmed Speech as an Emotional Load Biomarker in Clinical Applications
title_sort Speech as an Emotional Load Biomarker in Clinical Applications
author Coelho,L. F.
author_facet Coelho,L. F.
author_role author
dc.contributor.author.fl_str_mv Coelho,L. F.
dc.subject.por.fl_str_mv Biomarkers
Emotions
Machine Learning
Speech.
topic Biomarkers
Emotions
Machine Learning
Speech.
description Abstract Introduction: Healthcare professionals often contend with significant emotional burdens in their work, including the impact of negative emotions, such as stress and anxiety, which can have profound consequences on immediate and long-term healthcare delivery. In this paper a stress estimation algorithm is proposed based on the classification of negative valence emotions in speech recordings. Methods: An end-to-end machine learning pipeline is proposed. Two distinct decision models are considered, VGG-16 and SqueezeNet, while sharing a common constant Q power spectrogram input for acoustic representation. The system is trained and evaluated using the RAVDESS and TESS emotional speech datasets. Results: The system was evaluated for individual emotion classification (multiclass problem) and also for negative and neutral or positive emotion classification (binary problem). The results achieved are comparable to previously reported systems, with the SqueezeNet model offering a significantly smaller footprint, enabling versatile applications. Further exploration of the model's parameter space holds promise for enhanced performance. Conclusion: The proposed system can constitute a feasible approach for the estimation of a low-cost non-invasive biomarker for negative emotions. This allows to raise alerts and develop mitigating actions to the burden of negative emotions, being an additional management tool for healthcare services that allows to maintain quality and maximize availability.
publishDate 2024
dc.date.none.fl_str_mv 2024-05-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004
url http://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv http://scielo.pt/scielo.php?script=sci_arttext&pid=S0872-671X2024000300004
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Portuguesa de Medicina Interna
publisher.none.fl_str_mv Sociedade Portuguesa de Medicina Interna
dc.source.none.fl_str_mv Medicina Interna v.31 suppl.spe1 2024
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv mluisa.alvim@gmail.com
_version_ 1817548618103521280