Automatic audio signal analysis for the detection of anomalies in calls

Detalhes bibliográficos
Autor(a) principal: Justo, Inês Zita Nogueira
Data de Publicação: 2021
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/32331
Resumo: Machine Learning (ML) is one of the fastest growing technologies in recent years, having the ability to add value to workflows and to existing technologies. An aspect of ML that is present in many enterprise applications is the detection of anomalies. This project aims to create a system for call anomaly detection in a contact center context, more precisely detection of audio cuts. For this, it was researched the best features and models for the solution, by understanding the original data set; re-labeling it to improve the data representation; extracting relevant features, to distinguish the classes; and selecting the most relevant ones to the system. The models used to create the system were the Support Vector Classifier (SVC) and the Random Forest Classifier, the last one having shown the best performance. A clustering-based approach was also performed on the class that represented the calls with worse audio quality, through the implementation of the K-means algorithm, revealing the possible stratification of two different types of calls within this class. The results showed that the Random Forest was the best performing model, so it was used in the final solution. This solution was inserted into a web app for integration in a business context allowing to improve the Quality of Service (QoS) in contact centers.
id RCAP_528009f731cc51f021ec4ef39352a0dc
oai_identifier_str oai:ria.ua.pt:10773/32331
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Automatic audio signal analysis for the detection of anomalies in callsSignal processingAudio processingMachine learningContact centersAnomaly detectionMachine Learning (ML) is one of the fastest growing technologies in recent years, having the ability to add value to workflows and to existing technologies. An aspect of ML that is present in many enterprise applications is the detection of anomalies. This project aims to create a system for call anomaly detection in a contact center context, more precisely detection of audio cuts. For this, it was researched the best features and models for the solution, by understanding the original data set; re-labeling it to improve the data representation; extracting relevant features, to distinguish the classes; and selecting the most relevant ones to the system. The models used to create the system were the Support Vector Classifier (SVC) and the Random Forest Classifier, the last one having shown the best performance. A clustering-based approach was also performed on the class that represented the calls with worse audio quality, through the implementation of the K-means algorithm, revealing the possible stratification of two different types of calls within this class. The results showed that the Random Forest was the best performing model, so it was used in the final solution. This solution was inserted into a web app for integration in a business context allowing to improve the Quality of Service (QoS) in contact centers.A Aprendizagem Automática (AA) é uma das tecnologias de mais rápido crescimento nos últimos anos, tendo a capacidade de acrescentar valor aos fluxos de trabalho e às tecnologias existentes. Uma vertente da AA que está presente em muitas aplicações empresariais é a detecção de anomalias. Este projecto visa a criação de um sistema de detecção de anomalias de chamadas no contexto de contact centers, mais precisamente a detecção de cortes no áudio. Para tal, foram investigadas as melhores características e modelos para a solução, através da compreensão do conjunto de dados original; criação de novas classes para melhorar a representação dos dados; extracção de características relevantes, para distinguir as classes; e selecção das mais relevantes para o sistema. Os modelos utilizados para criar o sistema foram o Support Vector Classifier (SVC) e o Random Forest Classifier, tendo este último mostrado o melhor desempenho. Foi também realizada uma abordagem baseada em clustering na classe que representava as chamadas com pior qualidade de áudio, através da implementação do algoritmo K-means, revelando a possível estratificação de dois tipos diferentes de chamadas dentro desta classe. Os resultados mostraram que o Random Forest era o modelo com melhor desempenho, pelo que foi utilizado na solução final. Esta solução foi inserida numa aplicação web para integração num contexto empresarial permitindo melhorar a Qualidade de Serviço nos Contact Centers.2022-07-29T00:00:00Z2021-07-23T00:00:00Z2021-07-23info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/32331engJusto, Inês Zita Nogueirainfo:eu-repo/semantics/embargoedAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T12:02:26Zoai:ria.ua.pt:10773/32331Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T03:04:04.071816Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Automatic audio signal analysis for the detection of anomalies in calls
title Automatic audio signal analysis for the detection of anomalies in calls
spellingShingle Automatic audio signal analysis for the detection of anomalies in calls
Justo, Inês Zita Nogueira
Signal processing
Audio processing
Machine learning
Contact centers
Anomaly detection
title_short Automatic audio signal analysis for the detection of anomalies in calls
title_full Automatic audio signal analysis for the detection of anomalies in calls
title_fullStr Automatic audio signal analysis for the detection of anomalies in calls
title_full_unstemmed Automatic audio signal analysis for the detection of anomalies in calls
title_sort Automatic audio signal analysis for the detection of anomalies in calls
author Justo, Inês Zita Nogueira
author_facet Justo, Inês Zita Nogueira
author_role author
dc.contributor.author.fl_str_mv Justo, Inês Zita Nogueira
dc.subject.por.fl_str_mv Signal processing
Audio processing
Machine learning
Contact centers
Anomaly detection
topic Signal processing
Audio processing
Machine learning
Contact centers
Anomaly detection
description Machine Learning (ML) is one of the fastest growing technologies in recent years, having the ability to add value to workflows and to existing technologies. An aspect of ML that is present in many enterprise applications is the detection of anomalies. This project aims to create a system for call anomaly detection in a contact center context, more precisely detection of audio cuts. For this, it was researched the best features and models for the solution, by understanding the original data set; re-labeling it to improve the data representation; extracting relevant features, to distinguish the classes; and selecting the most relevant ones to the system. The models used to create the system were the Support Vector Classifier (SVC) and the Random Forest Classifier, the last one having shown the best performance. A clustering-based approach was also performed on the class that represented the calls with worse audio quality, through the implementation of the K-means algorithm, revealing the possible stratification of two different types of calls within this class. The results showed that the Random Forest was the best performing model, so it was used in the final solution. This solution was inserted into a web app for integration in a business context allowing to improve the Quality of Service (QoS) in contact centers.
publishDate 2021
dc.date.none.fl_str_mv 2021-07-23T00:00:00Z
2021-07-23
2022-07-29T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/32331
url http://hdl.handle.net/10773/32331
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/embargoedAccess
eu_rights_str_mv embargoedAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137695812812800