Enhancing multimodal silent speech interfaces with feature selection

Detalhes bibliográficos
Autor(a) principal: Freitas, J.
Data de Publicação: 2014
Outros Autores: Teixeira, A., Dias, J., Ferreira, A., Figueiredo, M.
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10071/25831
Resumo: In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
id RCAP_c9ee2983e6bf40335f23b840889ded92
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/25831
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Enhancing multimodal silent speech interfaces with feature selectionMultimodalSilent speech interfacesSupervised classificationFeature extractionIn research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusionSpeech and Communication Association2022-07-15T11:07:47Z2014-01-01T00:00:00Z20142022-06-29T10:24:27Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10071/25831eng2308-457XFreitas, J.Teixeira, A.Dias, J.Ferreira, A.Figueiredo, M.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-07-07T03:30:00Zoai:repositorio.iscte-iul.pt:10071/25831Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-07-07T03:30Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Enhancing multimodal silent speech interfaces with feature selection
title Enhancing multimodal silent speech interfaces with feature selection
spellingShingle Enhancing multimodal silent speech interfaces with feature selection
Freitas, J.
Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
title_short Enhancing multimodal silent speech interfaces with feature selection
title_full Enhancing multimodal silent speech interfaces with feature selection
title_fullStr Enhancing multimodal silent speech interfaces with feature selection
title_full_unstemmed Enhancing multimodal silent speech interfaces with feature selection
title_sort Enhancing multimodal silent speech interfaces with feature selection
author Freitas, J.
author_facet Freitas, J.
Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
author_role author
author2 Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Freitas, J.
Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
dc.subject.por.fl_str_mv Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
topic Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
description In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
publishDate 2014
dc.date.none.fl_str_mv 2014-01-01T00:00:00Z
2014
2022-07-15T11:07:47Z
2022-06-29T10:24:27Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10071/25831
url http://hdl.handle.net/10071/25831
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2308-457X
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Speech and Communication Association
publisher.none.fl_str_mv Speech and Communication Association
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv mluisa.alvim@gmail.com
_version_ 1817546484558594048