Enhancing multimodal silent speech interfaces with feature selection

Freitas, J.; Teixeira, A.; Dias, J.; Ferreira, A.; Figueiredo, M.

Enhancing multimodal silent speech interfaces with feature selection

Detalhes bibliográficos
Autor(a) principal:	Freitas, J.
Data de Publicação:	2014
Outros Autores:	Teixeira, A., Dias, J., Ferreira, A., Figueiredo, M.
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10071/25831
Resumo:	In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion

Metadados do item

id	RCAP_c9ee2983e6bf40335f23b840889ded92
oai_identifier_str	oai:repositorio.iscte-iul.pt:10071/25831
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Enhancing multimodal silent speech interfaces with feature selectionMultimodalSilent speech interfacesSupervised classificationFeature extractionIn research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusionSpeech and Communication Association2022-07-15T11:07:47Z2014-01-01T00:00:00Z20142022-06-29T10:24:27Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10071/25831eng2308-457XFreitas, J.Teixeira, A.Dias, J.Ferreira, A.Figueiredo, M.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-07-07T03:30:00Zoai:repositorio.iscte-iul.pt:10071/25831Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-07-07T03:30Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Enhancing multimodal silent speech interfaces with feature selection
title	Enhancing multimodal silent speech interfaces with feature selection
spellingShingle	Enhancing multimodal silent speech interfaces with feature selection Freitas, J. Multimodal Silent speech interfaces Supervised classification Feature extraction
title_short	Enhancing multimodal silent speech interfaces with feature selection
title_full	Enhancing multimodal silent speech interfaces with feature selection
title_fullStr	Enhancing multimodal silent speech interfaces with feature selection
title_full_unstemmed	Enhancing multimodal silent speech interfaces with feature selection
title_sort	Enhancing multimodal silent speech interfaces with feature selection
author	Freitas, J.
author_facet	Freitas, J. Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
author_role	author
author2	Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
author2_role	author author author author
dc.contributor.author.fl_str_mv	Freitas, J. Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
dc.subject.por.fl_str_mv	Multimodal Silent speech interfaces Supervised classification Feature extraction
topic	Multimodal Silent speech interfaces Supervised classification Feature extraction
description	In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
publishDate	2014
dc.date.none.fl_str_mv	2014-01-01T00:00:00Z 2014 2022-07-15T11:07:47Z 2022-06-29T10:24:27Z
dc.type.driver.fl_str_mv	conference object
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10071/25831
url	http://hdl.handle.net/10071/25831
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	2308-457X
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Speech and Communication Association
publisher.none.fl_str_mv	Speech and Communication Association
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv	mluisa.alvim@gmail.com
_version_	1817546484558594048

Enhancing multimodal silent speech interfaces with feature selection

Registros relacionados