Enhancing multimodal silent speech interfaces with feature selection

Freitas, J.; Teixeira, A.; Dias, J.; Ferreira, A.; Figueiredo, M.

Enhancing multimodal silent speech interfaces with feature selection

Bibliographic Details
Main Author:	Freitas, J.
Publication Date:	2014
Other Authors:	Teixeira, A., Dias, J., Ferreira, A., Figueiredo, M.
Language:	eng
Source:	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full:	http://hdl.handle.net/10071/25831
Summary:	In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion

Item metadata

id	RCAP_c9ee2983e6bf40335f23b840889ded92
oai_identifier_str	oai:repositorio.iscte-iul.pt:10071/25831
network_acronym_str	RCAP
network_name_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str	https://opendoar.ac.uk/repository/7160
spelling	Enhancing multimodal silent speech interfaces with feature selectionMultimodalSilent speech interfacesSupervised classificationFeature extractionIn research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusionSpeech and Communication Association2022-07-15T11:07:47Z2014-01-01T00:00:00Z20142022-06-29T10:24:27Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10071/25831eng2308-457XFreitas, J.Teixeira, A.Dias, J.Ferreira, A.Figueiredo, M.info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-07-07T03:30:00Zoai:repositorio.iscte-iul.pt:10071/25831Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T18:25:33.706327Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv	Enhancing multimodal silent speech interfaces with feature selection
title	Enhancing multimodal silent speech interfaces with feature selection
spellingShingle	Enhancing multimodal silent speech interfaces with feature selection Freitas, J. Multimodal Silent speech interfaces Supervised classification Feature extraction
title_short	Enhancing multimodal silent speech interfaces with feature selection
title_full	Enhancing multimodal silent speech interfaces with feature selection
title_fullStr	Enhancing multimodal silent speech interfaces with feature selection
title_full_unstemmed	Enhancing multimodal silent speech interfaces with feature selection
title_sort	Enhancing multimodal silent speech interfaces with feature selection
author	Freitas, J.
author_facet	Freitas, J. Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
author_role	author
author2	Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
author2_role	author author author author
dc.contributor.author.fl_str_mv	Freitas, J. Teixeira, A. Dias, J. Ferreira, A. Figueiredo, M.
dc.subject.por.fl_str_mv	Multimodal Silent speech interfaces Supervised classification Feature extraction
topic	Multimodal Silent speech interfaces Supervised classification Feature extraction
description	In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
publishDate	2014
dc.date.none.fl_str_mv	2014-01-01T00:00:00Z 2014 2022-07-15T11:07:47Z 2022-06-29T10:24:27Z
dc.type.driver.fl_str_mv	conference object
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10071/25831
url	http://hdl.handle.net/10071/25831
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	2308-457X
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Speech and Communication Association
publisher.none.fl_str_mv	Speech and Communication Association
dc.source.none.fl_str_mv	reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP
instname_str	FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv	info@rcaap.pt
_version_	1833597401791725568

Enhancing multimodal silent speech interfaces with feature selection

Similar Items