Enhancing multimodal silent speech interfaces with feature selection

Bibliographic Details
Main Author: Freitas, J.
Publication Date: 2014
Other Authors: Teixeira, A., Dias, J., Ferreira, A., Figueiredo, M.
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10071/25831
Summary: In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
id RCAP_c9ee2983e6bf40335f23b840889ded92
oai_identifier_str oai:repositorio.iscte-iul.pt:10071/25831
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Enhancing multimodal silent speech interfaces with feature selectionMultimodalSilent speech interfacesSupervised classificationFeature extractionIn research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusionSpeech and Communication Association2022-07-15T11:07:47Z2014-01-01T00:00:00Z20142022-06-29T10:24:27Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10071/25831eng2308-457XFreitas, J.Teixeira, A.Dias, J.Ferreira, A.Figueiredo, M.info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-07-07T03:30:00Zoai:repositorio.iscte-iul.pt:10071/25831Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T18:25:33.706327Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Enhancing multimodal silent speech interfaces with feature selection
title Enhancing multimodal silent speech interfaces with feature selection
spellingShingle Enhancing multimodal silent speech interfaces with feature selection
Freitas, J.
Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
title_short Enhancing multimodal silent speech interfaces with feature selection
title_full Enhancing multimodal silent speech interfaces with feature selection
title_fullStr Enhancing multimodal silent speech interfaces with feature selection
title_full_unstemmed Enhancing multimodal silent speech interfaces with feature selection
title_sort Enhancing multimodal silent speech interfaces with feature selection
author Freitas, J.
author_facet Freitas, J.
Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
author_role author
author2 Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Freitas, J.
Teixeira, A.
Dias, J.
Ferreira, A.
Figueiredo, M.
dc.subject.por.fl_str_mv Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
topic Multimodal
Silent speech interfaces
Supervised classification
Feature extraction
description In research on Silent Speech Interfaces (SSI), different sources of information (modalities) have been combined, aiming at obtaining better performance than the individual modalities. However, when combining these modalities, the dimensionality of the feature space rapidly increases, yielding the well-known "curse of dimensionality". As a consequence, in order to extract useful information from this data, one has to resort to feature selection (FS) techniques to lower the dimensionality of the learning space. In this paper, we assess the impact of FS techniques for silent speech data, in a dataset with 4 non-invasive and promising modalities, namely: video, depth, ultrasonic Doppler sensing, and surface electromyography. We consider two supervised (mutual information and Fisher's ratio) and two unsupervised (meanmedian and arithmetic mean geometric mean) FS filters. The evaluation was made by assessing the classification accuracy (word recognition error) of three well-known classifiers (knearest neighbors, support vector machines, and dynamic time warping). The key results of this study show that both unsupervised and supervised FS techniques improve on the classification accuracy on both individual and combined modalities. For instance, on the video component, we attain relative performance gains of 36.2% in error rates. FS is also useful as pre-processing for feature fusion
publishDate 2014
dc.date.none.fl_str_mv 2014-01-01T00:00:00Z
2014
2022-07-15T11:07:47Z
2022-06-29T10:24:27Z
dc.type.driver.fl_str_mv conference object
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10071/25831
url http://hdl.handle.net/10071/25831
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2308-457X
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Speech and Communication Association
publisher.none.fl_str_mv Speech and Communication Association
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833597401791725568