Text mining applications to facilitate economic and food safety law enforcement
Main Author: | |
---|---|
Publication Date: | 2019 |
Other Authors: | , , |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/10400.22/25451 |
Summary: | Economic and Food Safety Authority receives on a daily basis reports and complaints regarding infractions, delicts and possible food and economic crimes. These reports and complaints can be in different forms, such as e-mails, online forms, letters, phone calls and complaint books present in every establishment. This paper aims to apply text mining and classification algorithms to textual data extracted from these reports and complains in order to help identify if the responsible entity to analyze the content is, in fact, the Economic and Food Safety Authority. The paper describes text preprocessing and feature extraction procedures applied to Portuguese text data. Supervised multi-class classification methods such as Naïve Bayes and Support Vector Machine Classifiers are employed in the task. We show that a non-semantical text mining approach can achieve good results, scoring around 70% of accuracy. |
id |
RCAP_9c816bafead4faf1a2ac02dfab5b34ef |
---|---|
oai_identifier_str |
oai:recipp.ipp.pt:10400.22/25451 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Text mining applications to facilitate economic and food safety law enforcementText miningEconomic and food safetyNatural language processingText classificationMulti-class classificationEconomic and Food Safety Authority receives on a daily basis reports and complaints regarding infractions, delicts and possible food and economic crimes. These reports and complaints can be in different forms, such as e-mails, online forms, letters, phone calls and complaint books present in every establishment. This paper aims to apply text mining and classification algorithms to textual data extracted from these reports and complains in order to help identify if the responsible entity to analyze the content is, in fact, the Economic and Food Safety Authority. The paper describes text preprocessing and feature extraction procedures applied to Portuguese text data. Supervised multi-class classification methods such as Naïve Bayes and Support Vector Machine Classifiers are employed in the task. We show that a non-semantical text mining approach can achieve good results, scoring around 70% of accuracy.IADISREPOSITÓRIO P.PORTOMagalhães, GustavoFaria, Brígida MónicaReis, Luís PauloCardoso, Henrique Lopes2024-05-03T09:14:23Z20192019-01-01T00:00:00Zconference objectinfo:eu-repo/semantics/publishedVersionapplication/pdfhttp://hdl.handle.net/10400.22/25451eng978-989-8533-92-0info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-07T10:03:30Zoai:recipp.ipp.pt:10400.22/25451Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T00:29:33.253708Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Text mining applications to facilitate economic and food safety law enforcement |
title |
Text mining applications to facilitate economic and food safety law enforcement |
spellingShingle |
Text mining applications to facilitate economic and food safety law enforcement Magalhães, Gustavo Text mining Economic and food safety Natural language processing Text classification Multi-class classification |
title_short |
Text mining applications to facilitate economic and food safety law enforcement |
title_full |
Text mining applications to facilitate economic and food safety law enforcement |
title_fullStr |
Text mining applications to facilitate economic and food safety law enforcement |
title_full_unstemmed |
Text mining applications to facilitate economic and food safety law enforcement |
title_sort |
Text mining applications to facilitate economic and food safety law enforcement |
author |
Magalhães, Gustavo |
author_facet |
Magalhães, Gustavo Faria, Brígida Mónica Reis, Luís Paulo Cardoso, Henrique Lopes |
author_role |
author |
author2 |
Faria, Brígida Mónica Reis, Luís Paulo Cardoso, Henrique Lopes |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
REPOSITÓRIO P.PORTO |
dc.contributor.author.fl_str_mv |
Magalhães, Gustavo Faria, Brígida Mónica Reis, Luís Paulo Cardoso, Henrique Lopes |
dc.subject.por.fl_str_mv |
Text mining Economic and food safety Natural language processing Text classification Multi-class classification |
topic |
Text mining Economic and food safety Natural language processing Text classification Multi-class classification |
description |
Economic and Food Safety Authority receives on a daily basis reports and complaints regarding infractions, delicts and possible food and economic crimes. These reports and complaints can be in different forms, such as e-mails, online forms, letters, phone calls and complaint books present in every establishment. This paper aims to apply text mining and classification algorithms to textual data extracted from these reports and complains in order to help identify if the responsible entity to analyze the content is, in fact, the Economic and Food Safety Authority. The paper describes text preprocessing and feature extraction procedures applied to Portuguese text data. Supervised multi-class classification methods such as Naïve Bayes and Support Vector Machine Classifiers are employed in the task. We show that a non-semantical text mining approach can achieve good results, scoring around 70% of accuracy. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019 2019-01-01T00:00:00Z 2024-05-03T09:14:23Z |
dc.type.driver.fl_str_mv |
conference object |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.22/25451 |
url |
http://hdl.handle.net/10400.22/25451 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
978-989-8533-92-0 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IADIS |
publisher.none.fl_str_mv |
IADIS |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833600556777603072 |