Email Classification: a case study
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Texto Completo: | https://hdl.handle.net/10216/88362 |
Resumo: | Internet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users. |
id |
RCAP_9608094d1f02fb0af9d69bca892833ad |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/88362 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Email Classification: a case studyCiências da computação e da informaçãoComputer and information sciencesInternet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users.2016-07-062016-07-06T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/88362TID:201296845engAndré Ricardo Azevedo Gonçalves da Silvainfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T16:58:25Zoai:repositorio-aberto.up.pt:10216/88362Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T21:59:08.622117Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Email Classification: a case study |
title |
Email Classification: a case study |
spellingShingle |
Email Classification: a case study André Ricardo Azevedo Gonçalves da Silva Ciências da computação e da informação Computer and information sciences |
title_short |
Email Classification: a case study |
title_full |
Email Classification: a case study |
title_fullStr |
Email Classification: a case study |
title_full_unstemmed |
Email Classification: a case study |
title_sort |
Email Classification: a case study |
author |
André Ricardo Azevedo Gonçalves da Silva |
author_facet |
André Ricardo Azevedo Gonçalves da Silva |
author_role |
author |
dc.contributor.author.fl_str_mv |
André Ricardo Azevedo Gonçalves da Silva |
dc.subject.por.fl_str_mv |
Ciências da computação e da informação Computer and information sciences |
topic |
Ciências da computação e da informação Computer and information sciences |
description |
Internet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-07-06 2016-07-06T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/88362 TID:201296845 |
url |
https://hdl.handle.net/10216/88362 |
identifier_str_mv |
TID:201296845 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833599505659854849 |