Email Classification: a case study

Detalhes bibliográficos
Autor(a) principal: André Ricardo Azevedo Gonçalves da Silva
Data de Publicação: 2016
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Texto Completo: https://hdl.handle.net/10216/88362
Resumo: Internet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users.
id RCAP_9608094d1f02fb0af9d69bca892833ad
oai_identifier_str oai:repositorio-aberto.up.pt:10216/88362
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Email Classification: a case studyCiências da computação e da informaçãoComputer and information sciencesInternet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users.2016-07-062016-07-06T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/88362TID:201296845engAndré Ricardo Azevedo Gonçalves da Silvainfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T16:58:25Zoai:repositorio-aberto.up.pt:10216/88362Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T21:59:08.622117Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Email Classification: a case study
title Email Classification: a case study
spellingShingle Email Classification: a case study
André Ricardo Azevedo Gonçalves da Silva
Ciências da computação e da informação
Computer and information sciences
title_short Email Classification: a case study
title_full Email Classification: a case study
title_fullStr Email Classification: a case study
title_full_unstemmed Email Classification: a case study
title_sort Email Classification: a case study
author André Ricardo Azevedo Gonçalves da Silva
author_facet André Ricardo Azevedo Gonçalves da Silva
author_role author
dc.contributor.author.fl_str_mv André Ricardo Azevedo Gonçalves da Silva
dc.subject.por.fl_str_mv Ciências da computação e da informação
Computer and information sciences
topic Ciências da computação e da informação
Computer and information sciences
description Internet dependance on email has been frequent since its early days. In the present days, electronic mail is widely used in a professional and personal context. Although this service was developed as a way of communication, nowadays it serves many other purposes. The majority of services available online will require an email address in order to authenticate or as a bridge of communication between the user and the service. The average number of emails sent and received, by the average user, is in the order of the hundreds per day, and these emails can be of varying categories: social, professional, notifications, marketing, transactional, emails which warrant no response, emails to send files, emails requiring response, among others with different purposes. This originates an information overload problem, that proves difficult to be completely solved manually by the email address owner. Therefore, there is a growing need to develop systems that can automatically learn and recommend users effective ways to organize their email information, which can aggregate emails into smaller groups to be easily interpreted by the user, expediting the process of reading and consulting the mailbox. To alleviate this information overload problem there are several possible approaches and techniques, such as machine learning to help on email classification and clustering, in order to find new subsets of emails in the massive inboxes we all have, now or in the future. After a careful review of the state of the art on email classification and grouping techniques, this work will enumerate and select the most effective approaches for the problem at hand, and will adapt them to a very concrete case study, a desktop email client under development at Mailcube Lda. The approach in mind will follow a competitive learning paradigm, which means that emails will compete with each other in order to find subsets in the inbox. It will also follow a reinforcement learning paradigm to add sensitiveness to user profile and interaction history. At the end, the resulting system is expected to suggest the user to organize his inbox into relevant groups of emails, based on learning users' interactions and continuously adapting to the arrival of new emails, improving the overall user experience and saving precious time for the users.
publishDate 2016
dc.date.none.fl_str_mv 2016-07-06
2016-07-06T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/88362
TID:201296845
url https://hdl.handle.net/10216/88362
identifier_str_mv TID:201296845
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833599505659854849