Sentence embedding approach using LSTM auto-encoder for discussion threads summarization

Bibliographic Details
Main Author: Khan, Abdul Wali
Publication Date: 2023
Other Authors: Al-Obeidat, Feras, Khalid, Afsheen, Adnan, Amin, Moreira, Fernando
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/11328/5061
https://doi.org/10.2298/CSIS221210055K
Summary: Online discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model.
id RCAP_4b34f8eb46c63a3e4149e349257bc6d4
oai_identifier_str oai:repositorio.upt.pt:11328/5061
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Sentence embedding approach using LSTM auto-encoder for discussion threads summarizationSentence embeddingLSTM Auto-encoderCBOWDeep learningMachine learningNLPOnline discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model.ComSIS Consortium2023-09-01T14:15:45Z2023-09-012023-08-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfKhan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061http://hdl.handle.net/11328/5061Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061http://hdl.handle.net/11328/5061https://doi.org/10.2298/CSIS221210055Keng2683-3867https://doiserbia.nb.rs/Article.aspx?ID=1820-02142300055Khttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessKhan, Abdul WaliAl-Obeidat, FerasKhalid, AfsheenAdnan, AminMoreira, Fernandoreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-01-09T02:07:08Zoai:repositorio.upt.pt:11328/5061Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T19:27:48.876909Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
title Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
spellingShingle Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
Khan, Abdul Wali
Sentence embedding
LSTM Auto-encoder
CBOW
Deep learning
Machine learning
NLP
title_short Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
title_full Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
title_fullStr Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
title_full_unstemmed Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
title_sort Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
author Khan, Abdul Wali
author_facet Khan, Abdul Wali
Al-Obeidat, Feras
Khalid, Afsheen
Adnan, Amin
Moreira, Fernando
author_role author
author2 Al-Obeidat, Feras
Khalid, Afsheen
Adnan, Amin
Moreira, Fernando
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Khan, Abdul Wali
Al-Obeidat, Feras
Khalid, Afsheen
Adnan, Amin
Moreira, Fernando
dc.subject.por.fl_str_mv Sentence embedding
LSTM Auto-encoder
CBOW
Deep learning
Machine learning
NLP
topic Sentence embedding
LSTM Auto-encoder
CBOW
Deep learning
Machine learning
NLP
description Online discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model.
publishDate 2023
dc.date.none.fl_str_mv 2023-09-01T14:15:45Z
2023-09-01
2023-08-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061
http://hdl.handle.net/11328/5061
Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061
http://hdl.handle.net/11328/5061
https://doi.org/10.2298/CSIS221210055K
identifier_str_mv Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061
url http://hdl.handle.net/11328/5061
https://doi.org/10.2298/CSIS221210055K
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2683-3867
https://doiserbia.nb.rs/Article.aspx?ID=1820-02142300055K
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv ComSIS Consortium
publisher.none.fl_str_mv ComSIS Consortium
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833598114491006976