Sentence embedding approach using LSTM auto-encoder for discussion threads summarization
Main Author: | |
---|---|
Publication Date: | 2023 |
Other Authors: | , , , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/11328/5061 https://doi.org/10.2298/CSIS221210055K |
Summary: | Online discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model. |
id |
RCAP_4b34f8eb46c63a3e4149e349257bc6d4 |
---|---|
oai_identifier_str |
oai:repositorio.upt.pt:11328/5061 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarizationSentence embeddingLSTM Auto-encoderCBOWDeep learningMachine learningNLPOnline discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model.ComSIS Consortium2023-09-01T14:15:45Z2023-09-012023-08-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfKhan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061http://hdl.handle.net/11328/5061Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061http://hdl.handle.net/11328/5061https://doi.org/10.2298/CSIS221210055Keng2683-3867https://doiserbia.nb.rs/Article.aspx?ID=1820-02142300055Khttp://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessKhan, Abdul WaliAl-Obeidat, FerasKhalid, AfsheenAdnan, AminMoreira, Fernandoreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-01-09T02:07:08Zoai:repositorio.upt.pt:11328/5061Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T19:27:48.876909Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
title |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
spellingShingle |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization Khan, Abdul Wali Sentence embedding LSTM Auto-encoder CBOW Deep learning Machine learning NLP |
title_short |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
title_full |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
title_fullStr |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
title_full_unstemmed |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
title_sort |
Sentence embedding approach using LSTM auto-encoder for discussion threads summarization |
author |
Khan, Abdul Wali |
author_facet |
Khan, Abdul Wali Al-Obeidat, Feras Khalid, Afsheen Adnan, Amin Moreira, Fernando |
author_role |
author |
author2 |
Al-Obeidat, Feras Khalid, Afsheen Adnan, Amin Moreira, Fernando |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Khan, Abdul Wali Al-Obeidat, Feras Khalid, Afsheen Adnan, Amin Moreira, Fernando |
dc.subject.por.fl_str_mv |
Sentence embedding LSTM Auto-encoder CBOW Deep learning Machine learning NLP |
topic |
Sentence embedding LSTM Auto-encoder CBOW Deep learning Machine learning NLP |
description |
Online discussion forums are repositories of valuable information where users interact and articulate their ideas, opinions, and share experiences about nu merous topics. They are internet-based online communities where users can ask for help and find the solution to a problem. On online discussion forums, a new user becomes exhausted from reading the significant number of replies in a discussion. An automated discussion thread summarizing system (DTS) is necessary to create a candid view of the entire discussion of a query. Most of the previous approaches for automated DTS use the continuous bag of words (CBOW) model as a sentence embedding tool, which is poor at capturing the overall meaning of the sentence and is unable to grasp word dependency. To overcome this limitation, we introduce the LSTM Auto-encoder as a sentence embedding technique to improve the per formance of DTS. The empirical result in the context of average precision, recall, and F-measure of the proposed approach with respect to ROGUE-1 and ROUGE-2 of two standard experimental datasets proves the effectiveness and efficiency of the proposed approach and outperforms the state-of-the-art CBOW model in sentence embedding tasks by boosting the performance of the automated DTS model. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-09-01T14:15:45Z 2023-09-01 2023-08-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061 http://hdl.handle.net/11328/5061 Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061 http://hdl.handle.net/11328/5061 https://doi.org/10.2298/CSIS221210055K |
identifier_str_mv |
Khan, A. W., Al-Obeidat, F., Khalid, A., Adnan, A., & Moreira, F. (2023). Sentence embedding approach using LSTM auto-encoder for discussion threads summarization. Computer Science and Information Systems, OnLine-First, Issue 00, pp. 1-21. Repositório Institucional UPT. http://hdl.handle.net/11328/5061 |
url |
http://hdl.handle.net/11328/5061 https://doi.org/10.2298/CSIS221210055K |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
2683-3867 https://doiserbia.nb.rs/Article.aspx?ID=1820-02142300055K |
dc.rights.driver.fl_str_mv |
http://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
http://creativecommons.org/licenses/by/4.0/ |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
ComSIS Consortium |
publisher.none.fl_str_mv |
ComSIS Consortium |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833598114491006976 |