Bengali Abstractive News Summarization(BANS): A Neural Attention Approach

Bhattacharjee, Prithwiraj; Mallick, Avi; Islam, Md Saiful; Marium-E-Jannat

doi:10.1007/978-981-33-4673-4_4

Computer Science > Computation and Language

arXiv:2012.01747 (cs)

[Submitted on 3 Dec 2020]

Title:Bengali Abstractive News Summarization(BANS): A Neural Attention Approach

Authors:Prithwiraj Bhattacharjee, Avi Mallick, Md Saiful Islam, Marium-E-Jannat

View PDF

Abstract:Abstractive summarization is the process of generating novel sentences based on the information extracted from the original text document while retaining the context. Due to abstractive summarization's underlying complexities, most of the past research work has been done on the extractive summarization approach. Nevertheless, with the triumph of the sequence-to-sequence (seq2seq) model, abstractive summarization becomes more viable. Although a significant number of notable research has been done in the English language based on abstractive summarization, only a couple of works have been done on Bengali abstractive news summarization (BANS). In this article, we presented a seq2seq based Long Short-Term Memory (LSTM) network model with attention at encoder-decoder. Our proposed system deploys a local attention-based model that produces a long sequence of words with lucid and human-like generated sentences with noteworthy information of the original document. We also prepared a dataset of more than 19k articles and corresponding human-written summaries collected from bangla.bdnews24.com1 which is till now the most extensive dataset for Bengali news document summarization and publicly published in Kaggle2. We evaluated our model qualitatively and quantitatively and compared it with other published results. It showed significant improvement in terms of human evaluation scores with state-of-the-art approaches for BANS.

Comments:	10 Pages, 2 figures, 4 tables, 2nd International Conference on Trends in Computational and Cognitive Engineering(TCCE-2020)
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2012.01747 [cs.CL]
	(or arXiv:2012.01747v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2012.01747
Journal reference:	2nd International Conference on Trends in Computational and Cognitive Engineering, 2020
Related DOI:	https://doi.org/10.1007/978-981-33-4673-4_4

Submission history

From: Prithwiraj Bhattacharjee [view email]
[v1] Thu, 3 Dec 2020 08:17:31 UTC (462 KB)

Computer Science > Computation and Language

Title:Bengali Abstractive News Summarization(BANS): A Neural Attention Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Bengali Abstractive News Summarization(BANS): A Neural Attention Approach

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators