Improve Performance of Abstractive Text Summarization using Bidirectional LSTM with Content-based Attention

Journal: GRENZE International Journal of Engineering and Technology
Authors: Dakshata Argade, Vaishali Khairnar
Volume: 10 Issue: 2
Grenze ID: 01.GIJET.10.2.405 Pages: 5259-5266

Abstract

Easy access of the internet generating tremendous amount of data every day. The data is generated at high volume with high velocity and to know the essence of data automatic text summarization is needed. The purpose of research is to design Abstractive Text Summarizer using deep learning techniques. In this study, we model the summarization task as a Sequenceto- Sequence (Seq2Seq) problem. The paper experiments with encoder decoder based different variants of RNN such as vanilla RNN, Long Short Term Memory (LSTM) and Gated Recurrent Unit (GRU). The problem with existing approach is that the context vector contains information from the last recent part of the sentence and forgets the relevance of the initial time step input. To solve this problem, the content-based attention mechanism is applied which uses cosine similarity to generate context vector at each time step. The CNN/Daily Mail dataset is used for the study. The performance of models is measured using the ROUGE score to select the ideal model for summarization. The experimental results show bidirectional LSTM combined with a content-based attention mechanism enhances summarization performance. The experimental data demonstrates that content-based attention outperforms over dot product based attention.

Download Now << BACK

GIJET