IMPROVING COVERAGE AND NOVELTY OF ABSTRACTIVE TEXT SUMMARIZATION USING TRANSFER LEARNING AND DIVIDE AND CONQUER APPROACHES

Authors

  • Ayham Alomari Universiti Malaya
  • Norisma Idris Department of Artificial Intelligence Faculty of Computer Science and Information Technology
  • Aznul Qalid Department of Artificial Intelligence Faculty of Computer Science and Information Technology
  • Izzat Alsmadi Department of Computing and Cyber Security Science and Technology

DOI:

https://doi.org/10.22452/mjcs.vol36no3.4

Keywords:

Abstractive Summarization, Novelty, Coverage, Warm-Started Models, Transfer Learning, Deep Learning

Abstract

Automatic Text Summarization (ATS) models yield outcomes with insufficient coverage of crucial details and poor degrees of novelty. The first issue resulted from the lengthy input, while the second problem resulted from the characteristics of the training dataset itself. This research employs the divide-and-conquer approach to address the first issue by breaking the lengthy input into smaller pieces to be summarized, followed by the conquest of the results in order to cover more significant details. For the second challenge, these chunks are summarized by models trained on datasets with higher novelty levels in order to produce more human-like and concise summaries with more novel words that do not appear in the input article. The results demonstrate an improvement in both coverage and novelty levels. Moreover, we defined a new metric to measure the novelty of the summary. Finally, we investigated the findings to discover whether the novelty is influenced more by the dataset itself, as in CNN/DM, or by the training model and its training objective, as in Pegasus.

 

Downloads

Download data is not yet available.

Author Biographies

Ayham Alomari, Universiti Malaya

 

 

Norisma Idris, Department of Artificial Intelligence Faculty of Computer Science and Information Technology

 

 

Izzat Alsmadi, Department of Computing and Cyber Security Science and Technology

 

 

Downloads

Published

2023-07-31

How to Cite

Ayham Alomari, Norisma Idris, Qalid, A., & Izzat Alsmadi. (2023). IMPROVING COVERAGE AND NOVELTY OF ABSTRACTIVE TEXT SUMMARIZATION USING TRANSFER LEARNING AND DIVIDE AND CONQUER APPROACHES. Malaysian Journal of Computer Science, 36(3), 271–288. https://doi.org/10.22452/mjcs.vol36no3.4