Research on Text Abstract Generation Based on T5 PEGASUS and DeepKE 

Expand
  • 1.Henan Vocational College of Water Conservancy and Environment 2. North China University of Water Resources and Electric Power

Online published: 2024-11-01

Abstract

In order to solve the problem of false information and duplication in the summarizations generated by the T5 PEGASUS model, a text summarization model based on T5 PEGASUS and DeepKE - T5 PEGASUS-DK is proposed. This model combines the T5 PEGASUS model with DeepKE framework. Firstly, the Pkuseg segmentation method is used to improve the segmentation performance. Then, the DeepKE framework is used to extract triads from text. Finally, the word vector set of triads is concatenated with the representation vector of text. By establishing a mapping relationship between text and triads, the model can extract factual knowledge and extract information that is more consistent with the original content as a summary. The experimental results show that the T5 PEGASUS-DK model has the highest ROUGE value, and the generated abstracts are more authentic, coherent, and consistent with the original content.

Cite this article

ZHANG Qi WANG Ling SHEN Jie . Research on Text Abstract Generation Based on T5 PEGASUS and DeepKE [J]. Computer & Telecommunication, 2024 , 1(6) : 62 -67 . DOI: 10.15966/j.cnki.dnydx.2024.06.016

Options
Outlines

/