Please use this identifier to cite or link to this item: https://dl.ucsc.cmb.ac.lk/jspui/handle/123456789/4609
Title: Automatic Sinhala Text Summarization for Government Gazettes using Abstractive and Extractive Methods
Authors: Jayawardane, H.M.R.Y
Issue Date: 1-Jul-2022
Abstract: During this new era, information is very accessible and the amount of text data from various sources has increased dramatically. However, most of them can distract the reader from the most important information due to using larger paragraphs, examples, complex arguments, grammar, and some vocabularies. Since time is one of the most important facts in the 21st century, people want to summarize these contexts and retrieve only the important information in a shorter time. The Gazettes are important to people in different way and there was no attempt on summarization solution for the area, this research emphases on the summarizing gazettes in the Sinhala language. This research solution is to provide summarized output for Sinhala gazettes by identifying the most important and relevant sentences based on linguistic and statistical features of a given text, using an abstractive and extractive approaches. Even though there are very few attempts done on the Sinhala Summarization this is the first attempt on summarizing Sinhala gazettes. The project was evaluated by machine summaries with the summaries created by the author. The system has been tested with 450 actual Sinhala gazettes and final results were attached in the Appendix section. Further, this provides a turning point for future researches on automatic text summarization in Sinhala language.
URI: https://dl.ucsc.cmb.ac.lk/jspui/handle/123456789/4609
Appears in Collections:2021

Files in This Item:
File Description SizeFormat 
2017 MCS 040.pdf2.23 MBAdobe PDFView/Open


Items in UCSC Digital Library are protected by copyright, with all rights reserved, unless otherwise indicated.