Please use this identifier to cite or link to this item:
https://dl.ucsc.cmb.ac.lk/jspui/handle/123456789/4922
Title: | Enhanced Tamil Morphological Analysis through Deep Learning and Embedding-Based Similarity Techniques |
Authors: | Nareashkaan, V. |
Issue Date: | 25-Apr-2025 |
Abstract: | Abstract Morphological analysis is vital in NLP, especially for morphologically rich languages like Tamil, which pose challenges due to complex inflectional and derivational forms. Traditional rule-based methods struggle with scalability and unseen words, while deep learning remains underexplored for Tamil. This research proposes a hybrid approach combining deep learning for lemma prediction and an embedding-based similarity method for grammatical feature prediction. Various architectures—including Recurrent Neural Network (RNN), Long Short term memory (LSTM) and Gradient recurrent unit (GRU)—are evaluated for lemma prediction, while FastText embeddings enable effective feature transfer for unseen words, addressing the out-of-vocabulary problem. The model is trained on curated word-lemma pairs and grammatical annotations, demonstrating high accuracy and generalization. This work offers a scalable, low-resourcefriendly solution for Tamil morphological analysis and contributes to advancing Tamil NLP. |
URI: | https://dl.ucsc.cmb.ac.lk/jspui/handle/123456789/4922 |
Appears in Collections: | 2025 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
20001193-V.Nareashkaan - Mr. NAREASHKAAN V..pdf | 1.83 MB | Adobe PDF | View/Open |
Items in UCSC Digital Library are protected by copyright, with all rights reserved, unless otherwise indicated.