#3837. Low resource language specific pre-processing and features for sentiment analysis task

September 2026publication date
Proposal available till 11-05-2025
4 total number of authors per manuscript0 $

The title of the journal is available only for the authors who have already paid for
Journal’s subject area:
Language and Linguistics;
Linguistics and Language;
Library and Information Sciences;
Places in the authors’ list:
place 1place 2place 3place 4
FreeFreeFreeFree
2350 $1200 $1050 $900 $
Contract3837.1 Contract3837.2 Contract3837.3 Contract3837.4
1 place - free (for sale)
2 place - free (for sale)
3 place - free (for sale)
4 place - free (for sale)

Abstract:
Sentiment analysis is a classification task where polarity of textual data is identified, i.e. to analyze whether a sentence or document expresses a negative, positive or neutral sentiment. In this work, we report the sentiment analysis for a language using different types of machine learning based approaches. The dataset used in our work is collected from newspapers. The novelty of this work is that we carry out language specific pre-processing tasks such as transliteration, building negative morpheme based lexicon and filtering of noisy words. Using them as additional linguistic features in our models improves the classification result in terms of precision, recall and F-score. Based on this result, we attempt to classify the sentiment of news articles during a certain period of time. Further, we report the finding of deep learning based approaches on the same dataset.
Keywords:
BM25; Deep learning; Ensembled classifier; Low resource; Machine learning; Morphology; Pre-processing; Sentiment analysis

Contacts :
0