Deep Learning Model for Sentiment Analysis on Short Informal Texts

Poetro B.S.W., Haviana S.F.C.

Abstract

This paper proposes a classification model to classify short informal texts. Those short informal texts were texts that were noisy, typos, irregular, and could consist of a very small number of words or even only a single word. The proposed model was trained using a dataset collected from student comments from an application called Evaluasi Dosen Oleh Mahasiswa (EDOM). This application assesses the lecturers using questionnaires filled out by students. It also records the student's comments but is not part of the evaluation calculation, therefore this work makes the data possible to be part of the assessment through sentiment analysis. This work focuses on building suitable preprocessing algorithm and building a simple deep learning network. The preprocessing algorithm was based on multiple word n-gram and Term Frequency-Inverse Document Frequency (TF-IDF) vectorization, and the network was built with a relatively shallow network. To evaluate the model in real usage, an application was built. The results were very convincing, reaching 0.979 in accuracy and 0.63 in F1-Score. Nonetheless, the imbalanced dataset was the only factor that needed to be investigated further for better overall performance.

Journal
Indonesian Journal of Electrical Engineering and Informatics
Page Range
82-89
Publication date
2022
Total citations
Sentiment analysis of short informal texts

Kiritchenko S., Mohammad S.M., Zhu X.

Hoax classification and sentiment analysis of Indonesian news using Naive Bayes optimization

Basuki R.S., Nugraha A., Nugroho A.A., Rachmawanto E.H., Santoso H.A., Setiadi D.R.I.M.

Sentiment Analysis of Cyberbullying on Instagram User Comments

Bimantara A. A., Larasati A., Naf'an M. Z., Nugraha N. A. S., Risondang E. M.

Sarcasm Detection For Sentiment Analysis in Indonesian Tweets

Musdholifah A., Sari A. K., Yunitasari Y.

Adam: A method for stochastic optimization

Ba J.L., Kingma D.P.

Rectified linear units improve Restricted Boltzmann machines

Hinton G.E., Nair V.

No Title

Chen T., Li M., Wang N., Xu B.

Dropout: A simple way to prevent neural networks from overfitting

Hinton G., Krizhevsky A., Salakhutdinov R., Srivastava N., Sutskever I.

Delving deep into rectifiers: Surpassing human-level performance on imagenet classification

He K., Ren S., Sun J., Zhang X.

Rectifier nonlinearities improve neural network acoustic models

Hannun A. Y., Maas A. L., Ng A. Y.

BERT-BiLSTM model for hierarchical Arabic text classification

Doumi N., Bouziane A., Hamzaoui B., Bouchiha D.

Indonesian Journal of Electrical Engineering and Informatics

Access to Document