Tweets Topic Classification and Sentiment Analysis Based on Transformer-Based Language Models

Loading...
Thumbnail Image
File version

Version of Record (VoR)

Author(s)
Mandal, Ranju
Chen, Jinyan
Becken, Susanne
Stantic, Bela
Primary Supervisor
Other Supervisors
Editor(s)
Date
2022
Size
File type(s)
Location
Abstract

People provide information on their thoughts, perceptions, and activities through a wide range of channels, including social media. The wide acceptance of social media results in vast volume of valuable data, in variety of format as well as veracity. Analysis of such 'big data' allows organizations and analysts to make better and faster decisions. However, this data had to be quantified and information has to be extracted, which can be very challenging because of possible data ambiguity and complexity. To address information extraction, many analytic techniques, such as text mining, machine learning, predictive analytics, and diverse natural language processing, have been proposed in the literature. Recent advances in Natural Language Understanding-based techniques more specifically transformer-based architectures can solve sequence-to-sequence modeling tasks while handling long-range dependencies effi- ciently. In this work, we applied transformer-based sequence modeling on short texts' topic classification and sentiment analysis from user-posted tweets. Applicability of models is investigated on posts from the Great Barrier Reef tweet dataset and obtained findings are encouraging providing insight that can be valuable for researchers working on classification of large datasets as well as large number of target classes.

Journal Title

Vietnam Journal of Computer Science

Conference Title
Book Title
Edition
Volume
Issue
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© The Author(s) 2022.

This is an Open Access article published by World Scientific Publishing Company. It is distributed under the terms of the Creative Commons Attribution 4.0 (CC BY) License which permits use, distribution and reproduction in any medium, provided the original work is properly cited.

Item Access Status
Note

This publication has been entered in Griffith Research Online as an advanced online version.

Access the data
Related item(s)
Subject

Information systems

Science & Technology

Technology

Computer Science, Artificial Intelligence

Computer Science, Information Systems

Computer Science, Theory & Methods

Persistent link to this record
Citation

Mandal, R; Chen, J; Becken, S; Stantic, B, Tweets Topic Classification and Sentiment Analysis Based on Transformer-Based Language Models, Vietnam Journal of Computer Science, 2022

Collections