Evaluation of Cyberbullying using Optimized Multi-Stage ML Framework and NLP

Ketsbaia, Lida; Issac, Biju; Chen, Xiaomin

Evaluation of Cyberbullying using Optimized Multi-Stage ML Framework and NLP

Files

DSDS21 Slides_Lida Ketsbaia.pdf (325.51 KB)

DSDS21 Paper_Lida Ketsbaia.pdf (236.36 KB)

Date published

2021-12-11T19:02:52Z

Authors

Ketsbaia, Lida
Issac, Biju
Chen, Xiaomin

Publisher

Cranfield University

Type

Presentation

URI

https://dspace.lib.cranfield.ac.uk/handle/1826/21390

Citation

Ketsbaia, Lida; Issac, Biju; Chen, Xiaomin (2021). Evaluation of Cyberbullying using Optimized Multi-Stage ML Framework and NLP. Cranfield Online Research Data (CORD). Conference contribution. https://doi.org/10.17862/cranfield.rd.17162282.v1

Abstract

Due to the evolution of technology, online hate is increasing, more specifically in areas of social media amongst the general population. Online hate has become a phenomenon that destructively impacts individuals, with victims suffering long-lasting mental and psychological issues. Since cyberhate is conveyed as an ever-growing social problem, researchers have tried to tackle the matter. One of the main methods researchers have focused on is through the means of Machine Learning to help classify whether a piece of textual data can be identified as cyberbullying or not. Therefore, the purpose of the research is to employ a multi-stage optimized Machine Learning Framework that will look at using a combination of two data balancing methods (RUS and SMOTE), the feature selection method PCA as well as the bio-inspired metaheuristic optimization techniques PSO and GA. The framework applied increases the performance of the Machine Learning Classifier Logistic Regression to help detect instances of cyberbullying. Furthermore, the paper will show the potential of using various NLP methods such as RoBERTa, XLNet and DistilBERT to find the most suitable model to use within the textual analysis of cyberhate.

Keywords

cyberbullying', 'optimised machine learning', 'natural language processing', 'DSDS21', 'DSDS21 Technical Paper', 'Knowledge Representation and Machine Learning', 'Information and Computing Sciences not elsewhere classified

DOI

10.17862/cranfield.rd.17162282.v1

Rights

CC BY 4.0

https://creativecommons.org/licenses/by/4.0/

Collections

DSDS 21

Full item page

Evaluation of Cyberbullying using Optimized Multi-Stage ML Framework and NLP

Files

Date published

Free to read from

Authors

Supervisor/s

Journal Title

Journal ISSN

Volume Title

Publisher

Department

Course name

Type

ISSN

Format

URI

Citation

Abstract

Description

Software Description

Software Language

Github

Keywords

DOI

Rights

Funder/s

Relationships

Relationships

Resources

Collections