Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis

dc.contributor.authorChai, Dakun Mang
dc.contributor.authorMoulitsas, Irene
dc.contributor.authorBisandu, Desmond B.
dc.date.accessioned2025-04-14T11:36:45Z
dc.date.available2025-04-14T11:36:45Z
dc.date.freetoread2025-04-14
dc.date.issued2024-10-17
dc.date.pubOnline2025-03-03
dc.description.abstractSentiment classification is essential in natural language processing, leveraging machine learning algorithms to understand the sentiment expressed in textual data. Over the years, advancements in machine learning, particularly with Naive Bayes (NB) and Support Vector Machines (SVM), have tremendously improved sentiment classification. These models benefit from word embedding techniques such as Word2Vec and GloVe, which provide dense vector representations of words, capturing their semantic and syntactic relationships. This paper explores the parallelisation of NB and SVM models using CUDA on GPUs to enhance computational efficiency and performance. Despite the computational power offered by GPUs, the literature on parallelising machine learning methods, especially for sentiment classification, remains limited. Our work aims to fill this gap by comparing the performance of NB and SVM on CPU and GPU platforms, focusing on execution time and model accuracy. Our experiments demonstrate that NB outperforms SVM in execution time and overall efficiency, mainly when using GPU acceleration. The NB model consistently achieves higher accuracy, precision, and F1 scores with Word2Vec and GloVe embeddings. The results show the importance of leveraging GPU acceleration using varying numbers of threads per block for large-scale sentiment analysis and laying the foundation for parallelising sentiment classification tasks.
dc.description.conferencenameICAAI 2024: 2024 The 8th International Conference on Advances in Artificial Intelligence
dc.description.sponsorshipWe acknowledge the Petroleum Technology Development Fund (PTDF) Nigeria funding, which sponsors the first author's PhD research, with ID PTDF/ED/OSS/PHD/DMC/1972/22.
dc.format.extentpp. 28-38
dc.identifier.citationChai DM, Moulitsas I, Bisandu DB. (2024) Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis. In: Proceedings of the 2024 8th International Conference on Advances in Artificial Intelligence ICAAI 2024, 17 - 19 Oct 2024, London, United Kingdom, pp. 28-38
dc.identifier.elementsID565705
dc.identifier.isbn979-8-4007-1801-4
dc.identifier.urihttps://doi.org/10.1145/3704137.3704142
dc.identifier.urihttps://dspace.lib.cranfield.ac.uk/handle/1826/23753
dc.language.isoen
dc.publisherAssociation for Computing Machinery (ACM)
dc.publisher.urihttps://dl.acm.org/doi/10.1145/3704137.3704142
dc.rightsAttribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subject46 Information and Computing Sciences
dc.subject4611 Machine Learning
dc.subjectBioengineering
dc.subjectMachine Learning and Artificial Intelligence
dc.subjectNetworking and Information Technology R&D (NITRD)
dc.subjectCUDA
dc.subjectMachine Learning
dc.subjectSentiment Analysis
dc.subjectWord Embedding
dc.titleUnderstanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis
dc.typeConference paper
dcterms.coverageLondon, United Kingdom
dcterms.temporal.endDate19 Oct 2024
dcterms.temporal.startDate17 Oct 2024

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Understanding_the_relevance-2025.pdf
Size:
613.98 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.63 KB
Format:
Plain Text
Description: