Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis
| dc.contributor.author | Chai, Dakun Mang | |
| dc.contributor.author | Moulitsas, Irene | |
| dc.contributor.author | Bisandu, Desmond B. | |
| dc.date.accessioned | 2025-04-14T11:36:45Z | |
| dc.date.available | 2025-04-14T11:36:45Z | |
| dc.date.freetoread | 2025-04-14 | |
| dc.date.issued | 2024-10-17 | |
| dc.date.pubOnline | 2025-03-03 | |
| dc.description.abstract | Sentiment classification is essential in natural language processing, leveraging machine learning algorithms to understand the sentiment expressed in textual data. Over the years, advancements in machine learning, particularly with Naive Bayes (NB) and Support Vector Machines (SVM), have tremendously improved sentiment classification. These models benefit from word embedding techniques such as Word2Vec and GloVe, which provide dense vector representations of words, capturing their semantic and syntactic relationships. This paper explores the parallelisation of NB and SVM models using CUDA on GPUs to enhance computational efficiency and performance. Despite the computational power offered by GPUs, the literature on parallelising machine learning methods, especially for sentiment classification, remains limited. Our work aims to fill this gap by comparing the performance of NB and SVM on CPU and GPU platforms, focusing on execution time and model accuracy. Our experiments demonstrate that NB outperforms SVM in execution time and overall efficiency, mainly when using GPU acceleration. The NB model consistently achieves higher accuracy, precision, and F1 scores with Word2Vec and GloVe embeddings. The results show the importance of leveraging GPU acceleration using varying numbers of threads per block for large-scale sentiment analysis and laying the foundation for parallelising sentiment classification tasks. | |
| dc.description.conferencename | ICAAI 2024: 2024 The 8th International Conference on Advances in Artificial Intelligence | |
| dc.description.sponsorship | We acknowledge the Petroleum Technology Development Fund (PTDF) Nigeria funding, which sponsors the first author's PhD research, with ID PTDF/ED/OSS/PHD/DMC/1972/22. | |
| dc.format.extent | pp. 28-38 | |
| dc.identifier.citation | Chai DM, Moulitsas I, Bisandu DB. (2024) Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis. In: Proceedings of the 2024 8th International Conference on Advances in Artificial Intelligence ICAAI 2024, 17 - 19 Oct 2024, London, United Kingdom, pp. 28-38 | |
| dc.identifier.elementsID | 565705 | |
| dc.identifier.isbn | 979-8-4007-1801-4 | |
| dc.identifier.uri | https://doi.org/10.1145/3704137.3704142 | |
| dc.identifier.uri | https://dspace.lib.cranfield.ac.uk/handle/1826/23753 | |
| dc.language.iso | en | |
| dc.publisher | Association for Computing Machinery (ACM) | |
| dc.publisher.uri | https://dl.acm.org/doi/10.1145/3704137.3704142 | |
| dc.rights | Attribution 4.0 International | en |
| dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
| dc.subject | 46 Information and Computing Sciences | |
| dc.subject | 4611 Machine Learning | |
| dc.subject | Bioengineering | |
| dc.subject | Machine Learning and Artificial Intelligence | |
| dc.subject | Networking and Information Technology R&D (NITRD) | |
| dc.subject | CUDA | |
| dc.subject | Machine Learning | |
| dc.subject | Sentiment Analysis | |
| dc.subject | Word Embedding | |
| dc.title | Understanding the relevance of parallelising machine learning algorithms using CUDA for sentiment analysis | |
| dc.type | Conference paper | |
| dcterms.coverage | London, United Kingdom | |
| dcterms.temporal.endDate | 19 Oct 2024 | |
| dcterms.temporal.startDate | 17 Oct 2024 |