Modified word representation vector based scalar weight for contextual text classification

This thesis investigates contextual text classification, which is the process of categorising textual data into different classes or categories based on its meaning within a given context. Central to this process is the representation of words through vectors for computational interpretation. Curren...

詳細記述

書誌詳細
第一著者: Abbas Saliimi, Lokman
フォーマット: 学位論文
言語:英語
出版事項: 2024
主題:
オンライン・アクセス:http://umpir.ump.edu.my/id/eprint/44632/1/Modified%20word%20representation%20vector%20based%20scalar%20weight%20for%20contextual%20text%20classification.pdf
_version_ 1846216857719144448
author Abbas Saliimi, Lokman
author_facet Abbas Saliimi, Lokman
author_sort Abbas Saliimi, Lokman
description This thesis investigates contextual text classification, which is the process of categorising textual data into different classes or categories based on its meaning within a given context. Central to this process is the representation of words through vectors for computational interpretation. Current practices employ Large Language Models (LLMs) to generate contextualised word representation vectors, achieved through pre-training the LLM on vast corpora that enables it to grasp intricate language patterns and context. For contextual text classification, the pre-trained LLM is further train on classificationspecific labeled data in a process called fine-tuning. Although this approach is currently considered the most optimal in the field, it poses a notable challenge due to the substantial demand for computing resources stemming from the vast number of trainable parameters in LLMs. Furthermore, although pre-trained LLMs can generate contextualised word representation vectors, they lack the flexibility to modify the semantic significance of these vectors outside of the LLM, necessitating fine-tuning for the modification of word vectors. To bridge this gap, a five-phase research methodology is structured to propose and evaluate an algorithm enabling the external modification of LLM-generated word vectors using scalar values as the focus weightage. To validate this algorithm, the modified word vectors are compared with original LLM-generated word vectors to evaluate their reflection of the intended context. In addition, a contextual text classification experiment is conducted using benchmarked datasets to assess the performance of the modified word vectors in the targeted classification task. For this experiment, the modified word vectors serve as input to train a Machine Learning (ML) model for the text classification process, aiming for the developed ML model to have a significantly smaller parameter count. This experiment aims to determine the effectiveness of the modified word vectors in contextual text classification tasks, utilizing a more computationally efficient approach. Based on the acquired results, the experiments reveal that the modified word vectors algorithm can effectively alter original LLM-generated word vectors to reflect intended contexts and can outperform baseline scores in contextual text classification tasks. Evaluation metrics including Accuracy, Precision, Recall, and F1 score are employed in the evaluation process, with Accuracy and F1 score serving as primary metrics. The evaluation showcases significant improvements, with the test ML model achieving a best accuracy score of 0.571, a 46% increase from the baseline, and a best F1 score of 0.727, a 30% increment from the baseline. Overall, this thesis presents five contributions: the proposed modified word vectors algorithm, the new contextual classification dataset named QCoC, the efficient question-type classifier based on the feed-forward neural network algorithm, the potential transferability of the presented work to other domains, and the practical implications of the presented work towards cases where computational resources are limited or costly.
format Thesis
id oai:umpir.ump.edu.my:44632
institution Universiti Malaysia Pahang Al-Sultan Abdullah
language English
publishDate 2024
record_format eprints
spelling oai:umpir.ump.edu.my:446322025-05-30T02:36:09Z http://umpir.ump.edu.my/id/eprint/44632/ Modified word representation vector based scalar weight for contextual text classification Abbas Saliimi, Lokman QA75 Electronic computers. Computer science This thesis investigates contextual text classification, which is the process of categorising textual data into different classes or categories based on its meaning within a given context. Central to this process is the representation of words through vectors for computational interpretation. Current practices employ Large Language Models (LLMs) to generate contextualised word representation vectors, achieved through pre-training the LLM on vast corpora that enables it to grasp intricate language patterns and context. For contextual text classification, the pre-trained LLM is further train on classificationspecific labeled data in a process called fine-tuning. Although this approach is currently considered the most optimal in the field, it poses a notable challenge due to the substantial demand for computing resources stemming from the vast number of trainable parameters in LLMs. Furthermore, although pre-trained LLMs can generate contextualised word representation vectors, they lack the flexibility to modify the semantic significance of these vectors outside of the LLM, necessitating fine-tuning for the modification of word vectors. To bridge this gap, a five-phase research methodology is structured to propose and evaluate an algorithm enabling the external modification of LLM-generated word vectors using scalar values as the focus weightage. To validate this algorithm, the modified word vectors are compared with original LLM-generated word vectors to evaluate their reflection of the intended context. In addition, a contextual text classification experiment is conducted using benchmarked datasets to assess the performance of the modified word vectors in the targeted classification task. For this experiment, the modified word vectors serve as input to train a Machine Learning (ML) model for the text classification process, aiming for the developed ML model to have a significantly smaller parameter count. This experiment aims to determine the effectiveness of the modified word vectors in contextual text classification tasks, utilizing a more computationally efficient approach. Based on the acquired results, the experiments reveal that the modified word vectors algorithm can effectively alter original LLM-generated word vectors to reflect intended contexts and can outperform baseline scores in contextual text classification tasks. Evaluation metrics including Accuracy, Precision, Recall, and F1 score are employed in the evaluation process, with Accuracy and F1 score serving as primary metrics. The evaluation showcases significant improvements, with the test ML model achieving a best accuracy score of 0.571, a 46% increase from the baseline, and a best F1 score of 0.727, a 30% increment from the baseline. Overall, this thesis presents five contributions: the proposed modified word vectors algorithm, the new contextual classification dataset named QCoC, the efficient question-type classifier based on the feed-forward neural network algorithm, the potential transferability of the presented work to other domains, and the practical implications of the presented work towards cases where computational resources are limited or costly. 2024-06 Thesis NonPeerReviewed pdf en http://umpir.ump.edu.my/id/eprint/44632/1/Modified%20word%20representation%20vector%20based%20scalar%20weight%20for%20contextual%20text%20classification.pdf Abbas Saliimi, Lokman (2024) Modified word representation vector based scalar weight for contextual text classification. PhD thesis, Universti Malaysia Pahang Al-Sultan Abdullah (Contributors, Thesis advisor: Mohamed Ariff, Ameedeen).
spellingShingle QA75 Electronic computers. Computer science
Abbas Saliimi, Lokman
Modified word representation vector based scalar weight for contextual text classification
title Modified word representation vector based scalar weight for contextual text classification
title_full Modified word representation vector based scalar weight for contextual text classification
title_fullStr Modified word representation vector based scalar weight for contextual text classification
title_full_unstemmed Modified word representation vector based scalar weight for contextual text classification
title_short Modified word representation vector based scalar weight for contextual text classification
title_sort modified word representation vector based scalar weight for contextual text classification
topic QA75 Electronic computers. Computer science
url http://umpir.ump.edu.my/id/eprint/44632/1/Modified%20word%20representation%20vector%20based%20scalar%20weight%20for%20contextual%20text%20classification.pdf
url-record http://umpir.ump.edu.my/id/eprint/44632/
work_keys_str_mv AT abbassaliimilokman modifiedwordrepresentationvectorbasedscalarweightforcontextualtextclassification