Author
Contributions by role
Author 2
Khairullah Khan
Institute of Computer Science & Information Technology University of Science & Technology, Bannu, KP, Pakistan
Summary
Edited Journals
ICCK Contributions

Open Access | Research Article | 21 June 2024
Comparison of Machine Learning and Deep Learning Models for Part-of-Speech Tagging
ICCK Transactions on Advanced Computing and Systems | Volume 1, Issue 2: 106-116, 2024 | DOI: 10.62762/TACS.2024.493945
Abstract
The process of assigning grammatical categories, such as ``Noun'' and ``Verb,'' to every word in a text corpus is known as part-of-speech (POS) tagging. This technique is widely used in applications like sentiment analysis, machine translation, and other linguistic and computational tasks. However, the unique features of the Pashto language and its limited resources present significant challenges for POS tagging. This study explores the critical role of POS tagging in the Pashto language by employing six popular deep-learning and machine-learning techniques. Experimental results demonstrate machine learning methods' effectiveness in capturing Pashto text's grammatical patterns. The evaluatio... More >

Graphical Abstract
Comparison of Machine Learning and Deep Learning Models for Part-of-Speech Tagging

Open Access | Research Article | 26 May 2024
Comparing Fine-Tuned RoBERTa with Traditional Machine Learning Models for Stance Detection in Political Tweets
ICCK Transactions on Advanced Computing and Systems | Volume 1, Issue 2: 78-96, 2024 | DOI: 10.62762/TACS.2024.928069
Abstract
Stance detection identifies a text’s position or attitude toward a given subject. A major challenge in Roman Urdu is the lack of a publicly available dataset for political stance detection. To address this gap, we constructed a high-quality dataset of 8,374 political tweets and comments using the Twitter API, annotated with stance labels: agree, disagree, and unrelated. The dataset captures diverse political viewpoints and user interactions. For feature representation, we employed TF-IDF due to its effectiveness in handling high-dimensional, context-sensitive Roman Urdu text. Several machine learning classifiers were evaluated, with Random Forest achieving the highest accuracy of 95%. Addi... More >

Graphical Abstract
Comparing Fine-Tuned RoBERTa with Traditional Machine Learning Models for Stance Detection in Political Tweets