Enhancing Fake News Detection Models Through Modified Cosine Similarity (MCS) Using Sorensen-Dice Distance

Authors

  • Aliyu Shuaibu * Department of Computer Science, Faculty of Computing, Bayero University, Kano, Nigeria
  • Hadiza Ali Umar Department of Computer Science, Faculty of Computing, Bayero University, Kano, Nigeria
  • Yusuf Tijjani Baffa Department of Software Engineering, Faculty of Computing, Bayero University, Kano, Nigeria

DOI:

https://doi.org/10.54938/ijemdcsai.2025.04.1.545

Keywords:

Euclidian distance; Misinformation; Machine Learning Classifiers; Natural Language Processing; Sørensen-Dice distance

Abstract

The rapid spread of fake news undermines public trust and highlights the need for more reliable detection models. Traditional approaches, such as LSTM with standard cosine similarity using Euclidean distance, often fail to capture subtle textual relationships. This study introduces a Modified Cosine Similarity (MCS) that replaces Euclidean distance with Sørensen-Dice distance and evaluates its effectiveness across four models: Long Short-Term Memory (LSTM), K-Nearest Neighbor (KNN), Random Forest (RF), and Support Vector Machine (SVM). Baseline results using cosine similarity showed strong performance, with SVM achieving the highest accuracy (0.987) and F1-score (0.986), followed by RF (accuracy 0.979) and KNN (accuracy 0.973). However, enhanced models with MCS demonstrated substantial improvements. LSTM achieved the best results overall (accuracy 0.997, recall 0.998, F1-score 0.997) with reduced cross-entropy loss (0.016), false positive rate (0.005), and false negative rate (0.002). SVM and KNN also showed notable gains with accuracies of 0.995 and 0.991, respectively, while RF recorded high recall (0.995) and competitive performance across metrics. These findings confirm that integrating Sørensen-Dice distance into cosine similarity significantly boosts semantic representation and model performance, making MCS a robust similarity measure for advancing fake news detection.

Downloads

Published

2025-11-02

How to Cite

Aliyu Shuaibu *, Hadiza Ali Umar, & Yusuf Tijjani Baffa. (2025). Enhancing Fake News Detection Models Through Modified Cosine Similarity (MCS) Using Sorensen-Dice Distance. International Journal of Emerging Multidisciplinaries: Computer Science & Artificial Intelligence, 4(1), 17. https://doi.org/10.54938/ijemdcsai.2025.04.1.545

Issue

Section

Research Article