Advancing fake news detection: a comparative study of RNN, LSTM, and Bidirectional LSTM Architectures

Authors

  • Gregorius Airlangga Atma Jaya Catholic University of Indonesia

DOI:

https://doi.org/10.35335/cit.Vol16.2024.696.pp13-23

Keywords:

Comparative analysis, Data preprocessing, Fake news detection, LSTM model, Neural network architectures

Abstract

In the era of information overload, the exponential growth of digital content has coincided with the proliferation of 'fake news,' posing a critical challenge to online information credibility. This study addresses the pressing need for robust fake news detection systems by conducting a comparative analysis of three neural network architectures: Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM), and Bidirectional LSTM (BiLSTM). Our primary objective is to assess their effectiveness in identifying fake news in a binary classification setting. To achieve this goal, we employed advanced neural network models and a dataset of news titles. Our applied research method included data preprocessing and the utilization of RNN, LSTM, and BiLSTM models, each tailored to handle sequential data and capture temporal dependencies. we rigorously assessed the performance of RNN, LSTM, and BiLSTM models using a range of metrics, including accuracy, precision, recall, and F1-score. To achieve a comprehensive evaluation, we divided our dataset into training and testing subsets. Specifically, we allocated 67% of the data for training purposes and the remaining 33% for testing. Our research findings reveal that all three models consistently achieved high accuracy levels, approximately 91%, with slight variations in precision and recall. Notably, the LSTM model exhibited a marginal improvement in recall, which is crucial when the consequences of missing deceptive content outweigh false alarms. Conversely, the RNN model demonstrated slightly better precision, making it suitable for applications where minimizing false positives is paramount. Surprisingly, the BiLSTM model did not significantly outperform the unidirectional models, suggesting that, for our dataset, processing information bidirectionally may not be essential. In conclusion, our study contributes valuable insights to the field of fake news detection. It underscores the significance of model selection based on specific task requirements and dataset characteristics.

Downloads

Download data is not yet available.

References

J. W. Salmon, S. L. Thompson, J. W. Salmon, and S. L. Thompson, “Big data: information technology as control over the profession of medicine,” Corp. Am. Heal. Care Rise Corp. Hegemony Loss Prof. Auton., pp. 181–254, 2021.

D. Bawden and L. Robinson, “Information overload: An overview,” 2020.

G. Newton, K. Drysdale, M. Zappavigna, and C. E. Newman, “Truth, proof, sleuth: trust in direct-to-consumer DNA testing and other sources of identity information among Australian donor-conceived people,” Sociology, vol. 57, no. 1, pp. 36–53, 2023.

J. P. Baptista and A. Gradim, “Understanding fake news consumption: A review,” Soc. Sci., vol. 9, no. 10, p. 185, 2020.

J.-N. Kim and H. de Zúñiga, “Pseudo-information, media, publics, and the failing marketplace of ideas: Theory,” Am. Behav. Sci., vol. 65, no. 2, pp. 163–179, 2021.

M. Freeze et al., “Fake claims of fake news: Political misinformation, warnings, and the tainted truth effect,” Polit. Behav., vol. 43, pp. 1433–1465, 2021.

N. A. Al Shehab, “the dark side of social media: spreading misleading information during covid-19 crisis,” Adv. Data Sci. Intell. Data Commun. Technol. COVID-19 Innov. Solut. Against COVID-19, pp. 277–306, 2022.

H. N. Chua, Q. Khan, M. B. Jasser, and R. T. K. Wong, “Problem Understanding of Fake News Detection from a Data Mining Perspective,” in 2023 IEEE 13th International Conference on Control System, Computing and Engineering (ICCSCE), 2023, pp. 297–302.

N. Zaidi, M. Maurya, S. Grima, and P. Tyagi, “Unveiling AI’s Ethical Impact in Marketing Through Social Media’s Darker Influence,” in Building AI Driven Marketing Capabilities: Understand Customer Needs and Deliver Value Through AI, Springer, 2023, pp. 173–193.

Z. Guo, M. Schlichtkrull, and A. Vlachos, “A survey on automated fact-checking,” Trans. Assoc. Comput. Linguist., vol. 10, pp. 178–206, 2022.

Q. Su, M. Wan, X. Liu, C.-R. Huang, and others, “Motivations, methods and metrics of misinformation detection: an NLP perspective,” Nat. Lang. Process. Res., vol. 1, no. 1–2, pp. 1–13, 2020.

S. Adak et al., “Mining the online infosphere: A survey,” Wiley Interdiscip. Rev. Data Min. Knowl. Discov., vol. 12, no. 5, p. e1453, 2022.

R. Varma, Y. Verma, P. Vijayvargiya, and P. P. Churi, “A systematic survey on deep learning and machine learning approaches of fake news detection in the pre-and post-COVID-19 pandemic,” Int. J. Intell. Comput. Cybern., vol. 14, no. 4, pp. 617–646, 2021.

K. Chora? Micha?and Demestichas, A. Gie?czyk, Á. Herrero, K. Ksieniewicz Pawe?and Remoundou, D. Urda, and M. Wo?niak, “Advanced Machine Learning techniques for fake news (online disinformation) detection: A systematic mapping study,” Appl. Soft Comput., vol. 101, p. 107050, 2021.

A. Agarwal, M. Mittal, A. Pathak, and L. M. Goyal, “Fake news detection using a blend of neural networks: An application of deep learning,” SN Comput. Sci., vol. 1, pp. 1–9, 2020.

B. Jang, M. Kim, G. Harerimana, S. Kang, and J. W. Kim, “Bi-LSTM model to increase accuracy in text classification: Combining Word2vec CNN and attention mechanism,” Appl. Sci., vol. 10, no. 17, p. 5841, 2020.

I. Al-Nader, A. Lasebae, R. Raheem, and A. Khoshkholghi, “A Novel Scheduling Algorithm for Improved Performance of Multi-Objective Safety-Critical Wireless Sensor Networks Using Long Short-Term Memory,” Electronics, vol. 12, no. 23, p. 4766, 2023.

A. Altheneyan and A. Alhadlaq, “Big data ML-based fake news detection using distributed learning,” IEEE Access, vol. 11, pp. 29447–29463, 2023.

S. Shreyashree, P. Sunagar, S. Rajarajeswari, and A. Kanavalli, “BERT-Based Hybrid RNN Model for Multi-class Text Classification to Study the Effect of Pre-trained Word Embeddings,” Int. J. Adv. Comput. Sci. Appl., vol. 13, no. 9, 2022.

P. K. Verma, P. Agrawal, I. Amorim, and R. Prodan, “WELFake: word embedding over linguistic features for fake news detection,” IEEE Trans. Comput. Soc. Syst., vol. 8, no. 4, pp. 881–893, 2021.

M. Tajrian, A. Rahman, M. A. Kabir, and M. R. Islam, “A review of methodologies for fake news analysis,” IEEE Access, 2023.

W. Ceron, M.-F. de-Lima-Santos, and M. G. Quiles, “Fake news agenda in the era of COVID-19: Identifying trends through fact-checking content,” Online Soc. Networks Media, vol. 21, p. 100116, 2021.

H. Liao et al., “MUSER: A MUlti-Step Evidence Retrieval Enhancement Framework for Fake News Detection,” in Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023, pp. 4461–4472.

X. Zhou and R. Zafarani, “A survey of fake news: Fundamental theories, detection methods, and opportunities,” ACM Comput. Surv., vol. 53, no. 5, pp. 1–40, 2020.

W. Ansar and S. Goswami, “Combating the menace: A survey on characterization and detection of fake news from a data science perspective,” Int. J. Inf. Manag. Data Insights, vol. 1, no. 2, p. 100052, 2021.

J. Zeng, Y. Zhang, and X. Ma, “Fake news detection for epidemic emergencies via deep correlations between text and images,” Sustain. Cities Soc., vol. 66, p. 102652, 2021.

Downloads

Published

2024-03-30

How to Cite

Airlangga, G. (2024). Advancing fake news detection: a comparative study of RNN, LSTM, and Bidirectional LSTM Architectures. Jurnal Teknik Informatika C.I.T Medicom, 16(1), 13–23. https://doi.org/10.35335/cit.Vol16.2024.696.pp13-23

Issue

Section

OPTIMIZATION AND ARTIFICIAL INTELLIGENCE