A Novel Integration of Multiple Learning Methods for Detecting Misleading Information From Different Datasets During the Pandemic

Irmak, Muhammed Coskun; Aydin, Tolga; Yaganoglu, Mete

A Novel Integration of Multiple Learning Methods for Detecting Misleading Information From Different Datasets During the Pandemic

Date

2025

Authors

Irmak, Muhammed Coskun

Aydin, Tolga

Yaganoglu, Mete

Publisher

Pergamon-elsevier Science Ltd

Abstract

Coronavirus Disease 2019 (COVID-19) was an intensely and commonly discussed topic on social media platforms during the pandemic due to uncertainty about the virus, especially as new variants of the virus emerged around the world. Unfortunately, during the pandemic, people shared many posts about COVID-19 on their social media accounts without paying attention or checking whether they were true or not. In this way, intentionally or unintentionally, they highly manipulated public opinion through their posts. The majority of these posts contained misleading information that negatively affected readers' cognitive and mental health, leading to a new neologism associated with the pandemic: "infodemic." Therefore, the present study focuses on the classification of Fake News disseminated during the pandemic to mislead people. To this end, five different datasets were first trained independently using natural language processing and machine learning methods, and the results obtained were compared. Later, these datasets were combined according to the different scenarios to improve the model performance. According to the results, the highest accuracy value of 98.1% was obtained with the model Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA) when the datasets were trained independently. Similarly, the highest training accuracy of 94.12% was obtained with the ELECTRA method and the highest test accuracy of 91.71% was obtained with the Random Forest method. In summary, the model ELECTRA, which is less preferred than other pre-trained models, achieved the highest performance scores in all study-specific scenarios.

Keywords

Efficiently Learning An Encoder That Classifies, Token Replacements Accurately, Coronavirus Disease 2019 Fake News, Natural Language Processing, Text Mining

WoS Q

Q1

Scopus Q

Q1

Volume

142

URI

https://doi.org/10.1016/j.engappai.2024.109944
https://hdl.handle.net/20.500.14720/11229

Collections

WoS İndeksli Yayınlar Koleksiyonu
Scopus İndeksli Yayınlar Koleksiyonu

Full item page

A Novel Integration of Multiple Learning Methods for Detecting Misleading Information From Different Datasets During the Pandemic

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

WoS Q

Scopus Q

Source

Volume

Issue

Start Page

End Page

URI

Collections