A Novel Integration of Multiple Learning Methods for Detecting Misleading Information From Different Datasets During the Pandemic

No Thumbnail Available

Date

2025

Journal Title

Journal ISSN

Volume Title

Publisher

Pergamon-elsevier Science Ltd

Abstract

Coronavirus Disease 2019 (COVID-19) was an intensely and commonly discussed topic on social media platforms during the pandemic due to uncertainty about the virus, especially as new variants of the virus emerged around the world. Unfortunately, during the pandemic, people shared many posts about COVID-19 on their social media accounts without paying attention or checking whether they were true or not. In this way, intentionally or unintentionally, they highly manipulated public opinion through their posts. The majority of these posts contained misleading information that negatively affected readers' cognitive and mental health, leading to a new neologism associated with the pandemic: "infodemic." Therefore, the present study focuses on the classification of Fake News disseminated during the pandemic to mislead people. To this end, five different datasets were first trained independently using natural language processing and machine learning methods, and the results obtained were compared. Later, these datasets were combined according to the different scenarios to improve the model performance. According to the results, the highest accuracy value of 98.1% was obtained with the model Efficiently Learning an Encoder that Classifies Token Replacements Accurately (ELECTRA) when the datasets were trained independently. Similarly, the highest training accuracy of 94.12% was obtained with the ELECTRA method and the highest test accuracy of 91.71% was obtained with the Random Forest method. In summary, the model ELECTRA, which is less preferred than other pre-trained models, achieved the highest performance scores in all study-specific scenarios.

Description

Keywords

Efficiently Learning An Encoder That Classifies, Token Replacements Accurately, Coronavirus Disease 2019 Fake News, Natural Language Processing, Text Mining

Turkish CoHE Thesis Center URL

WoS Q

Q1

Scopus Q

Q1

Source

Volume

142

Issue

Start Page

End Page

Google Scholar Logo
Google Scholar™