Fake News Analysis and Graph Classification on a COVID-19 Twitter Dataset

Publication Date

1-1-2021

Document Type

Conference Proceeding

Publication Title

Proceedings - IEEE 7th International Conference on Big Data Computing Service and Applications, BigDataService 2021

DOI

10.1109/BigDataService52369.2021.00013

First Page

60

Last Page

68

Abstract

In this work we aim to study the spread of fake news compared to real news in a social network. We do that by performing social network analysis to discover various characteristics, and formulate the problem as a binary classification one, where we have graphs modeling the spread of fake and real news. For our experiments we rely on how news are propagated through a popular social media service such as Twitter during the pandemic caused by the COVID-19 virus. In the past, several other approaches classify news as fake or real by deploying various graph embedding techniques and deep learning techniques. We focus on developing a dataset that contains tweets specific to COVID-19 by using the content of the tweets. Further, we create graphs of the fake and real news along with their retweets and followers and work on the graphs. We perform social network analysis and compare their characteristics. Additionally, we study the propagation of fake and real news among users using community detection algorithms on the graphs. Finally, we create a model by deploying the Weisfeiler Lehman graph kernel for graph classification on our labeled dataset. The model is able to predict whether a news article is real or fake based on how the corresponding graph of the retweets and followers are connected.

Keywords

Community detection, COVID-19, Fake news, Graph classification, Graph kernels, Weisfeiler Lehman kernel

Department

Computer Science

Share

COinS