Fake News Analysis and Graph Classification on a COVID-19 Twitter Dataset
Publication Date
1-1-2021
Document Type
Conference Proceeding
Publication Title
Proceedings - IEEE 7th International Conference on Big Data Computing Service and Applications, BigDataService 2021
DOI
10.1109/BigDataService52369.2021.00013
First Page
60
Last Page
68
Abstract
In this work we aim to study the spread of fake news compared to real news in a social network. We do that by performing social network analysis to discover various characteristics, and formulate the problem as a binary classification one, where we have graphs modeling the spread of fake and real news. For our experiments we rely on how news are propagated through a popular social media service such as Twitter during the pandemic caused by the COVID-19 virus. In the past, several other approaches classify news as fake or real by deploying various graph embedding techniques and deep learning techniques. We focus on developing a dataset that contains tweets specific to COVID-19 by using the content of the tweets. Further, we create graphs of the fake and real news along with their retweets and followers and work on the graphs. We perform social network analysis and compare their characteristics. Additionally, we study the propagation of fake and real news among users using community detection algorithms on the graphs. Finally, we create a model by deploying the Weisfeiler Lehman graph kernel for graph classification on our labeled dataset. The model is able to predict whether a news article is real or fake based on how the corresponding graph of the retweets and followers are connected.
Keywords
Community detection, COVID-19, Fake news, Graph classification, Graph kernels, Weisfeiler Lehman kernel
Department
Computer Science
Recommended Citation
Kriti Gupta and Katerina Potika. "Fake News Analysis and Graph Classification on a COVID-19 Twitter Dataset" Proceedings - IEEE 7th International Conference on Big Data Computing Service and Applications, BigDataService 2021 (2021): 60-68. https://doi.org/10.1109/BigDataService52369.2021.00013