COVIDLIES: Detecting COVID-19 Misinformation on Social Media
Skip to main content
eScholarship
Open Access Publications from the University of California

UC Irvine

UC Irvine Electronic Theses and Dissertations bannerUC Irvine

COVIDLIES: Detecting COVID-19 Misinformation on Social Media

Abstract

The ongoing pandemic has heightened the need for developing tools to flag COVID-19-related misinformation on the internet, specifically on social media such as Twitter. However, due to novel language and the rapid change of information, existing misinformation detection datasets are not effective for evaluating systems designed to detect misinformation on this topic. Misinformation detection can be divided into two sub-tasks: (i) retrieval of misconceptions relevant to posts being checked for veracity, and (ii) stance detection to identify whether the posts Agree, Disagree, or express No Stance towards the retrieved misconceptions. To facilitate research on this task, we release COVIDLIES, a dataset of 6591 expert-annotated tweets to evaluate the performance of misinformation detection systems on 62 different pieces of COVID-19 related misinformation. We evaluate existing NLP systems on this dataset, providing initial benchmarks and identifying key challenges for future models to improve upon. For the stance detection sub-task we provide benchmark models in zero-shot and few-shot settings. In addition to evaluation of the models using standard metrics, we also provide behaviour testing of the best models of each setting.

Main Content
For improved accessibility of PDF content, download the file to your device.
Current View