User Guidance for Efficient Fact Checking
File version
Accepted Manuscript (AM)
Author(s)
Weidlich, Matthias
Yin, Hongzhi
Zheng, Bolong
Nguyen, Quoc Viet Hung
Stantic, Bela
Griffith University Author(s)
Primary Supervisor
Other Supervisors
Editor(s)
Date
Size
File type(s)
Location
License
Abstract
The Web constitutes a valuable source of information. In recent years, it fostered the construction of large-scale knowledge bases, such as Freebase, YAGO, and DBpedia. The open nature of the Web, with content potentially being generated by everyone, however, leads to inaccuracies and misinformation. Construction and maintenance of a knowledge base thus has to rely on fact checking, an assessment of the credibility of facts. Due to an inherent lack of ground truth information, such fact checking cannot be done in a purely automated manner, but requires human involvement. In this paper, we propose a comprehensive framework to guide users in the validation of facts, striving for a minimisation of the invested effort. Our framework is grounded in a novel probabilistic model that combines user input with automated credibility inference. Based thereon, we show how to guide users in fact checking by identifying the facts for which validation is most beneficial. Moreover, our framework includes techniques to reduce the manual effort invested in fact checking by determining when to stop the validation and by supporting efficient batching strategies. We further show how to handle fact checking in a streaming setting. Our experiments with three real-world datasets demonstrate the efficiency and effectiveness of our framework: A knowledge base of high quality, with a precision of above 90%, is constructed with only a half of the validation effort required by baseline techniques.
Journal Title
Proceedings of the VLDB Endowment
Conference Title
Book Title
Edition
Volume
12
Issue
8
Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement
© VLDB Endowment, 2019. Published by ACM. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Volume 12, Issue 8, https://doi.org/10.14778/3324301.3324303
Item Access Status
Note
Access the data
Related item(s)
Subject
Theory of computation
Information systems
Library and information studies
Data management and data science
Science & Technology
Technology
Computer Science, Information Systems
Computer Science, Theory & Methods
Computer Science
Persistent link to this record
Citation
Thanh, TN; Weidlich, M; Yin, H; Zheng, B; Quoc, VHN; Stantic, B, User Guidance for Efficient Fact Checking, Proceedings of the VLDB Endowment, 2019, 12 (8), pp. 850-863