User Guidance for Efficient Fact Checking

Loading...
Thumbnail Image
File version

Accepted Manuscript (AM)

Author(s)
Thanh, Tam Nguyen
Weidlich, Matthias
Yin, Hongzhi
Zheng, Bolong
Nguyen, Quoc Viet Hung
Stantic, Bela
Primary Supervisor
Other Supervisors
Editor(s)
Date
2019
Size
File type(s)
Location
License
Abstract

The Web constitutes a valuable source of information. In recent years, it fostered the construction of large-scale knowledge bases, such as Freebase, YAGO, and DBpedia. The open nature of the Web, with content potentially being generated by everyone, however, leads to inaccuracies and misinformation. Construction and maintenance of a knowledge base thus has to rely on fact checking, an assessment of the credibility of facts. Due to an inherent lack of ground truth information, such fact checking cannot be done in a purely automated manner, but requires human involvement. In this paper, we propose a comprehensive framework to guide users in the validation of facts, striving for a minimisation of the invested effort. Our framework is grounded in a novel probabilistic model that combines user input with automated credibility inference. Based thereon, we show how to guide users in fact checking by identifying the facts for which validation is most beneficial. Moreover, our framework includes techniques to reduce the manual effort invested in fact checking by determining when to stop the validation and by supporting efficient batching strategies. We further show how to handle fact checking in a streaming setting. Our experiments with three real-world datasets demonstrate the efficiency and effectiveness of our framework: A knowledge base of high quality, with a precision of above 90%, is constructed with only a half of the validation effort required by baseline techniques.

Journal Title

Proceedings of the VLDB Endowment

Conference Title
Book Title
Edition
Volume

12

Issue

8

Thesis Type
Degree Program
School
Publisher link
Patent number
Funder(s)
Grant identifier(s)
Rights Statement
Rights Statement

© VLDB Endowment, 2019. Published by ACM. This is the author's version of the work. It is posted here by permission of ACM for your personal use. Not for redistribution. The definitive version was published in Volume 12, Issue 8, https://doi.org/10.14778/3324301.3324303

Item Access Status
Note
Access the data
Related item(s)
Subject

Theory of computation

Information systems

Library and information studies

Data management and data science

Science & Technology

Technology

Computer Science, Information Systems

Computer Science, Theory & Methods

Computer Science

Persistent link to this record
Citation

Thanh, TN; Weidlich, M; Yin, H; Zheng, B; Quoc, VHN; Stantic, B, User Guidance for Efficient Fact Checking, Proceedings of the VLDB Endowment, 2019, 12 (8), pp. 850-863

Collections