dc.contributor.author | Nguyen, Thanh Tam | |
dc.contributor.author | Nguyen, Quae Viet Hung | |
dc.contributor.author | Weidlich, Matthias | |
dc.contributor.author | Aberer, Karl | |
dc.contributor.editor | Jeong-Hyon Hwang, Yang-Sae Moon | |
dc.date.accessioned | 2017-10-16T04:00:34Z | |
dc.date.available | 2017-10-16T04:00:34Z | |
dc.date.issued | 2015 | |
dc.identifier.issn | 1084-4627 | |
dc.identifier.doi | 10.1109/ICDE.2015.7113287 | |
dc.identifier.uri | http://hdl.handle.net/10072/348006 | |
dc.description.abstract | The amount of information available on the Web has been growing dramatically, raising the importance of techniques for searching the Web. Recently, Web Tables emerged as a model, which enables users to search for information in a structured way. However, effective presentation of results for Web Table search requires (1) selecting a ranking of tables that acknowledges the diversity within the search result; and (2) summarizing the information content of the selected tables concisely but meaningful. In this paper, we formalize these requirements as the diversified table selection problem and the structured table summarization problem. We show that both problems are computationally intractable and, thus, present heuristic algorithms to solve them. For these algorithms, we prove salient performance guarantees, such as near-optimality, stability, and fairness. Our experiments with real-world collections of thousands of Web Tables highlight the scalability of our techniques. We achieve improvements up to 50% in diversity and 10% in relevance over baselines for Web Table selection, and reduce the information loss induced by table summarization by up to 50%. In a user study, we observed that our techniques are preferred over alternative solutions. | |
dc.description.peerreviewed | Yes | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.publisher.place | United States | |
dc.relation.ispartofconferencename | 31st IEEE International Conference on Data Engineering | |
dc.relation.ispartofconferencetitle | 2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | |
dc.relation.ispartofdatefrom | 2015-04-13 | |
dc.relation.ispartofdateto | 2015-04-17 | |
dc.relation.ispartoflocation | Seoul, SOUTH KOREA | |
dc.relation.ispartofpagefrom | 231 | |
dc.relation.ispartofpageto | 242 | |
dc.subject.fieldofresearch | Database systems | |
dc.subject.fieldofresearchcode | 460505 | |
dc.title | Result selection and summarization for Web Table search | |
dc.type | Conference output | |
dc.type.description | E1 - Conferences | |
dc.type.code | E - Conference Publications | |
dc.description.version | Accepted Manuscript (AM) | |
gro.rights.copyright | © 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Nguyen, Henry | |
gro.griffith.author | Nguyen, Thanh Tam | |