Show simple item record

dc.contributor.authorNguyen, Thanh Tam
dc.contributor.authorNguyen, Quae Viet Hung
dc.contributor.authorWeidlich, Matthias
dc.contributor.authorAberer, Karl
dc.contributor.editorJeong-Hyon Hwang, Yang-Sae Moon
dc.date.accessioned2017-10-16T04:00:34Z
dc.date.available2017-10-16T04:00:34Z
dc.date.issued2015
dc.identifier.issn1084-4627
dc.identifier.doi10.1109/ICDE.2015.7113287
dc.identifier.urihttp://hdl.handle.net/10072/348006
dc.description.abstractThe amount of information available on the Web has been growing dramatically, raising the importance of techniques for searching the Web. Recently, Web Tables emerged as a model, which enables users to search for information in a structured way. However, effective presentation of results for Web Table search requires (1) selecting a ranking of tables that acknowledges the diversity within the search result; and (2) summarizing the information content of the selected tables concisely but meaningful. In this paper, we formalize these requirements as the diversified table selection problem and the structured table summarization problem. We show that both problems are computationally intractable and, thus, present heuristic algorithms to solve them. For these algorithms, we prove salient performance guarantees, such as near-optimality, stability, and fairness. Our experiments with real-world collections of thousands of Web Tables highlight the scalability of our techniques. We achieve improvements up to 50% in diversity and 10% in relevance over baselines for Web Table selection, and reduce the information loss induced by table summarization by up to 50%. In a user study, we observed that our techniques are preferred over alternative solutions.
dc.description.peerreviewedYes
dc.languageEnglish
dc.publisherInstitute of Electrical and Electronics Engineers (IEEE)
dc.publisher.placeUnited States
dc.relation.ispartofconferencename31st IEEE International Conference on Data Engineering
dc.relation.ispartofconferencetitle2015 IEEE 31ST INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE)
dc.relation.ispartofdatefrom2015-04-13
dc.relation.ispartofdateto2015-04-17
dc.relation.ispartoflocationSeoul, SOUTH KOREA
dc.relation.ispartofpagefrom231
dc.relation.ispartofpageto242
dc.subject.fieldofresearchDatabase systems
dc.subject.fieldofresearchcode460505
dc.titleResult selection and summarization for Web Table search
dc.typeConference output
dc.type.descriptionE1 - Conferences
dc.type.codeE - Conference Publications
dc.description.versionAccepted Manuscript (AM)
gro.rights.copyright© 2015 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
gro.hasfulltextFull Text
gro.griffith.authorNguyen, Henry
gro.griffith.authorNguyen, Thanh Tam


Files in this item

This item appears in the following Collection(s)

  • Conference outputs
    Contains papers delivered by Griffith authors at national and international conferences.

Show simple item record