dc.contributor.author | Nguyen, Quoc Viet Hung | |
dc.contributor.author | Zheng, Kai | |
dc.contributor.author | Weidlich, Matthias | |
dc.contributor.author | Zheng, Bolong | |
dc.contributor.author | Yin, Hongzhi | |
dc.contributor.author | Nguyen, Thanh Tam | |
dc.contributor.author | Stantic, Bela | |
dc.date.accessioned | 2019-05-29T13:09:38Z | |
dc.date.available | 2019-05-29T13:09:38Z | |
dc.date.issued | 2018 | |
dc.identifier.isbn | 9781538655207 | |
dc.identifier.issn | 1084-4627 | |
dc.identifier.doi | 10.1109/ICDE.2018.00018 | |
dc.identifier.uri | http://hdl.handle.net/10072/382451 | |
dc.description.abstract | What-if analysis is a data-intensive exploration to inspect how changes in a set of input parameters of a model influence some outcomes. It is motivated by a user trying to understand the sensitivity of a model to a certain parameter in order to reach a set of goals that are defined over the outcomes. To avoid an exploration of all possible combinations of parameter values, efficient what-if analysis calls for a partitioning of parameter values into data ranges and a unified representation of the obtained outcomes per range. Traditional techniques to capture data ranges, such as histograms, are limited to one outcome dimension. Yet, in practice, what-if analysis often involves conflicting goals that are defined over different dimensions of the outcome. Working on each of those goals independently cannot capture the inherent trade-off between them. In this paper, we propose techniques to recommend data ranges for what-if analysis, which capture not only data regularities, but also the trade-off between conflicting goals. Specifically, we formulate a parametric data partitioning problem and propose a method to find an optimal solution for it. Targeting scalability to large datasets, we further provide a heuristic solution to this problem. By theoretical and empirical analyses, we establish performance guarantees in terms of runtime and result quality. | |
dc.description.peerreviewed | Yes | |
dc.language | English | |
dc.publisher | IEEE Computer Society | |
dc.publisher.place | United States | |
dc.relation.ispartofchapter | 43307 | |
dc.relation.ispartofconferencename | 34th IEEE International Conference on Data Engineering Workshops (ICDEW) | |
dc.relation.ispartofconferencetitle | 2018 IEEE 34TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE) | |
dc.relation.ispartofdatefrom | 2018-04-16 | |
dc.relation.ispartofdateto | 2018-04-19 | |
dc.relation.ispartoflocation | Paris, FRANCE | |
dc.relation.ispartofpagefrom | 89 | |
dc.relation.ispartofpagefrom | 12 pages | |
dc.relation.ispartofpageto | 100 | |
dc.relation.ispartofpageto | 12 pages | |
dc.subject.fieldofresearch | Database systems | |
dc.subject.fieldofresearchcode | 460505 | |
dc.title | What-If Analysis with Conflicting Goals: Recommending Data Ranges for Exploration | |
dc.type | Conference output | |
dc.type.description | E1 - Conferences | |
dc.type.code | E - Conference Publications | |
dc.description.version | Accepted Manuscript (AM) | |
gro.faculty | Griffith Sciences, School of Information and Communication Technology | |
gro.rights.copyright | © 2018 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Stantic, Bela | |
gro.griffith.author | Nguyen, Henry | |
gro.griffith.author | Nguyen, Thanh Tam | |