dc.contributor.author | Nguyen, Quoc Viet Hung | |
dc.contributor.author | Nguyen, Thanh Tam | |
dc.contributor.author | Miklos, Zoltan | |
dc.contributor.author | Aberer, Karl | |
dc.contributor.author | Gal, Avigdor | |
dc.contributor.author | Weidlich, Matthias | |
dc.contributor.editor | Gabriel Ghinita, Ali Inan | |
dc.date.accessioned | 2021-07-15T04:26:07Z | |
dc.date.available | 2021-07-15T04:26:07Z | |
dc.date.issued | 2014 | |
dc.identifier.doi | 10.1109/ICDE.2014.6816653 | |
dc.identifier.uri | http://hdl.handle.net/10072/370179 | |
dc.description.abstract | Schema matching is the process of establishing correspondences between the attributes of database schemas for data integration purposes. Although several automatic schema matching tools have been developed, their results are often incomplete or erroneous. To obtain a correct set of correspondences, a human expert is usually required to validate the generated correspondences. We analyze this reconciliation process in a setting where a number of schemas needs to be matched, in the presence of consistency expectations about the network of attribute correspondences. We develop a probabilistic model that helps to identify the most uncertain correspondences, thus allowing us to guide the expert's work and collect his input about the most problematic cases. As the availability of such experts is often limited, we develop techniques that can construct a set of good quality correspondences with a high probability, even if the expert does not validate all the necessary correspondences. We demonstrate the efficiency of our techniques through extensive experimentation using real-world datasets. | |
dc.description.peerreviewed | Yes | |
dc.language | English | |
dc.publisher | Institute of Electrical and Electronics Engineers (IEEE) | |
dc.publisher.place | United States | |
dc.relation.ispartofconferencename | ICDE 2014 | |
dc.relation.ispartofconferencetitle | Proceedings of the 2014 IEEE 30th International Conference on Data Engineering | |
dc.relation.ispartofdatefrom | 2014-03-31 | |
dc.relation.ispartofdateto | 2014-04-04 | |
dc.relation.ispartoflocation | Chicago, IL, United States | |
dc.subject.fieldofresearch | Database systems | |
dc.subject.fieldofresearchcode | 460505 | |
dc.title | Pay-as-you-go reconciliation in schema matching networks | |
dc.type | Conference output | |
dc.type.description | E1 - Conferences | |
dc.type.code | E - Conference Publications | |
dc.description.version | Accepted Manuscript (AM) | |
gro.rights.copyright | © 2014 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. | |
gro.hasfulltext | Full Text | |
gro.griffith.author | Nguyen, Henry | |
gro.griffith.author | Nguyen, Thanh Tam | |