Home>Schools

  • Complex
  • Title
  • Author
  • Keyword
  • Abstract
  • Scholars
Search

[期刊]

Approximate top-K answering under uncertain schema mappings

Share
Edit Delete Claim

Author:

Li, Longzhuang (Li, Longzhuang.) | Tian, Feng (Tian, Feng.) | Liu, Yonghuai (Liu, Yonghuai.) | Unfold

Indexed by:

Abstract:

Data integration techniques provide a communication bridge between isolated sources and offer a platform for information exchange. When the schemas of heterogeneous data sources map to the centralized schema in a mediated data integration system or a source schema maps to a target schema in a peer-to-peer system, multiple schema mappings may exist due to the ambiguities in the attribute matching. The obscure schema mappings lead to the uncertainty in query answering, and frequently people are only interested in retrieving the best k answers (top-k) with the biggest probabilities. Retrieving the top-k answers efficiently has become a research issue. For uncertain queries, two semantics, by-table and by-tuple, have been developed to capture top-k answers based on the schema mapping probabilities. However, although the existing algorithms support certain features to capture the accurate top-k answers and avoid accessing all data from sources, they cannot effectively reduce the number of processed tuples in most cases. In this paper, new algorithms based on the histogram approximation and heuristic are proposed to efficiently identify the top-k answers for the data integration systems under uncertain schema mappings. In the experiments, the Histogram algorithm in the by-table semantics and the expected approach in the by-tuple semantics are shown to significantly reduce the number of processed tuples while maintaining high accuracy with the estimated probabilistic confidence. © 2018

Keyword:

Database integration Data integration system Heterogeneous data sources Histogram-based approximation Information exchanges Integration techniques Schema mappings Top-k query

Author Community:

  • [ 1 ] [Li, Longzhuang]Department Of Computing Sciences, Texas A&M University-Corpus Christi, TX, United States
  • [ 2 ] [Tian, Feng]System Engineering Institute, Xi'an Jiaotong University, Xi'an, Shaanxi Province, China
  • [ 3 ] [Liu, Yonghuai]Department of Computer Science, Edge Hill University, United Kingdom
  • [ 4 ] [Mao, Shanxian]Office of Institutional Research, Texas A&M University-Kingsville, TX, United States

Reprint Author's Address:

  • Texas A&M Univ, Dept Comp Sci, Corpus Christi, TX 78412 USA.

Show more details

Related Article:

Source :

Data and Knowledge Engineering

ISSN: 0169-023X

Year: 2018

Volume: 118

Page: 71-91

1 . 5 8 3

JCR@2018

1 . 9 9 2

JCR@2020

ESI Discipline: COMPUTER SCIENCE;

ESI HC Threshold:114

JCR Journal Grade:3

CAS Journal Grade:4

Cited Count:

WoS CC Cited Count: 0

30 Days PV: 0

FAQ| About| Online/Total:341/216800837
Address:XI'AN JIAOTONG UNIVERSITY LIBRARY(No.28, Xianning West Road, Xi'an, Shaanxi Post Code:710049) Contact Us:029-82667865
Copyright:XI'AN JIAOTONG UNIVERSITY LIBRARY Technical Support:Beijing Aegean Software Co., Ltd.