Efficient profiling for estimation of query result quality

Naiem K. Yeganeh, Shazia Sadiq, Mohamed A. Sharaf, Ke Deng

Research output: Contribution to conferencePaperpeer-review

Abstract

The issue of Data Quality (DQ) is of increasing importance as individuals as well as corporations are relying on multiple, often external sources of data to make decisions. Data quality profiles consist of statistical measurements about the quality of data sets. Query systems can use DQ profiles as a form of metadata to estimate the quality of a query result set. Traditional DQ profiling provides an estimate on the overall quality of a data set or data source, but quality of a query result can be remarkably different from the overall quality of the data set because conditions within the query typically select a subset of the data. In this paper we propose an efficient conditional DQ profiling method which can estimate the quality of a result set for a given query with guaranteed user definable level of accuracy.

Original languageEnglish
Pages415-426
Number of pages12
Publication statusPublished - 2011
Externally publishedYes
Event16th International Conference on Information Quality, ICIQ 2011 - Adelaide, SA, Australia
Duration: Nov 18 2011Nov 20 2011

Conference

Conference16th International Conference on Information Quality, ICIQ 2011
Country/TerritoryAustralia
CityAdelaide, SA
Period11/18/1111/20/11

Keywords

  • Conditional data quality
  • Data quality profiling
  • Query result

ASJC Scopus subject areas

  • Information Systems
  • Safety, Risk, Reliability and Quality

Fingerprint

Dive into the research topics of 'Efficient profiling for estimation of query result quality'. Together they form a unique fingerprint.

Cite this