DivIDE: Efficient diversification for interactive data exploration

Hina A. Khan, Mohamed A. Sharaf, Abdullah Albarrak

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)


Today, Interactive Data Exploration (IDE) has become a main constituent of many discovery-oriented applications, in which users repeatedly submit exploratory queries to identify interesting subspaces in large data sets. Returning relevant yet diverse results to such queries provides users with quick insights into a rather large data space. Meanwhile, search results diversification adds additional cost to an already computationally expensive exploration process. To address this challenge, in this paper, we propose a novel diversification scheme called DivIDE, which targets the problem of efficiently diversifying the results of queries posed during data exploration sessions. In particular, our scheme exploits the properties of data diversification functions while leveraging the natural overlap occurring between the results of different queries so that to provide significant reductions in processing costs. Our extensive experimental evaluation on both synthetic and real data sets shows the significant benefits provided by our scheme as compared to existing methods.

Original languageEnglish
Title of host publicationSSDBM 2014 - Proceedings of the 26th International Conference on Scientific and Statistical Database Management
PublisherAssociation for Computing Machinery
ISBN (Print)9781450327220
Publication statusPublished - 2014
Externally publishedYes
Event26th International Conference on Scientific and Statistical Database Management, SSDBM 2014 - Aalborg, Denmark
Duration: Jun 30 2014Jul 2 2014

Publication series

NameACM International Conference Proceeding Series


Conference26th International Conference on Scientific and Statistical Database Management, SSDBM 2014

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications


Dive into the research topics of 'DivIDE: Efficient diversification for interactive data exploration'. Together they form a unique fingerprint.

Cite this