Skip to main navigation Skip to search Skip to main content

CoDAQ: Congressional-Based Data Imputation for Aggregate Queries

Research output: Contribution to journalConference articlepeer-review

Abstract

Visual analytics are widely used across various domains, including business intelligence, healthcare, and e-commerce, to provide data-driven insights that enhance decision-making processes. These systems analyze large datasets to generate visualizations that help users identify trends, patterns, and actionable insights. These insights rely heavily on accurate and complete data. However, effectively addressing the challenges posed by missing data is crucial for ensuring the accuracy and reliability of data-driven insights and visualizations. Incomplete datasets often lead to reduced accuracy in visualizations, prompting users to adopt costly imputation strategies to clean the data. This paper presents an innovative approach, 'CoDAQ'-Congressional-Based Data Imputation for Aggregate Queries that explores the use of a stratified-based selection technique for imputation to handle missing values within a specified imputation budget effectively. CoDAQ strategically selects cells for imputation by analyzing missing data patterns across different groups, ensuring a structured and representative approach. This technique offers enhanced accuracy and consistency over baseline methods. Our findings illustrate how the proposed technique enhances the utilization of available data, improves the accuracy of visualizations, and reduces bias in analysis, ultimately increasing the performance of visual analytics.

Original languageEnglish
Pages (from-to)287-294
Number of pages8
JournalProceedings of the IEEE International Conference on Big Data and Smart Computing, BIGCOMP
Issue number2025
DOIs
Publication statusPublished - 2025
Event2025 IEEE International Conference on Big Data and Smart Computing, BigComp 2025 - Kota Kinabalu, Malaysia
Duration: Feb 9 2025Feb 12 2025

Keywords

  • aggregate queries
  • approximate query processing
  • big data
  • data imputation
  • data quality
  • data sampling
  • visual analytics

ASJC Scopus subject areas

  • Information Systems
  • Computer Vision and Pattern Recognition
  • Computer Science Applications
  • Computer Networks and Communications
  • Computational Theory and Mathematics
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'CoDAQ: Congressional-Based Data Imputation for Aggregate Queries'. Together they form a unique fingerprint.

Cite this