Abstract
Visual analytics are widely used across various domains, including business intelligence, healthcare, and e-commerce, to provide data-driven insights that enhance decision-making processes. These systems analyze large datasets to generate visualizations that help users identify trends, patterns, and actionable insights. These insights rely heavily on accurate and complete data. However, effectively addressing the challenges posed by missing data is crucial for ensuring the accuracy and reliability of data-driven insights and visualizations. Incomplete datasets often lead to reduced accuracy in visualizations, prompting users to adopt costly imputation strategies to clean the data. This paper presents an innovative approach, 'CoDAQ'-Congressional-Based Data Imputation for Aggregate Queries that explores the use of a stratified-based selection technique for imputation to handle missing values within a specified imputation budget effectively. CoDAQ strategically selects cells for imputation by analyzing missing data patterns across different groups, ensuring a structured and representative approach. This technique offers enhanced accuracy and consistency over baseline methods. Our findings illustrate how the proposed technique enhances the utilization of available data, improves the accuracy of visualizations, and reduces bias in analysis, ultimately increasing the performance of visual analytics.
| Original language | English |
|---|---|
| Pages (from-to) | 287-294 |
| Number of pages | 8 |
| Journal | Proceedings of the IEEE International Conference on Big Data and Smart Computing, BIGCOMP |
| Issue number | 2025 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 2025 IEEE International Conference on Big Data and Smart Computing, BigComp 2025 - Kota Kinabalu, Malaysia Duration: Feb 9 2025 → Feb 12 2025 |
Keywords
- aggregate queries
- approximate query processing
- big data
- data imputation
- data quality
- data sampling
- visual analytics
ASJC Scopus subject areas
- Information Systems
- Computer Vision and Pattern Recognition
- Computer Science Applications
- Computer Networks and Communications
- Computational Theory and Mathematics
- Artificial Intelligence
Fingerprint
Dive into the research topics of 'CoDAQ: Congressional-Based Data Imputation for Aggregate Queries'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS