Toward optimal streaming feature selection

Noura Al Nuaimi, Mohammad M. Masud

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Recently, real-time data brings explosion of big data that is challenged traditional data mining techniques. Analyzing data in real-time would allow making better decisions on real-time. Usually, big data contains many irrelevant and redundant data. Therefore, removing and discarding these data is essential. Streaming feature selection involving big data has generally been viewed as a solution for selecting informative features that lead to accurate learning models. In this paper we introduce an efficient algorithm for selection of features from a feature stream by online feature grouping. This technique will be useful in big data analytics due to its efficiency and scalability. The main contribution of this work is to solve the challenge of extremely high dimensional of big data by delivering the streaming feature grouping and selection algorithm. In our approach the algorithm is designed with the idea of grouping similar features to reduce the redundancy and to handle the stream of features in an online fashion. Experimental results have demonstrated that our proposed algorithm shown superior performance in terms of prediction accuracy and running time.

Original languageEnglish
Title of host publicationProceedings - 2017 International Conference on Data Science and Advanced Analytics, DSAA 2017
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages775-782
Number of pages8
ISBN (Electronic)9781509050048
DOIs
Publication statusPublished - Jul 2 2017
Event4th International Conference on Data Science and Advanced Analytics, DSAA 2017 - Tokyo, Japan
Duration: Oct 19 2017Oct 21 2017

Publication series

NameProceedings - 2017 International Conference on Data Science and Advanced Analytics, DSAA 2017
Volume2018-January

Other

Other4th International Conference on Data Science and Advanced Analytics, DSAA 2017
Country/TerritoryJapan
CityTokyo
Period10/19/1710/21/17

Keywords

  • Big data
  • Feature grouping
  • Stream of features
  • Streaming feature
  • Streaming feature grouping
  • Streaming feature selection

ASJC Scopus subject areas

  • Signal Processing
  • Information Systems and Management
  • Statistics, Probability and Uncertainty
  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'Toward optimal streaming feature selection'. Together they form a unique fingerprint.

Cite this