TY - JOUR
T1 - Online streaming feature selection with incremental feature grouping
AU - Al Nuaimi, Noura
AU - Masud, Mohammad M.
N1 - Publisher Copyright:
© 2020 Wiley Periodicals, Inc.
PY - 2020/7/1
Y1 - 2020/7/1
N2 - Today, the dimensionality of data is increasing in a massive way. Thus, traditional feature selection techniques are not directly applicable. Consequently, recent research has led to the development of a more efficient approach to the selection of features from a feature stream, known as streaming feature selection. Another active research area, related to feature selection, is feature grouping. Feature grouping selects relevant features by evaluating the hidden information of selected features. However, although feature grouping is a promising technique, it is not directly applicable to feature streams. In this paper, we propose a novel and efficient algorithm that uses online feature grouping, embedded within a new incremental technique, to select features from a feature stream. This technique groups similar features together; it assigns new incoming features to an existing group or creates a new group. To the best of our knowledge, this is the first approach that proposes the use of incremental feature grouping to perform feature selection from features. We have implemented this algorithm and evaluated it, using benchmark datasets, against state-of-the-art streaming feature selection algorithms that use feature grouping or incremental selection techniques. The results show superior performance by the proposed technique through combining the online selection and grouping, in terms of prediction accuracy and running time. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Technologies > Data Preprocessing Technologies > Classification Technologies > Machine Learning.
AB - Today, the dimensionality of data is increasing in a massive way. Thus, traditional feature selection techniques are not directly applicable. Consequently, recent research has led to the development of a more efficient approach to the selection of features from a feature stream, known as streaming feature selection. Another active research area, related to feature selection, is feature grouping. Feature grouping selects relevant features by evaluating the hidden information of selected features. However, although feature grouping is a promising technique, it is not directly applicable to feature streams. In this paper, we propose a novel and efficient algorithm that uses online feature grouping, embedded within a new incremental technique, to select features from a feature stream. This technique groups similar features together; it assigns new incoming features to an existing group or creates a new group. To the best of our knowledge, this is the first approach that proposes the use of incremental feature grouping to perform feature selection from features. We have implemented this algorithm and evaluated it, using benchmark datasets, against state-of-the-art streaming feature selection algorithms that use feature grouping or incremental selection techniques. The results show superior performance by the proposed technique through combining the online selection and grouping, in terms of prediction accuracy and running time. This article is categorized under: Algorithmic Development > Spatial and Temporal Data Mining Technologies > Data Preprocessing Technologies > Classification Technologies > Machine Learning.
KW - feature selection
KW - features grouping
KW - redundancy analysis
KW - stream of features
KW - streaming data
UR - http://www.scopus.com/inward/record.url?scp=85081922079&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85081922079&partnerID=8YFLogxK
U2 - 10.1002/widm.1364
DO - 10.1002/widm.1364
M3 - Article
AN - SCOPUS:85081922079
SN - 1942-4787
VL - 10
JO - Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
JF - Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery
IS - 4
M1 - e1364
ER -