Random Subclasses Ensembles by Using 1-Nearest Neighbor Framework

Amir Ahmad, Hamza Abujabal, C. Aswani Kumar

Research output: Contribution to journalArticlepeer-review

Abstract

A classifier ensemble is a combination of diverse and accurate classifiers. Generally, a classifier ensemble performs better than any single classifier in the ensemble. Naive Bayes classifiers are simple but popular classifiers for many applications. As it is difficult to create diverse naive Bayes classifiers, naive Bayes ensembles are not very successful. In this paper, we propose Random Subclasses (RS) ensembles for Naive Bayes classifiers. In the proposed method, new subclasses for each class are created by using 1-Nearest Neighbor (1-NN) framework that uses randomly selected points from the training data. A classifier considers each subclass as a class of its own. As the method to create subclasses is random, diverse datasets are generated. Each classifier in an ensemble learns on one dataset from the pool of diverse datasets. Diverse training datasets ensure diverse classifiers in the ensemble. New subclasses create easy to learn decision boundaries that in turn create accurate naive Bayes classifiers. We developed two variants of RS, in the first variant RS(2), two subclasses per class were created whereas in the second variant RS(4), four subclasses per class were created. We studied the performance of these methods against other popular ensemble methods by using naive Bayes as the base classifier. RS(4) outperformed other popular ensemble methods. A detailed study was carried out to understand the behavior of RS ensembles.

Original languageEnglish
Article number1750031
JournalInternational Journal of Pattern Recognition and Artificial Intelligence
Volume31
Issue number10
DOIs
Publication statusPublished - Oct 1 2017

Keywords

  • Classifier ensembles
  • adaBoost.M1
  • bagging
  • clusters
  • naive bayes
  • subclasses

ASJC Scopus subject areas

  • Software
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence

Fingerprint

Dive into the research topics of 'Random Subclasses Ensembles by Using 1-Nearest Neighbor Framework'. Together they form a unique fingerprint.

Cite this