Research Article Open Access

A Comparative Study of Data Mining approaches for Bag of Visual Words Based Image Classification

Sahar Abd El Hameed Mokhtar1 and Heba Ahmed Elnemr1
  • 1 Electronics Research Institute, Egypt

Abstract

Image classification is one of the most significant and challenging tasks in computer vision. The goal of this task is to build a system that is capable to reveal an image label within a collection of different image categories. This paper presents and discusses the application of various data mining techniques for image classification based on Bag of Visual Words (BoVW) feature extraction algorithm. The BoVW model is constructed using grey level features: The Speeded Up Robust Features (SURF) and Maximally Stable Extremal Regions (MSER) descriptors along with color features: Color correlograms and Improved Color Coherence Vector (ICCV). Five data mining techniques; Neural Networks (NN), Decision Trees (DT), Bayesian Network (BN), Discriminant Analysis (DA) and K Nearest Neighbor (KNN), are explored and evaluated on two large different datasets: Corel-1000 and COIL-100. The experimental results illustrate that BN and DA outperform the other data mining methods considered in this comparative study. For Corel-1000 dataset, BN and DA achieved an average accuracy and specificity of about 99.9% and an average sensitivity of about 99.5 and 99.4%, respectively. While for the COIL-100 dataset, BN and DA accomplished an average accuracy and sensitivity of about 100% and an average specificity of about 98.5 and 98.9, respectively.

Journal of Computer Science
Volume 14 No. 1, 2018, 53-66

DOI: https://doi.org/10.3844/jcssp.2018.53.66

Submitted On: 8 October 2017 Published On: 12 January 2018

How to Cite: Mokhtar, S. A. E. H. & Elnemr, H. A. (2018). A Comparative Study of Data Mining approaches for Bag of Visual Words Based Image Classification. Journal of Computer Science, 14(1), 53-66. https://doi.org/10.3844/jcssp.2018.53.66

  • 3,584 Views
  • 2,252 Downloads
  • 0 Citations

Download

Keywords

  • Discriminant Analysis
  • Bayesian Network
  • Decision Trees
  • Neural Networks
  • Machine Learning
  • Data Mining
  • Image Classification
  • K Nearest Neighbor