1. Image classification based on sparse-coded features using sparse coding technique for aerial imagery: a hybrid dictionary approach
- Author
-
Abdul Qayyum, Naufal M. Saad, Tuan Ab Rashid Bin Tuan Abdullah, Mohd Faris Abdullah, Mahboob Iqbal, Mohd Yaqoob Bin Jafaar, Aamir Saeed Malik, and Waqas Rasheed
- Subjects
Discrete wavelet transform ,0209 industrial biotechnology ,Contextual image classification ,Local binary patterns ,Computer science ,business.industry ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,Scale-invariant feature transform ,Image processing ,Pattern recognition ,02 engineering and technology ,020901 industrial engineering & automation ,Wavelet ,Histogram of oriented gradients ,Artificial Intelligence ,Feature (computer vision) ,Computer Science::Computer Vision and Pattern Recognition ,0202 electrical engineering, electronic engineering, information engineering ,Discrete cosine transform ,020201 artificial intelligence & image processing ,Artificial intelligence ,business ,Software ,Aerial image - Abstract
This work offers an approach to aerial image classification for use in remote sensing object recognition, image processing and computer vision. Sparse coding (SC) is used to classify unmanned-aerial-vehicle (UAV) and satellite images because SC representation can generalize a large dataset and improve the detection of distinctive features by reducing calculation time for feature matching and classification. Features from images are extracted based on the following descriptors: (a) Scale Invariant Feature Transform; (b) Histogram of Oriented Gradients; and (c) Local Binary Patterns. SC representation and local image features are combined to represent global features for classification. Features are deployed in a sparse model to store descriptor features using extant dictionaries such as (a) the Discrete Cosine Transform and (b) the Discrete Wavelet Transform. An additional two dictionaries are proposed as developed for the present work: (c) the Discrete Ridgelet Transform (DRT) and (d) the Discrete Tchebichef Transform. The DRT dictionary is constructed by using the Ricker wavelet function to generate finite Ridgelet transforms as basis elements for a hybrid dictionary. Different pooling methods have also been employed to convert sparse-coded features into a feature matrix. Various machine learning algorithms are then applied to the feature matrix to classify objects contained in UAV and satellite imagery data. Experimental results show that the SC model secured better accuracy rates for extracted discriminative features contained in remote sensing images. The authors concluded that the proposed SC technique and proposed dictionaries provided feasible solutions for image classification and object recognition.
- Published
- 2017
- Full Text
- View/download PDF