Back to Search Start Over

A frequency-domain approach with learnable filters for image classification.

Authors :
Stuchi, José Augusto
Canto, Natalia Gil
Attux, Romis Ribeiro de Faissol
Boccato, Levy
Source :
Applied Soft Computing; Apr2024, Vol. 155, pN.PAG-N.PAG, 1p
Publication Year :
2024

Abstract

Machine learning applied to computer vision and signal processing is achieving results comparable to the human brain due to the great improvements brought by deep neural networks (DNN). The majority of state-of-the-art architectures are DNN-related, but only a few explicitly explore the frequency domain to extract useful information and improve the results. This paper presents a new approach for exploring the Fourier transform of the input images, which is composed of trainable frequency filters that boost discriminative components in the spectrum. Additionally, we propose a cropping procedure to allow the network to learn both global and local spectral features of the image blocks. The proposed method proved to be competitive concerning well-known DNN architectures in the selected experiments, which involved texture classification, cataract detection, and retina image analysis, where there is a noticeable appeal for the frequency domain, with the advantage of being a lightweight model. • A new architecture for neural networks exploring the frequency domain is proposed. • Trainable frequency filters retrieve image discriminative features. • A block division scheme allows extracting local and global spectral features. • A frequency pooling technique reduces the model parameters and training time. • The proposed model reaches competitive results when compared to modern ConvNets. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
15684946
Volume :
155
Database :
Supplemental Index
Journal :
Applied Soft Computing
Publication Type :
Academic Journal
Accession number :
176224909
Full Text :
https://doi.org/10.1016/j.asoc.2024.111443