Start Over

End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs.

Authors :: Logesh Babu, R.
Gurumoorthy, Sasikumar
Parameshachari, B. D.
Christalin Nelson, S.
Hua, Qiaozhi
Source :: Journal of Circuits, Systems & Computers. 9/15/2022, Vol. 31 Issue 13, p1-25. 25p.
Publication Year :: 2022
Abstract: In the resource-constrained embedded systems, the designing of efficient deep neural networks is a challenging process, due to diversity in the artificial intelligence applications. The quantization in deep neural networks superiorly diminishes the storage and computational time by reducing the bit-width of networks encoding. In order to highlight the problem of accuracy loss, the quantization levels are automatically discovered using Policy Feedback-based Reinforcement Learning Method (PF-RELEQ). In this paper, the Proximal Policy Optimization with Policy Feedback (PPO-PF) technique is proposed to determine the best design decisions by choosing the optimum hyper-parameters. In order to enhance the sensitivity of the value function to the change of policy and to improve the accuracy of value estimation at the early learning stage, a policy update method is devised based on the clipped discount factor. In addition, specifically the loss functions of policy satisfy the unbiased estimation of the trust region. The proposed PF-RELEQ effectively balances quality and speed compared to other deep learning methods like ResNet-1202, ResNet-32, ResNet-110, GoogLeNet and AlexNet. The experimental analysis showed that PF-RELEQ achieved 20% computational work-load reduction compared to the existing deep learning methods on ImageNet, CIFAR-10, CIFAR-100 and tomato leaf disease datasets and achieved approximately 2% of improvisation in the validation accuracy. Additionally, the PF-RELEQ needs only 0.55 Graphics Processing Unit on an NVIDIA GTX-1080Ti to develop DNNs that delivers better accuracy improvement with fewer cycle counts for image classification. [ABSTRACT FROM AUTHOR]

Subjects :: *ARTIFICIAL neural networks
*REINFORCEMENT learning
*ARTIFICIAL intelligence
*DEEP learning
*GRAPHICS processing units

Details

Language :: English
ISSN :: 02181266
Volume :: 31
Issue :: 13
Database :: Academic Search Index
Journal :: Journal of Circuits, Systems & Computers
Publication Type :: Academic Journal
Accession number :: 158756301
Full Text :: https://doi.org/10.1142/S0218126622502322

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs.

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

End-to-End Deep Policy Feedback-Based Reinforcement Learning Method for Quantization in DNNs.

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources