Open Access iconOpen Access

ARTICLE

crossmark

Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision

Yuejiao Wang, Zhong Ma*, Chaojie Yang, Yu Yang, Lu Wei

Xi’an Microelectronics Technology Institute, Xi’an, 710065, China

* Corresponding Author: Zhong Ma. Email: email

(This article belongs to the Special Issue: Development and Industrial Application of AI Technologies)

Computers, Materials & Continua 2024, 79(1), 819-836. https://doi.org/10.32604/cmc.2024.047108

Abstract

The quantization algorithm compresses the original network by reducing the numerical bit width of the model, which improves the computation speed. Because different layers have different redundancy and sensitivity to data bit width. Reducing the data bit width will result in a loss of accuracy. Therefore, it is difficult to determine the optimal bit width for different parts of the network with guaranteed accuracy. Mixed precision quantization can effectively reduce the amount of computation while keeping the model accuracy basically unchanged. In this paper, a hardware-aware mixed precision quantization strategy optimal assignment algorithm adapted to low bit width is proposed, and reinforcement learning is used to automatically predict the mixed precision that meets the constraints of hardware resources. In the state-space design, the standard deviation of weights is used to measure the distribution difference of data, the execution speed feedback of simulated neural network accelerator inference is used as the environment to limit the action space of the agent, and the accuracy of the quantization model after retraining is used as the reward function to guide the agent to carry out deep reinforcement learning training. The experimental results show that the proposed method obtains a suitable model layer-by-layer quantization strategy under the condition that the computational resources are satisfied, and the model accuracy is effectively improved. The proposed method has strong intelligence and certain universality and has strong application potential in the field of mixed precision quantization and embedded neural network model deployment.

Keywords


Cite This Article

APA Style
Wang, Y., Ma, Z., Yang, C., Yang, Y., Wei, L. (2024). Reinforcement learning based quantization strategy optimal assignment algorithm for mixed precision. Computers, Materials & Continua, 79(1), 819-836. https://doi.org/10.32604/cmc.2024.047108
Vancouver Style
Wang Y, Ma Z, Yang C, Yang Y, Wei L. Reinforcement learning based quantization strategy optimal assignment algorithm for mixed precision. Comput Mater Contin. 2024;79(1):819-836 https://doi.org/10.32604/cmc.2024.047108
IEEE Style
Y. Wang, Z. Ma, C. Yang, Y. Yang, and L. Wei "Reinforcement Learning Based Quantization Strategy Optimal Assignment Algorithm for Mixed Precision," Comput. Mater. Contin., vol. 79, no. 1, pp. 819-836. 2024. https://doi.org/10.32604/cmc.2024.047108



cc Copyright © 2024 The Author(s). Published by Tech Science Press.
This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 441

    View

  • 267

    Download

  • 1

    Like

Share Link