Open Access iconOpen Access

ARTICLE

Activation Redistribution Based Hybrid Asymmetric Quantization Method of Neural Networks

Lu Wei, Zhong Ma*, Chaojie Yang

R&D Innovation Center, Xi’an Microelectronics Technology Institute, Xi’an, 710065, China

* Corresponding Author: Zhong Ma. Email: email

(This article belongs to the Special Issue: Recent Advances in Virtual Reality)

Computer Modeling in Engineering & Sciences 2024, 138(1), 981-1000. https://doi.org/10.32604/cmes.2023.027085

Abstract

The demand for adopting neural networks in resource-constrained embedded devices is continuously increasing. Quantization is one of the most promising solutions to reduce computational cost and memory storage on embedded devices. In order to reduce the complexity and overhead of deploying neural networks on Integer-only hardware, most current quantization methods use a symmetric quantization mapping strategy to quantize a floating-point neural network into an integer network. However, although symmetric quantization has the advantage of easier implementation, it is sub-optimal for cases where the range could be skewed and not symmetric. This often comes at the cost of lower accuracy. This paper proposed an activation redistribution-based hybrid asymmetric quantization method for neural networks. The proposed method takes data distribution into consideration and can resolve the contradiction between the quantization accuracy and the ease of implementation, balance the trade-off between clipping range and quantization resolution, and thus improve the accuracy of the quantized neural network. The experimental results indicate that the accuracy of the proposed method is 2.02% and 5.52% higher than the traditional symmetric quantization method for classification and detection tasks, respectively. The proposed method paves the way for computationally intensive neural network models to be deployed on devices with limited computing resources. Codes will be available on .

Graphical Abstract

Activation Redistribution Based Hybrid Asymmetric Quantization Method of Neural Networks

Keywords


Cite This Article

APA Style
Wei, L., Ma, Z., Yang, C. (2024). Activation redistribution based hybrid asymmetric quantization method of neural networks. Computer Modeling in Engineering & Sciences, 138(1), 981-1000. https://doi.org/10.32604/cmes.2023.027085
Vancouver Style
Wei L, Ma Z, Yang C. Activation redistribution based hybrid asymmetric quantization method of neural networks. Comput Model Eng Sci. 2024;138(1):981-1000 https://doi.org/10.32604/cmes.2023.027085
IEEE Style
L. Wei, Z. Ma, and C. Yang "Activation Redistribution Based Hybrid Asymmetric Quantization Method of Neural Networks," Comput. Model. Eng. Sci., vol. 138, no. 1, pp. 981-1000. 2024. https://doi.org/10.32604/cmes.2023.027085



cc This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
  • 335

    View

  • 258

    Download

  • 0

    Like

Share Link