SQNR-based Layer-wise Mixed-Precision Schemes with Computational Complexity Consideration

Ha Na Kim, Hyun Eun, Jung Hwan Choi, Ji Hoon Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently, AI acceleration is critical for hardware systems from mobile/edge devices to high-performance data servers. For on-device AI, there have been many studies on hardware numerical precision reduction for low hardware complexity considering limited hardware resources of mobile/edge devices [1-2]. However, if taken too far, aggressive quantization with low precision can degrade model accuracy which is the fundamental measure of deep learning quality. With the layer-wise mixed-precision networks where the different precision scheme is applied to each layer, we can reduce off-chip memory bandwidth requirements as well as hardware complexity while retaining model accuracy [3]. However, it takes a long time to determine the optimal precision scheme for each layer due to the repetitive trainings with the different precision configurations. Also, if only the accuracy of the target neural network is considered in the process of the mixed-precision determination, it is difficult to sufficiently reduce the computational complexity, which is the most important in mobile/edge devices [4].

Original languageEnglish
Title of host publicationIEEE International Symposium on Circuits and Systems, ISCAS 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages234-235
Number of pages2
ISBN (Electronic)9781665484855
DOIs
StatePublished - 2022
Event2022 IEEE International Symposium on Circuits and Systems, ISCAS 2022 - Austin, United States
Duration: 27 May 20221 Jun 2022

Publication series

NameProceedings - IEEE International Symposium on Circuits and Systems
Volume2022-May
ISSN (Print)0271-4310

Conference

Conference2022 IEEE International Symposium on Circuits and Systems, ISCAS 2022
Country/TerritoryUnited States
CityAustin
Period27/05/221/06/22

Bibliographical note

Publisher Copyright:
© 2022 IEEE.

Fingerprint

Dive into the research topics of 'SQNR-based Layer-wise Mixed-Precision Schemes with Computational Complexity Consideration'. Together they form a unique fingerprint.

Cite this