Abstract
The binary-weight CNN is one of the most efficient solutions for mobile CNNs. However, a large number of operations are required to process each image. To reduce such a huge operation count, we propose an energy-efficient kernel decomposition architecture, based on the observation that a large number of operations are redundant. In this scheme, all kernels are decomposed into sub-kernels to expose the common parts. By skipping the redundant computations, the operation count for each image was consequently reduced by 47.7%. Furthermore, a low cost bit-width quantization technique was implemented by exploiting the relative scales of the feature data. Experimental results showed that the proposed architecture achieves a 22% energy reduction.
Original language | English |
---|---|
Title of host publication | Proceedings of the 54th Annual Design Automation Conference 2017, DAC 2017 |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781450349277 |
DOIs | |
State | Published - 18 Jun 2017 |
Event | 54th Annual Design Automation Conference, DAC 2017 - Austin, United States Duration: 18 Jun 2017 → 22 Jun 2017 |
Publication series
Name | Proceedings - Design Automation Conference |
---|---|
Volume | Part 128280 |
ISSN (Print) | 0738-100X |
Conference
Conference | 54th Annual Design Automation Conference, DAC 2017 |
---|---|
Country/Territory | United States |
City | Austin |
Period | 18/06/17 → 22/06/17 |
Bibliographical note
Publisher Copyright:© 2017 ACM.