Abstract
The binary-weight CNN is one of the most efficient solutions for mobile CNNs. However, a large number of operations are required to process each image. To reduce such a huge operation count, we propose an energy-efficient kernel decomposition architecture, based on the observation that a large number of operations are redundant. In this scheme, all kernels are decomposed into sub-kernels to expose the common parts. By skipping the redundant computations, the operation count for each image was consequently reduced by 47.7%. Furthermore, a low cost bit-width quantization technique was implemented by exploiting the relative scales of the feature data. Experimental results showed that the proposed architecture achieves a 22% energy reduction.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the 54th Annual Design Automation Conference 2017, DAC 2017 |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| ISBN (Electronic) | 9781450349277 |
| DOIs | |
| State | Published - 18 Jun 2017 |
| Event | 54th Annual Design Automation Conference, DAC 2017 - Austin, United States Duration: 18 Jun 2017 → 22 Jun 2017 |
Publication series
| Name | Proceedings - Design Automation Conference |
|---|---|
| Volume | Part 128280 |
| ISSN (Print) | 0738-100X |
Conference
| Conference | 54th Annual Design Automation Conference, DAC 2017 |
|---|---|
| Country/Territory | United States |
| City | Austin |
| Period | 18/06/17 → 22/06/17 |
Bibliographical note
Publisher Copyright:© 2017 ACM.