Abstract
Training convolutional neural network on device has become essential where it allows applications to consider user's individual environment. Meanwhile, the weight update operation from the training process is the primary factor of high energy consumption due to its substantial memory accesses. We propose a dedicated weight update architecture with two key features: (1) a specialized local buffer for the DRAM access deduction (2) a novel dataflow and its suitable processing element array structure for weight gradient computation to optimize the energy consumed by internal memories. Our scheme achieves 14.3%-30.2% total energy reduction by drastically eliminating the memory accesses.
Original language | English |
---|---|
Title of host publication | ISLPED 2018 - Proceedings of the 2018 International Symposium on Low Power Electronics and Design |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Print) | 9781450357043 |
DOIs | |
State | Published - 23 Jul 2018 |
Event | 23rd IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2018 - Bellevue, United States Duration: 23 Jul 2018 → 25 Jul 2018 |
Publication series
Name | Proceedings of the International Symposium on Low Power Electronics and Design |
---|---|
ISSN (Print) | 1533-4678 |
Conference
Conference | 23rd IEEE/ACM International Symposium on Low Power Electronics and Design, ISLPED 2018 |
---|---|
Country/Territory | United States |
City | Bellevue |
Period | 23/07/18 → 25/07/18 |
Bibliographical note
Publisher Copyright:© 2018 Association for Computing Machinery.
Keywords
- CNN
- Dataflow
- On-device
- Training architecture
- Weight update