Abstract
Capsule networks (CapsNets) offer distinct advantages over conventional convolutional neural networks (CNNs) by introducing the concept of a capsule. Specifically, this innovation achieves both rotational invariance and spatial awareness, making CapsNets a powerful tool in the field of machine learning. However, this breakthrough comes with an increased level of computational complexity. In our comprehensive experimental analysis of CapsNets, we meticulously inspected its various components and identified the squash function as the main computational bottleneck. To address this challenge, In this paper, we adapts the principles of neural architecture search (NAS) and introduces AutoCaps-Zero, a framework that automatically searches the hardware-efficient squash function to reduce model inference time. Meanwhile, CapsNet models incorporating the searched squash function have exhibited excellent performance across datasets of various sizes, while retaining robust features that make them resistant to adversarial attacks. Besides, these models maintain high performance even on challenging datasets like multiMNIST. Particularly, our experimental results demonstrate that the squash function searched by AutoCaps-Zero reduces the execution time of the squash function itself by approximately 68 %. Consequently, deploying the searched squash function on our benchmark models can reduce the end-To-end graphic processing unit (GPU) inference time by up to 34%. Overall, with the searched function, the CapsNet code will be released.
Original language | English |
---|---|
Title of host publication | Proceedings of the 2024 IEEE International Conference on Communications, Computing, Cybersecurity and Informatics, CCCI 2024 |
Editors | Mohammad S. Obaidat, Lin Zhang, Xiaokun Wang, Chao Yao, Kuei-Fang Hsiao, Petros Nicopolitidis, Yu Guo |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9798350349832 |
DOIs | |
State | Published - 2024 |
Event | 2024 IEEE International Conference on Communications, Computing, Cybersecurity and Informatics, CCCI 2024 - Beijing, China Duration: 16 Oct 2024 → 18 Oct 2024 |
Publication series
Name | Proceedings of the 2024 IEEE International Conference on Communications, Computing, Cybersecurity and Informatics, CCCI 2024 |
---|
Conference
Conference | 2024 IEEE International Conference on Communications, Computing, Cybersecurity and Informatics, CCCI 2024 |
---|---|
Country/Territory | China |
City | Beijing |
Period | 16/10/24 → 18/10/24 |
Bibliographical note
Publisher Copyright:© 2024 IEEE.
Keywords
- Artificial Intelligence
- Capsule Network
- Deep Learning
- Evolutionary Algorithm
- Model Optimization
- Neural Architecture Search