TY - JOUR
T1 - Stereo Confidence Estimation via Locally Adaptive Fusion and Knowledge Distillation
AU - Kim, Sunok
AU - Kim, Seungryong
AU - Min, Dongbo
AU - Frossard, Pascal
AU - Sohn, Kwanghoon
N1 - Publisher Copyright:
© 1979-2012 IEEE.
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Stereo confidence estimation aims to estimate the reliability of the estimated disparity by stereo matching. Different from the previous methods that exploit the limited input modality, we present a novel method that estimates confidence map of an initial disparity by making full use of tri-modal input, including matching cost, disparity, and color image through deep networks. The proposed network, termed as Locally Adaptive Fusion Networks (LAF-Net), learns locally-varying attention and scale maps to fuse the tri-modal confidence features. Moreover, we propose a knowledge distillation framework to learn more compact confidence estimation networks as student networks. By transferring the knowledge from LAF-Net as teacher networks, the student networks that solely take as input a disparity can achieve comparable performance. To transfer more informative knowledge, we also propose a module to learn the locally-varying temperature in a softmax function. We further extend this framework to a multiview scenario. Experimental results show that LAF-Net and its variations outperform the state-of-the-art stereo confidence methods on various benchmarks.
AB - Stereo confidence estimation aims to estimate the reliability of the estimated disparity by stereo matching. Different from the previous methods that exploit the limited input modality, we present a novel method that estimates confidence map of an initial disparity by making full use of tri-modal input, including matching cost, disparity, and color image through deep networks. The proposed network, termed as Locally Adaptive Fusion Networks (LAF-Net), learns locally-varying attention and scale maps to fuse the tri-modal confidence features. Moreover, we propose a knowledge distillation framework to learn more compact confidence estimation networks as student networks. By transferring the knowledge from LAF-Net as teacher networks, the student networks that solely take as input a disparity can achieve comparable performance. To transfer more informative knowledge, we also propose a module to learn the locally-varying temperature in a softmax function. We further extend this framework to a multiview scenario. Experimental results show that LAF-Net and its variations outperform the state-of-the-art stereo confidence methods on various benchmarks.
KW - Stereo matching
KW - deep learning
KW - knowledge distillation
KW - stereo confidence estimation
UR - http://www.scopus.com/inward/record.url?scp=85139453361&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2022.3207286
DO - 10.1109/TPAMI.2022.3207286
M3 - Article
C2 - 36112555
AN - SCOPUS:85139453361
SN - 0162-8828
VL - 45
SP - 6372
EP - 6385
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 5
ER -