TY - JOUR
T1 - Cross-Scale Cost Aggregation for Stereo Matching
AU - Zhang, Kang
AU - Fang, Yuqiang
AU - Min, Dongbo
AU - Sun, Lifeng
AU - Yang, Shiqiang
AU - Yan, Shuicheng
N1 - Funding Information:
The work of K. Zhang, L. Sun, and S. Yang was supported in part by the National Natural Science Foundation of China under Grant 61272231, Grant 61472204, and Grant 61210008, in part by the Beijing Key Laboratory of Networked Multimedia, and in part by the Tsinghua Samsung Joint Laboratory. The work of Y. Fang and S. Yan was supported by the Singapore National Research Foundation under its International Research Centre at Singapore Funding Initiative and administered by the IDM Programme Office. The work of D. Min was supported by the Institute for Information and Communications Technology Promotion within the Ministry of Science, ICT and Future Planning through the Korean Government under Grant R0115-15-1007. This paper was recommended by Associate Editor S. Ci.
Publisher Copyright:
© 1991-2012 IEEE.
PY - 2017/5
Y1 - 2017/5
N2 - This paper proposes a generic framework that enables a multiscale interaction in the cost aggregation step of stereo matching algorithms. Inspired by the formulation of image filters, we first reformulate cost aggregation from a weighted least-squares (WLS) optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels. Our key motivation is that while the human stereo vision system processes information at both coarse and fine scales interactively for the correspondence search, state-of-the-art approaches aggregate costs at the finest scale of the input stereo images only, ignoring inter-consistency across multiple scales. This motivation leads us to introduce an inter-scale regularizer into the WLS optimization objective to enforce the consistency of the cost volume among the neighboring scales. The new optimization objective with the inter-scale regularization is convex, and thus, it is easily and analytically solved. Minimizing this new objective leads to the proposed framework. Since the regularization term is independent of the similarity kernel, various cost aggregation approaches, including discrete and continuous parameterization methods, can be easily integrated into the proposed framework. We show that the cross-scale framework is important as it effectively and efficiently expands state-of-the-art cost aggregation methods and leads to significant improvements, when evaluated on Middlebury, Middlebury Third, KITTI, and New Tsukuba data sets.
AB - This paper proposes a generic framework that enables a multiscale interaction in the cost aggregation step of stereo matching algorithms. Inspired by the formulation of image filters, we first reformulate cost aggregation from a weighted least-squares (WLS) optimization perspective and show that different cost aggregation methods essentially differ in the choices of similarity kernels. Our key motivation is that while the human stereo vision system processes information at both coarse and fine scales interactively for the correspondence search, state-of-the-art approaches aggregate costs at the finest scale of the input stereo images only, ignoring inter-consistency across multiple scales. This motivation leads us to introduce an inter-scale regularizer into the WLS optimization objective to enforce the consistency of the cost volume among the neighboring scales. The new optimization objective with the inter-scale regularization is convex, and thus, it is easily and analytically solved. Minimizing this new objective leads to the proposed framework. Since the regularization term is independent of the similarity kernel, various cost aggregation approaches, including discrete and continuous parameterization methods, can be easily integrated into the proposed framework. We show that the cross-scale framework is important as it effectively and efficiently expands state-of-the-art cost aggregation methods and leads to significant improvements, when evaluated on Middlebury, Middlebury Third, KITTI, and New Tsukuba data sets.
KW - Cost aggregation
KW - local stereo matching
KW - multiscale
UR - http://www.scopus.com/inward/record.url?scp=85018932733&partnerID=8YFLogxK
U2 - 10.1109/TCSVT.2015.2513663
DO - 10.1109/TCSVT.2015.2513663
M3 - Article
AN - SCOPUS:85018932733
SN - 1051-8215
VL - 27
SP - 965
EP - 976
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 5
M1 - 7368908
ER -