TY - GEN
T1 - Revitalizing Optimization for 3D Human Pose and Shape Estimation
T2 - 18th IEEE/CVF International Conference on Computer Vision, ICCV 2021
AU - Fan, Taosha
AU - Alwala, Kalyan Vasudev
AU - Xiang, Donglai
AU - Xu, Weipeng
AU - Murphey, Todd
AU - Mukadam, Mustafa
N1 - Funding Information:
Acknowledgments. For this work authors affiliated with Northwestern University were partially supported by the National Science Foundation under award DCSD-1662233.
Publisher Copyright:
© 2021 IEEE
PY - 2021
Y1 - 2021
N2 - We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation. Our optimization method, SCOPE (Sparse Constrained Optimization for 3D human Pose and shapE estimation), is orders of magnitude faster (avg. 4ms convergence) than existing optimization methods, while being mathematically equivalent to their dense unconstrained formulation under mild assumptions. We achieve this by exploiting the underlying sparsity and constraints of our formulation to efficiently compute the Gauss-Newton direction. We show that this computation scales linearly with the number of joints and measurements of a complex 3D human model, in contrast to prior work where it scales cubically due to their dense unconstrained formulation. Based on our optimization method, we present a real-time motion capture framework that estimates 3D human poses and shapes from a single image at over 30 FPS. In benchmarks against state-of-the-art methods on multiple public datasets, our framework outperforms other optimization methods and achieves competitive accuracy against regression methods. Project page with code and videos: https://sites.google.com/view/scope-human/.
AB - We propose a novel sparse constrained formulation and from it derive a real-time optimization method for 3D human pose and shape estimation. Our optimization method, SCOPE (Sparse Constrained Optimization for 3D human Pose and shapE estimation), is orders of magnitude faster (avg. 4ms convergence) than existing optimization methods, while being mathematically equivalent to their dense unconstrained formulation under mild assumptions. We achieve this by exploiting the underlying sparsity and constraints of our formulation to efficiently compute the Gauss-Newton direction. We show that this computation scales linearly with the number of joints and measurements of a complex 3D human model, in contrast to prior work where it scales cubically due to their dense unconstrained formulation. Based on our optimization method, we present a real-time motion capture framework that estimates 3D human poses and shapes from a single image at over 30 FPS. In benchmarks against state-of-the-art methods on multiple public datasets, our framework outperforms other optimization methods and achieves competitive accuracy against regression methods. Project page with code and videos: https://sites.google.com/view/scope-human/.
UR - http://www.scopus.com/inward/record.url?scp=85120602425&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85120602425&partnerID=8YFLogxK
U2 - 10.1109/ICCV48922.2021.01126
DO - 10.1109/ICCV48922.2021.01126
M3 - Conference contribution
AN - SCOPUS:85120602425
T3 - Proceedings of the IEEE International Conference on Computer Vision
SP - 11437
EP - 11446
BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision, ICCV 2021
PB - Institute of Electrical and Electronics Engineers Inc.
Y2 - 11 October 2021 through 17 October 2021
ER -