TY - GEN
T1 - OF-NET
T2 - ASME 2020 15th International Manufacturing Science and Engineering Conference, MSEC 2020
AU - Yang, Ru
AU - Guo, Ping
N1 - Funding Information:
This research was supported by the start-up fund from McCormick School of Engineering, Northwestern University, Evanston, IL, USA.
Publisher Copyright:
Copyright © 2020 ASME
PY - 2020
Y1 - 2020
N2 - Object motion trajecting using computer vision is a technology enabler for various smart manufacturing systems. Sub-pixel displacement estimation is still unsatisfactory with the existing tracking algorithms. In this paper, we extend the popular computer vision task, optical flow, to solve the small displacement detection problem. Since conventional optical flow methods have weakness in robustness and poor performance at the boundary region, convolutional neural networks (CNNs) based approach has been developed to solve optical flow problems. We construct a new multi-scale CNN, OF-NET, for sub-pixel optical flow estimation. In the model, we adopt an inverse-pyramid structure to enlarge the small displacement to larger-scale feature maps for motion detection. A novel data structure with multi-level ground truth is adopted to synthesize the dataset for training. The results have demonstrated competitive performance and efficiency compared with the existing state-of-the-art, FlowNetC, and the conventional optical flow method. Validation results from our model reach an end point error (EPE) at the level of 0.01 pixels. Our model excels in identifying the boundaries of moving objects compared with the other reference methods. The efficiency of the model has been optimized by using anisotropic upscaling and independent learning in two directions. Real-time tracking of 25-35 fps is achievable with the proposed model. The model is also verified with experimental results with good performance.
AB - Object motion trajecting using computer vision is a technology enabler for various smart manufacturing systems. Sub-pixel displacement estimation is still unsatisfactory with the existing tracking algorithms. In this paper, we extend the popular computer vision task, optical flow, to solve the small displacement detection problem. Since conventional optical flow methods have weakness in robustness and poor performance at the boundary region, convolutional neural networks (CNNs) based approach has been developed to solve optical flow problems. We construct a new multi-scale CNN, OF-NET, for sub-pixel optical flow estimation. In the model, we adopt an inverse-pyramid structure to enlarge the small displacement to larger-scale feature maps for motion detection. A novel data structure with multi-level ground truth is adopted to synthesize the dataset for training. The results have demonstrated competitive performance and efficiency compared with the existing state-of-the-art, FlowNetC, and the conventional optical flow method. Validation results from our model reach an end point error (EPE) at the level of 0.01 pixels. Our model excels in identifying the boundaries of moving objects compared with the other reference methods. The efficiency of the model has been optimized by using anisotropic upscaling and independent learning in two directions. Real-time tracking of 25-35 fps is achievable with the proposed model. The model is also verified with experimental results with good performance.
KW - Convolutional Neural Network
KW - Optical Flow
KW - Sub-pixel Displacement
UR - http://www.scopus.com/inward/record.url?scp=85101470223&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85101470223&partnerID=8YFLogxK
U2 - 10.1115/MSEC2020-8517
DO - 10.1115/MSEC2020-8517
M3 - Conference contribution
AN - SCOPUS:85101470223
T3 - ASME 2020 15th International Manufacturing Science and Engineering Conference, MSEC 2020
BT - Manufacturing Processes; Manufacturing Systems; Nano/Micro/Meso Manufacturing; Quality and Reliability
PB - American Society of Mechanical Engineers
Y2 - 3 September 2020
ER -