TY - GEN
T1 - Speeding up spatio-temporal sliding-window search for efficient event detection in crowded videos
AU - Yuan, Junsong
AU - Liu, Zicheng
AU - Wu, Ying
AU - Zhang, Zhengyou
PY - 2009
Y1 - 2009
N2 - Despite previous successes of sliding window-based object detection in images, searching desired events in the volumetric video space is still a challenging problem, partially because the pattern search in spatio-temporal video space is much more complicated than that in spatial image space. Without knowing the location, temporal duration, and the spatial scale of the event, the search space for video events is prohibitively large for exhaustive search. To reduce the search complexity, we propose a heuristic branch-and-bound solution for event detection in videos. Unlike existing branch-and-bound method which searches for an optimal subvolume before comparing its detection score against the threshold, we aim at directly finding subvolumes whose scores are higher than the threshold. In doing so, many unnecessary branches are terminated much earlier, thus the search speed can be much faster. To validate this approach, we select three human action classes from the KTH dataset for training while testing with our own action dataset which has clutter and moving backgrounds as well as large variations in lighting, scale, and performing speed of actions. The experiment results show that our technique dramatically reduces computational cost without significantly degrading the quality of the detection results.
AB - Despite previous successes of sliding window-based object detection in images, searching desired events in the volumetric video space is still a challenging problem, partially because the pattern search in spatio-temporal video space is much more complicated than that in spatial image space. Without knowing the location, temporal duration, and the spatial scale of the event, the search space for video events is prohibitively large for exhaustive search. To reduce the search complexity, we propose a heuristic branch-and-bound solution for event detection in videos. Unlike existing branch-and-bound method which searches for an optimal subvolume before comparing its detection score against the threshold, we aim at directly finding subvolumes whose scores are higher than the threshold. In doing so, many unnecessary branches are terminated much earlier, thus the search speed can be much faster. To validate this approach, we select three human action classes from the KTH dataset for training while testing with our own action dataset which has clutter and moving backgrounds as well as large variations in lighting, scale, and performing speed of actions. The experiment results show that our technique dramatically reduces computational cost without significantly degrading the quality of the detection results.
KW - Event detection
KW - Sliding window
KW - Spatio-temporal pattern
KW - Video pattern search
UR - http://www.scopus.com/inward/record.url?scp=72049102437&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=72049102437&partnerID=8YFLogxK
U2 - 10.1145/1631024.1631028
DO - 10.1145/1631024.1631028
M3 - Conference contribution
AN - SCOPUS:72049102437
SN - 9781605587547
T3 - 1st ACM International Workshop on Events in Multimedia - EiMM'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09
SP - 3
EP - 8
BT - 1st ACM International Workshop on Events in Multimedia - EiMM'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09
T2 - 1st ACM International Workshop on Events in Multimedia - EiMM'09, Co-located with the 2009 ACM International Conference on Multimedia, MM'09
Y2 - 19 October 2009 through 24 October 2009
ER -