Abstract
Given a set A of activities expressed via temporal stochastic automata, and a set O of observations (detections of low level events), we study the problem of identifying instances of activities from A in O. While past work has developed algorithms to solve this problem, in this paper, we develop methods to significantly scale these algorithms. Our PASS architecture consists of three parts: (i) leveraging past work to represent all activities in A via a single 'merged' graph, (ii) partitioning the graph into a set of C subgraphs, where (C+1) is the number of compute nodes in a cluster, and (iii) developing a parallel activity detection algorithm that uses a different compute node in the cluster to intensively process each subgraph. We propose three possible partitioning methods and a parallel activity-search detection ( PASSDetect) algorithm that coordinates computations across nodes in the cluster. We report on experiments showing that our algorithms enable us to handle both large numbers of observations per second as well as large merged graphs. In particular, on a cluster with 9 compute nodes, PASS can reliably handle between 400K and 569K observations per second and merged graphs with as many as 50K vertices.
Original language | English (US) |
---|---|
Article number | 6654141 |
Pages (from-to) | 1989-2001 |
Number of pages | 13 |
Journal | IEEE Transactions on Knowledge and Data Engineering |
Volume | 26 |
Issue number | 8 |
DOIs | |
State | Published - Aug 2014 |
Keywords
- Activity detection
- parallel computation
- temporal stochastic automata
ASJC Scopus subject areas
- Information Systems
- Computer Science Applications
- Computational Theory and Mathematics