Warehousing and mining massive RFID data sets

Jiawei Han*, Hector Gonzalez, Xiaolei Li, Diego Klabjan

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

28 Scopus citations

Abstract

Radio Frequency Identification (RFID) applications are set to play an essential role in object tracking and supply chain management systems. In the near future, it is expected that every major retailer will use RFID systems to track the movement of products from suppliers to warehouses, store backrooms and eventually to points of sale. The volume of information generated by such systems can be enormous as each individual item (a pallet, a case, or an SKU) will leave a trail of data as it moves through different locations. We propose two data models for the management of this data. The first is a path cube that preserves object transition information while allowing muti-dimensional analysis of path dependent aggregates. The second is a workflow cube that summarizes the major patterns and significant exceptions in the flow of items through the system. The design of our models is based on the following observations: (1) items usually move together in large groups through early stages in the system (e.g., distribution centers) and only in later stages (e.g., stores) do they move in smaller groups, (2) although RFID data is registered at the primitive level, data analysis usually takes place at a higher abstraction level, (3) many items have similar flow patterns and only a relatively small number of them truly deviate from the general trend, and (4) only non-redundant flow deviations with respect to previously recorded deviations are interesting. These observations facilitate the construction of highly compressed RFID data warehouses and the exploration of such data warehouses by scalable data mining. In this study we give a general overview of the principles driving the design of our framework. We believe warehousing and mining RFID data presents an interesting application for advanced data mining.

Original languageEnglish (US)
Title of host publicationAdvanced Data Mining and Applications - Second International Conference, ADMA 2006, Proceedings
EditorsXue Li, Osmar R. Zaïane, Zhanhuai Li
PublisherSpringer Verlag
Pages1-18
Number of pages18
ISBN (Print)3540370250, 9783540370253
DOIs
StatePublished - 2006
Event2nd International Conference on Advanced Data Mining and Applications, ADMA 2006 - Xi'an, China
Duration: Aug 14 2006Aug 16 2006

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume4093 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Other

Other2nd International Conference on Advanced Data Mining and Applications, ADMA 2006
CountryChina
CityXi'an
Period8/14/068/16/06

ASJC Scopus subject areas

  • Theoretical Computer Science
  • Computer Science(all)

Fingerprint Dive into the research topics of 'Warehousing and mining massive RFID data sets'. Together they form a unique fingerprint.

  • Cite this

    Han, J., Gonzalez, H., Li, X., & Klabjan, D. (2006). Warehousing and mining massive RFID data sets. In X. Li, O. R. Zaïane, & Z. Li (Eds.), Advanced Data Mining and Applications - Second International Conference, ADMA 2006, Proceedings (pp. 1-18). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 4093 LNAI). Springer Verlag. https://doi.org/10.1007/11811305_1