OBJECTSTORE

Handling RFID data accurately and at speed

You could water your lawn pretty quickly if you could use the fire hydrant on your street. But if you hooked up your garden hose directly to the hydrant, you’d wind up with a hose blown apart, a big pool of water and a very angry fire chief. If you deploy an RFID system by directly hooking RFID readers to your backend systems, the results will be similarly disastrous. What’s needed is the right architecture not only to maintain accuracy and authenticity, but to make meaning of the vast volumes of data.

RFID readers will deliver a gush of data. Our estimates are that pallet-, case- and item-level tracking, combined with data generated by RFID readers as items move within the enterprise, will increase the volume of data by 100 to 1,000 times today’s levels.

Effective RFID implementations must borrow from the architectural principles developed for financial trading systems, process control and large-scale network management. Like RFID systems, these systems process terabytes of data, correct errors in real time, correlate events, detect patterns, re-organise and cleanse data and recover from faults – all in real time. RFID architectures should embrace three central principles of these systems.

1. Develop an operational data management architecture
An operational data management architecture is a software system that captures events at the “edge” of the enterprise, where operational activity occurs, rather than in the centre, where business-oriented transaction processing occurs. These architectures act as a cache for transactional, enterprise systems, and then communicate via middleware once actionable information is identified.

By digesting your RFID event traffic close to the source – at the “edge” of the enterprise – only the meaningful and validated events are passed on to central IT systems. This digestion process is more than basic filtering – it’s data cleansing, consolidation, and summarisation; it’s exception handling of many types – automated and human-made; it’s compensation for unreliable technical infrastructure – application, hardware, network failure; it’s adjustment for unreliable RFID tag-reader environments. Digestion of data close to the source allows this complex processing to occur, and exceptions to be handled locally thus protecting central IT systems from the flood of data.

Operational data management architectures are common in the financial industry – for example, trading systems collect market data in real time, execute trading strategies heuristically, identify trading opportunities, and forward the results to traders for action. These systems eventually generate the trades tracked by back-office systems, but billions of dollars are invested every year on the operational stages needed that precede the actual trade. This kind of data management and applied intelligence will become more prevalent as RFID adoption encourages companies to deploy intelligence at the point of operational activity.

2. Employ RFID data “concentrators”
You could water your lawn using water from a hydrant if you could step down the pressure and flow. One way to deal with the gush of RFID data is to develop RFID data “concentrators” that help control the flow and provide filtering and aggregation of EPC event streams (see figure below). An RFID data concentrator is software that collects and processes raw RFID event flows close to the source of the data (principle 1). There are three primary elements: RFID middleware, event processing, and an in-memory data cache – combined they achieve the reliable speed you need.

RFID middleware provides the interface for applications to receive events from RFID readers. There are many commercial implementations of RFID middleware available today. Additionally, the EPCglobal standards organisation (EPCglobal is organised by EAN International and the Uniform Code Council (UCC) to drive the global adoption and implementation of the EPCglobal Network) is in the process of defining a standard for RFID middleware, the ALE (Application Level Event) standard, which defines a reader-neutral interface for receiving events from RFID readers.



Event processing handles high-volume, high-performance flows of events by organising raw events into pipelines. Pipelining is a concept found in hardware and software systems of many types, including the central processing units (CPUs) in your computer, in software designed for handling stock market feeds in real time, and transaction processing systems that your credit card company uses. Pipelines allow the events to be categorised, then process those categories with a set of simple tasks, as each thread gets a slice of processing time from the CPU. By performing a large number of operations on data in small “bursts”, overall throughput is increased, and the average speed that any individual event can be processed is increased, as well.



Finally, an in-memory database, or data cache, makes the concentrator work in real time. In-memory data management techniques are crucial to accommodate the real-time nature of RFID. It’s a basic law of physics: memory is 1,000 times faster than disk. Physics is the reason why MIT’s (Massachusetts Institute of Technology) Auto-ID centre included a “Real Time In-Memory Event Database,” or REID, as part of its first RFID reference architecture. Similarly, Stanford’s CEP research has employed an “in-core-memory” database. While this cache provides much the same reliability, availability, and fault tolerance of a database, the distinction of calling this element a “cache” is to distinguish it from the enterprise application database, shown at right in the data concentrator diagram.

3. Create pipelines for distribution
Developers often start small projects that are successful. But as the project scope expands, data volume grows far more rapidly than expected. The system can’t scale and the project winds up becoming a disaster (think exploded garden hoses and pools of water). RFID deployments are particularly susceptible to this “success disaster” syndrome. Increasing numbers of RFID readers will lead to more streams of data, which can quickly overwhelm the system. The way to handle the load is through pipeline processing.

Once you have an RFID data concentrator architecture in place, set up pipelines to handle the streams of data. Pipelines separate streams of EPC data to handle load and coordinate or process the data streams after they have been captured. To achieve additional scalability, you may need more than one concentrator distributed on a set of distributed machines, each one controlling a domain of RFID activity, distributed on multiple machines. For instance, let’s say you have deployed EPC readers at a distribution center with 30 dock doors. Half the readers are tied into one RFID data concentrator and half into another. You should create several pipelines from each concentrator. One will feed data into a local inventory database, one into a regional database, and one into financial applications.

As rollouts of major RFID projects unfold, companies will quickly find they are going to be overwhelmed by data, unless they follow these three basic principles and build an RFID architecture that will scale, not fail.

ObjectStore® is a global provider of real-time data management products. Its products enable corporate data caching and complex event processing, and its leading object database is renowned for performance and scalability. In the UK ObjectStore can be reached on 01753 216350. Or at info-emea@objectstore.net
By Mark Palmer RFID technical evangelist with ObjectStore. www.objectstore.com


Register to receive the RFID Today magazine and emails for free.  

RFID Today, 3 Todmore, Greatham, Liss, Hants, GU33 6AR

Tel: +44 (0)1420 538196

© Copyright 2001 - 2006 by ADA Communications Ltd. All rights reserved. Statements of opinion and fact are made on the responsibility of the authors alone and do not imply an opinion on the part of ADA Communications Ltd or the editorial staff. Registered in England No. 04843018