OBJECTSTORE Seven principles of effective RFID data management Though the recent publicity might suggest differently, Radio-Frequency Identification technology – RFID – is not new. RFID first appeared during World War II when Allied aircraft carried transponders that would acknowledge radar interrogations from friendly aircraft. Since then, both the size and cost of RFID tags have followed the progression of Moore’s Law to where it’s now feasible to attach RFID tags to a package of razor blades. But in fact, RFID has been used in commercial applications for years, from automated toll systems like E-Z Pass to automated manufacturing facilities. But in 2003, triggered by mandates from Wal-Mart and others, a media feeding frenzy ensued. Such mandates have validated that RFID just might enable a new era of business optimization where a Proctor and Gamble can instantly know its inventory stock levels, a Gillette can eliminate razor blade theft, and a Wal-Mart can squeeze even more cost from its supply chain by reducing the labor associated with bar-code scanning. This value, accumulated millions of times daily, will add up to billions. And that’s just the beginning: visionaries see RFID impacting anything that involves physical items where keeping track has value: baggage tracked by airlines to improve security; Alzheimer’s patients monitored in real time; pharmaceutical shipments to curtail counterfeiting. But every opportunity carries its challenge, and there are many challenges posed by RFID. One of the biggest hills to climb is dealing with the flood of data RFID generates: in-store RFID implementation at Wal-Mart has the potential to generate as much data in three days as is contained in the entire U.S. Library of Congress (based on estimates from analyst Jim Crawford from Retail Forward). And it’s not just a problem for companies the size of Wal-Mart – even modest RFID deployments will generate gigabytes of rapidly changing data a day. The volume and velocity of RFID data will exceed the capacity of existing technology infrastructure. Capturing large volumes of data at high velocity is just the first step. Woody Allen once said: “I took a course in speed reading and was able to read War and Peace in 20 minutes. It's about Russia.” If all you learn from massive volumes of RFID data is the most general conclusions then the value of that data is lost. It is as if one looked at a 250 gigabyte disk drive filled with useful information and concluded simply that it contained 10,424 files. If all you do is capture RFID events, then most of the value is lost because the advantage of RFID is real time knowledge, not data collection. Instant decision-making on high-volume, high-velocity data streams is not a new challenge. It is the same problem found in the program trading systems of large investment banks, the command and control applications used by the military, and the network management applications in telecommunications. Based on experience in those industries, as well as RFID system development, I pose seven principles to help you effectively manage your RFID data. Principle
#1: Digest the data close to the source A better approach is to digest your RFID event traffic close to the source – at the “edge” of the enterprise – and forward only the meaningful events to central IT systems. This digestion process is more than basic filtering – it’s data cleansing, consolidation, and summarization; it’s exception handling of many types – automated and human-made; it’s compensation for unreliable technical infrastructure – application, hardware, network failure; it’s adjustment for unreliable RFID tag-reader environments. Digestion of data close to the source allows this complex processing to occur, and exceptions to be handled locally thus protecting central IT systems from the flood of data. Digest RFID event data close to the source of RFID activity to ensure greater reliability and protect your IT infrastructure. Principle
#2: Turns simple events into meaningful events According to Gartner, CEP will become a common computing model within five to ten years. But developers aren’t sitting still – you can build CEP systems today in languages like Java or C++. Whether you use a CEP tool or build your own, principle #2 is: turn simple events into meaningful events to derive actionable knowledge from discreet events. Principle
#3: Buffer event streams An RFID data concentrator is software that collects and processes raw RFID event flows close to the source of the data (principle 1). There are three primary elements: RFID middleware, event processing, and an in-memory data cache. RFID middleware provides the interface for applications to receive events from RFID readers. There are many commercial implementations of RFID middleware available today. Additionally, the EPCglobal standards organization is in the process of defining a standard for RFID middleware, the ALE (Application Level Event) standard, which defines a reader-neutral interface for receiving events from RFID readers. Event processing
handles high-volume, high-performance flows of events by organizing raw
events into pipelines. Pipelining is a concept found in hardware and software
systems of many types, including the central processing units (CPUs) in
your computer, in software designed for handling stock market feeds in
real time, and transaction processing systems that your credit card company
uses. Pipelines allow the events to be categorized, then processes those
categories with a set of simple tasks, as each thread gets a slice of
processing time from the CPU. By performing a large number of operations
on data in small “bursts”, overall throughput is increased,
and the average speed that any individual event can be processed is increased,
as well. Employ data concentrators that buffer event stream flows by combining RFID middleware, event processing, and an in-memory data cache to achieve the reliable speed you need. Principle
#4: Cache context For example, context can come from information in an advanced shipping notice (ASN). As provided by a manufacturing plant, the ASN can be used to confirm that tagged items sent by the manufacturer were actually received. Context may also come from an EPC Information Services directory (EPC IS). EPCglobal is defining standards to allow the EPC IS to serve as the framework for trading partners to exchange detailed product information. The EPC IS allows anyone with proper authorization in Kellogg’s supply chain to learn whether the EPC tag 01.0000A89.00016F.000169DC0 is a 24 ounce box of Kellogg’s Corn Flakes or a Gillette Mach3 Turbo razor. In addition the EPC IS will allow you to determine where it came from, where it’s going, when it was produced, and so on. Reference data may also come from internal enterprise systems. For example, RFID-enabled baggage handling systems use data from passenger information systems to ensure that RFID-tagged bags get on the same plane you do. Just as principle 3 leverages in-memory data caching for event data, context data needs the same approach. EPCGlobal’s next generation (Gen2) standard of RFID readers specifies read rates of 1,800 RFID tags a second, which means a distribution center with 20 readers could generate 36,000 events a second at peak rates. Adding 36,000 SQL requests to your existing warehouse database probably isn’t feasible, so it’s best to replicate, and cache this data in your data concentrator such that is available in real-time. By caching reference data, your data concentrator can process RFID event data, in context. Principle
#5: Federate data distribution in near real time The definition of federated, from Merriam-Webster, is: “united in an alliance.” Federating data is hardly new – trading systems federate data daily among different financial centers and do it in near real-time. For RFID, federated data distribution unites the RFID data concentrators in an alliance – whereby meaningful RFID events and context data is shared among members. A reliable, distributed middleware fabric facilitates data federation, integrated with the data caches at each concentrator, thus enabling meaningful events to be distributed among concentrators. The implementation
details of this near real time distribution fabric is beyond the scope
of this paper, but the principles are the same as we’ve described:
process data close to the source, utilize data concentrators, cache data
in concentrators, filter and handle exceptions. Ultimately, one must forward
meaningful events to members of a distributed “alliance” to
enable as close to a real-time view of the system as possible. Principle
#6: Age RFID data gracefully An airline baggage handling system must track events from the gate to the plane, but they don’t all have to be stored forever. Yet RFID-enabled systems must permit operators to see the end-to-end baggage flow. The value is in understanding the baggage origin, how long it was on the conveyor belt, where it was handled, and how long it was handled. Furthermore, the basic flow of information will need augmentation – for example, at what time was the bag loaded into the aircraft and which cargo bay was it placed in? Finally, performance and scalability typically requires that the storage of these events be optimized as the system runs. The RFID concentrator can delete events that are superfluous (e.g., redundant reads of baggage), supplement events that require context (e.g., associate a cargo bay number to a “bag loaded” event), siphon data off to other systems (e.g., save all baggage events for security subsequent security audits), and prepare event data to be distributed to other airports. At every step of aging process, the data concentrator’s cache (principle 3) is kept transactionally consistent to provide system recoverability. Age RFID data to keep your working set of data manageable, enrich raw data with required context, and reduce the load on down-stream systems Principle
#7: Automate exception handling Though detection of events is done by complex event processing, exception handling requires knowledge of what happened leading up to this event. That operation requires an algorithmic approach to corrective action. The combination of CEP and event replay is what makes exception handling possible. Event replay requires the storage of RFID events in a data cache such as ObjectStore RFID Accelerator, that operates much like a TiVo for your RFID data. Just as TiVo has revolutionized the way we watch television, with flexible archival and random playback capability, RFID event replay can help automate exception handling. When an exceptional event occurs (e.g., bag must be reloaded), the data cache can replay the events for a particular item to figure out where it was last seen, which operator is physically closest to the item now, and then send a message, for example, to that person’s pager. Without event replay, the RFID system has no ability to go back in time to discover when a bag was loaded, which section of the plane it was loaded into, and what to do to correct the exception. Automate exception handling to improve overall business efficiency – our final principle of effective RFID data management. Pulling
it together Mark Palmer is RFID Technical evangelist with ObjectStore, an operating company of Progress Software. ObjectStore® RFID Accelerator transforms raw RFID event streams into meaningful business data. RFID Accelerator's proven, scalable architecture facilitates the real-time query and analysis of RFID sensor data as it is captured, the historical time-series analysis of such data, and the integration RFID into key business applications that can leverage RFID information to better understand and manage their assets. For more information about ObjectStore please visit our website at www.objectstore.com or mail us at info-emea@objectstore.net. |
|
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|