The 31st IPP Symposium

Towards a Content and Load Adaptive

Archan Misra, IBM Watson

Pervasive computing applications, such as Internet-based control systems and large-scale mobile asset management, require the monitoring and processing of data streams from highly distributed, mobile data sources. Middleware to support data stream processing must scale to potentially millions of streams. In this talk, we present recent work in developing a scalable, decentralized architecture for processing data streams based on distributed hash table (DHT) overlay networks. The primary focus will be the content and load aware scalable hashing (CLASH) protocol, which provides a flexible, adaptive mechanism for scaling DHTs to skewed workloads. CLASH is part of the IBM ContextSphere project, an effort to provide programmable and distributed middleware for data composition from pervasive data sources. We will also provide an overview of ContextSphere and present initial results that show the benefits of our approaches.