Why This Matters

Resilience in distributed systems requires robust mechanisms for service discovery despite node failures and network partitions. This work is innovative because it presents practical design patterns for implementing distributed discovery services that maintain consistency and enable self-healing in decentralized systems.

What We Did

This paper describes RIAPS' resilient discovery service for distributed smart systems, including mechanisms for service registration, heartbeat-based failure detection, and distributed hash table implementation for service lookups. The work presents design choices for handling node failures and ensuring scalability across network changes.

Key Results

The paper demonstrates the discovery service handling ingress and egress of nodes in clusters, maintaining consistent service registries across the network. Results show successful service lookup under node failures and validation of key-based service distribution using OpenDHT.

Full Abstract

Cite This Paper

@inproceedings{Eisele2017b,
  author = {Eisele, Scott and Madari, Istv{\'{a}}n and Dubey, Abhishek and Karsai, Gabor},
  booktitle = {20th {IEEE} International Symposium on Real-Time Distributed Computing, {ISORC} 2017, Toronto, ON, Canada, May 16-18, 2017},
  title = {RIAPS:} Resilient Information Architecture Platform for Decentralized Smart Systems},
  year = {2017},
  pages = {125--132},
  abstract = {The emerging Fog Computing paradigm provides an additional computational layer that enables new capabilities in real-time data-driven applications. This is especially interesting in the domain of Smart Grid as the boundaries between traditional generation, distribution, and consumer roles are blurring. This is a reflection of the ongoing trend of intelligence distribution in Smart Systems. In this paper, we briefly describe a component-based decentralized software platform called Resilient Information Architecture Platform for Smart Systems (RIAPS) which provides an infrastructure for such systems. We briefly describe some initial applications built using this platform. Then, we focus on the design and integration choices for a resilient Discovery Manager service that is a critical component of this infrastructure. The service allows applications to discover each other, work collaboratively, and ensure the stability of the Smart System.},
  bibsource = {dblp computer science bibliography, https://dblp.org},
  biburl = {https://dblp.org/rec/bib/conf/isorc/EiseleMDK17},
  category = {selectiveconference},
  contribution = {lead},
  doi = {10.1109/ISORC.2017.22},
  file = {:Eisele2017b-RIAPS_Resilient_Information_Architecture_Platform_for_Decentralized_Smart_Systems.pdf:PDF},
  keywords = {service discovery, distributed hash table, resilience, decentralized systems, fault tolerance},
  project = {smart-transit,smart-cities},
  tag = {platform,decentralization,power},
  timestamp = {Wed, 16 Oct 2019 14:14:53 +0200},
  url = {https://doi.org/10.1109/ISORC.2017.22}
}
Quick Info
Year 2017
Keywords
service discovery distributed hash table resilience decentralized systems fault tolerance
Research Areas
CPS middleware scalable AI
Search Tags

RIAPS, Resilient, Information, Architecture, Platform, Decentralized, Smart, Systems, service discovery, distributed hash table, resilience, decentralized systems, fault tolerance, CPS, middleware, scalable AI, 2017, Eisele, Madari, Dubey, Karsai