Why This Matters

As commercial aircraft increasingly rely on integrated modular avionics where software implements safety-critical functions, the ability to diagnose and mitigate software faults in real-time becomes essential for maintaining aircraft safety. This work innovates by developing practical techniques to model complex software systems and automatically generate health managers that can detect subtle faults missed by traditional testing. The model-based approach enables diagnosis of latent software defects that may escape rigorous verification and testing during development.

What We Did

This work presents a comprehensive case study applying software health management techniques to the Boeing 777 Air Data Inertial Reference Unit using timed failure propagation graphs for real-time fault diagnosis. The authors develop models of the ADIRU software architecture using the ARINC Component Model and apply component-level health management to detect anomalies in individual processing modules. The framework integrates formal methods for specifying monitoring conditions and uses event-based anomaly detection with timed state machines to diagnose faults in real-time.

Key Results

The case study demonstrates successful detection and diagnosis of injected faults in an emulated ADIRU system using the component framework and monitoring-based approach. Results show that the system can identify specific component failures and characterize their effects on system output through timed fault propagation analysis. The code generation tools successfully produce runtime code that performs health monitoring with acceptable performance overhead while maintaining strict timing properties required for real-time avionics.

Full Abstract

Cite This Paper

@techreport{Mahadevan2011a,
  author = {Mahadevan, Nagabhushan and Dubey, Abhishek and Karsai, Gabor},
  institution = {Institute For Software Integrated Systems, Vanderbilt University},
  title = {A Case Study On The Application of Software Health Management Techniques},
  year = {2011},
  address = {Nashville},
  month = {01/2011},
  number = {ISIS-11-101},
  abstract = {Ever increasing complexity of software used in large-scale, safety critical cyber-physical systems makes it increasingly difficult to expose and thence correct all potential bugs. There is a need to augment the existing fault tolerance methodologies with new approaches that address latent software bugs exposed at runtime. This paper describes an approach that borrows and adapts traditional {\textquoteleft}Systems Health Management{\textquoteright} techniques to improve software dependability through simple formal specification of  runtime monitoring, diagnosis  and mitigation strategies. The two-level approach of Health Management at Component and System level  is demonstrated on a simulated case study of an Air Data Inertial Reference Unit (ADIRU).  That subsystem was categorized as the primary failure source for the in-flight upset caused in the Malaysian Air flight 124 over Perth, Australia in August 2005.},
  attachments = {http://www.isis.vanderbilt.edu/sites/default/files/ADIRUTechReport.pdf},
  contribution = {colab},
  file = {:Mahadevan2011a-A_case_study_on_the_application_of_software_health_management_techniques.pdf:PDF},
  tag = {platform},
  keywords = {software health management, avionics, fault diagnosis, timed failure propagation, real-time systems, component models}
}
Quick Info
Year 2011
Keywords
software health management avionics fault diagnosis timed failure propagation real-time systems component models
Research Areas
CPS middleware Explainable AI
Search Tags

Case, Study, Application, Software, Health, Management, Techniques, software health management, avionics, fault diagnosis, timed failure propagation, real-time systems, component models, CPS, middleware, Explainable AI, 2011, Mahadevan, Dubey, Karsai