|
|
|
- Adaptive Fault Tolerance: SoHaR is developing high-performance middleware based
on CORBA technology to handle fault detection and recovery of large scale dispersed
computer networks. The detection and recovery strategies change dynamically, based
on the environment; hence the term adaptive is used to describe these approaches.
- Practicable Fault Tolerant Software: SoHaR developed techniques to improve the reliability
of Army tactical command and control systems by incorporating error detection, circumvention
and recovery provisions aimed at specific frequent and critical software faults.
This approach can be implemented much more readily than classical software fault
tolerance techniques.
- Fault Tolerant Software for Battle Management: In a two-year contract for RADC/COEE
SoHaR developed techniques for the design and verification of fault tolerant software
for real-time battle management tasks. The design method, based on fault trees,
allowed the incorporation of n-version programming, recovery blocks, and rollback/retry
and is the first attempt to integrate these techniques. The verification method,
based on fault trees and condition tables, can also be applied to critical non-fault
tolerant software.
- Software Reliability and Estimation: SoHaR developed a prediction methodology and
models for software and systems failures in a battle management environment. The
methodology is particularly targeted at highly stressed computing environments in
which resources may be lost due to normal attrition or due to enemy action.
- Survey of Techniques for High Reliability in Battle Management Computers: SoHaR
developed guidelines for evaluation of reliability attributes of computer architectures
during the concept definition phase for the U. S. Army BMD organization.
- Fault Tolerance in Large Distributed Databases: SoHaR was awarded a contract from
the U.S. Air Force to develop fault tolerance techniques for large distributed databases
(i.e., those containing image and related data) in real time systems. The project
includes both development of the techniques and demonstration on a distributed testbed
located at UCLA.
- Fault Tolerance and Testability in C3I Distributed Systems: SoHaR conducted a multi-year
study to determine design features that enhance the reliability of U.S. Air Force
command, control, communications, and intelligence (C3I) systems. The study included
a survey of the reliability and availability of major military, civilian governmental,
and commercial computer networks and the development of guidelines for the development
of reliable, testable, and maintainable distributed computing systems.
- Support for Advanced Fault Tolerant Computing Systems: SoHaR supported the development
of a high throughput fault tolerant computing system for battle management applications.
Specific tasks included overall architecture definition, fault tolerance requirements
definition, and conceptual design of the operating system.
|
|
|