|
|
|
- Adaptive Fault Tolerance: SoHaR is developing high-performance
middleware based on CORBA technology to handle fault detection
and recovery of large scale dispersed computer networks.
The detection and recovery strategies change dynamically,
based on the environment; hence the term adaptive is used
to describe these approaches.
- Practicable Fault Tolerant Software: SoHaR developed techniques
to improve the reliability of Army tactical command and
control systems by incorporating error detection, circumvention
and recovery provisions aimed at specific frequent and critical
software faults. This approach can be implemented much more
readily than classical software fault tolerance techniques.
- Fault Tolerant Software for Battle Management: In a two-year
contract for RADC/COEE SoHaR developed techniques for the
design and verification of fault tolerant software for real-time
battle management tasks. The design method, based on fault
trees, allowed the incorporation of n-version programming,
recovery blocks, and rollback/retry and is the first attempt
to integrate these techniques. The verification method,
based on fault trees and condition tables, can also be applied
to critical non-fault tolerant software.
- Software Reliability and Estimation: SoHaR developed a
prediction methodology and models for software and systems
failures in a battle management environment. The methodology
is particularly targeted at highly stressed computing environments
in which resources may be lost due to normal attrition or
due to enemy action.
- Survey of Techniques for High Reliability in Battle Management
Computers: SoHaR developed guidelines for evaluation of
reliability attributes of computer architectures during
the concept definition phase for the U. S. Army BMD organization.
- Fault Tolerance in Large Distributed Databases: SoHaR
was awarded a contract from the U.S. Air Force to develop
fault tolerance techniques for large distributed databases
(i.e., those containing image and related data) in real
time systems. The project includes both development of the
techniques and demonstration on a distributed testbed located
at UCLA.
- Fault Tolerance and Testability in C3I Distributed Systems:
SoHaR conducted a multi-year study to determine design features
that enhance the reliability of U.S. Air Force command,
control, communications, and intelligence (C3I) systems.
The study included a survey of the reliability and availability
of major military, civilian governmental, and commercial
computer networks and the development of guidelines for
the development of reliable, testable, and maintainable
distributed computing systems.
- Support for Advanced Fault Tolerant Computing Systems:
SoHaR supported the development of a high throughput fault
tolerant computing system for battle management applications.
Specific tasks included overall architecture definition,
fault tolerance requirements definition, and conceptual
design of the operating system.
|
|
|