First page Back Continue Last page Overview Graphics
Motivation
As machines grow in size
- MTBF decreases
- Applications have to tolerate faults
Applications need fast, low cost and scalable fault tolerance support
Fault tolerant runtime for:
- Charm++ (Parallel C++ language and runtime)
- Adaptive MPI