Reproducible Fault Injection in Distributed Systems
FaultSee is a language to describe distributed systems experiments subject to faults and a platform to execute and reproduce these experiments.
FaultSee source code is available at GitHub.
You can also check the EDCC paper, and a video recording of the talk we did at the conference.