datadog Nov 20, 2024 How we use formal modeling, lightweight simulations, and chaos testing to design reliable distributed systems (opens in new tab) distributed-systemsgrpchigh-availabilityfoundationdbchaos-testingformal-modelingmessage-queuinghorizontal-scalability