A company is looking for a Distributed Systems & Reliability Engineer.
Key Responsibilities
Own the reliability, availability, and failover behavior of the centralized planning system in production
Design and implement leader election, health checks, and controlled failover mechanisms
Engineer restart-safe workflows and extend observability for key service indicators
Required Qualifications
Strong experience with distributed, real-time backend systems (C++ and Go)
Deep understanding of networked, message-driven architectures and distributed databases
Proven track record in designing high-availability and failover patterns
Expertise in idempotent operations and APIs that tolerate retries and duplicates
Hands-on experience with automated testing for distributed systems
System Engineer • Washington, District of Columbia, United States