Ensure high availability and reliability of cloud-based systems and applications. Design and implement monitoring, alerting, and incident response processes. Work on performance optimization, capacity planning, and disaster recovery strategies. Build and maintain infrastructure automation and tooling.
Competitive salary, comprehensive health benefits, flexible work arrangements, professional development opportunities, on-call compensation, and the chance to work on critical production systems.