zscaler
Staff Escalation Engineer
At a Glance
- Location
- United States
- Posted
- 2026-02-19T08:49:11-05:00
Key Requirements
Required Skills
Domain Knowledge
- Education
- Engineering
- Medical
Requirements
True ownership involves leveraging dynamic range: the ability to navigate seamlessly between high-level strategy and hands-on execution.
Expert troubleshooting, debugging, and root-cause analysis for complex, high-priority incidents, with experience using CPU/memory profilers to diagnose resource exhaustion
Strong hands-on skills in Python, Bash, and Java; cloud platforms (GCP, AWS, Azure); and IaC/configuration tools (Terraform, Ansible)
Ability to write complex MySQL queries and generate business reports
Experience with authentication protocols such as SAML and OAuth
Solid networking fundamentals (TCP/IP, UDP, ICMP) and debugging with Postman and packet captures, along with proficiency in monitoring tools like Grafana
Responsibilities
Own and resolve escalated cloud incidents end-to-end, including impact analysis, debugging, implementing solutions, and communicating with stakeholders
Collaborate with development, security, and operations to design and implement code/configuration fixes for complex system issues
Monitor system health, performance, and security via PagerDuty and enhance alerting to meet SLOs
Build diagnostic tools, dashboards, and documentation to enable faster, more effective incident resolution across the team
Lead production service ownership and supportability by deploying critical fixes, making key deployment decisions, and responding to high-pressure off-hours events