zscaler

Principal DevOps Engineer - Federal

Apply Now

At a Glance

Location
United States
Work Regime
remote
Experience
12+ years
Posted
2026-03-02T18:40:03-05:00

Key Requirements

Required Skills

AWSDevOpsLinuxPythonSQLTerraform

Domain Knowledge

  • Automation
  • Education
  • Engineering
  • IoT
  • Medical
  • Regulatory

Requirements

You thrive in ambiguity. You're comfortable building the path as you walk it. You thrive in a dynamic environment, seeing ambiguity not as a hindrance, but as the raw material to build something meaningful.

You act like an owner. Your passion for the mission fuels your bias for action, and you operate with integrity because you genuinely care about the outcome. You adapt to what’s needed, navigating seamlessly between high-level strategy and hands-on execution.

You are a problem-solver. You seek out challenges because you are energized by finding solutions, knowing that solving the hard problems delivers the biggest impact.

You are customer-obsessed. You build deep empathy for the customer—both internal and external—and anchor your decisions in solving their real-world problems. You champion their needs from start to finish, knowing their success is our success.

You operate with urgency. You understand that in a high-growth environment, speed and quality are not mutually exclusive. You have a relentless focus on execution and a bias for action, delivering high-impact results quickly to win for the customer and the team.

Responsibilities

Design and implement a multi-region AWS architecture and lead the development of modular Terraform libraries to automate provisioning across diverse geographies

Architect self-healing infrastructure using advanced cloud load balancing, auto-scaling patterns, and Multi-AZ database topologies to ensure high availability

Modernize CI/CD pipelines and implement Blue/Green and Canary deployment strategies to ensure zero-downtime upgrades for a continuously running global network service

Build comprehensive SRE dashboards and implement intelligent alerting frameworks to detect regional outages or capacity exhaustion before they impact customers

Monitor cloud resource utilization and implement scaling policies that perfectly balance performance requirements with cost-efficiency