proton
Site Reliability Engineer - Observability
At a Glance
- Location
- Geneva; Paris
- Work Regime
- onsite
- Posted
- 2026-06-15T03:47:37-04:00
Key Requirements
Required Skills
Domain Knowledge
- Automation
- Engineering
Benefits & Perks
ht. We offer strong health coverage, solid retirement options, generous lea
Requirements
Extensive experience in an SRE, DevOps, or Platform Engineering role.
Comfortable writing Python and/or Go for tooling and automation.
Hands-on experience operating open-source observability stacks (logs, metrics, traces, alerting).
Working knowledge of Kubernetes and GitOps workflows.
Practical experience with infrastructure-as-code (Terraform, Ansible, Puppet, or similar) and solid Linux system administration skills.
Experience running ClickHouse for log and metric storage at scale.
Compensation & Benefits
Work that Matters:
millions of people trust Proton with their privacy. We answer only to our users — not advertisers, not investors with conflicting agendas, not governments. The work you do here is real, and the impact is measurable. (read more about our impact
here
)
Stock Options:
at Proton, we all have the opportunity to be owners of the company. From day one, you have a real stake in what we're building. When Proton wins, you win.
Responsibilities
We're a small, tool-agnostic team that owns the observability infrastructure behind Proton's services — the logs, metrics, traces, and alerts that keep systems running smoothly for the millions of users who trust us with their privacy.
We run on open-source stacks across Proton's on-premise data centers, and we dogfood heavily: we're our own first customers.
We favor simple, solid solutions over large engineering efforts, and we believe good systems emerge iteratively.
Design, deploy, and operate observability pipelines for logs, metrics, traces, and alerts across Proton's services using open-source technologies.
Partner with development and platform teams to ship practical alerting, dashboarding, and integration solutions that engineers actually rely on.
Build reusable templates and tooling that streamline onboarding, incident response, and analysis.