CerebrasSystems
Inference Core Platform Benchmarking Engineer
At a Glance
- Location
- Toronto, Ontario, Canada
- Posted
- 2026-03-18T10:00:25-04:00
Key Requirements
Required Skills
Domain Knowledge
- Engineering
Requirements
Proficiency in Python and/or C++ programming.
Proven experience in building and scaling automated infrastructure.
Strong background in throughput and performance optimization techniques, especially in complex, large-scale systems.
Familiarity with problem-solving at the intersection of hardware and software.
Hands-on experience with AI workloads and architectures is a plus.
Responsibilities
Inference Core Platform
group is at the heart of Cerebras' mission to deliver the world’s fastest AI inference.
Our team builds the foundational software and hardware infrastructure that powers low-latency, high-speed, high-throughput deployment on the Cerebras Wafer-Scale Engine (WSE).
We are responsible for the full stack—from model compilation and scheduling down to custom hardware kernels and driver development.
Platform Benchmarking
team plays a pivotal role in shaping the performance and scalability of AI inference on one of the most advanced computing systems ever built.