xai

Member of Technical Staff - Voice Model

Apply Now

At a Glance

Location
Palo Alto, California, United States
Compensation
s. COMPENSATION AND BENEFITS: $150,000 - $450,000 USD Base salary is just one p
Posted
2026-03-16T18:15:00-04:00

Key Requirements

Required Skills

KubernetesPyTorchPython

Requirements

Python expert with deep proficiency in writing clean, efficient code for AI/ML systems.

Hands-on experience processing large-scale datasets using tools like Spark and Ray for cleaning, augmentation, and feature extraction.

Ability to set up and run rigorous evaluation pipelines: objective metrics, human preference studies, content factuality checks, and iterative A/B testing to drive model improvements.

Experience building or working with large-scale distributed training and inference systems on Kubernetes.

Compensation & Benefits

$150,000 - $450,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view our

Recruitment Privacy Notice

.

Responsibilities

You will join the Grok Voice Model team to help build the world’s best voice AI.

Our goal: make talking to AI feel like conversing with the most charming, kind, and knowledgeable person imaginable.

We’re seeking exceptionally smart, execution-oriented engineers to help us get there.

Build and iterate a comprehensive evaluation framework covering objective metrics (accuracy, quality, latency, expressiveness), human preference studies, content factuality assessments, real-time interaction quality, and experimentation infrastructure to measure and improve performance.

Work closely with product teams to integrate voice models into applications and real-time environments, define spoken interaction specifications, and handle the full lifecycle from prototype to global-scale deployment for stable, low-latency, delightful voice experiences.