xai

Member of Technical Staff - Voice Model

Preview — apply on company site for full detailsApply Now

At a Glance

Location: Palo Alto, California, United States
Compensation: s. COMPENSATION AND BENEFITS: $150,000 - $450,000 USD Base salary is just one p
Posted: 2026-03-16T18:15:00-04:00

Key Requirements

Required Skills

KubernetesPyTorchPython

Requirements

Python expert with deep proficiency in writing clean, efficient code for AI/ML systems.

Hands-on experience processing large-scale datasets using tools like Spark and Ray for cleaning, augmentation, and feature extraction.

Ability to set up and run rigorous evaluation pipelines: objective metrics, human preference studies, content factuality checks, and iterative A/B testing to drive model improvements.

Experience building or working with large-scale distributed training and inference systems on Kubernetes.

Compensation & Benefits

$150,000 - $450,000 USD

Base salary is just one part of our total rewards package at xAI, which also includes equity, comprehensive medical, vision, and dental coverage, access to a 401(k) retirement plan, short & long-term disability insurance, life insurance, and various other discounts and perks.

xAI is an equal opportunity employer. For details on data processing, view our

Recruitment Privacy Notice

Responsibilities

You will join the Grok Voice Model team to help build the world’s best voice AI.

Our goal: make talking to AI feel like conversing with the most charming, kind, and knowledgeable person imaginable.

We’re seeking exceptionally smart, execution-oriented engineers to help us get there.

Build and iterate a comprehensive evaluation framework covering objective metrics (accuracy, quality, latency, expressiveness), human preference studies, content factuality assessments, real-time interaction quality, and experimentation infrastructure to measure and improve performance.

Work closely with product teams to integrate voice models into applications and real-time environments, define spoken interaction specifications, and handle the full lifecycle from prototype to global-scale deployment for stable, low-latency, delightful voice experiences.