xai

AI Tutor - Italian

Apply Now

At a Glance

Location
Remote
Work Regime
remote
Posted
2026-04-01T00:38:09-04:00

Benefits & Perks

Health Insurance

d positions include health insurance, 401(k) plan, and paid sick leave. Spec

Requirements

Demonstrated ability to transcribe audio with high accuracy across accents and varying audio quality.

Comfort providing high-quality voice recordings and feedback on audio samples in multiple languages.

Commitment to developing AI that masters sophisticated multilingual audio capabilities.

Deep understanding and taste of what good/useful Audio data is.

Experience working with speech/audio datasets, annotation workflows, or AI training data, including knowledge/experience with training voice models, and an understanding of how data quality impacts model performance.

Compensation & Benefits

US-based candidates: $35/hour - $45/hour

depending on factors including relevant experience, skills, education, geographic location, and qualifications. International candidates: Information will be provided to you during the recruitment process.

Benefits vary based on employment type, location, and jurisdiction. Benefits for eligible U.S.-based positions include health insurance, 401(k) plan, and paid sick leave. Specific details and role-specific information will be provided to you during the interview process.

xAI is an equal opportunity employer. For details on data processing, view our

Recruitment Privacy Notice

.

Responsibilities

As an AI Tutor specialized in multilingual audio capabilities, you will contribute to xAI's mission by training and refining Grok to excel in voice interactions, speech recognition, and auditory experiences across diverse languages, accents, and cultural contexts.

Your work will focus on curating and annotating high-quality audio data to enhance Grok's global accessibility, enabling natural spoken interactions for users worldwide, bridging language barriers through accurate speech processing, and improving the AI's handling of multilingual audio nuances.

Use proprietary software to provide labels, annotations, recordings, and inputs on projects involving multilingual audio clips, voice recordings, speech samples, and auditory elements in various languages.

Support the delivery of high-quality curated audio data that ensures clear, natural spoken output, accurate representation of linguistic and prosodic details (such as intonation, rhythm, and accent), and professional audio standards.

Collaborate with technical staff to develop tasks that improve AI's ability to handle speech modulation, accent variation, noise in real-world recordings, and multilingual audio processing.