companyxAI logo

Technical Staff Member - Voice Model Development

xAIPalo Alto, CA
On-site Full-time $150K/yr - $450K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Design and implement large-scale speech data curation and processing pipelines, including the collection of diverse real-world audio, synthetic data generation, and automated annotation workflows to enable high-quality model training and evaluation. Engage in the pre-training and post-training of speech-language models, applying targeted enhancements through supervised fine-tuning, reinforcement learning, and other techniques to ensure Grok Voice responses are accurate, factually grounded, and idiomatic in spoken style, conversational in tone, and fluent across languages. Develop and refine a comprehensive evaluation framework that includes objective metrics (accuracy, quality, latency, expressiveness), human preference studies, content factuality assessments, real-time interaction quality, and experimentation infrastructure to measure and enhance performance.

About the job

About xAI

At xAI, our vision is to develop AI systems that deeply comprehend the universe and assist humanity in its quest for understanding. Our team is a close-knit, highly driven group committed to engineering excellence. We welcome individuals who relish challenges and thrive on curiosity. Operating within a flat organizational structure, we expect all employees to be hands-on contributors to our mission. Proactive leadership is recognized, and a strong work ethic combined with exceptional prioritization skills is essential. Effective communication is crucial, as employees must be able to share knowledge clearly and precisely with colleagues.

ROLE OVERVIEW:

Join our Grok Voice Model team to engineer the leading voice AI technology. We aim to facilitate seamless, natural, low-latency spoken interactions that are expressive, multilingual, and reliable across devices and real-time applications. We manage the entire training pipeline, encompassing extensive data curation, high-quality audio processing, cutting-edge speech-language pre-training, and rigorous post-training to maximize quality, speed, and stability.

Our aspiration is to make conversing with AI feel like engaging with the most charming, knowledgeable, and kind individual imaginable. We are in search of exceptionally intelligent, execution-focused engineers to help us achieve this goal.

About xAI

xAI is dedicated to creating advanced AI technologies that enhance human understanding of the universe. Our culture fosters innovation, collaboration, and a commitment to excellence, making us an ideal environment for those eager to push boundaries and drive progress in AI development.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.