About the job
About xAI
xAI is focused on building advanced AI systems capable of understanding complex problems and supporting humanity’s search for knowledge. The team values curiosity, hands-on problem solving, and strong communication. Leadership comes from initiative and results, not hierarchy. Team members share insights openly and work closely together.
Role Overview: Technical Staff Member - Multimodal Intelligence
This position sits within the multimodal team at xAI in Palo Alto, CA. The goal: push the boundaries of multimodal intelligence by building systems that understand and generate image, video, audio, and text data.
What You Will Do
- Work on every stage of the multimodal pipeline, including data acquisition, tokenizer training, large-scale pre-training, infrastructure scaling, and tooling.
- Develop and deliver end-to-end product experiences that showcase advanced multimodal capabilities.
- Collaborate with teams across xAI to advance multimodal reasoning, world modeling, tool use, and interactive human-AI collaboration.
- Help build models that perceive, understand, and interact with the world in real time.
Team Culture
- Flat structure: leadership is earned by initiative and performance.
- Open communication and collaboration are essential.
- Curiosity and a drive to tackle tough challenges are highly valued.
