About the job
About xAI
At xAI, our mission is to develop advanced AI technologies capable of comprehending the complexities of the universe while assisting humanity in its quest for knowledge. Our dedicated team thrives in a dynamic and collaborative environment, focused on engineering excellence. We seek individuals who are driven by curiosity and relish challenges. Our flat organizational structure fosters hands-on contributions, where leadership is earned through initiative and outstanding performance. Strong communication skills are essential, enabling team members to effectively share insights.
ABOUT THE ROLE:
You will be an integral part of our multimodal team, working towards achieving superhuman multimodal intelligence. Your responsibilities will encompass the understanding and generation of diverse modalities—image, video, audio, and text—across the entire spectrum, from data acquisition and tokenizer training to large-scale pre-training, infrastructure scaling, tooling, and delivering end-to-end product experiences.
Collaboration is key, as you will work cross-functionally with various teams to advance capabilities in multimodal reasoning, world modeling, tool utilization, and interactive human-AI collaboration. Your contributions will help build models that can perceive, comprehend, and engage with the world in real-time, achieving unprecedented levels of performance.

