About the job
Unity Technologies is advancing mobile gaming with AI-driven features. The Principal Machine Learning Engineer will focus on deploying advanced AI models, such as transformers and diffusion networks, directly onto mobile devices. This position shapes how Unity brings state-of-the-art multi-modal models from research into real-world mobile applications.
What You Will Do
- Technical Leadership: Set the vision for deploying multi-modal AI models on iOS and Android, drawing on deep experience with transformers, diffusion models, and generative architectures.
- Make key decisions on model optimization strategies, including compression, quantization, and knowledge distillation to address mobile device constraints.
- Assess and select inference runtimes (such as CoreML, ONNX Runtime Mobile, TFLite) to improve team capabilities and deployment outcomes.
- Oversee the entire optimization pipeline, from model export through hardware-specific kernel tuning across different processing units.
- Architecture & Research Translation: Work closely with research scientists to convert innovative model architectures into operational, mobile-optimized systems.
- Design scalable systems capable of processing varied inputs, images, text, metadata, while ensuring real-time output performance.
- Develop new approaches for dynamic resolution and token reduction tailored for mobile environments.
- Monitor and incorporate advancements in efficient AI technologies to keep Unity’s mobile AI stack current.
- Team Leadership & Mentorship: Guide and mentor machine learning engineers, establishing best practices for on-device performance evaluation.
- Collaborate with cross-functional teams to ensure AI capabilities align with product roadmaps and device requirements.
- Promote a culture centered on performance measurement, defining and tracking key metrics for efficiency and accuracy.
Location
Mountain View, CA, USA

