companyUnity Technologies logo

Principal Machine Learning Engineer - Mobile AI Inference Optimization

Unity TechnologiesMountain View, CA, USA
On-site Full-time $278.1K/yr - $347.6K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

QualificationsWe are looking for candidates with a robust background in machine learning, particularly in deploying models for mobile applications. You should have extensive experience in:Deep learning frameworks and multi-modal model architectures. Mobile hardware constraints and optimization techniques. Leading teams and mentoring junior engineers. Collaborating in cross-functional environments with product and engineering teams.

About the job

Unity Technologies is advancing mobile gaming with AI-driven features. The Principal Machine Learning Engineer will focus on deploying advanced AI models, such as transformers and diffusion networks, directly onto mobile devices. This position shapes how Unity brings state-of-the-art multi-modal models from research into real-world mobile applications.

What You Will Do

  • Technical Leadership: Set the vision for deploying multi-modal AI models on iOS and Android, drawing on deep experience with transformers, diffusion models, and generative architectures.
  • Make key decisions on model optimization strategies, including compression, quantization, and knowledge distillation to address mobile device constraints.
  • Assess and select inference runtimes (such as CoreML, ONNX Runtime Mobile, TFLite) to improve team capabilities and deployment outcomes.
  • Oversee the entire optimization pipeline, from model export through hardware-specific kernel tuning across different processing units.
  • Architecture & Research Translation: Work closely with research scientists to convert innovative model architectures into operational, mobile-optimized systems.
  • Design scalable systems capable of processing varied inputs, images, text, metadata, while ensuring real-time output performance.
  • Develop new approaches for dynamic resolution and token reduction tailored for mobile environments.
  • Monitor and incorporate advancements in efficient AI technologies to keep Unity’s mobile AI stack current.
  • Team Leadership & Mentorship: Guide and mentor machine learning engineers, establishing best practices for on-device performance evaluation.
  • Collaborate with cross-functional teams to ensure AI capabilities align with product roadmaps and device requirements.
  • Promote a culture centered on performance measurement, defining and tracking key metrics for efficiency and accuracy.

Location

Mountain View, CA, USA

About Unity Technologies

Unity Technologies is a leading platform for creating and operating interactive, real-time 3D content. We empower developers and creators worldwide to build remarkable experiences across a variety of platforms, including mobile devices.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.