companyApplied Intuition, Inc. logo

Machine Learning Runtime Optimization Engineer

Applied Intuition, Inc.Sunnyvale, California, United States
On-site Full-time $159.1K/yr - $199.3K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

The ideal candidate should possess a strong background in software engineering, particularly in optimizing machine learning models for embedded systems. Proficient knowledge of ML frameworks such as PyTorch, JAX, and TensorRT is essential. Experience with model pruning, quantization, and deployment in memory-constrained environments is highly beneficial.

About the job

About Applied Intuition

Applied Intuition, Inc. is at the forefront of advancing physical AI technology. Established in 2017 and currently valued at $15 billion, this Silicon Valley-based company is building the essential digital infrastructure to infuse intelligence into every moving machine worldwide. We cater to industries such as automotive, defense, trucking, construction, mining, and agriculture through three primary sectors: tools and infrastructure, operating systems, and autonomy. Our solutions are trusted by 18 of the top 20 global automakers, along with the United States military and its allies, to deliver exceptional physical intelligence. Our headquarters is located in Sunnyvale, California, with additional offices across Washington, D. C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.

We are an in-office company, expecting our employees to primarily work from their Applied Intuition office five days a week. We understand the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments.

About the Role

We are in search of a skilled software engineer with extensive experience in optimizing machine learning models and deploying them in production-grade embedded runtime environments. Your expertise will span the entire ML framework stack, including PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, and Triton.

At Applied Intuition, You Will:

  • Lead ML performance optimization across various technologies for both on-road and off-road ADAS/AD stacks aimed at deployment on a range of embedded computing platforms.
  • Devise compute usage strategies to enhance efficiency and minimize latency of model inference for compute boards chosen by our customers.
  • Engage in model pruning and quantization, ensuring successful deployment on memory-constrained platforms.
  • Collaborate closely with ML engineers and software developers to identify and optimize efficient model architecture solutions.
  • Establish methodologies to...

About Applied Intuition, Inc.

Applied Intuition is a pioneering technology firm focused on enhancing physical AI capabilities. With a valuation of $15 billion, we are dedicated to providing the digital infrastructure necessary for intelligent machinery across various industries, ensuring our solutions are reliable and impactful.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.