About the job
About Applied Intuition
We are an in-office company, expecting our employees to primarily work from their Applied Intuition office five days a week. We understand the importance of flexibility and trust our employees to manage their schedules responsibly. This may include occasional remote work, starting the day with morning meetings from home before heading to the office, or leaving earlier when needed to accommodate family commitments.
About the Role
We are in search of a skilled software engineer with extensive experience in optimizing machine learning models and deploying them in production-grade embedded runtime environments. Your expertise will span the entire ML framework stack, including PyTorch, JAX, ONNX, TensorRT, CUDA, XLA, and Triton.
At Applied Intuition, You Will:
- Lead ML performance optimization across various technologies for both on-road and off-road ADAS/AD stacks aimed at deployment on a range of embedded computing platforms.
- Devise compute usage strategies to enhance efficiency and minimize latency of model inference for compute boards chosen by our customers.
- Engage in model pruning and quantization, ensuring successful deployment on memory-constrained platforms.
- Collaborate closely with ML engineers and software developers to identify and optimize efficient model architecture solutions.
- Establish methodologies to...

