Position has been filled

Technical Staff Member - Advanced Machine Learning Optimization

MoonlakeSan Mateo

On-site Full-time

Position filled

Experience Level

Experience

Qualifications

Ideal candidates should possess strong expertise in machine learning frameworks, experience with high-performance computing, and proficiency in optimizing algorithms for large-scale data processing. A solid understanding of GPU architectures and programming is crucial.

About the role

Join Moonlake, a pioneering company harnessing AI to develop immersive world simulations.

Role Overview

Enhancing Training Efficiency

Implement data loaders, fusion techniques, activation rematerialization, and gradient checkpointing.
Optimize training with FSDP/ZeRO/tensor+pipeline parallelism and NCCL tuning.

Improving GPU and Kernel Performance

Conduct Nsight profiling, develop Triton/CUDA kernels, and create fused operations.
Implement flash-attention style accelerations, sequence packing, and KV-cache optimizations.

Optimizing Inference

Focus on low-latency serving, continuous batching, and speculative decoding strategies.
Apply quantization methods (GPTQ/AWQ), distillation, and pruning techniques.

Infrastructure and Reliability

Manage SLURM/Kubernetes multi-node jobs and ensure checkpoint hygiene.
Maintain determinism, environment pinning, and effectively handle GPU failures.

Our dedicated team thrives on collaboration in our San Mateo office.

About Moonlake

Moonlake is at the forefront of AI technology, specializing in creating captivating world simulations that push the boundaries of imagination and interactivity.

Similar jobs

Browse all companies, explore by city & role, or SEO search pages. View directory listings: all jobs, search results, location & role pages.

Tailoring 0 resumes…

We'll move completed jobs to Ready to Apply automatically.