Technical Staff Member - ML Infrastructure & Performance

embedding-vcSan Mateo, CA

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Candidates should possess a strong background in machine learning infrastructure, performance optimization, and practical experience with the technologies outlined in the scope of work. A passion for pushing the boundaries of AI technology and a collaborative mindset are essential.

About the job

Join the innovative team at Moonlake, where we harness the power of AI to create real-time interactive content.

Mission: Elevate performance metrics by enhancing throughput, reducing latency, and optimizing costs - deploying our models 2–10 times faster and at lower costs without compromising quality.

Scope of Work:

GPU Performance: Expertise in CUDA/Triton kernels, FlashAttention family, paged attention, and CUDA Graphs.
Serving Stack: Proficiency with TensorRT-LLM/Triton Inference Server, vLLM/TGI; continuous batching; on-GPU KV reuse; speculative decoding/medusa; and mixture-of-agents routing.
Parallelism: Experience with FSDP/ZeRO, TP/PP/expert parallel; NCCL tuning.
Quantization/PEFT: Familiarity with AWQ/GPTQ/FP8; LoRA/DoRA serving.
Systems: Knowledge of Ray/k8s/Argo, observability tools (Prom/Grafana/OpenTelemetry), autoscaling, A/B infrastructure, and canary + rollback.

Tech Signals:

Ideal candidates will have previous experience at infrastructure-heavy startups such as Databricks or Roblox.

We are dedicated to maintaining an on-site, in-person team based in San Mateo.

About embedding-vc

Moonlake is at the forefront of AI-driven innovation, specializing in the development of real-time interactive content. Our mission is to deliver cutting-edge technology solutions that enhance user experiences and streamline operations.

1 - 20 of 1,069,198 Jobs

Select all on this page (20)

Apply

Warehouse Manager Sales Assistant at JYSK Svendborg

JYSK

Full-time|On-site|Svendborg

Join our dynamic team at JYSK Svendborg as a Warehouse Manager Sales Assistant. In this pivotal role, you will oversee inventory management, ensure efficient stock handling, and contribute to an exceptional customer experience. Your leadership skills will shine as you guide and support the sales team in providing top-notch service.

Technical Staff Member - ML Infrastructure & Performance

Unlock Your Potential

Experience Level

Qualifications

About the job

About embedding-vc

Warehouse Manager Sales Assistant at JYSK Svendborg

Tech Manager

Director of Learning & Development

Customer Success Architect - Public Sector

Junior CRM and Systems Analyst

Purchasing Specialist - Motor

Senior Packaging Purchasing Specialist

SAP Functional Consultant

SAP ABAP Developer at Accenture Federal Services | Washington, DC

Customer Service Cashier at Pilot Company | Yucca

Chef.fe de Service Entretien

Senior Systems Engineer - Foreign Object Detection

Service Food Manager

Associate - Architectural Design

Chef.fe de Service Alimentaire

Quantity Surveyor at AECOM | Singapore

IT Security Consultant (Cyber Access Management, IAM, PAM, CyberArk)

Global Client Executive - LinkedIn Talent Solutions

Casual Retail Merchandiser

Lead Service Delivery Manager

Technical Staff Member - ML Infrastructure & Performance

Unlock Your Potential

Experience Level

Qualifications

About the job

About embedding-vc

Technical Staff Member - ML Infrastructure & Performance

Unlock Your Potential

Experience Level

Qualifications

About the job

About embedding-vc

Technical Staff Member - ML Infrastructure & Performance

Unlock Your Potential

Experience Level

Qualifications

About the job

About embedding-vc