About the job
Welcome to Moonlake, where we leverage AI to craft immersive world simulations.
Mission: Join us as an Applied AI Research Engineer focused on designing and coding intelligent agents (post-training and systems).
Scope of Work:
- Design agentic systems: Develop tool catalogs, function calls, program synthesis, repair loops, and control mechanisms such as ReAct, Reflexion, ToT, and LangGraph, along with self-verification and sandboxed execution.
- Evaluation mindset: Create comprehensive task suites for multi-step coding, including full-stack LLM engineering, prompt libraries, routing, retrieval, KV-cache management, streaming, and telemetry.
- Security and isolation: Implement Docker/firejail, manage network egress controls, maintain secrets hygiene, and ensure dependency pinning for supply-chain integrity.
- Strong post-training capabilities: Conduct supervised fine-tuning, preference and trace reinforcement learning (DPO/RLAIF/RLHF), dataset curation, reward shaping, and safety filtering.
Technical Signals:
- Experience shipping agents that successfully navigate real repository test suites from start to finish.
- Published research in the fields of agentic systems and code generation, contributing to frameworks or open-source evaluations such as LangGraph, AutoGen, Guidance, LEAP, and SWE-bench variants.
- Developed datasets from execution traces, demonstrating significant enhancements from data over parameters.
We are committed to maintaining an on-site, collaborative team environment based in San Mateo.

