companyInflection AI logo

Model Training Engineer at Inflection AI | Palo Alto, CA

Inflection AIPalo Alto, CA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

This role is ideal for you if you:Possess hands-on experience in training and fine-tuning large transformer models on multi-GPU/multi-node clusters. Are proficient in PyTorch and its ecosystem tools (Torchtune, FSDP, DeepSpeed) and enjoy exploring distributed training internals, mixed precision, and memory efficiency techniques. Have delivered or published work involving RLHF, DPO, GRPO, or RLAIF, with a solid understanding of their practical trade-offs. Value training tools, pipelines, and reproducibility; you automate mundane tasks to focus on innovative aspects. Strike a balance between research curiosity and product pragmatism, knowing when to conduct an ablation study versus when to ship. Communicate effectively with both technical and non-technical team members. Hold a bachelor’s degree or equivalent in a relevant field.

About the job

At Inflection AI, we are dedicated to leveraging the transformative capabilities of artificial intelligence to enhance human well-being and productivity.

The future of AI will be characterized by agents we can trust to act on our behalf.

We are at the forefront of this evolution with our human-centric AI models that integrate emotional intelligence (EQ) with cognitive intelligence (IQ), shifting interactions from mere transactions to meaningful relationships, thereby generating lasting value for individuals and organizations alike.

Our initiatives manifest in two primary forms:

Pi, your personal AI, designed to be a compassionate companion that enriches everyday life through practical support and insights.

Platform — large language models (LLMs) and APIs that empower developers, agents, and enterprises to infuse Pi-level emotional intelligence into experiences where empathy and understanding are crucial.

We are building towards a future of AI agents that foster trust, enhance understanding, and create aligned, long-term value for everyone.

About the Role

As a Model Training Engineer, you will be responsible for designing, building, and scaling post-training pipelines that transform general LLMs into brand-fluent, production-ready assistants. Your innovations in fine-tuning and preference optimization techniques (RLHF, DPO, GRPO, RLAIF) will significantly enhance reliability, alignment, and cost-effectiveness.

About Inflection AI

Inflection AI is committed to advancing the public benefit by harnessing AI to enhance human well-being and productivity. We are developing innovative AI models that integrate emotional and cognitive intelligence, creating transformative experiences for users.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.