Scale AI, Inc.San Francisco, CA; Seattle, WA; New York, NY
On-site Full-time $218.4K/yr - $273K/yr
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Experience
Qualifications
Key Responsibilities:Design, profile, and enhance our training and inference framework. Work collaboratively with ML teams to expedite their research and development processes, empowering them to create next-generation models and data curation strategies. Investigate and incorporate cutting-edge technologies to refine our ML system. Preferred Qualifications:A strong enthusiasm for system optimization. Hands-on experience with multi-node LLM training and inference. Proven experience in developing large-scale distributed ML systems. Robust software engineering capabilities, with proficiency in frameworks and tools like CUDA, PyTorch, Transformers, Flash Attention, etc. Excellent written and verbal communication skills with the ability to thrive in a cross-functional team environment. Desirable Skills:Demonstrated expertise in post-training methodologies and/or innovative use cases for large language models, including instruction tuning, RLHF (Reinforcement Learning from Human Feedback), tool usage, reasoning, agents, and multimodal applications.
About the job
Join Scale AI's ML platform team (RLXF) as a Machine Learning Research Engineer, where you will play a pivotal role in developing our advanced distributed framework for training and inference of large language models. This platform is vital for enabling machine learning engineers, researchers, data scientists, and operators to conduct rapid and automated training, as well as evaluation of LLMs and data quality.
At Scale, we occupy a unique position in the AI landscape, serving as an essential provider of training and evaluation data along with comprehensive solutions for the entire ML lifecycle. You will collaborate closely with Scale's ML teams and researchers to enhance the foundational platform that underpins our ML research and development initiatives. Your contributions will be crucial in optimizing the platform to support the next generation of LLM training, inference, and data curation.
If you are passionate about driving the future of AI through groundbreaking innovations, we want to hear from you!
About Scale AI, Inc.
Scale AI is a leader in the AI sector, providing indispensable training and evaluation data as well as comprehensive end-to-end solutions for the machine learning lifecycle. Our platform empowers researchers and engineers to push the boundaries of AI technology.
Similar jobs
1 - 20 of 7,266 Jobs
Search for Senior Or Staff Machine Learning Systems Engineer Llms
Full-time|$200K/yr - $240K/yr|On-site|San Francisco, CA
Join Us in Building a Safer World.At TRM Labs, we specialize in blockchain analytics and AI solutions aimed at assisting law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in identifying, investigating, and preventing crypto-related fraud and financial crime. Our innovative platforms leverage blockchain intelligence and AI technology to trace funds, detect illicit activity, and construct comprehensive threat profiles. Trusted by leading organizations worldwide, TRM Labs is committed to enabling a safer and more secure environment for all.Our AI Engineering Team is dedicated to pioneering next-generation AI applications, particularly in the realm of Large Language Models (LLMs) and agentic systems. Our goal is to develop resilient pipelines and high-performance infrastructure that facilitate the swift, safe, and scalable deployment of AI systems.We manage extensive petabyte-scale pipelines, ensuring model serving with millisecond latency while providing the necessary observability and governance to make AI production-ready. Our team actively evaluates and integrates leading-edge tools in the LLM and agent space, including open-source stacks, vector databases, evaluation frameworks, and orchestration tools to accelerate TRM’s innovation pace.As a Senior or Staff ML Systems Engineer – LLM, you will play a pivotal role in constructing and scaling our technical infrastructure for AI/ML systems. Your responsibilities will include:Creating reusable CI/CD workflows for model training, evaluation, and deployment, integrating tools such as Langfuse, GitHub Actions, and experiment tracking.Automating model versioning, approval processes, and compliance checks across various environments.Developing a modular and scalable AI infrastructure stack that encompasses vector databases, feature stores, model registries, and observability tools.Collaborating with engineering and data science teams to embed AI models and agents into real-time applications and workflows.Continuously assessing and incorporating state-of-the-art AI tools (e.g., LangChain, LlamaIndex, vLLM, MLflow, BentoML).Promoting AI reliability and governance while enabling experimentation, ensuring compliance, security, and continuous uptime.Enhancing AI/ML Model Performance and ensuring data accuracy and consistency, leading to improved model training and inference.Implementing infrastructure to facilitate both offline and online evaluation of LLMs and agents.
Join gleanwork as a Machine Learning Engineer specializing in LLM evaluations and observability. In this role, you will be instrumental in developing cutting-edge machine learning systems that enhance our understanding and effectiveness of language learning models. You will collaborate with cross-functional teams to drive the integration of advanced analytics and machine learning solutions.
Bland Inc. seeks a Machine Learning Researcher specializing in Multimodal Large Language Models (LLMs) to join the team in San Francisco. The focus is on advancing AI systems that integrate language with other types of data. Role overview This position centers on research and development aimed at improving how AI models process and understand information from multiple sources, such as text combined with images or other modalities. What you will do Investigate how language interacts with additional data types within multimodal LLMs Create and evaluate new methods to enhance AI model performance Work closely with colleagues on projects designed to push the boundaries of machine learning Location This role is based in San Francisco.
Full-time|Hybrid|Mountain View, CA USA; San Francisco, CA USA;
Join Waymo as a Senior Machine Learning Engineer focusing on Perception LLM/VLM. In this role, you will leverage cutting-edge machine learning techniques to enhance our autonomous driving technology. You will collaborate with a talented team of engineers and researchers to develop algorithms that improve our perception systems, ensuring safety and efficiency on the road.
Full-time|$280K/yr - $380K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY
At Scale AI, we are the premier partner for data and evaluation in the rapidly evolving field of artificial intelligence. Our commitment to advancing the assessment and benchmarking of large language models (LLMs) positions us at the forefront of AI innovation. We are dedicated to creating leading-edge LLM evaluation methodologies that set new benchmarks for model performance. Our research teams collaborate with the top AI laboratories in the industry to provide high-quality data, accelerate progress in generative AI research, and inform what excellence looks like in this domain. As a Staff Machine Learning Research Scientist on our LLM Evals team, you will spearhead the creation of novel evaluation methodologies, metrics, and benchmarks to assess the strengths and weaknesses of cutting-edge LLMs. Your work will shape our internal strategies and influence the broader AI research community, making this role essential for establishing best practices in data-driven AI development.
Full-time|$275K/yr - $350K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY
About Scale AI At Scale AI, we are dedicated to propelling the advancement of AI applications. Over the past eight years, we have established ourselves as the premier AI data foundry, supporting groundbreaking innovations in fields such as generative AI, defense technologies, and autonomous vehicles. Following our recent Series F funding round, we are intensifying our efforts to harness frontier data, paving the way toward achieving Artificial General Intelligence (AGI). Our work with enterprise clients and governments has enhanced our model evaluation capabilities, allowing us to expand our offerings for both public and private evaluations. About the ACE Team The Agent Capabilities & Environments (ACE) team, a vital part of Scale’s Research organization, unites customer-focused Researchers and Applied AI Engineers. Our primary mission is to conduct research on agent environments and reinforcement learning reward signals, benchmark autonomous agent performance in real-world contexts, and develop robust data programs aimed at enhancing the capabilities of Large Language Models (LLMs). We are committed to creating foundational tools and frameworks for evaluating models as agents, focusing on autonomous agents that interact dynamically with a wide range of external environments, including code repositories and GUI interfaces. About This Role This position sits at the cutting edge of AI research and its practical applications, concentrating on the data types necessary for the development of state-of-the-art agents, including browser and software engineering agents. The ideal candidate will investigate the data landscape required to propel intelligent and adaptable AI agents, steering the data strategy at Scale to foster innovation. This role demands not only expertise in LLM agents and planning algorithms but also creative problem-solving skills to tackle novel challenges pertaining to data, interaction, and evaluation. You will contribute to influential research publications on agents, collaborate with customer researchers, and partner with the engineering team to transform these advancements into scalable real-world solutions.
Airbnb began in 2007 with two hosts and three guests in San Francisco. Since then, the platform has grown to over 5 million hosts and more than 2 billion guests worldwide. Airbnb connects people with unique places to stay and experiences, building authentic community connections across nearly every country. The team: Growth Platform Engineering The Growth Platform team focuses on driving sustainable, long-term growth for Airbnb. The team’s mission centers on building an agentic system and supporting capabilities to help all Airbnb offerings grow, both now and in the future. Efforts include delivering personalized and relevant content and product experiences to users, both on and off the Airbnb platform. The team is working toward a future where AI identifies opportunities, creates campaigns, personalizes experiences, and optimizes outcomes with minimal human input. This journey moves through a maturity curve: AI-assisted, agentic, and ultimately autonomous systems, always with human oversight to ensure brand safety, quality, and compliance. Growth Platform Engineering is tightly integrated with the Airbnb product, enhancing the customer journey and enabling new ways for users to engage. The platform supports a range of digital marketing channels, landing pages, email, push notifications, SMS, and digital advertising, as well as the machine learning and data infrastructure that powers these efforts. What you will do Develop AI-driven solutions to shape the future of Airbnb’s agentic growth platform, using the latest AI methodologies. Lead and mentor engineers through brainstorming, design, and implementation of AI products and features, from initial concept to deployment. Work at the intersection of technical depth, architectural innovation, and mentorship as a Senior Staff Engineer. Collaborate with cross-functional teams to build scalable systems that operate globally. Help evolve the foundational elements of Airbnb’s AI-powered growth systems.
At Gimlet Labs, we are pioneering the development of the first heterogeneous neocloud designed specifically for AI workloads. As the demand for AI systems surges, traditional homogeneous infrastructures face critical limits in power, capacity, and cost. Our innovative platform effectively decouples AI workloads from their hardware foundations, intelligently partitioning tasks and orchestrating them to the most suitable hardware for optimal performance and efficiency. This strategy fosters heterogeneous systems that span multiple vendors and generations, including cutting-edge accelerators, enabling significant enhancements in performance and cost-effectiveness at scale.In addition to this foundational work, Gimlet is establishing a robust neocloud for agentic workloads. Our clients benefit from deploying and managing their workloads via stable, production-ready APIs, without the need to navigate hardware selection or performance optimization intricacies.We collaborate with foundation labs, hyperscalers, and AI-native companies to drive real production workloads capable of scaling to gigawatt-class AI datacenters.We are currently seeking a Member of Technical Staff specializing in ML systems and inference. In this pivotal role, you will be responsible for designing and constructing inference systems that facilitate the execution of complete models in real production environments. You will operate at the intersection of model architecture and system performance to ensure that inference processes are swift, predictable, and scalable.This position is perfect for engineers with a deep understanding of modern model execution and a passion for optimizing latency, throughput, and memory utilization across the entire inference lifecycle.
Full-time|Remote|San Francisco, CA or remote within the U.S.
At Philo, we are a dedicated team of technology and product enthusiasts committed to reshaping the television landscape. We blend cutting-edge technology with the captivating medium of television to create the ultimate viewing experience. Our mission is to enhance streaming capabilities through innovative cloud delivery and sophisticated machine learning algorithms that personalize content discovery. As a Senior Machine Learning Engineer specializing in Recommendation Systems, you will be at the forefront of our content personalization initiatives, significantly enhancing user engagement and satisfaction. Your expertise will help ensure that every time users open the Philo app, they find something they want to watch. In this pivotal role, you will spearhead the development of advanced algorithms and large-scale systems that drive Philo's recommendation engine. Collaborating closely with data science, product, infrastructure, and backend engineering teams, you will tackle complex machine learning challenges and develop innovative, data-driven solutions that enhance content discovery and foster user retention.
About KreaKrea is at the forefront of developing advanced AI creative tools designed to enhance and empower human creativity. Our mission is to create intuitive and controllable AI solutions that allow creatives to express themselves across various formats including text, images, video, sound, and 3D.About the PositionWe are seeking a talented Machine Learning Engineer to lead the design and implementation of Krea’s personalization and recommendation systems from the ground up. You will take full ownership of how we comprehend user preferences, curate engaging content, and customize generative models to reflect individual aesthetics.This role sits at the exciting intersection of recommendation systems, representation learning, and generative imaging and video technologies.Your ResponsibilitiesLead the architecture and development of Krea’s personalization and recommendation framework, overseeing the technical direction from inception to deployment.Craft algorithms that effectively model user preferences and tastes, enabling our systems to adapt to individual styles and aesthetics.Develop high-quality, curated feeds that strike a balance between exploration, personalization, and aesthetic coherence.Collaborate closely with our model and research teams to co-create personalization mechanisms that shape how our generative models learn, adapt, and express creative styles.Contribute to research in personalized image generation, with a focus on style, taste, and subjective quality.Work in tandem with product, design, and research teams to define what “good personalization” means in a creative context.Take systems from initial research and prototyping stages through to production, ongoing iteration, and enhancement.
Full-time|$210K/yr - $260K/yr|Hybrid|San Francisco, CA, Washington, D.C., New York City, N.Y., Denver, CO
We are looking for a talented individual who is local to any of our offices (Silver Spring, NYC, SF, Miami, Denver) and is eager to work at least 1-2 times per week from one of these locations.ABOUT ROCKET MONEY At Rocket Money, our mission is to empower individuals to take control of their financial lives. We provide our members with unparalleled insights into their finances and a suite of services that save them both time and money, enabling them to achieve their financial goals.ABOUT THE TEAM As Machine Learning Engineers at Rocket Money, we play a vital role in enhancing customer engagement with our diverse range of financial products. Our responsibilities include transaction enrichment, personalization, and creating cross-functional tools that bolster various AI initiatives. Collaborating closely with product teams, we develop features that aid customers in understanding, tracking, and improving their personal finances. We value team players who excel in cross-team collaboration, can align strategy with ML and AI-driven user experiences, deliver scalable and high-quality user experiences, and are mindful of the impact our products have on end users. At the Staff level, you will be expected to cultivate broad expertise in our products and the ML solutions that enhance them, while driving technical advancements within the team.ABOUT THE ROLE As a Staff Machine Learning Engineer, you will spearhead our ML and AI product development efforts, utilizing your expertise to design, implement, and maintain sophisticated ML systems that elevate our product experiences. Your responsibilities will include:Leading the architecture and development of advanced AI and ML features across Rocket Money's product suite, proactively identifying and addressing technical challenges.Designing and maintaining robust evaluation frameworks to ensure continual improvement of ML/AI systems and facilitating similar initiatives among others.Creating innovative product experiences that leverage our unique dataset and scalability, guiding others in delivering impactful results through effective technical leadership and collaboration with product teams.Overseeing the end-to-end development and implementation of ML and AI product features in partnership with cross-functional product teams, emphasizing thorough technical critique and clear communication of business impacts.Providing technical mentorship to foster an environment of high-impact contributions from all team members.
Full-time|$137.1K/yr - $246.8K/yr|Hybrid|San Francisco, CA; Sunnyvale, CA
Join us in creating the most dependable on-demand logistics engine for last-mile retail delivery! We are on the lookout for a seasoned machine learning engineer to aid in the development of cutting-edge growth and personalization models that will elevate DoorDash's expanding retail and grocery services.About the RoleWe are seeking a dedicated Applied Machine Learning expert to become part of our innovative team. As a Staff Machine Learning Engineer, you will conceptualize, design, implement, and validate algorithmic enhancements that enrich the growth and personalization experiences central to our rapidly evolving grocery and retail delivery business. Leveraging our advanced data and machine learning infrastructure, you will implement novel ML solutions to enhance the consumer search experience, making it more relevant, seamless, and enjoyable across grocery, convenience, and various retail sectors. A strong command of production-level machine learning and proven experience in addressing end-user challenges while collaborating effectively with multidisciplinary teams is essential.This position will report to the engineering manager on our Personalization team and is expected to be hybrid, combining both in-office and remote work (#LI-Hybrid).
Full-time|$250.8K/yr - $318.6K/yr|Hybrid|Mountain View, CA, USA; San Francisco, CA, USA; New York City, NY, USA
About Waymo Waymo develops autonomous driving technology with the goal of making transportation safer and more accessible. Originally launched in 2009 as the Google Self-Driving Car Project, Waymo has logged over 100 million autonomous miles and completed more than ten million rider-only trips. The company focuses on building the Waymo Driver to support safe, efficient travel across a range of vehicle platforms. Team Overview: Prediction & Planning - System Architecture The Predictive Planning (PrePlan) team builds machine learning systems that help the Waymo Driver anticipate possible scenarios and plan actions accordingly. By transforming large-scale driving data into robust deep neural networks, the team enables vehicles to navigate complex, real-world environments with confidence. The system architecture group ensures these models work smoothly within Waymo's technology stack. This includes attention to kinematics, interfaces, and data representations. The team works across projects involving data pipelines, model integration, evaluation metrics, and the planner systems that guide autonomous vehicles. Role Details Title: Staff Machine Learning Engineer, Prediction & Planning - System Architecture Location: Mountain View, CA; San Francisco, CA; New York City, NY (hybrid) Reports to: Technical Lead Manager
Full-time|$268K/yr - $368.5K/yr|On-site|San Francisco, CA
About FaireFaire is a transformative online wholesale marketplace, driven by the conviction that local businesses are the future. Independent retailers around the globe generate more revenue than massive corporations like Walmart and Amazon combined, yet individually, they remain small. At Faire, we harness technology, data, and machine learning to connect this vibrant community of entrepreneurs. Think of your favorite local boutique — we empower them to discover and sell the best products from around the world. With our innovative tools and insights, we aim to level the playing field, enabling small businesses to thrive against larger competitors.By championing the growth of independent businesses, Faire positively impacts local economies on a global scale. We’re in search of intelligent, resourceful, and passionate individuals to join us in fueling the shop local movement. If you value community, we invite you to be part of ours.About this RoleAs the Senior Staff Machine Learning Platform Engineer, you will spearhead the technical vision and evolution of Faire's ML platform. You will establish standards, influence organization-wide architecture, and lead intricate, cross-functional initiatives that enhance data science velocity at scale. This position is crucial for adapting ML workflows to leverage modern AI productivity tools. You will not only develop models but also design the systems that enable those models to empower tens of thousands of small retailers in competing and growing their local businesses.
Join Decagon as a Staff Software Engineer specializing in Machine Learning Infrastructure. In this role, you will play a crucial part in enhancing and optimizing our machine learning systems. You will collaborate with a talented team of engineers to build scalable and efficient infrastructure that supports our AI-driven initiatives.As a key contributor, you will leverage your expertise in software engineering and machine learning to solve complex challenges and drive innovation. Your work will impact various projects and help shape the future of our technology.
Full-time|$218.4K/yr - $273K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY
Join Scale AI's ML platform team (RLXF) as a Machine Learning Research Engineer, where you will play a pivotal role in developing our advanced distributed framework for training and inference of large language models. This platform is vital for enabling machine learning engineers, researchers, data scientists, and operators to conduct rapid and automated training, as well as evaluation of LLMs and data quality.At Scale, we occupy a unique position in the AI landscape, serving as an essential provider of training and evaluation data along with comprehensive solutions for the entire ML lifecycle. You will collaborate closely with Scale's ML teams and researchers to enhance the foundational platform that underpins our ML research and development initiatives. Your contributions will be crucial in optimizing the platform to support the next generation of LLM training, inference, and data curation.If you are passionate about driving the future of AI through groundbreaking innovations, we want to hear from you!
Full-time|$224K/yr - $308K/yr|On-site|San Francisco, CA
About FaireAt Faire, we are revolutionizing the wholesale marketplace with an unwavering commitment to local communities. Our platform empowers independent retailers globally, enabling them to thrive against larger competitors like Walmart and Amazon. By leveraging cutting-edge technology, data insights, and machine learning, we connect these vibrant entrepreneurs with the best products from around the world. We believe that with the right tools, small businesses can elevate their potential and compete on a grand scale.By nurturing independent businesses, Faire is making a significant positive impact on local economies worldwide. We are in search of intelligent, resourceful, and passionate individuals to join our mission of championing local commerce. If you resonate with our community-driven values, we'd love to welcome you to our team.About this roleAs a Staff Machine Learning Platform Engineer, you will play a pivotal role in shaping, enhancing, and managing a scalable machine learning platform designed to expedite model training, deployment, and governance. You will serve as the vital technical link between our data science and production engineering teams. Joining a small but integral team, you will amplify Faire’s capabilities to support tens of thousands of local businesses in an increasingly competitive retail landscape.
On-site|On-site|San Francisco, CA | New York City, NY | Seattle, WA
Join Anthropic as a Machine Learning Systems Engineer within our Encodings and Tokenization team, where you'll play a pivotal role in refining and optimizing our tokenization systems across Pretraining and Finetuning workflows. By bridging the gap between our Pretraining and Finetuning teams, you will help shape the essential infrastructure that enhances how our AI models learn from diverse data. Your contributions will be crucial in ensuring our AI systems remain reliable, interpretable, and steerable, driving forward our mission of developing beneficial AI technologies.
Join Hive as a Senior Machine Learning Engineer and help shape the future of AI! We are seeking passionate individuals who excel at developing and deploying cutting-edge deep learning models. In this role, you will work with large-scale datasets to create innovative machine learning solutions, collaborating closely with a talented team of engineers to push the boundaries of artificial intelligence. Ideal candidates will have a proven track record of building and scaling machine learning projects from conception to production, along with a strong commitment to continuous learning and personal ownership in their work.
About UsAt Applied Compute, we specialize in creating Specific Intelligence solutions for enterprises, developing agents that learn continuously from an organization’s processes, data, expertise, and objectives. We recognize a significant gap between the capabilities of AI models in isolation and their practical applications in real-world business contexts. Our systems often fall short because they lack adaptability to feedback. To address this, we are building a continual learning infrastructure that captures context, memory, and decision-making processes throughout the enterprise, enabling specialized agents to effectively execute real tasks.What Excites Us: We operate at a unique intersection where our product team constructs the platform that fuels a new generation of digital coworkers. Our research team pushes the boundaries of post-training and reinforcement learning, creating innovative product experiences. Our applied research engineers collaborate closely with clients to deploy models into production. This blend of strong product focus, deep research, and hands-on customer engagement is crucial for integrating AI into the enterprise. We are product-driven, research-informed, and actively engaged with our clients.Our Team: Our diverse team consists of engineers, researchers, and operators, many of whom are former founders. We have built RL infrastructure at leading organizations like OpenAI and Scale AI, and developed systems at Together, Two Sigma, and Watershed. We proudly serve Fortune 50 clients alongside companies like DoorDash, Mercor, and Cognition. Our work is supported by renowned investors, including Benchmark, Sequoia, and Lux.Who Thrives in Our Environment: We seek individuals eager to apply cutting-edge research and complex systems to tackle real-world challenges. You should be adept at quickly adapting to new environments, whether it’s a fresh codebase, a client’s data architecture, or an unfamiliar problem domain. A genuine enjoyment of customer interactions—listening, empathizing, and understanding how tasks are accomplished within their organizations—is essential. Those with entrepreneurial backgrounds, extensive side projects, or demonstrated end-to-end ownership typically excel in our company.
Oct 29, 2025
Sign in to browse more jobs
Create account — see all 7,266 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.