Staff Software Engineer, Foundation Model Serving

DatabricksSan Francisco, California

On-site Full-time $192K/yr - $260K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

What we look for:10+ years of experience in building and operating large-scale distributed systems. Experience with customer-facing APIs, Edge Gateways, ML inference, or similar services. Strong interest in developing LLM APIs and runtimes at scale.

About the job

Foundation Model Serving represents the API product designed for hosting and serving advanced AI model inference, catering to both open-source models like Llama, Qwen, and GPT OSS, as well as proprietary models such as Claude and OpenAI GPT. We welcome engineers who have experience managing high-scale operational systems, including customer-facing APIs, Edge Gateways, or ML Inference services, even if they do not have a background in ML or AI. A passion for developing LLM APIs and runtimes at scale is essential.

As a Staff Engineer, you will play a pivotal role in defining both the product experience and the underlying infrastructure. You will be tasked with designing and building systems that facilitate high-throughput, low-latency inference on GPU workloads with cutting-edge models. Your influence will extend to architectural direction, working closely with platform, product, infrastructure, and research teams to deliver an exceptional foundation model API product.

The impact you will have:

Design and implement core systems and APIs that drive Databricks Foundation Model Serving, ensuring scalability, reliability, and operational excellence.
Collaborate with product and engineering leaders to outline the technical roadmap and long-term architecture for workload serving.
Make architectural decisions to enhance performance, throughput, autoscaling, and operational efficiency for GPU serving workloads.
Contribute directly to critical components within the serving infrastructure, from systems like vLLM and SGLang to developing token-based rate limiters and optimizers, ensuring seamless and efficient operations at scale.
Work cross-functionally with product, platform, and research teams to transform customer requirements into dependable and high-performing systems.
Establish best practices for code quality, testing, and operational readiness while mentoring fellow engineers through design reviews and technical support.
Represent the team in inter-departmental technical discussions, influencing Databricks’ wider AI platform strategy.

About Databricks

Databricks is at the forefront of data and AI innovation, dedicated to creating solutions that empower organizations to tackle complex challenges through advanced technology. Our team thrives on collaboration and is committed to delivering groundbreaking insights that drive success for our clients.

Similar jobs

1 - 20 of 5,525 Jobs

Search for Machine Learning Engineer For Foundation Models Personalization

5,525 results

Select all on this page (20)

Apply

Machine Learning Engineer - Foundation Models for Biology

Prima Mente

Full-time|On-site|San Francisco

Join Prima MenteAt Prima Mente, we are pioneers in the field of biology-focused artificial intelligence. Our mission is to generate unique datasets, develop versatile biological foundation models, and translate scientific breakthroughs into real-world clinical applications. Our primary focus is on understanding the brain in-depth, safeguarding it from neurological disorders, and enhancing its function during health. Our dynamic team of AI researchers, experimentalists, clinicians, and operational experts are strategically located in London, San Francisco, and Dubai.Your Role: Foundation Models for BiologyAs a Machine Learning Engineer, you will be instrumental in the design, implementation, and scaling of foundational AI models and infrastructure for multi-omics at an unprecedented scale. Your contributions will facilitate significant advancements in scientific comprehension and lead to groundbreaking applications in the medical and biological fields.Key Responsibilities:Develop high-performance machine learning algorithms optimized for large-scale applications, ensuring utmost reliability and efficiency.Design, implement, and maintain comprehensive experimentation pipelines that allow for rapid iterations, precise assessments, and reproducible research results.Refactor and enhance prototype research code into clean, maintainable, and efficient repositories prepared for production-level deployments.Create fast data processing workflows that can effectively manage extensive datasets to expedite research and model development.Engage in experimental design, with a focus on high-impact experiments that yield the greatest signal-to-noise ratio.Growth ExpectationsIn 1 month, you will initiate initial experiments utilizing state-of-the-art machine learning models, review and apply advanced research papers, and enhance existing code for improved efficiency and precision.By 3 months, you will take ownership of a prototype model architecture, showcasing notable algorithmic enhancements, and contribute to methods for large-scale data ingestion and training.Within 6 months, you will have significantly impacted the implementation of a high-performance foundation model, incorporating key algorithmic optimizations that improve scalability and throughput, along with publishing internal benchmarks that demonstrate substantial effects.

Mar 2, 2026

Apply

Machine Learning Engineer for Foundation Models & Personalization

Eight Sleep

Full-time|On-site|San Francisco

Join the Sleep Fitness RevolutionAt Eight Sleep, we are dedicated to unlocking human potential through the power of optimal sleep. As pioneers in the sleep fitness domain, we are transforming the concept of well-being by developing cutting-edge hardware, software, and AI technologies designed to enhance sleep quality. Our innovative products are engineered to maximize mental, physical, and emotional performance, turning each night into a tailored, data-driven recovery session.Trusted by elite athletes and health-conscious individuals across over 30 countries, Eight Sleep has been recognized as one of Fast Company’s Most Innovative Companies in 2019, 2022, and 2023, as well as featured twice in TIME's “Best Inventions of the Year.” Our team operates like a high-performance unit: agile, focused, and driven by impactful results. We prioritize refining and iterating on our offerings to enhance our members' sleep experiences and empower them to wake up rejuvenated.Every position at Eight Sleep offers an opportunity to contribute to groundbreaking technology, collaborate with exceptional talent, and influence a future where sleep is a proactive element of living well. If you are passionate about pushing boundaries and creating innovative solutions, this is your chance to make a difference in how the world experiences sleep and its potential.High Standards. No Compromises.At Eight Sleep, we operate with intensity and commitment, reflecting the mindset of top performers. We embrace a relentless focus on excellence in our endeavors, akin to the mamba mentality applied to innovative ideas and next-gen technology. We are not just about meeting expectations; we strive to exceed them, working diligently not out of obligation, but from a passion for impactful work. If you flourish under pressure and seek to engage in the most meaningful projects of your career, you will find a home here. If you desire an easier path, this may not be the right place for you.The RoleWe are in search of a Machine Learning Engineer to develop and deploy consumer-oriented AI systems that enhance personalization, coaching, and next-gen “sleep intelligence.” You will collaborate across data science, modeling, product development, and engineering to convert research insights into tangible, measurable improvements for our members.This role is perfect for individuals who thrive on end-to-end ownership, from defining problems and prototyping to offline evaluations, online experimentation, production deployment, and continuous iteration.

Jan 21, 2026

Apply

Machine Learning Engineer - Multimodal Foundation Models

The Bot Company

Full-time|On-site|San Francisco

The Bot CompanyAt The Bot Company, we are on a mission to create an innovative robotic assistant for every household.Our dynamic team, composed of talented engineers, designers, and operators, is based in San Francisco. We have a rich background from industry leaders such as Tesla, Cruise, OpenAI, Google, and Pixar, and we have successfully delivered products to hundreds of millions of users, honing our ability to create exceptional products and experiences.We pride ourselves on maintaining a streamlined team structure that fosters swift decision-making and minimizes bureaucracy. Each member is considered an Individual Contributor, granted substantial autonomy, ownership, and accountability. Our culture enables us to work across the technology stack with an emphasis on rapid iteration and execution.What We Seek in CandidatesCandidates for all positions at The Bot Company must exhibit remarkable sharpness and the capacity to thrive in high-pressure environments. We expect candidates to showcase:Exceptional Cognitive Abilities: You possess quick thinking, instant learning capabilities, and the ability to reason across diverse domains.Engineering Curiosity: You demonstrate an innate desire to understand how systems function, even beyond your area of expertise.Performance-Driven Attitude: You excel in fast-paced settings, effectively navigate ambiguity, and thrive under demanding circumstances.Machine Learning: Multimodal Foundation ModelsWe are developing unified foundation models capable of reasoning across text, images, video, and kinematics to inform intelligent robotic behaviors.You will engage with large-scale multimodal networks, overseeing the complete process from data handling to model training and deployment.Your ResponsibilitiesConstruct Native Multimodal Policies: Create architectures where vision, language, and other modalities are represented in a unified manner.Enhance Cross-Modal Reasoning: Explore and implement strategies to ensure that the model not only correlates modalities but also comprehends them (e.g., linking visual physics to kinematic constraints).Manage the Training Loop from Start to Finish: Design, execute, troubleshoot, and refine large-scale training experiments; identify failure points, enhance data mixtures, and tighten evaluations to achieve measurable improvements.Deploy and Refine Real Systems: Integrate models into practical robotic frameworks, enhance robot code for model deployment, and optimize performance for edge inference.

Feb 25, 2026

Apply

Machine Learning Engineer - Personalization & Recommendations

Quizlet Inc.

Full-time|On-site|San Francisco, CA

About Quizlet:At Quizlet, our vision is to empower every learner to achieve their educational goals in the most effective and enjoyable manner. As a thriving $1B+ educational platform, we serve two-thirds of U.S. high school students and half of college students, facilitating over 1 billion learning interactions weekly.By integrating cognitive science with advanced machine learning techniques, we tailor and enhance the learning experience for students, professionals, and lifelong learners alike. Our enthusiasm lies in the potential to support more learners through diverse methodologies and tools.Let's Shape the Future of Learning TogetherJoin us in designing and implementing AI-driven learning solutions that scale globally, unlocking the potential of learners everywhere.About the Team:The Personalization & Recommendations team is dedicated to crafting customized learning experiences that enable millions of learners to study more effectively. We are seeking Machine Learning Engineers across Senior to Staff levels (including Sr. Staff) to join our innovative team.You will leverage your expertise in modern recommender systems—encompassing deep learning-based retrieval, embeddings, and multi-stage ranking—to enhance Quizlet's personalization capabilities. Collaborating at the nexus of machine learning, product development, and scalable systems, you will ensure our recommendations are efficient, ethical, and aligned with learner outcomes, privacy, and fairness.This is an onsite position, requiring team members to work in the office at least three days a week: Monday, Wednesday, and Thursday, as well as additional days as needed. We believe this in-office collaboration fosters efficiency, enhances teamwork, and promotes both personal and organizational growth.

Apr 9, 2026

Apply

Staff Software Engineer, Machine Learning - Personalization

DoorDash Inc.

Full-time|$137.1K/yr - $246.8K/yr|Hybrid|San Francisco, CA; Sunnyvale, CA

Join us in creating the most dependable on-demand logistics engine for last-mile retail delivery! We are on the lookout for a seasoned machine learning engineer to aid in the development of cutting-edge growth and personalization models that will elevate DoorDash's expanding retail and grocery services.About the RoleWe are seeking a dedicated Applied Machine Learning expert to become part of our innovative team. As a Staff Machine Learning Engineer, you will conceptualize, design, implement, and validate algorithmic enhancements that enrich the growth and personalization experiences central to our rapidly evolving grocery and retail delivery business. Leveraging our advanced data and machine learning infrastructure, you will implement novel ML solutions to enhance the consumer search experience, making it more relevant, seamless, and enjoyable across grocery, convenience, and various retail sectors. A strong command of production-level machine learning and proven experience in addressing end-user challenges while collaborating effectively with multidisciplinary teams is essential.This position will report to the engineering manager on our Personalization team and is expected to be hybrid, combining both in-office and remote work (#LI-Hybrid).

Mar 11, 2026

Apply

Machine Learning Engineer

Boomtrain

Full-time|On-site|San Francisco

Join our dynamic Personalization team at Boomtrain as a Machine Learning Engineer. We are in search of a skilled engineer who will play a pivotal role in developing and enhancing our recommendation systems that cater to a variety of customers.In this role, you will collaborate with a talented team dedicated to designing and implementing innovative models and systems that deliver personalized recommendations. You will have the opportunity to work on complex engineering challenges and contribute to generating hundreds of millions of recommendations daily.This position offers a unique chance to engage in end-to-end project work and make a significant impact on our personalization initiatives.Key Responsibilities:Research and propose advanced recommendation and optimization models to enhance our personalization systems.Develop and maintain offline model generation pipelines.Design and maintain online recommendation serving systems.

Jul 21, 2016

Apply

Machine Learning Engineer - Personalization & Recommendation Systems

Krea

Full-time|On-site|San Francisco

About KreaKrea is at the forefront of developing advanced AI creative tools designed to enhance and empower human creativity. Our mission is to create intuitive and controllable AI solutions that allow creatives to express themselves across various formats including text, images, video, sound, and 3D.About the PositionWe are seeking a talented Machine Learning Engineer to lead the design and implementation of Krea’s personalization and recommendation systems from the ground up. You will take full ownership of how we comprehend user preferences, curate engaging content, and customize generative models to reflect individual aesthetics.This role sits at the exciting intersection of recommendation systems, representation learning, and generative imaging and video technologies.Your ResponsibilitiesLead the architecture and development of Krea’s personalization and recommendation framework, overseeing the technical direction from inception to deployment.Craft algorithms that effectively model user preferences and tastes, enabling our systems to adapt to individual styles and aesthetics.Develop high-quality, curated feeds that strike a balance between exploration, personalization, and aesthetic coherence.Collaborate closely with our model and research teams to co-create personalization mechanisms that shape how our generative models learn, adapt, and express creative styles.Contribute to research in personalized image generation, with a focus on style, taste, and subjective quality.Work in tandem with product, design, and research teams to define what “good personalization” means in a creative context.Take systems from initial research and prototyping stages through to production, ongoing iteration, and enhancement.

Dec 17, 2025

Apply

Machine Learning Specialist in Behavioral Modeling

Palladio AI

Full-time|On-site|San Francisco Bay Area

Join the Revolution in Behavioral IntelligenceAmplify Your InfluenceYou have achieved remarkable success in your career, creating robust behavioral or neuroscience models that have driven significant outcomes. You possess a talent for discerning patterns in user behavior, comprehending motivations, and optimizing end-to-end user experiences.Now, envision extending your impact across multiple products and organizations, enhancing the entire app ecosystem. Every application at your fingertips becomes smarter, more engaging, and indispensable to its users.Your expertise can empower product teams to innovate more rapidly, delight users, and boost revenue, all thanks to the behavioral intelligence you develop once and deploy universally.We share this vision: our team has accomplished this repeatedly at industry leaders like Uber, Apple, Google, and Chime, generating tens of billions of dollars in value for products vital to billions globally. We are poised to elevate our impact even further.Does this resonate with the next chapter you're seeking? If so, continue reading.Palladio: Pioneering BreakthroughsPalladio AI is an innovative AI platform aimed at transforming product-led growth and enhancing the value our clients provide in users’ daily lives.Our initial focus is on mobile gaming, where development is swift, user engagement is high, and experimentation yields immediate results—making it the perfect testing ground for our platform.Your ContributionsOur team is constructing foundational systems in behavioral modeling, causal inference, forecasting, and agentic platforms. You will play a pivotal role in extending these areas: creating machine learning and AI-driven behavioral models to identify and highlight product opportunities while deploying self-improving learning loops with each iteration. Your work will analyze user sentiments, thoughts, decisions, and actions—translating behavioral insights into opportunities that enhance product intuitiveness, engagement, and rewards. In essence, you will convert first-principles data science, neuroscience, cognitive science, and machine learning into scalable solutions across various industries.Your ProfileUser-Focused. You empathize with users' challenges, needs, and goals throughout their journeys, measure success through user outcomes, and convert insights into innovative and engaging product experiences.Scientific Innovator. You...

Feb 14, 2026

Apply

Machine Learning Researcher in Generative Modeling

latentlabs

Full-time|On-site|San Francisco

Join latentlabs, a pioneering company at the forefront of biotechnology, as we seek a talented Machine Learning Researcher specializing in generative modeling. You will become part of a dynamic, interdisciplinary team comprising machine learning experts, protein engineers, and biologists, all committed to revolutionizing biological control and disease treatment. In this role, you will design innovative generative models aimed at creating new proteins that exhibit functionality in wet lab assays.

Feb 19, 2026

Apply

Senior Machine Learning Engineer for Multi-Sensor Modeling

Gridware

Full-time|On-site|San Francisco, CA

Join Our Team at GridwareAs a Senior Machine Learning Engineer specializing in Multi-Sensor Modeling, you will be at the forefront of developing innovative solutions that enhance the reliability and safety of the electrical grid. Our groundbreaking Active Grid Response (AGR) platform leverages cutting-edge technology to monitor various aspects of the grid, enabling proactive maintenance and fault mitigation. Your expertise will play a pivotal role in advancing our mission to protect the grid and ensure efficient operations. Are you ready to make a significant impact?

Mar 9, 2026

Apply

Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI

Plaid Inc.

Full-time|On-site|San Francisco

About Plaid Plaid builds tools that help developers create new financial products and experiences. Since 2013, Plaid has connected millions of users to over 12,000 financial institutions across the US, Canada, the UK, and Europe. The company partners with organizations like Venmo, SoFi, Fortune 500 firms, and major banks to make linking financial accounts to apps and services easier. Headquarters are in San Francisco, with offices in New York, Washington D.C., London, and Amsterdam. Team: Data Foundation & AI The Data Foundation and AI team designs and maintains the machine learning and AI infrastructure that supports Plaid’s products. This group transforms Plaid’s financial network data into flexible formats used by teams across the company. Responsibilities span the entire system lifecycle: data curation for pretraining, model development, deployment, serving, and monitoring in production. Role Overview: Senior Machine Learning Engineer (Research Scientist) This position focuses on applied research for Plaid’s foundation model. The Senior Research Scientist leads efforts to design model architectures, set pretraining objectives, and implement fine-tuning strategies that work across a range of product needs. The role also involves building and maintaining production machine learning systems, including training pipelines, model serving, feature engineering, and performance monitoring. Key Responsibilities Design model architectures and define pretraining objectives for Plaid’s foundation model Develop and apply fine-tuning methods for diverse product use cases Build and maintain end-to-end machine learning systems, from data pipelines to model serving Engineer features and monitor system performance in production Create evaluation frameworks to measure model quality across multiple tasks and metrics Location This role is based in San Francisco.

Apr 15, 2026

Apply

Machine Learning Engineer: World Models at The Bot Company | San Francisco

The Bot Company

Full-time|On-site|San Francisco

The Bot CompanyJoin us in creating a revolutionary robot designed to enhance everyday living.Based in San Francisco, our dynamic team consists of talented engineers, designers, and operators hailing from industry leaders like Tesla, Cruise, OpenAI, Google, and Pixar. We've successfully delivered exceptional products and experiences to hundreds of millions of users.Our deliberately streamlined team structure fosters prompt decision-making, eliminating bureaucracy and hierarchy. Each team member is an individual contributor empowered with significant scope, radical ownership, and direct accountability, working collaboratively across the stack in a fast-paced environment focused on rapid iteration and execution.What We Value in CandidatesAt The Bot Company, we seek individuals who exhibit remarkable sharpness and can thrive in high-pressure situations. Throughout the selection process, we expect candidates to showcase:Exceptional mental acuity: You think quickly, absorb information rapidly, and navigate unfamiliar domains with ease.Engineering curiosity: You possess an innate desire to understand system functionalities, extending beyond your primary expertise.High performance mindset: You excel in ambiguous environments and maintain high productivity under challenging conditions.Machine Learning Engineer: World ModelsWe are developing neural simulators capable of comprehending the fundamental

Feb 25, 2026

Apply

Senior Machine Learning Engineer - Model Evaluations for Public Sector

Scale AI

Full-time|$216.3K/yr - $300.3K/yr|On-site|San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC

Senior Machine Learning Engineer - Model Evaluations for the Public Sector The Public Sector Machine Learning team at Scale AI pioneers the deployment of cutting-edge AI systems, including Large Language Models (LLMs), agentic models, and comprehensive multimodal pipelines, within critical government operations. We establish robust evaluation frameworks that ensure these models function reliably, safely, and effectively in real-world scenarios. As a Senior Machine Learning Engineer, you will architect, implement, and enhance automated evaluation pipelines that empower our clients to trust and effectively utilize advanced AI systems in defense, intelligence, and federal missions. Your Responsibilities Include: Creating and maintaining automated evaluation pipelines for machine learning models, focusing on functional, performance, robustness, and safety metrics, including evaluations based on LLM judges. Designing test datasets and benchmarks to assess generalization, bias, explainability, and potential failure modes. Building evaluation frameworks for LLM agents, which includes the infrastructure for scenario-based and environment-based testing. Conducting comparative analyses of model architectures, training procedures, and evaluation results. Implementing tools for continuous monitoring, regression testing, and quality assurance of machine learning systems. Designing and executing stress tests and red-teaming workflows to identify vulnerabilities and edge cases. Collaborating with operations teams and subject matter experts to generate high-quality evaluation datasets. This position requires an active security clearance or the ability to obtain one.

Mar 26, 2026

Apply

Senior Machine Learning Engineer - Conversion Modeling at unity3d | San Francisco

Unity

Full-time|$172.2K/yr - $258.4K/yr|On-site|San Francisco, CA, USA

About the OpportunityAt Unity, we are dedicated to fostering a culture of collaboration and innovation. Our dynamic environment allows us to tackle intricate challenges that create significant value for creators and users within our ecosystem.The Vector team is at the forefront of this mission, creating cutting-edge conversion rate (CVR) prediction and market price models that enhance our ad ranking and recommendation systems. These models enable advertisers to engage the right users at optimal moments by accurately assessing engagement and conversion probabilities. By harnessing extensive behavioral data, creative features, and contextual signals, we continually refine our predictions’ relevance and accuracy. This leads to crucial outcomes such as increased user engagement, improved conversion rates, and a better return on ad spend—empowering advertisers to meet their objectives while enhancing user experience.We are on the lookout for an experienced Senior Machine Learning Engineer to spearhead advanced bidding optimization systems that facilitate efficient budget management, goal-driven automated strategies, ongoing enhancements through experimentation, and sustainable growth for Unity Ads.

Feb 11, 2026

Apply

Machine Learning Infrastructure Engineer

Mach9

Full-time|On-site|San Francisco

Mach9’s Machine Learning Infrastructure Engineers create and maintain the backbone for production AI models used in civil engineering and surveying. The team manages a machine learning pipeline that processes over 10,000 miles of labeled survey data, supports image segmentation networks, and runs 3D prediction models. These systems deliver real-time inference capabilities directly to surveyors and engineers working in the field. Role overview This position is designed for mid-career engineers with a strong background in both training and inference aspects of machine learning infrastructure. The work involves handling large-scale data and ensuring reliable performance for demanding, real-world applications. What you will do Build and improve training pipelines for deep transformer models using hundreds of terabytes of 3D point cloud and image data. Design and implement inference infrastructure to support both offline detection algorithms and responsive, real-time inference integrated with CAD software. Location Based in San Francisco.

Apr 25, 2026

Apply

Machine Learning Engineer - NomadicML

Amari AI

Full-time|On-site|San Francisco

About NomadicMLAt NomadicML, we are harnessing the power of artificial intelligence to revolutionize the way machines understand and interpret motion. Our vision-language models (VLMs) transform vast amounts of video data into actionable insights, paving the way for advancements in self-driving technology, robotics, and industrial automation.Founded by Mustafa Bal and Varun Krishnan, both alumni of Harvard University, our team is comprised of experts who have previously developed critical AI systems at industry giants like Snowflake, Lyft, Microsoft, Amazon, and IBM Research. With a commitment to innovation, we are dedicated to mining insights from the 5 trillion miles driven by Americans annually, uncovering the next frontier in machine intelligence.About the RoleWe are looking for a passionate Machine Learning Engineer who excels at the intersection of foundational model research and production engineering. In this role, you will play a key part in optimizing how machines learn from motion, focusing on training and refining large-scale Vision-Language Models that analyze complex real-world video data.You will be responsible for creating multi-modal architectures that accurately perceive, localize, and describe motion events across millions of video frames, transforming these innovations into robust APIs and SDKs for enterprise clients.Working closely with the founders, your contributions will include:Training and assessing VLMs tailored for motion comprehension within autonomous driving and robotics datasets.Designing and scaling GPU-accelerated pipelines for training, fine-tuning, and inference on diverse data types (video, language, and sensor metadata).Developing evaluation frameworks that benchmark spatiotemporal reasoning and localization precision.

Oct 28, 2025

Apply

Senior/Staff Applied Machine Learning Engineer

Comfy Org

Full-time|On-site|San Francisco

The OpportunityJoin us at ComfyOrg as a Senior/Staff Applied Machine Learning Engineer! We are on the hunt for a passionate innovator who is enthusiastic about optimizing model inference. You will play a pivotal role in developing the heart of ComfyUI, our cutting-edge visual AI platform. Your expertise will help us push the limits of AI model performance, making them run faster and more efficiently than ever before.Are You a Match?You are fascinated by model inference, memory management, and torch optimizations.You possess experience in writing production-level PyTorch code that challenges performance standards.You have a passion for understanding the inner workings of AI models.You thrive on developing highly optimized code that consistently delivers results.You believe that the current landscape of ML deployment holds significant room for improvement.Your Responsibilities:Develop and enhance the core inference engine that drives ComfyUI.Optimize large models for speed and memory efficiency.Collaborate with our core team to architect new features.Tackle complex technical challenges within the visual AI domain.Contribute to the future direction of our technology.Experience with diffusion or LLM models, as well as creating custom nodes for ComfyUI, is highly beneficial.

May 29, 2025

Apply

Generative AI Researcher - Atomistic Foundation Models

Achira

Full-time|On-site|San Francisco Office

Join Achira in shaping the future of deep learning with cutting-edge generative, representational, and simulation models for molecules and materials. Our mission is to create foundational models that render the atomistic universe understandable, predictable, and designable.Why Choose Achira?Be part of an elite, cross-disciplinary team comprising ML researchers, physicists, chemists, and engineers who are redefining atomistic simulation through expansive foundation models.Advance the integration of deep learning with the principles of nature, merging generative AI, probabilistic reasoning, and molecular physics.Engage in projects at an unparalleled scale, tackling extensive datasets, computational challenges, and ambitious goals.Take full ownership of your research journey — from ideation and architecture to training, evaluation, and deployment.Flourish in a dynamic culture that values rigor, speed, creativity, and impact over bureaucracy.Position OverviewAs a Generative AI Researcher at Achira, you will contribute to the development of foundation simulation models — large-scale systems designed to learn the structure, dynamics, and energetics of the atomistic realm. These models will unite deep representation learning, generative modeling, and sophisticated simulation techniques.Your responsibilities will include:Crafting and training state-of-the-art deep generative models — including diffusion, autoregressive, flow-based, and latent-variable architectures focused on molecules, materials, and atomic systems.Creating expressive representations of molecular and atomistic structures and dynamics utilizing equivariant graph neural networks, geometric transformers, and latent encoders that respect physical symmetries and constraints.Innovating advanced sampling and simulation techniques that blend probabilistic inference, deep learning, and reinforcement learning to facilitate efficient exploration and simulation of learned energy landscapes.Developing models that comprehend, generate, and simulate the physical world, merging reasoning, simulation, and predictive capabilities.Working collaboratively with physicists and chemists to validate models against ab initio, molecular dynamics, and experimental datasets.Rapidly prototyping, benchmarking, and iterating — converting research concepts into reusable, scalable model components across Achira’s foundation model suite.

Oct 24, 2025

Apply

Staff Software Engineer, Foundation Model Serving

Databricks

Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California

At Databricks, we are driven by our commitment to empower data teams in tackling the world's most challenging problems — from transforming transportation solutions to accelerating medical advancements. Our mission revolves around constructing and maintaining the world's premier data and AI infrastructure platform, enabling our clients to harness deep data insights for enhanced business outcomes.Foundation Model Serving represents the API product designed for hosting and serving advanced AI model inference, catering to both open-source models like Llama, Qwen, and GPT OSS, as well as proprietary models such as Claude and OpenAI GPT. We welcome engineers who have experience managing high-scale operational systems, including customer-facing APIs, Edge Gateways, or ML Inference services, even if they do not have a background in ML or AI. A passion for developing LLM APIs and runtimes at scale is essential.As a Staff Engineer, you will play a pivotal role in defining both the product experience and the underlying infrastructure. You will be tasked with designing and building systems that facilitate high-throughput, low-latency inference on GPU workloads with cutting-edge models. Your influence will extend to architectural direction, working closely with platform, product, infrastructure, and research teams to deliver an exceptional foundation model API product.The impact you will have:Design and implement core systems and APIs that drive Databricks Foundation Model Serving, ensuring scalability, reliability, and operational excellence.Collaborate with product and engineering leaders to outline the technical roadmap and long-term architecture for workload serving.Make architectural decisions to enhance performance, throughput, autoscaling, and operational efficiency for GPU serving workloads.Contribute directly to critical components within the serving infrastructure, from systems like vLLM and SGLang to developing token-based rate limiters and optimizers, ensuring seamless and efficient operations at scale.Work cross-functionally with product, platform, and research teams to transform customer requirements into dependable and high-performing systems.Establish best practices for code quality, testing, and operational readiness while mentoring fellow engineers through design reviews and technical support.Represent the team in inter-departmental technical discussions, influencing Databricks’ wider AI platform strategy.

Jan 30, 2026

Apply

Machine Learning Engineer

Orchard

Full-time|On-site|San Francisco

Join Orchard as a Machine Learning Engineer and play a pivotal role in transforming data into actionable insights. In this dynamic position, you will leverage your expertise in machine learning algorithms and data analysis to develop innovative solutions that enhance our products and services.We are looking for a proactive team player who thrives in a fast-paced environment and possesses strong problem-solving skills. You will collaborate with cross-functional teams, engage with large datasets, and contribute to the design and implementation of machine learning models.

Mar 14, 2026

Create account — see all 5,525 results