Senior Ml Performance Engineer jobs in San Francisco – Browse 6,805 openings on RoboApply Jobs

Senior Ml Performance Engineer jobs in San Francisco

Open roles matching “Senior Ml Performance Engineer” with location signals for San Francisco. 6,805 active listings on RoboApply Jobs.

6,805 jobs found

1 - 20 of 6,805 Jobs
Apply
Full-time|On-site|SF Bay Area

About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a …

Oct 31, 2025
Apply
fal logo
Full-time|On-site|San Francisco

Join fal as we revolutionize the generative-media infrastructure landscape. Our mission is to enhance model inference performance, enabling creative experiences on an unprecedented scale. We are seeking a Staff Technical Lead for Inference & ML Performance, an individual who possesses a unique blend of deep technical knowledge and strategic foresight. In this pivotal role, you will lead a talented team dedicated to building and optimizing cutting-edge inference systems. If you're ready to influence the future of inference performance in a fast-paced and rapidly growing environment, we want to hear from you.Why This Role MattersIn this role, you will play a crucial part in shaping the future of fal’s inference engine, ensuring that our generative models consistently deliver outstanding performance. Your contributions will directly affect our capacity to swiftly provide innovative creative solutions to a diverse clientele, from individual creators to global brands.Your ResponsibilitiesDefine and steer the technical direction, guiding your team across various domains including kernels, applied performance, ML compilers, and distributed inference to develop high-performance solutions.

Oct 29, 2025
Apply
tvScientific logo
Full-time|$155.6K/yr - $320.3K/yr|Remote|San Francisco, CA, US; Remote, US

About tvScientific tvScientific is the premier CTV advertising platform exclusively tailored for performance marketers. Our innovative approach harnesses vast data and state-of-the-art science to automate and enhance TV advertising, ultimately driving impactful business results. Our platform seamlessly integrates media buying, optimization, measurement, and attribution into one powerful, efficient solution. Developed by industry veterans with extensive backgrounds in programmatic advertising, digital media, and ad verification, our CTV performance platform is designed to help advertisers confidently scale their business. We are currently seeking a Senior MLOps Engineer to join our dynamic, distributed engineering team focused on our Connected TV ad-buying platform, as we expand our Machine Learning capabilities. Having successfully optimized TV ad campaigns, we are poised for massive growth, and we need your expertise to ensure our scalability is both sustainable and effective. As a proud member of Idealab, tvScientific was co-founded by leaders deeply rooted in programmatic advertising and digital media. We empower our clients to purchase ads across the expansive CTV landscape, including platforms such as Hulu, PlutoTV, and the ad-supported tiers of Disney+ and HBO Max. Following our acquisition by Pinterest, we are intensifying our focus on CTV to enhance the performance of search and social advertising.

Apr 7, 2026
Apply
Crusoe logo
Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.

Mar 18, 2026
Apply
Parafin logo
Full-time|On-site|San Francisco, CA

About Us:At Parafin, our mission is to empower small businesses to thrive in today's competitive landscape. We understand that small businesses form the backbone of our economy, yet they often face challenges in accessing essential financial resources. Our innovative technology streamlines access to vital financial tools directly on the platforms they already utilize for sales. Partnering with industry leaders such as DoorDash, Amazon, Worldpay, and Mindbody, we provide small businesses with fast, flexible funding, efficient spend management, and effective savings solutions through simple integrations. Parafin manages the complexities of capital markets, underwriting, servicing, compliance, and customer support to ensure seamless experiences for our partners and their small business clients.We are composed of a dynamic team of innovators with backgrounds from top firms like Stripe, Square, Plaid, Coinbase, Robinhood, and CERN, all driven by a passion for developing tools that facilitate small business success. Backed by esteemed venture capitalists including GIC, Notable Capital, Redpoint Ventures, Ribbit Capital, and Thrive Capital, Parafin stands as a Series C company with over $194M raised in equity and $340M in debt facilities. Join us in shaping a future where every small business has access to the financial tools they need.About The PositionWe are on the lookout for a skilled Software Engineer to join our Infrastructure team and spearhead the advancement of our Machine Learning (ML) Platform. This pivotal role is essential for constructing reliable, scalable, and developer-centric systems for model experimentation, training, evaluation, inference, and retraining that drive underwriting and other ML-powered products for small businesses.As a Software Engineer, you will design, build, and maintain the core frameworks and platforms that empower data scientists to deploy high-quality models into production efficiently and safely. You'll work closely with Data Science and Platform Engineering, taking ownership of the ML platform from end-to-end, and develop both batch and real-time underwriting infrastructure.What You'll DoTransform notebooks into reliable software. Break down data scientist training and inference notebooks into reusable, well-tested components (libraries, pipelines, templates) with clear interfaces and documentation.Develop user-friendly ML abstractions. Create SDKs, CLIs, and templates that simplify the definition of features, model training and evaluation, and deployment to batch or real-time targets with minimal boilerplate.Construct our real-time ML inference platform. Establish and scale low-latency model serving capabilities.Enhance batch ML inference processes. Optimize scheduling, parallelism, cost controls, and observability to improve efficiencies.

Jan 5, 2026
Apply
fal logo
Full-time|$180K/yr - $250K/yr|On-site|San Francisco

Join fal in our pursuit to maintain a leading edge in model performance for generative media models. You'll be instrumental in designing and implementing innovative solutions for model serving architecture, built on our proprietary inference engine. Your focus will be on maximizing throughput while minimizing latency and resource consumption. In addition, you will create performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Collaborate closely with our Applied ML team and clients in the media sector to ensure their workloads leverage our accelerator effectively.

Dec 16, 2025
Apply
ClickUp logo
Full-time|On-site|United States of America

At ClickUp, we're not just developing software; we're shaping the future of work! In an era dominated by work sprawl, we identified a more efficient way. This led us to create the first truly integrated AI workspace, consolidating tasks, documents, chat, calendar, and enterprise search, all enhanced by context-driven AI. Our mission is to empower millions of teams to escape silos, reclaim their time, and reach unprecedented levels of productivity. At ClickUp, you'll have the chance to learn, innovate, and leverage AI in transformative ways that will not only influence our product but also the broader landscape of work itself. Join a daring, pioneering team that's challenging the limits of what's possible! We are on the lookout for a technical leader in SaaS client performance who is passionate about enhancing the customer experience through top-tier performance solutions. As a Senior Performance Engineer, you will spearhead comprehensive strategies to optimize application speed, memory utilization, and reliability across our entire platform. You will be empowered to analyze, diagnose, and address performance bottlenecks wherever they arise—be it front-end, back-end, or infrastructure—ensuring ClickUp remains the fastest and most reliable productivity platform available.The ideal candidate is a hands-on authority in browser and NodeJS performance, with a thorough understanding of how code influences rendering, memory management, and overall user experience. You excel in solving intricate challenges, collaborating across teams, and establishing new benchmarks for performance excellence. If you're driven to make a significant impact for millions of users, this is your chance to lead at scale.Your Responsibilities:Conduct root cause analysis on client performance issues and perform post-mortems.Profile application code to identify inefficient algorithms, memory leaks, and other issues; propose and implement effective solutions.Establish performance monitoring, alerting, and dashboards to proactively detect and resolve client performance challenges.Examine client traffic patterns, load testing outcomes, and other metrics to set benchmarks and drive enhancements.Champion performance best practices and set performance standards across the engineering organization.Identify infrastructure upgrades (caching, CDNs, database optimization) to elevate the client experience.Collaborate with development teams to incorporate performance as a core requirement in the development of new features.

Dec 22, 2025
Apply
Databricks logo
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California

P-97 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world. We achieve this by creating and managing a leading data and AI infrastructure platform that enables our clients to leverage deep data insights for business enhancement. Our commitment to pushing the limits of data and AI technology is matched by our focus on resilience, security, and scalability, which are essential for our customers' success on our platform. Databricks operates one of the largest-scale software platforms, comprising millions of virtual machines that generate terabytes of logs and process exabytes of data daily. Given our scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must adeptly protect our customers from these issues. As a Senior Performance Engineer, you will collaborate with various teams throughout the organization to assess product and feature performance, pinpoint performance bottlenecks, and partner with engineers to address performance and scalability challenges. This includes setting performance goals for different software releases, guiding teams in developing performance benchmarks, conducting competitive benchmark analyses for various Databricks products, and performing in-depth analyses to identify and resolve performance issues.

Jan 30, 2026
Apply
Full-time|Remote|San Francisco

At Runway ML, we are revolutionizing the intersection of art and science through innovative AI technology. Our mission is to build sophisticated world models that transcend traditional artificial intelligence limitations. We believe that to tackle the most pressing challenges—such as robotics, disease, and scientific breakthroughs—we need systems that can learn from experiences just like humans do. By simulating these experiences, we can expedite progress in ways that were previously unimaginable.Our diverse and driven team consists of creative thinkers who are passionate about pushing boundaries and achieving the extraordinary. If you share this ambition and are eager to contribute to our groundbreaking work, we invite you to join us.About the Role*We are open to hiring remotely across North America. We also have offices in NYC, San Francisco, and Seattle.We are on the lookout for a highly skilled and intellectually inquisitive Technical Accounting Manager to be our go-to authority on intricate accounting issues. This position offers significant visibility and is ideal for a professional adept at interpreting complex accounting guidelines, formulating sound conclusions, and translating technical insights into practical accounting practices.

Mar 17, 2026
Apply
Air Apps logo
Full-time|On-site|San Francisco

Join Our Team at Air AppsAt Air Apps, we are on a mission to revolutionize resource management through innovative technology. Founded in 2018 in Lisbon, Portugal, we have expanded our reach with offices in both Lisbon and San Francisco, boasting over 100 million downloads globally. Our vision is to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), and we are looking for passionate individuals to help us achieve this goal.Our commitment to challenging the status quo drives us to push the boundaries of AI-driven solutions that make a real impact. Here, you will have the opportunity to be a creative force, developing products that empower individuals worldwide.Join us as we embark on this journey to redefine how people plan, work, and live.

Feb 25, 2025
Apply
Anthropic logo
On-site|On-site|San Francisco, CA | New York City, NY | Seattle, WA

About AnthropicAt Anthropic, we are on a mission to develop AI systems that are reliable, interpretable, and steerable, ensuring they are safe and beneficial for users and society. Our dynamic team consists of dedicated researchers, engineers, policy experts, and business leaders who collaborate to advance the field of beneficial AI.About the Role:As the Engineering Manager in our performance and scaling teams, you will lead efforts to optimize our computing resources for both inference and training. Your role will involve identifying and eliminating bottlenecks, creating robust solutions, and enhancing system efficiency. In this fast-paced environment, you will provide clarity, focus, and context to your team, driving impactful results.

Feb 10, 2026
Apply
Canva logo
Full-time|On-site|San Francisco

Join our talented team at Canva as a Senior Software Engineer specializing in Video Performance. We are looking for an innovative and solutions-oriented engineer who is passionate about optimizing video experiences for our users. In this role, you will collaborate with cross-functional teams to enhance performance, develop new features, and implement best practices in video engineering.

Mar 16, 2026
Apply
ML Infrastructure Engineer

Sygaldry Technologies

Full-time|On-site|San Francisco

About Sygaldry Technologies Sygaldry Technologies develops quantum-accelerated AI servers in San Francisco, focusing on faster AI training and inference. By combining quantum technology with artificial intelligence, the team addresses challenges in computing costs and energy efficiency. Their AI servers integrate multiple qubit types within a fault-tolerant system, aiming for a balance of cost, scalability, and speed. The company values optimism, rigor, and a drive to solve complex problems in physics, engineering, and AI. Role Overview: ML Infrastructure Engineer The ML Infrastructure Engineer joins the AI & Algorithms team, which includes research scientists, applied mathematicians, and quantum algorithm specialists. This role centers on building and maintaining the compute infrastructure that powers advanced research. The systems you build will support reliable GPU access, reproducible experiments, and scalable workloads, so researchers can focus on their core work without needing deep cloud expertise. Expect to design and manage compute platforms for a range of tasks, including quantum circuit simulation, large-scale numerical optimization, model training, tensor network contractions, and high-throughput data generation. These workloads span multiple cloud providers and on-premises GPU servers. Key Responsibilities Develop compute abstractions for diverse workloads, such as GPU-accelerated simulations, distributed training, high-throughput CPU jobs, and interactive analyses using frameworks like PyTorch and JAX. Set up infrastructure to support experiment tracking and reproducibility. Create developer tools that make cloud computing feel local, streamlining environment setup, job submission, monitoring, and artifact management. Scale experiments from single-GPU prototypes to large, multi-node production runs. Multi-Cloud GPU Orchestration Design orchestration strategies for workloads across multiple cloud providers, optimizing job routing for cost, availability, and capability. Monitor and improve cloud spending, keeping track of credit balances, burn rates, and expiration dates.

Apr 14, 2026
Apply
Sciforium logo
Full-time|On-site|San Francisco

At Sciforium, we are at the forefront of AI infrastructure, dedicated to the development of advanced multimodal AI models and an innovative serving platform that emphasizes high efficiency. With substantial funding and direct collaboration from AMD, our team is rapidly expanding to create the complete stack for pioneering AI models and dynamic real-time applications.Role OverviewThis position provides a distinct opportunity to engage with the fundamental systems that drive Sciforium's multimodal AI models. You will play a crucial role in constructing the model serving platform, working with C++, Python, runtime execution, and distributed infrastructure to design a swift, dependable engine for real-time AI applications.You will acquire practical experience in performance engineering, discover how large AI models are optimized and deployed at scale, and collaborate closely with ML researchers and seasoned systems engineers. If you thrive in low-level programming and are passionate about performance, this role offers both impactful contributions and significant growth opportunities.

Nov 15, 2025
Apply
Pinterest logo
Full-time|$208.6K/yr - $429.5K/yr|Remote|San Francisco, CA, US; Remote, US

About Pinterest:At Pinterest, our platform inspires millions of people around the globe to explore creative ideas, envision new possibilities, and create lasting memories. We are dedicated to providing the inspiration needed to build a fulfilling life, starting with the talented individuals who drive our product development.Join us in a career that sparks innovation for millions, transforms passion into opportunities for growth, and celebrates the diverse experiences of our team members, all while enjoying the flexibility to perform at your best. Building a career you love is within reach.Position Overview:We are looking for a Senior Engineering Manager to spearhead our AI/ML Serving Platform team, which develops the core tools and infrastructure utilized by numerous AI/ML engineers across Pinterest. This includes systems for recommendations, advertisements, visual search, notifications, and trust and safety. Our goal is to enhance the efficiency, quality, and speed of AI/ML systems, ensuring they are production-ready and reliable for iterative model development.Key Responsibilities:Lead the team in driving continuous improvements in advanced model architectures, optimizing resource usage, and boosting AI/ML developer productivity.Establish the technical vision for the team aligned with company and organizational priorities.Mentor and cultivate talent within the team.Qualifications:Proven experience in managing engineering teams with diverse cross-organizational clients.Expertise in developing large-scale distributed serving systems.Familiarity with AI/ML inference technologies (e.g., PyTorch, TensorFlow) for web-scale online serving.Bachelor's degree in Computer Science or a related field, or equivalent professional experience.

Feb 11, 2026
Apply
Whatnot logo
Full-Time|On-site|San Francisco, CA

Embrace the Future of Commerce with Whatnot!Whatnot stands as North America and Europe’s premier live shopping platform, dedicated to transforming the way you buy, sell, and discover your favorite items. We are on a mission to redefine e-commerce by seamlessly merging community engagement, shopping, and entertainment into a unique experience tailored just for you. As part of a remote, co-located team, we thrive on innovation while being firmly rooted in our core values. With operational hubs across the US, UK, Germany, Ireland, and Poland, we are collaboratively shaping the future of online marketplaces.Our live auctions span a diverse range of categories from fashion and beauty to electronics and collectibles, including trading cards, comic books, and even live plants. There’s truly something for everyone!And this is just the beginning! As one of the fastest-growing marketplaces, we are in search of bold, innovative problem solvers across all functional areas. Stay updated with the latest Whatnot news through our news and engineering blogs, and join us in empowering individuals to transform their passions into thriving businesses, fostering connections through commerce. Your RoleWe are seeking hands-on leaders—intellectually curious and technically proficient individuals ready to influence the future of AI and ML at Whatnot. In this pivotal role, you will spearhead the development and scaling of the foundational infrastructure that supports machine learning and self-hosted large language model applications across our organization. Collaborating closely with machine learning scientists, you will drive the implementation of innovative models powered by near-real-time features, enhancing product experiences. This entails building robust systems that ensure advanced ML is both reliable and efficient at scale—from low-latency deep learning model serving and streaming feature ingestion to distributed training and high-throughput GPU inference. As a managerial role, a strong technical foundation is essential, and potential candidates should be enthusiastic about diving deep into the details. You will elevate architectural discussions, provide insightful technical feedback, and dedicate at least one day a week to coding.Your Responsibilities:Lead the infrastructure supporting AI and ML models across critical business areas, enhancing growth, recommendations, trust and safety, fraud detection, seller tooling, and more.Oversee the prototyping, deployment, and productionization of innovative ML architectures, ensuring they align with our strategic objectives.

Jan 15, 2026
Apply
Sigma Computing logo
Full-time|$240K/yr - $270K/yr|On-site|San Francisco, CA

Role Overview Sigma Computing is building the next generation of data interaction. The platform lets users explore and analyze billions of data rows in seconds, all within a familiar spreadsheet-like interface. Sigma aims to make it simple to analyze, present, and build data-driven applications at scale. AI is central to Sigma's vision for the future. The company is expanding its use of artificial intelligence to help users build in Sigma, surface insights, and make decisions faster. What You Will Do As a Senior AI/ML Engineer, join a team focused on shaping the AI architecture behind Sigma's platform. This work directly impacts thousands of enterprises that depend on Sigma for their data workflows. The team is responsible for designing and implementing the systems that will power Sigma's AI-driven features for years to come. Location This position is based in San Francisco, CA.

Apr 25, 2026
Apply
Nash logo
Full-time|On-site|San Francisco

Senior Infrastructure & Performance EngineerAs a Senior Infrastructure & Performance Engineer, you will take charge of enhancing the performance, reliability, and scalability of Nash's foundational infrastructure. Collaborating closely with the Engineering Leadership and both platform and product engineering teams, you will design and manage low-latency, mission-critical systems that facilitate real-time logistics for some of the world's largest retailers.This is a key senior role focused on elastic capacity, high availability, cloud-native architectures, Postgres performance, and enterprise-grade CI/CD for multi-region deployments. You will define the technical roadmap, establish best practices, and implement systems that support the essential workflows of major retailers.Key ResponsibilitiesOversee infrastructure performance and reliability for Nash's production environments, ensuring low latency, high throughput, and consistent performance under load.Design, develop, and enhance AWS infrastructure, utilizing managed services with a focus on ECS/Fargate.Lead initiatives in Postgres performance engineering, including query optimization, indexing strategies, connection management, replication, cluster design, and failover.Architect and maintain multi-region, highly available systems with robust resiliency and guaranteed disaster recovery.Design and refine enterprise-grade CI/CD pipelines that enable safe, repeatable, and rapid deployments across environments and regions.Establish observability standards (metrics, logs, tracing, SLOs) to proactively identify and resolve performance bottlenecks.Collaborate with application engineers to inform system design choices that influence scalability, latency, and reliability.Lead incident response efforts and postmortems, emphasizing root cause analysis, systemic improvements, and long-term resilience.Set best practices for infrastructure and performance while mentoring engineers throughout the organization.Qualifications6+ years of experience in building and managing high-scale production infrastructure for mission-critical systems.Proficiency with AWS, particularly with ECS/Fargate, and experience with cloud-native architecture.Strong background in Postgres performance tuning and optimization.Deep understanding of CI/CD practices and experience in multi-region deployments.Exceptional analytical and problem-solving skills, with a proactive approach to performance management.

Jan 6, 2026
Apply
Full-time|Remote|San Francisco

At Runway ML, we are pioneering artificial intelligence that creatively simulates the world by blending art with science.We envision a future where world models are at the cutting edge of AI advancements. Relying solely on language models will not address the most challenging issues we face—such as robotics, disease management, and scientific breakthroughs. Genuine progress necessitates models that can interact with the world and learn through trial and error, much like humans do. This process can be significantly expedited in a simulated environment rather than the real world.Our world models provide the clearest pathway to general-purpose simulation, revolutionizing storytelling, scientific advancement, and the exploration of humanity's next frontiers.Our team is comprised of innovative, open-minded, caring, and ambitious individuals dedicated to making a difference. We strive to create the extraordinary, and our success is built on assembling an exceptional team. If you share our passion and vision, we would be thrilled to hear from you.About the Role*We are open to hiring remotely across the US, with offices located in NYC, San Francisco, and Seattle.We are looking for a meticulous and self-motivated Senior Accountant to become a vital part of our Finance team. This position demands a foundational accounting professional who takes ownership of the monthly close process, excels in a dynamic environment, and maintains precision in every task.What You'll DoConduct a prompt and accurate monthly financial close, including preparing and posting journal entries, maintaining account reconciliations, and identifying and resolving discrepancies.Assist in accounts receivable functions, including invoicing, collection follow-ups, cash application, and aging reviews.Support accounts payable processes by reviewing and reconciling invoices and managing cash flow.Identify and implement enhancements to accounting workflows, such as documenting procedures, strengthening controls, and promoting consistency as the team scales.Act as the accounting representative in cross-functional discussions with Legal, Sales, and Engineering, ensuring that accounting aspects are considered early and integrated into decision-making.Assist in preparing supporting schedules and documentation for external audits and financial reporting.What You’ll NeedBachelor's degree in Accounting or Finance; CPA or CPA candidate is preferred.A minimum of 4 years of experience in a similar role.

Mar 17, 2026
Apply
Full-time|On-site|San Francisco Bay Area

Merge Labs is a pioneering research laboratory dedicated to uniting biological intelligence with artificial intelligence, aiming to enhance human potential, autonomy, and overall experience. We are innovating groundbreaking methods for brain-computer interfaces that facilitate high-bandwidth interactions with the brain, seamlessly integrate advanced AI, and ensure safety and accessibility for all users.About the Team:At Merge Labs, we are developing the future of brain-computer interfaces through the integration of cutting-edge advancements in synthetic biology, neuroscience, AI, and non-invasive imaging. Our cross-functional data and software engineering team collaborates closely with wet-lab scientists, automation engineers, and data scientists to construct a digital infrastructure that expedites molecular discoveries and optimizes device performance.About the Role:We are seeking a Senior / Principal ML Engineer to lead the development and ownership of the digital infrastructure supporting Merge's extensive computational operations. In this role, you will design distributed training and inference systems, experiment tracking, and deployment frameworks, empowering data scientists to swiftly iterate on models encompassing de-novo molecular design, biophysical modeling, signal processing, and computer vision. Your architectural contributions will transform research prototypes into production-ready systems, enhancing the speed, rigor, and fluidity of every computational scientist's workflow.Key Responsibilities:Develop the scientific and engineering framework for active learning and closed-loop optimization, including data ETL, machine learning modeling, and library architecture.Work alongside computational scientists to establish achievable optimization goals and encode domain-specific knowledge and constraints.Create model registries, evaluation frameworks, and automated reporting systems for benchmarking and experimental comparisons.Implement CI/CD pipelines and resource orchestration using tools like Kubernetes, Ray, or Slurm.Define and manage the ML engineering roadmap, providing mentorship to other computational scientists while establishing best practices for code quality, testing, and reproducibility.

Dec 11, 2025

Sign in to browse more jobs

Create account — see all 6,805 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.