Software Engineer, Distributed Data Systems

ExaSan Francisco, California

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

Required QualificationsIn-depth knowledge of lakehouse architectures (Delta Lake, Iceberg, Hudi) and their appropriate applicationsProven experience in building and managing large-scale distributed data processing pipelinesHands-on expertise with streaming data systems such as Kafka or FlinkFamiliarity with production-scale tools like Ray, Spark, or ClickHouseA relentless commitment to reliability and crafting systems that do not require 3 AM wake-up callsPreferred QualificationsExperience with Lance or similar vector-native storage formatsBackground in GPU-accelerated data processing technologies (RAPIDS, cuDF)Example ProjectsDesign a lakehouse architecture capable of managing over 100 PB of web crawl dataDevelop streaming pipelines to process billions of documents daily for real-time indexingArchitect the data layer for our embedding training infrastructure utilizing RayExpand our ClickHouse deployment to efficiently handle analytical queries across vast amounts of search logs

About the job

As a Software Engineer specializing in Distributed Data Systems, you will be responsible for designing and implementing the data infrastructure that drives our operations—from crawling billions of web pages to training sophisticated embedding models and delivering real-time search functionalities. You will enjoy significant autonomy in creating systems capable of scaling to hundreds of petabytes. This is your opportunity to work on data pipelines at an unprecedented scale.

About Exa

Exa is pioneering a revolutionary search engine designed specifically for AI applications. Our commitment to leveraging advanced technology enables us to build a robust infrastructure capable of handling vast amounts of data efficiently.

Similar jobs

1 - 20 of 9,265 Jobs

Search for Senior Software Engineer Foundational Data Systems For Ai

9,265 results

Select all on this page (20)

Apply

Senior Software Engineer - Foundational Data Systems for AI

Granica

Full-time|On-site|Bay Area Office

About GranicaGranica is an innovative AI research and infrastructure firm dedicated to creating reliable and steerable representations of enterprise data.We build trust through our product Crunch, a policy-driven health layer that ensures large tabular datasets remain efficient, reliable, and reversible. On this solid foundation, we are developing Large Tabular Models—systems designed to learn cross-column and relational structures in order to provide trustworthy answers and automation with inherent provenance and governance.Our MissionAI is currently hampered not only by the design of models but also by the inefficiencies of the data that supports them. Every redundant byte, poorly organized dataset, and inefficient data pathway contributes to significant costs, latency, and energy waste as we scale.Granica aims to eliminate these inefficiencies. We merge cutting-edge research in information theory, probabilistic modeling, and distributed systems to craft self-optimizing data infrastructures: systems that consistently enhance the representation and utilization of information by AI.Our engineering team collaborates closely with the Granica Research group led by Prof. Andrea Montanari of Stanford University, bridging advancements in information theory and learning efficiency with large-scale distributed systems. Together, we firmly believe that the next major advancement in AI will stem from breakthroughs in efficient systems rather than merely larger models.Your ContributionsGlobal Metadata Substrate: Design a transactional and metadata substrate that facilitates time-travel, schema evolution, and atomic consistency across massive petabyte-scale tabular datasets.Adaptive Engines: Develop systems that autonomously reorganize data, learning from access patterns and workloads to maintain peak efficiency without the need for manual tuning.Intelligent Data Layouts: Optimize bit-level organization (including encoding, compression, and layout) to maximize signal extraction per byte read.Autonomous Compute Pipelines: Create distributed compute systems that scale predictably, adapt to dynamic loads, and ensure reliability under failure conditions.Research to Production: Apply new algorithms in compression, representation, and optimization that emerge from ongoing research. We encourage opportunities to publish and open-source your work.Latency as Intelligence: Design systems that inherently minimize latency as a measure of intelligence.

Nov 7, 2025

Apply

Senior Software Engineer, Data Foundations

SoFi

Full-time|On-site|CA - San Francisco; WA - Seattle; UT - Cottonwood Heights

Join SoFi as a Senior Software Engineer in our Data Foundations team, where you will play a pivotal role in shaping our data architecture and enhancing our data-driven capabilities. You will work closely with cross-functional teams to develop robust data solutions that empower our business decisions and improve customer experiences.As a Senior Software Engineer, you will leverage your expertise in data engineering, software development, and cloud technologies to build scalable data pipelines and maintain high-quality data infrastructure. Your contributions will directly impact our ability to deliver innovative financial solutions.

Apr 13, 2026

Apply

Staff Software Engineer – Foundational Data Systems for AI

Granica

Full-time|On-site|Bay Area Office

About GranicaGranica is a pioneering AI research and infrastructure company dedicated to creating reliable and steerable representations of enterprise data.We build trust through Crunch, a policy-driven health layer designed to keep extensive tabular datasets efficient, reliable, and reversible. From this foundation, we are developing Large Tabular Models—systems that learn cross-column and relational structures to provide trustworthy answers and automation, complete with built-in provenance and governance.Our MissionThe current limitations of AI are not solely due to model design but also to the inefficiencies of the data that supports it. At scale, every redundant byte, poorly organized dataset, and inefficient data path contributes to significant costs, latency, and energy waste.Granica’s mission is to eliminate these inefficiencies. We leverage cutting-edge research in information theory, probabilistic modeling, and distributed systems to create self-optimizing data infrastructures that continuously enhance how information is represented and utilized by AI.Our engineering team collaborates closely with the Granica Research group led by Prof. Andrea Montanari from Stanford University, merging advancements in information theory and learning efficiency with large-scale distributed systems. We believe that the next major breakthrough in AI will stem from innovations in efficient systems, rather than simply larger models.What You Will CreateGlobal Metadata Substrate. Design and refine the global metadata and transactional substrate that enables atomic consistency and schema evolution across exabyte-scale data systems.Adaptive Engines. Architect systems that self-optimize, reorganizing and compressing data according to access patterns, achieving unprecedented efficiency improvements.Intelligent Data Layouts. Innovate new encoding and layout strategies that challenge the theoretical limits of signal per byte read.Autonomous Compute Pipelines. Spearhead the development of distributed compute platforms that scale predictively and maintain reliability even under extreme load and failure conditions.Research to Production. Partner with Granica Research to transform advances in compression and probabilistic modeling into production-ready, industry-leading systems.Latency as Intelligence. Propel systems forward by optimizing for latency as a key aspect of intelligence.

Nov 7, 2025

Apply

Senior Software Engineer - Distributed Data Systems

Databricks

Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California

At Databricks, we are driven by a passion for empowering data teams to tackle the world’s most challenging problems — from transforming transportation to accelerating medical innovations. We achieve this by creating and maintaining the leading data and AI infrastructure platform, enabling our clients to leverage profound data insights for business enhancement. Founded by engineers with a customer-first mentality, we eagerly embrace every opportunity to tackle complex technical challenges, ranging from the design of next-generation UI/UX for data interactions to scaling our services across millions of virtual machines. Our journey has just begun.As a member of the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems. These systems will surpass specialized SQL query engines in relational query performance while offering the programming abstractions necessary to support a variety of workloads, from ETL to data science.Example projects include:Apache Spark™: Contribute to the de facto open-source standard framework for big data.Data Plane Storage: Develop reliable and high-performance services and client libraries for managing vast amounts of data within cloud storage backends like AWS S3 and Azure Blob Store.Delta Lake: Design a storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, providing features like ACID transactions and time travel.Delta Pipelines: Simplify the orchestration and operation of numerous data pipelines, enabling clients to deploy, test, and upgrade pipelines effortlessly.Performance Engineering: Create the next-generation query optimizer and execution engine that is fast, scalable, and robust.

Jan 30, 2026

Apply

Software Engineer - Transforming Homebuilding Experience

Foundation

Full-time|On-site|San Francisco, Boulder, or Austin

Join Our MissionFoundation is on the lookout for exceptional engineers to propel our mission of modernizing the homebuilding industry and enhancing the journey of purchasing, selling, and owning homes.About UsWith an impressive backing of $6.8 million from top venture capitalists, including Y Combinator, Foundation is formed by a team of former Opendoor professionals dedicated to revolutionizing residential real estate.Our flagship product serves as a pioneering customer experience platform tailored for homebuilders, akin to “Shopify for Homebuilders.” We collaborate with large-scale homebuilders to provide a cutting-edge digital customer experience, significantly boosting customer satisfaction and team productivity. In just two years, we’ve achieved notable product-market fit and rapid growth, supported solely by contract design.Our Growth PhasesWe are currently navigating the first of three interconnected growth phases:Innovative, AI-driven vertical SaaS for homebuilding - a public-scale potential in its own right.The enterprise ecosystem for real estate - enabling collaboration among adjacent trillion-dollar industries including lending, title, home insurance, and home services.The AI-native home operating system and interface - leveraging the network effects of the enterprise ecosystem to deliver transformative AI experiences for homebuyers and homeowners. Discover, purchase, and manage your home seamlessly with Foundation.Your RoleIn this pivotal role, you will contribute to creating the most reliable and modern customer experience platform within the homebuilding sector, transitioning traditional workflows into the AI era with insight, pragmatism, and empathy.You will work closely with product and customer teams, engaging in comprehensive stack development, collaborating with design and go-to-market teams, and owning significant portions of the platform from initial discovery to production refinement. Our focus is on real businesses, real revenue, and real users who require robust and reliable tools.

Feb 10, 2026

Apply

Software Engineer - AI Foundation (Mid-Senior Level)

Zip

Full-time|On-site|San Francisco

Join Zip as a talented Software Engineer in our AI Foundation team, where you'll play a crucial role in developing innovative AI solutions. This position offers the opportunity to work on cutting-edge technologies and contribute to projects that impact millions. If you have a passion for artificial intelligence and software development, we want to hear from you!

Mar 16, 2026

Apply

Foundations Engineer - Deep Infrastructure (On-site in San Francisco)

Rox Data Corp

Full-time|On-site|San Francisco

About RoxAt Rox, we are pioneering the development of an AI-native revenue operating system that transforms how enterprises interact with technology. Unlike traditional software designed for human dashboard operators, Rox is engineered for agents managing complex systems.We eliminate static workflows, enabling continuous decision-making processes powered by real-time insights from across the enterprise landscape. Our agents are equipped to analyze signals, reason through them, and autonomously execute actions.To support this innovation, we are constructing a robust infrastructure that integrates:Distributed data platformsReal-time decision-making systemsAgent execution frameworksLow-latency context retrievalBacked by prominent investors like Sequoia, GV, and General Catalyst, we are assembling a talented team of engineers eager to tackle deep technical challenges that have a tangible impact on the world.About the Foundations TeamThe Foundations team is responsible for developing the core infrastructure that powers Rox agents.Our work focuses on:Real-time context ingestionAgent execution and orchestrationEnsuring reliability for long-term AI tasksLow-latency decision-making across distributed systemsIf you have experience with:Streaming compute platformsDistributed query enginesReal-time OLAP systemsMatching enginesLarge-scale data infrastructureMany challenges you encounter here will resonate with your past work but will be applied to a novel category of software. At Rox, agents continuously:Retrieve contextMake decisionsTrigger actionsUpdate stateThe Foundations team builds the infrastructure that ensures these feedback loops are reliable, swift, and observable.The RoleWe are on the lookout for a Foundations Engineer (Deep Infrastructure) to design and oversee the systems that power Rox's agent runtime.

Mar 24, 2026

Apply

Senior Machine Learning Engineer (Research Scientist) - Data Foundation & AI

Plaid Inc.

Full-time|On-site|San Francisco

About Plaid Plaid builds tools that help developers create new financial products and experiences. Since 2013, Plaid has connected millions of users to over 12,000 financial institutions across the US, Canada, the UK, and Europe. The company partners with organizations like Venmo, SoFi, Fortune 500 firms, and major banks to make linking financial accounts to apps and services easier. Headquarters are in San Francisco, with offices in New York, Washington D.C., London, and Amsterdam. Team: Data Foundation & AI The Data Foundation and AI team designs and maintains the machine learning and AI infrastructure that supports Plaid’s products. This group transforms Plaid’s financial network data into flexible formats used by teams across the company. Responsibilities span the entire system lifecycle: data curation for pretraining, model development, deployment, serving, and monitoring in production. Role Overview: Senior Machine Learning Engineer (Research Scientist) This position focuses on applied research for Plaid’s foundation model. The Senior Research Scientist leads efforts to design model architectures, set pretraining objectives, and implement fine-tuning strategies that work across a range of product needs. The role also involves building and maintaining production machine learning systems, including training pipelines, model serving, feature engineering, and performance monitoring. Key Responsibilities Design model architectures and define pretraining objectives for Plaid’s foundation model Develop and apply fine-tuning methods for diverse product use cases Build and maintain end-to-end machine learning systems, from data pipelines to model serving Engineer features and monitor system performance in production Create evaluation frameworks to measure model quality across multiple tasks and metrics Location This role is based in San Francisco.

Apr 15, 2026

Apply

Software Engineer - Foundations Retrieval at OpenAI | San Francisco

OpenAI

Full-time|On-site|San Francisco

About the Foundations Retrieval Team The Foundations Research group at OpenAI explores new approaches that could shape artificial intelligence for years to come. The team focuses on improving the science and data behind model training and scaling, especially for future advanced models. Areas of focus include data utilization, scaling laws, optimization strategies, model architectures, and efficiency improvements. Within Foundations, the Search team builds agentic search solutions. This group works closely with others to design interfaces between models and the core search stack, serving, indexing, and retrieval, so model intent leads to reliable, real-world results. The team develops large-scale systems to transform and index massive information sources, enabling models to reason over global knowledge. Close collaboration with researchers helps move new modeling ideas into production quickly, changing how intelligent systems discover and synthesize information at scale. Role Overview OpenAI is hiring a Software Engineer with expertise in retrieval system development and scalability for its San Francisco office. This role involves working with researchers and engineers to build infrastructure that lets models access the right information when needed. Responsibilities include designing and operating indexing systems, retrieval pipelines, and serving layers. Work in this role will directly improve retrieval capabilities across OpenAI’s research and products, with a strong influence on system performance, reliability, and scalability. What You’ll Do Develop and scale retrieval infrastructure, including indexing, serving, and query execution. Build low-latency, high-throughput systems for real-time model interactions. Work with research teams to bring embedding and retrieval methods into production. Support dense, sparse, and hybrid retrieval pipelines. Maintain system performance, reliability, and observability at scale. Collaborate with Pretraining, Inference, and Product teams to deliver end-to-end retrieval solutions. Help develop model-system interfaces for agentic workflows. Who We’re Looking For Experience building and scaling distributed systems. Background in developing high-performance, low-latency systems. Hands-on work with indexing and retrieval techniques. Familiarity with hybrid retrieval systems. Comfort working collaboratively across multiple teams.

Apr 14, 2026

Apply

Software Engineer, Integrity Foundations

OpenAI

Full-time|On-site|San Francisco

About Our TeamThe Applied Foundations team at OpenAI is at the forefront of safeguarding our innovative technology against diverse adversarial threats. Our primary mission is to ensure the integrity and security of our platforms as they grow.We are dedicated to defending against financial misuse, large-scale attacks, and other forms of exploitation that could compromise user experience or destabilize our operations. The Integrity Foundations team lays the groundwork and infrastructure to support this critical mission.About the RoleAt OpenAI, we aim to advance artificial intelligence in a manner that is safe, reliable, and aligned with broader societal values. The role of Software Engineer in Applied Foundations is vital for maintaining the dependability of our platforms. You will play a key role in developing strong defenses against a variety of adversarial behaviors that threaten our ecosystem.In this position, you will collaborate with our entire engineering team to design and implement systems that detect and prevent abuse, promote user safety, and mitigate risks across our platform. You will be at the forefront of our initiatives to responsibly and sustainably harness the vast potential of AI.Key Responsibilities:Design and enhance systems to identify and prevent various forms of abuse, including financial fraud, botting, and scripting.Work collaboratively with cross-functional teams to create solutions that defend against adversarial attacks while preserving an optimal user experience.Assist in responding to active incidents on the platform and develop new tools and infrastructure to address fundamental challenges.You will excel in this role if you:Possess a minimum of 3 years of professional experience in software engineering.Have experience in setting up and maintaining production backend services and data pipelines.Exhibit a humble attitude, a willingness to support colleagues, and a commitment to team success.Demonstrate self-direction and enjoy innovating solutions to complex problems.Take ownership of issues from start to finish and are eager to acquire any necessary knowledge to achieve your goals.Have a passion for AI safety in production environments and the skills to build effective software systems.

Jun 11, 2024

Apply

Senior Software Engineer, App Foundation (Backend)

Airbnb, Inc.

Full-time|$191K/yr - $223K/yr|On-site|United States

Founded in 2007, Airbnb has transformed the way people travel and connect. From humble beginnings with just three guests in a San Francisco home, we have grown to a global community of over 5 million hosts who have welcomed more than 2 billion guest arrivals across nearly every country. Our hosts provide unique accommodations and experiences that foster authentic connections with local communities.Join Our Talented Community:As a member of Airbnb's App Foundation team, you will collaborate across platforms to create high-quality, performant capabilities that enhance nearly all features within the Guest and Host ecosystems.Our primary focus is on developing App Product Frameworks, Insights & Logging, Performance & App Health, and Feature Architecture. We work hand in hand with Product, Design, Platform (iOS/Android/Web), Analytics Infrastructure, Data Platform, and other Product Foundation teams to establish cohesive paved paths and standards at scale. We pride ourselves on a culture that values technical excellence, pragmatic decision-making, robust ownership, and a steadfast commitment to enhancing both developer and user experiences through foundational work.Your Impact:Collaborate with cross-functional partners (design and product) to explore, shape, and implement new product experiences from ideation to large-scale execution.Develop efficient and reusable backend capabilities that prioritize quality while ensuring performance and scalability.Lead initiatives that significantly enhance Guest and Host experiences by improving app responsiveness and ensuring reliable performance across critical backend systems impacting millions.Establish a performance roadmap by identifying bottlenecks, prioritizing impactful work, and delivering enhancements across services, data access patterns, and infrastructure.Elevate performance engineering standards by creating tools, benchmarks, and guardrails that prevent regressions and integrate performance considerations into team workflows.Influence architecture and standards across Airbnb's backend ecosystem to improve observability, efficiency, and adaptability of systems.A Day in Your Life:Join us in making a difference for millions of users worldwide by building the backend foundations that power the Airbnb experience.

Mar 13, 2026

Apply

Marketing Lead at Foundation | San Francisco

Foundation

Full-time|On-site|San Francisco

Join Us as Our First Marketing LeadFoundation is on the lookout for our inaugural marketing leader to propel our vision of revolutionizing homebuilding and enhancing the journey of buying, selling, and owning a home.About FoundationWith approximately $6.8M in backing from top-tier venture capitalists, including Y Combinator, Foundation is composed of a dynamic team formerly from Opendoor, dedicated to reshaping the future of residential real estate.Our flagship product is a cutting-edge customer experience platform designed specifically for homebuilders—think of it as the "Shopify for Homebuilders." We collaborate with large-scale homebuilders to deliver a modern digital experience, significantly boosting customer satisfaction and team productivity. In just two years, we've achieved remarkable product-market fit and impressive growth, all without a dedicated marketing team.Our Growth JourneyWe are currently navigating the first of three interconnected growth phases:AI-Driven SaaS for Homebuilding: A transformative opportunity with public-scale potential.Real Estate Enterprise Ecosystem: Homebuilders drive this ecosystem, which fosters collaboration among adjacent trillion-dollar sectors such as lending, title, home insurance, and retail.AI Native Home Operating System: This will enable seamless home buying and ownership through our platform.Your Role as Our First MarketerWe seek a hands-on, results-driven marketer passionate about transforming a key sector of the U.S. economy and redefining marketing in the age of AI.Key ResponsibilitiesYou will be pivotal in steering Foundation's next growth phases by integrating AI with marketing and real estate innovation. Your primary objectives will include:Accelerating Growth: Drive rapid expansion of our core AI-driven product line for homebuilders.

Feb 9, 2026

Apply

Software Engineer, Distributed Data Systems

Exa

Full-time|On-site|San Francisco, California

At Exa, we are on a mission to create a cutting-edge search engine from the ground up, tailored specifically for AI applications. Our team is dedicated to developing large-scale infrastructure that efficiently crawls the internet, trains advanced embedding models for indexing, and constructs high-performance vector databases in Rust for optimized searching. We also manage a state-of-the-art $5M H200 GPU cluster that activates thousands of machines simultaneously.As a Software Engineer specializing in Distributed Data Systems, you will be responsible for designing and implementing the data infrastructure that drives our operations—from crawling billions of web pages to training sophisticated embedding models and delivering real-time search functionalities. You will enjoy significant autonomy in creating systems capable of scaling to hundreds of petabytes. This is your opportunity to work on data pipelines at an unprecedented scale.

Dec 19, 2025

Apply

Senior Manager of Data Engineering and AI Automation, Business Systems

Stitch Fix, Inc.

Full-time|$138K/yr - $230K/yr|Remote|Remote, USA

About Stitch Fix, Inc. Stitch Fix (NASDAQ: SFIX) is a premier online personal styling service that empowers individuals to discover and embrace their unique styles. By expertly blending skilled stylists with cutting-edge AI and recommendation algorithms, we curate an exceptional selection of both exclusive and national brands, tailored to meet each client's distinct tastes and preferences. Founded in 2011 and headquartered in San Francisco, Stitch Fix revolutionizes the way people shop, making it effortless for clients to express their personal style without the hassle of navigating through endless options in stores or online.About the TeamThe Business Systems team serves as the strategic technology and data partner for our core operations. We design and maintain the technological framework that supports our Finance, Procurement, Merchandising, and HR/People and Culture functions. By collaborating directly with business leaders, we create, implement, and enhance scalable systems while transforming our business data into a strategic asset. Our team is responsible for building and managing data engineering pipelines, analytics dashboards, and next-generation automation and Gen AI solutions that provide critical insights and empower leaders to make informed, data-driven decisions.About the RoleWe are on the lookout for a strategic Senior Engineering Manager to lead our Business Systems Data & Insights team, focusing on pivotal domains including Finance (Accounting, FP&A), Merchandising, Procurement, and HR/People & Culture. This high-impact role will allow you to shape how Stitch Fix utilizes data and AI to drive essential business decisions and influence company strategy. You will spearhead our data and AI transformation by constructing scalable data infrastructure, enhancing analytics capabilities, implementing intelligent automation, and accelerating the adoption of Gen AI across these critical business functions.

Feb 12, 2026

Apply

Senior Software Engineer II - Agentic AI Systems

Moveworks

Full-time|On-site|San Francisco, CA

The Role Are you a skilled software engineer with a proven track record in building and refining production systems? Are you eager to apply your expertise at the forefront of AI technology? If so, this opportunity may be perfect for you. As a Senior Software Engineer on our Natural Language Understanding team within the “agent lab,” you will be pivotal in our mission to enhance the capabilities of AI agents for reliable, scalable performance. You will have the chance to influence the evolution of the Moveworks AI Assistant platform in several key areas: agent orchestration, sandboxed file systems, latency optimization, and multimodal I/O, among others. You will leverage the best tools in enterprise AI, including cutting-edge LLMs from top providers like OpenAI. Our team prioritizes rapid innovation on scalable infrastructure while tackling challenging product and engineering obstacles to deliver exceptional value to our clients. If you are looking to achieve the pinnacle of your career alongside a passionate, dedicated team focused on making an impact, we invite you to connect with us.

Mar 20, 2026

Apply

Senior Software Engineer, AI Evals

Sentry

Full-time|$240K/yr - $280K/yr|Hybrid|San Francisco, California

About SentryAt Sentry, we are committed to transforming the way developers build software. With a mission to eradicate poor software experiences, we empower developers to create better applications more efficiently, ensuring a seamless encounter with technology.Backed by over $217 million in funding and trusted by more than 100,000 organizations, including industry giants like Disney, Microsoft, and Atlassian, we are at the forefront of performance monitoring and error tracking solutions. Our innovative tools enable companies to focus on product development rather than bug fixes.We embrace a hybrid work environment across our global offices, designating Mondays, Tuesdays, and Thursdays as in-office collaboration days to foster meaningful team interactions. If you are passionate about creating solutions that enhance the digital experience, join us in developing the next wave of software monitoring tools.About the RoleAs a Senior Software Engineer on Sentry’s AI/ML team, you will play a pivotal role in constructing the evaluation infrastructure that assesses the accuracy, reliability, and performance of our AI systems in real-world scenarios. This position is essential for ensuring that our debugging agents and AI-driven features operate correctly, safely, and predictably as they scale. You will design datasets, benchmarks, and test harnesses that convert vague AI behavior into quantifiable metrics, enabling the team to deploy AI solutions with confidence.In This Role You WillDevelop and implement robust evaluation frameworks to assess accuracy, reliability, regressions, and edge cases within AI systems.Generate and manage high-quality datasets, golden test cases, and benchmarks based on real production data.Create automated test harnesses and metrics pipelines to continuously evaluate models, prompts, and workflows.Collaborate closely with applied AI engineers and product leaders to establish clear definitions of success and translate them into measurable criteria.Oversee the evaluation lifecycle for significant AI projects, from initial experimentation to ongoing production monitoring.You'll Love This Job If YouHave a strong commitment to accuracy, rigor, and measurement in AI systems.Enjoy transforming ambiguous product objectives and model behaviors into precise tests and metrics.Take pleasure in building foundational infrastructure that facilitates rapid iteration and boosts team confidence.Thrive in collaborative environments and relish the opportunity to influence model design through effective evaluation.

Jan 28, 2026

Apply

Senior Software Engineer, AI Automations at Retell AI

Retell AI

Full-time|On-site|San Francisco Bay Area

About Retell AIAt Retell AI, we are pioneering the future of call centers through innovative voice AI technology. Our cutting-edge solutions are transforming how companies engage with customers.In just 18 months since our inception, we've empowered thousands of businesses with our AI voice agents that efficiently manage sales, support, and logistics calls, significantly reducing the need for large teams of human agents. Supported by industry-leading investors including Y Combinator and Alt Capital, we've grown our annual recurring revenue from $5M to an impressive $36M while expanding our team from 5 to 20 talented individuals since 2025.Our ambitious vision for 2026 is to develop a state-of-the-art customer experience platform where entire contact centers are driven by AI. Unlike basic automation requiring constant human oversight, we’re engineering intelligent AI “workers” capable of serving as frontline agents, quality assurance analysts, and managerial roles, all while optimizing customer interactions continuously.We are rapidly expanding and seeking driven builders who thrive on solving complex technical challenges, act decisively, and wish to make a tangible impact in one of the fastest-growing voice AI startups.Join us in shaping the future!Recognized as a top 50 AI application in the a16z list: https://tinyurl.com/5853dt2xRanked #4 in Brex's Fast-Growing Software Vendors of 2025: https://www.brex.com/journal/brex-benchmark-december-2025Featured among the top startups on: https://leanaileaderboard.com/

Jan 26, 2026

Apply

Software Engineer, Distributed Data Systems (Sora)

OpenAI

Full-time|Hybrid|San Francisco

About Our TeamJoin the innovative Sora team at OpenAI, where we are at the forefront of developing multimodal capabilities for our foundation models. As a dynamic hybrid of research and product development, we focus on seamlessly integrating advanced multimodal functionalities into our AI offerings, ensuring they are not only reliable and user-friendly but also aligned with our mission to foster broad societal benefits.About the PositionWe are seeking a dedicated Software Engineer specializing in Distributed Data Systems to architect and enhance the infrastructure that supports large-scale multimodal training and evaluation at OpenAI. In this role, you will oversee distributed data pipelines and collaborate closely with our researchers to translate their requirements into robust, high-performance systems. You will play a crucial role in fortifying the pipelines that underpin Sora’s rapid innovation cycles.We are looking for engineers with a keen eye for detail, substantial experience with distributed systems, and a proven track record of building reliable infrastructures in high-stakes environments.This position is based in San Francisco, CA, and follows a hybrid work model requiring three days in the office each week. We also provide relocation assistance to new team members.Key Responsibilities:Design, build, and maintain data infrastructure systems including distributed computing, data orchestration, distributed storage, streaming infrastructure, and machine learning infrastructure, ensuring they are scalable, reliable, and secure.Ensure our data platform can scale dramatically while maintaining high levels of reliability and efficiency.Collaborate with researchers to deeply understand their needs and translate them into production-ready systems.Harden, optimize, and maintain vital data infrastructure systems that drive multimodal training and evaluation.Ideal Candidates Will Have:Extensive experience with distributed systems and large-scale infrastructure, coupled with a strong passion for data.A detail-oriented mindset and a commitment to building and maintaining dependable systems.Solid software engineering fundamentals and exceptional organizational skills.Comfort with ambiguity and rapid changes in a fast-paced environment.About OpenAIOpenAI is a pioneering AI research and deployment organization dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity. We strive to advance digital intelligence in a way that is safe and beneficial, pushing the boundaries of innovation and technology.

Nov 14, 2025

Apply

Staff Software Engineer, Foundation Model Serving

Databricks

Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California

At Databricks, we are driven by our commitment to empower data teams in tackling the world's most challenging problems — from transforming transportation solutions to accelerating medical advancements. Our mission revolves around constructing and maintaining the world's premier data and AI infrastructure platform, enabling our clients to harness deep data insights for enhanced business outcomes.Foundation Model Serving represents the API product designed for hosting and serving advanced AI model inference, catering to both open-source models like Llama, Qwen, and GPT OSS, as well as proprietary models such as Claude and OpenAI GPT. We welcome engineers who have experience managing high-scale operational systems, including customer-facing APIs, Edge Gateways, or ML Inference services, even if they do not have a background in ML or AI. A passion for developing LLM APIs and runtimes at scale is essential.As a Staff Engineer, you will play a pivotal role in defining both the product experience and the underlying infrastructure. You will be tasked with designing and building systems that facilitate high-throughput, low-latency inference on GPU workloads with cutting-edge models. Your influence will extend to architectural direction, working closely with platform, product, infrastructure, and research teams to deliver an exceptional foundation model API product.The impact you will have:Design and implement core systems and APIs that drive Databricks Foundation Model Serving, ensuring scalability, reliability, and operational excellence.Collaborate with product and engineering leaders to outline the technical roadmap and long-term architecture for workload serving.Make architectural decisions to enhance performance, throughput, autoscaling, and operational efficiency for GPU serving workloads.Contribute directly to critical components within the serving infrastructure, from systems like vLLM and SGLang to developing token-based rate limiters and optimizers, ensuring seamless and efficient operations at scale.Work cross-functionally with product, platform, and research teams to transform customer requirements into dependable and high-performing systems.Establish best practices for code quality, testing, and operational readiness while mentoring fellow engineers through design reviews and technical support.Represent the team in inter-departmental technical discussions, influencing Databricks’ wider AI platform strategy.

Jan 30, 2026

Apply

Full Stack Software Engineer - Integrity Foundations

OpenAI

Full-time|On-site|San Francisco

About the Applied Foundations Team The Applied Foundations group at OpenAI focuses on protecting our technology from adversarial threats as our platforms grow. This team builds the core infrastructure that helps prevent financial abuse, large-scale attacks, and other forms of misuse that could impact user experience or disrupt operations. Role Overview OpenAI aims to advance AI in a way that is safe, reliable, and aligned with society’s values. As a Full Stack Software Engineer on the Integrity Foundations team in San Francisco, you will help maintain the trustworthiness of our platforms. Your work will involve building defenses against adversarial behaviors and ensuring our ecosystem remains secure. This role collaborates closely with engineering and cross-functional partners to design and implement systems that detect and prevent abuse, enhance user safety, and reduce risks. You will help lead efforts to ensure AI is used responsibly and sustainably. What You Will Do Design APIs for internal workflows that interact with all layers of the stack, supporting informed data model decisions. Build and integrate tools that enable internal teams to investigate safety, abuse, and risk issues, with a focus on strong access controls, auditability, and operational rigor. Work with teams across the company to develop solutions that guard against adversarial attacks while maintaining a positive user experience. Help respond to live incidents on the platform and create new infrastructure to address underlying problems. What We Look For Interest in building high-quality internal tools and a genuine concern for end-users. At least 3 years of professional software engineering experience. Ability to collaborate effectively, including with partners outside engineering. Humility, sound judgment, and a drive for continuous improvement.

Apr 14, 2026

Create account — see all 9,265 results