Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Your Responsibilities
Define and guide the architecture and design of large-scale distributed systems, ensuring scalability, fault tolerance, cost efficiency, and long-term maintainability across multi-region deployments.
Take complete ownership of critical infrastructure domains, managing production services, participating in on-call rotations, and leading in-depth root cause analysis of complex cross-system incidents.
Drive operational excellence by establishing SLO frameworks, enhancing observability and resilience patterns, and automating reliability and capacity management across the platform.
Raise engineering standards through thorough design reviews, architectural alignment, performance benchmarking, and mentorship of senior and staff engineers.
Collaborate with leadership on long-term infrastructure strategy, including capacity planning, cost optimization, and providing technical input for cloud vendor negotiations.
What We Are Looking For
A minimum of 10 years of experience in designing, building, and operating large-scale distributed systems in high-availability production environments.
Strong programming skills with a proven track record in delivering production-grade backend services and infrastructure systems.
Extensive hands-on experience with Kubernetes and cloud-native architectures in GCP, AWS, or Azure, including multi-cluster and multi-region patterns.
Advanced knowledge in distributed infrastructure, including traffic management, load balancing, networking, and messaging systems such as Kafka.
About the job
The Opportunity
Join the Ads Infrastructure team at Unity, where we design and manage the foundational distributed systems that drive one of the largest real-time advertising platforms globally. Our infrastructure is integral to every aspect of Unity Ads, enabling segmentation, optimization, bidding, traffic routing, experimentation, and analytics on a worldwide scale.
We are committed to developing resilient, scalable, and cost-effective systems capable of handling immense traffic volumes across multiple regions while adhering to strict latency and availability standards. Utilizing advanced technologies including Kubernetes, Kafka, Flink, Starrocks, Valkey, and other cloud-native components, our platform supports engineers, data scientists, and product teams in advancing Unity Ads.
This senior individual contributor role will have a significant technical impact across the organization. You will be responsible for making essential architectural decisions, guiding the long-term evolution of our platform, and collaborating with senior managers and directors to shape the technical vision of Ads Infrastructure while remaining actively involved in hands-on development.
About Unity Technologies
Unity Technologies is a leading platform that enables developers to create and operate interactive, real-time 3D content. Our Ads Infrastructure team is at the forefront of technological innovation, providing robust solutions to enhance the advertising experience on a global scale.
Full-time|$250K/yr - $340K/yr|On-site|San Francisco, CA, USA
The Opportunity Join the Ads Infrastructure team at Unity, where we design and manage the foundational distributed systems that drive one of the largest real-time advertising platforms globally. Our infrastructure is integral to every aspect of Unity Ads, enabling segmentation, optimization, bidding, traffic routing, experimentation, and analytics on a worldwide scale. We are committed to developing resilient, scalable, and cost-effective systems capable of handling immense traffic volumes across multiple regions while adhering to strict latency and availability standards. Utilizing advanced technologies including Kubernetes, Kafka, Flink, Starrocks, Valkey, and other cloud-native components, our platform supports engineers, data scientists, and product teams in advancing Unity Ads. This senior individual contributor role will have a significant technical impact across the organization. You will be responsible for making essential architectural decisions, guiding the long-term evolution of our platform, and collaborating with senior managers and directors to shape the technical vision of Ads Infrastructure while remaining actively involved in hands-on development.
Full-time|$233.3K/yr - $291.6K/yr|On-site|San Francisco, CA, USA
Exciting OpportunityUnity is embarking on a transformative journey to enhance its capacity to directly assess the effectiveness of its advertising solutions—specifically measuring whether a particular ad led to an installation—without solely depending on third-party mobile measurement partners (MMPs). We are seeking a Principal Engineer for Ads Measurement to spearhead this initiative. This position entails owning the vision and implementation of Unity's self-attribution capabilities, facilitating independent ad performance measurement, cultivating reliable sources of truth for AI/ML optimization, and establishing a clear, deliberate relationship with the MMP ecosystem. This key leader will work at the nexus of advertising, data science, platform strategy, and developer trust, influencing how Unity measures, interprets, and enhances advertising outcomes in an intricate and dynamic measurement environment.Key ResponsibilitiesArchitect Unity’s Measurement Infrastructure: Design and execute a scalable technical framework essential for independent measurement of install attribution and ad effectiveness.Enhance Self-Attribution Capabilities: Lead the engineering initiatives to create internal attribution signals and ensure reliable, privacy-conscious install measurement across various platforms.Drive AI/ML Optimization: Work closely with data science and machine learning teams to guarantee that measurement data is of high quality and optimally structured for model training.Establish Technical Strategy for MMP Ecosystem: Define technical integration standards for Unity as a self-attributing network, ensuring secure and consistent data exchanges within the broader ecosystem.Foster Cross-Functional Technical Leadership: Ensure alignment across Ads, Data, Engineering, and Privacy teams, making measurement a robust strategic asset.
Full-time|$400K/yr - $450K/yr|On-site|San Francisco Bay Area
Join Discord, a platform that connects over 200 million users every month primarily through gaming. With over 90% of our users engaged in gaming activities, we facilitate over 1.5 billion hours of gaming conversations, enhancing the experience before, during, and after gameplay.The Infrastructure organization at Discord is fundamental to our user experience. We handle the real-time delivery of over 40 million events per second and manage the storage of trillions of messages, ensuring robust connections among our vast user base. As a Principal Engineer, you will play a pivotal role in guiding our infrastructure teams, shaping our technical vision, and maintaining the reliability of Discord at a massive scale.This position is ideal for a professional who excels at the intersection of advanced technical skills and organizational leadership. You will contribute to our infrastructure roadmap, address our most challenging technical dilemmas, and ensure our systems can efficiently scale to accommodate the next wave of users.
Full-time|$2K/yr - $2K/yr|On-site|San Francisco, CA
Role Overview Nextdata is hiring a Lead Principal Infrastructure Engineer in San Francisco, CA. This position focuses on building the foundation for a decentralized data mesh platform, supporting data ownership and enabling AI, machine learning, and analytics at scale. What You Will Do Develop automation solutions for provisioning and managing the Nextdata OS across multiple cloud platforms. Work closely with the founding engineering team to design and implement a secure, self-service infrastructure for future data product developers. Own the architecture and deployment of the OS, using infrastructure-as-code to ensure high code quality and scalability. Engage directly with customers to understand their requirements and translate feedback into technical improvements. Collaborate with product teams to align infrastructure capabilities with business needs. What You Bring Expertise in large-scale distributed systems and data infrastructure. Experience designing, deploying, and maintaining cloud-based platforms. Strong background in infrastructure-as-code and automation. Ability to work collaboratively with engineering and product teams. Comfort engaging with customers to gather feedback and requirements.
Full-time|$190.8K/yr - $267.1K/yr|On-site|San Francisco, CA
Join Reddit's dynamic Brand Ad Formats team as an Android Software Engineer, where you'll be instrumental in shaping innovative ad experiences for millions of users. Your role will involve developing impactful ad formats, enhancing the performance of Reddit’s Android app, and collaborating with cross-functional teams to tackle complex challenges. We are seeking a product-focused engineer who is eager to push the boundaries of advertising technology and elevate user engagement through creativity and technical expertise.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Full-time|Remote|San Francisco, CA, US; Remote, US
Join Pinterest as a Staff Software Engineer focused on Ads Measurement Products. In this role, you will be at the forefront of developing innovative solutions that enhance our advertising strategies and improve our measurement capabilities. Your expertise will directly impact how we analyze and optimize campaigns, ensuring that our advertisers can achieve their goals effectively.
Join Cloudflare as a Principal Software Engineer specializing in Resiliency, where you will play a pivotal role in enhancing our systems' robustness and availability. Your expertise will contribute to building and maintaining resilient infrastructure that supports our global network, ensuring our customers receive uninterrupted service.In this role, you will work alongside a talented team of engineers to identify vulnerabilities, implement solutions, and innovate new strategies that enhance system performance and reliability. If you are passionate about software engineering and system resiliency, we invite you to apply!
Full-time|$245K/yr - $290K/yr|On-site|San Francisco, CA
Redpanda Data is building the Agentic Data Plane (ADP), a platform that connects AI agents with enterprise data and systems. The ADP supports real-time, autonomous reasoning and action by agentic applications, powered by Redpanda's multi-modal data streaming engine. Major organizations across industries, including Activision Blizzard, Cisco, Moody's, Texas Instruments, Vodafone, and two of the top five U.S. banks, rely on Redpanda to process hundreds of terabytes of data every day. Backed by investors such as Lightspeed, GV, and Haystack VC, Redpanda operates as a globally distributed, people-first company. Role overview The Principal Software Engineer will architect and develop the Agentic Data Plane, which serves as the control and execution layer for AI agents interacting with enterprise data. This system enables agents to access, analyze, and act on data in real time, while providing human operators with oversight and control for secure operations. The ADP brings together Redpanda's low-latency streaming technology, a distributed query engine for real-time context, a library of over 300 data connectors, and a global policy and observability framework. This framework enforces access controls, records agent actions, and supports replayable audits. What you will do Design and build the core architecture of the Agentic Data Plane, focusing on secure and efficient data interaction for AI agents. Integrate streaming, query, and policy enforcement components to support real-time, autonomous agent operations. Monitor developments in the agentic AI field and translate research into engineering proposals and product strategies. Work closely with Engineering, Product, and Go-To-Market teams, as well as key customers, to shape the direction of the ADP.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Join Cloudflare as a Principal Software Engineer specializing in billing systems, where you will play a pivotal role in shaping our payment and invoicing solutions. You will collaborate with cross-functional teams to implement innovative solutions that enhance user experiences and streamline processes. If you're passionate about building scalable software and want to contribute to a fast-paced, innovative environment, we want to hear from you!
About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.
About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENJoin Baseten, where we drive mission-critical AI inference for leading companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our unique blend of applied AI research, robust infrastructure, and intuitive developer tools empowers organizations at the forefront of AI innovation to deploy state-of-the-art models into production. Recently, we secured a $300M Series E funding round, backed by esteemed investors such as BOND, IVP, Spark Capital, Greylock, and Conviction. Be a part of our rapid growth and help shape the platform that engineers trust for launching AI products.THE ROLEAs an Infrastructure Software Engineer at Baseten, you will play a pivotal role in developing and maintaining our ML inference platform that powers AI applications in production. Your contributions will enhance the core infrastructure, enabling developers to deploy, scale, and monitor machine learning models with exceptional performance.EXAMPLE INITIATIVESYou will engage in innovative projects within our Infrastructure team, including:Multi-cloud capacity managementInference on B200 GPUsMulti-node inferenceFractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and develop infrastructure components for our ML inference platform, primarily using Python and Go.Implement and maintain Kubernetes deployments for optimal model serving.Contribute to the orchestration layer for model deployments.Build and enhance monitoring systems to track model performance metrics effectively.Develop efficient resource management solutions to optimize performance.
Full-time|$150K/yr - $200K/yr|On-site|San Francisco, CA
At Sift, we are revolutionizing the way cutting-edge machines are constructed, tested, and managed. Our innovative platform provides engineers with real-time visibility into high-frequency telemetry, effectively removing bottlenecks and facilitating quicker, more dependable development.Sift originated from our experience at SpaceX, contributing to projects like Dragon, Falcon, Starlink, and Starship, where the demands of scaling telemetry, debugging flight systems, and ensuring mission reliability necessitated a new kind of infrastructure. Founded by a talented team from SpaceX, Google, and Palantir, Sift is tailored for mission-critical systems where precision and scalability are imperative.As one of the pioneering engineers at Sift, your role will extend beyond just coding—you will play a crucial part in defining the architecture, shaping the product, and influencing the culture of a company dedicated to addressing real engineering challenges. If you're eager to take on intricate technical obstacles and build foundational systems that support complex machines from the ground up, we would love to connect with you.
Principal Software Engineer Saviynt offers an AI-driven identity platform that effectively manages and governs access permissions for both human and non-human entities across all organizational applications, data, and processes. Our clients rely on Saviynt to protect their digital assets, enhance operational efficiency, and lower compliance expenses. Designed for the age of AI, Saviynt is at the forefront of helping organizations safely advance their AI deployments and utilization. As a recognized leader in identity security, we provide solutions that empower and protect some of the world’s leading brands, Fortune 500 companies, and government institutions. For more details, please visit www.saviynt.com. Role Summary In this pivotal role, you will provide technical leadership and extensive knowledge in complex engineering domains, guiding architectural decisions while ensuring scalability, reliability, and quality across key platform components. As a Principal Engineer, you will act as a technical authority, mentor senior engineers, and tackle the most intricate technical challenges. The Connectors team plays a crucial role in facilitating seamless integrations between Saviynt's Identity Governance platform and a multitude of enterprise applications by developing and maintaining robust, scalable connector frameworks. We are committed to ensuring reliable data synchronization, provisioning, and lifecycle management across diverse external systems, forming a vital foundation for the entire platform. What You Will Be Doing ● Design and architect scalable, high-performance connector frameworks for enterprise application integrations.● Define technical standards, best practices, and design patterns for connector development.● Drive architectural decisions for complex integration scenarios involving over 200 enterprise applications.● Evaluate and recommend new technologies, tools, and frameworks to enhance connector reliability and performance.● Lead technical design reviews and provide guidance on system architecture and design trade-offs.
Join Ivo's Engineering Team!At Ivo, we are pioneers in the tech industry. Our engineers are innovators who have created groundbreaking solutions such as:• An AI agent that seamlessly integrates with MS Word to enhance document editing [2023]• Revolutionizing embedding models with agentic RAG technology [2023]• Advanced LLM-based legal fact extraction capabilities [2024]• A legal assistant designed to search extensive contract databases without compromising accuracy [2024]• Clustering legal documents from the same lineage [2025]• Automatic deviation analysis to uncover hidden risks in vast contract databases [2025]• Merging contracts with their amendments to create a “composite” contract timeline that has moved our clients to tears [2025]Role OverviewAs an Infrastructure Engineer at Ivo, you will lay the groundwork for our platform's future. Your responsibilities will include:• Designing and owning the future of our infrastructure, allowing you the freedom to innovate.• Managing multiple customer deployments, ensuring each receives tailored containers, databases, and VPCs.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics and logs into visually appealing dashboards and setting up pager alerts.• Leading infrastructure-related incidents and being on-call as necessary.• Enhancing our CI/CD system to reduce deployment time from ~12 minutes.If you're passionate about LLMs, you'll thrive in our engineering team, where you’ll have the opportunity to:• Develop real-time LLM evaluations to monitor the accuracy of our responses.• Collaborate with talented engineers to push the boundaries of DevOps.
Astranis is seeking a talented and motivated Software Engineer to join our Infrastructure team. In this role, you will be at the forefront of developing and maintaining critical software systems that support our innovative satellite technology. You'll collaborate with cross-functional teams to design, implement, and optimize our infrastructure solutions, ensuring high reliability and performance.
Join Cloudflare as a Principal Software Engineer, where you will play a pivotal role in designing and implementing innovative software solutions. You will collaborate with cross-functional teams to enhance our platform's scalability, security, and performance, making a significant impact on our global user base.
Full-time|$250K/yr - $340K/yr|On-site|San Francisco, CA, USA
The Opportunity Join the Ads Infrastructure team at Unity, where we design and manage the foundational distributed systems that drive one of the largest real-time advertising platforms globally. Our infrastructure is integral to every aspect of Unity Ads, enabling segmentation, optimization, bidding, traffic routing, experimentation, and analytics on a worldwide scale. We are committed to developing resilient, scalable, and cost-effective systems capable of handling immense traffic volumes across multiple regions while adhering to strict latency and availability standards. Utilizing advanced technologies including Kubernetes, Kafka, Flink, Starrocks, Valkey, and other cloud-native components, our platform supports engineers, data scientists, and product teams in advancing Unity Ads. This senior individual contributor role will have a significant technical impact across the organization. You will be responsible for making essential architectural decisions, guiding the long-term evolution of our platform, and collaborating with senior managers and directors to shape the technical vision of Ads Infrastructure while remaining actively involved in hands-on development.
Full-time|$233.3K/yr - $291.6K/yr|On-site|San Francisco, CA, USA
Exciting OpportunityUnity is embarking on a transformative journey to enhance its capacity to directly assess the effectiveness of its advertising solutions—specifically measuring whether a particular ad led to an installation—without solely depending on third-party mobile measurement partners (MMPs). We are seeking a Principal Engineer for Ads Measurement to spearhead this initiative. This position entails owning the vision and implementation of Unity's self-attribution capabilities, facilitating independent ad performance measurement, cultivating reliable sources of truth for AI/ML optimization, and establishing a clear, deliberate relationship with the MMP ecosystem. This key leader will work at the nexus of advertising, data science, platform strategy, and developer trust, influencing how Unity measures, interprets, and enhances advertising outcomes in an intricate and dynamic measurement environment.Key ResponsibilitiesArchitect Unity’s Measurement Infrastructure: Design and execute a scalable technical framework essential for independent measurement of install attribution and ad effectiveness.Enhance Self-Attribution Capabilities: Lead the engineering initiatives to create internal attribution signals and ensure reliable, privacy-conscious install measurement across various platforms.Drive AI/ML Optimization: Work closely with data science and machine learning teams to guarantee that measurement data is of high quality and optimally structured for model training.Establish Technical Strategy for MMP Ecosystem: Define technical integration standards for Unity as a self-attributing network, ensuring secure and consistent data exchanges within the broader ecosystem.Foster Cross-Functional Technical Leadership: Ensure alignment across Ads, Data, Engineering, and Privacy teams, making measurement a robust strategic asset.
Full-time|$400K/yr - $450K/yr|On-site|San Francisco Bay Area
Join Discord, a platform that connects over 200 million users every month primarily through gaming. With over 90% of our users engaged in gaming activities, we facilitate over 1.5 billion hours of gaming conversations, enhancing the experience before, during, and after gameplay.The Infrastructure organization at Discord is fundamental to our user experience. We handle the real-time delivery of over 40 million events per second and manage the storage of trillions of messages, ensuring robust connections among our vast user base. As a Principal Engineer, you will play a pivotal role in guiding our infrastructure teams, shaping our technical vision, and maintaining the reliability of Discord at a massive scale.This position is ideal for a professional who excels at the intersection of advanced technical skills and organizational leadership. You will contribute to our infrastructure roadmap, address our most challenging technical dilemmas, and ensure our systems can efficiently scale to accommodate the next wave of users.
Full-time|$2K/yr - $2K/yr|On-site|San Francisco, CA
Role Overview Nextdata is hiring a Lead Principal Infrastructure Engineer in San Francisco, CA. This position focuses on building the foundation for a decentralized data mesh platform, supporting data ownership and enabling AI, machine learning, and analytics at scale. What You Will Do Develop automation solutions for provisioning and managing the Nextdata OS across multiple cloud platforms. Work closely with the founding engineering team to design and implement a secure, self-service infrastructure for future data product developers. Own the architecture and deployment of the OS, using infrastructure-as-code to ensure high code quality and scalability. Engage directly with customers to understand their requirements and translate feedback into technical improvements. Collaborate with product teams to align infrastructure capabilities with business needs. What You Bring Expertise in large-scale distributed systems and data infrastructure. Experience designing, deploying, and maintaining cloud-based platforms. Strong background in infrastructure-as-code and automation. Ability to work collaboratively with engineering and product teams. Comfort engaging with customers to gather feedback and requirements.
Full-time|$190.8K/yr - $267.1K/yr|On-site|San Francisco, CA
Join Reddit's dynamic Brand Ad Formats team as an Android Software Engineer, where you'll be instrumental in shaping innovative ad experiences for millions of users. Your role will involve developing impactful ad formats, enhancing the performance of Reddit’s Android app, and collaborating with cross-functional teams to tackle complex challenges. We are seeking a product-focused engineer who is eager to push the boundaries of advertising technology and elevate user engagement through creativity and technical expertise.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Full-time|Remote|San Francisco, CA, US; Remote, US
Join Pinterest as a Staff Software Engineer focused on Ads Measurement Products. In this role, you will be at the forefront of developing innovative solutions that enhance our advertising strategies and improve our measurement capabilities. Your expertise will directly impact how we analyze and optimize campaigns, ensuring that our advertisers can achieve their goals effectively.
Join Cloudflare as a Principal Software Engineer specializing in Resiliency, where you will play a pivotal role in enhancing our systems' robustness and availability. Your expertise will contribute to building and maintaining resilient infrastructure that supports our global network, ensuring our customers receive uninterrupted service.In this role, you will work alongside a talented team of engineers to identify vulnerabilities, implement solutions, and innovate new strategies that enhance system performance and reliability. If you are passionate about software engineering and system resiliency, we invite you to apply!
Full-time|$245K/yr - $290K/yr|On-site|San Francisco, CA
Redpanda Data is building the Agentic Data Plane (ADP), a platform that connects AI agents with enterprise data and systems. The ADP supports real-time, autonomous reasoning and action by agentic applications, powered by Redpanda's multi-modal data streaming engine. Major organizations across industries, including Activision Blizzard, Cisco, Moody's, Texas Instruments, Vodafone, and two of the top five U.S. banks, rely on Redpanda to process hundreds of terabytes of data every day. Backed by investors such as Lightspeed, GV, and Haystack VC, Redpanda operates as a globally distributed, people-first company. Role overview The Principal Software Engineer will architect and develop the Agentic Data Plane, which serves as the control and execution layer for AI agents interacting with enterprise data. This system enables agents to access, analyze, and act on data in real time, while providing human operators with oversight and control for secure operations. The ADP brings together Redpanda's low-latency streaming technology, a distributed query engine for real-time context, a library of over 300 data connectors, and a global policy and observability framework. This framework enforces access controls, records agent actions, and supports replayable audits. What you will do Design and build the core architecture of the Agentic Data Plane, focusing on secure and efficient data interaction for AI agents. Integrate streaming, query, and policy enforcement components to support real-time, autonomous agent operations. Monitor developments in the agentic AI field and translate research into engineering proposals and product strategies. Work closely with Engineering, Product, and Go-To-Market teams, as well as key customers, to shape the direction of the ADP.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Join Cloudflare as a Principal Software Engineer specializing in billing systems, where you will play a pivotal role in shaping our payment and invoicing solutions. You will collaborate with cross-functional teams to implement innovative solutions that enhance user experiences and streamline processes. If you're passionate about building scalable software and want to contribute to a fast-paced, innovative environment, we want to hear from you!
About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.
About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENJoin Baseten, where we drive mission-critical AI inference for leading companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our unique blend of applied AI research, robust infrastructure, and intuitive developer tools empowers organizations at the forefront of AI innovation to deploy state-of-the-art models into production. Recently, we secured a $300M Series E funding round, backed by esteemed investors such as BOND, IVP, Spark Capital, Greylock, and Conviction. Be a part of our rapid growth and help shape the platform that engineers trust for launching AI products.THE ROLEAs an Infrastructure Software Engineer at Baseten, you will play a pivotal role in developing and maintaining our ML inference platform that powers AI applications in production. Your contributions will enhance the core infrastructure, enabling developers to deploy, scale, and monitor machine learning models with exceptional performance.EXAMPLE INITIATIVESYou will engage in innovative projects within our Infrastructure team, including:Multi-cloud capacity managementInference on B200 GPUsMulti-node inferenceFractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and develop infrastructure components for our ML inference platform, primarily using Python and Go.Implement and maintain Kubernetes deployments for optimal model serving.Contribute to the orchestration layer for model deployments.Build and enhance monitoring systems to track model performance metrics effectively.Develop efficient resource management solutions to optimize performance.
Full-time|$150K/yr - $200K/yr|On-site|San Francisco, CA
At Sift, we are revolutionizing the way cutting-edge machines are constructed, tested, and managed. Our innovative platform provides engineers with real-time visibility into high-frequency telemetry, effectively removing bottlenecks and facilitating quicker, more dependable development.Sift originated from our experience at SpaceX, contributing to projects like Dragon, Falcon, Starlink, and Starship, where the demands of scaling telemetry, debugging flight systems, and ensuring mission reliability necessitated a new kind of infrastructure. Founded by a talented team from SpaceX, Google, and Palantir, Sift is tailored for mission-critical systems where precision and scalability are imperative.As one of the pioneering engineers at Sift, your role will extend beyond just coding—you will play a crucial part in defining the architecture, shaping the product, and influencing the culture of a company dedicated to addressing real engineering challenges. If you're eager to take on intricate technical obstacles and build foundational systems that support complex machines from the ground up, we would love to connect with you.
Principal Software Engineer Saviynt offers an AI-driven identity platform that effectively manages and governs access permissions for both human and non-human entities across all organizational applications, data, and processes. Our clients rely on Saviynt to protect their digital assets, enhance operational efficiency, and lower compliance expenses. Designed for the age of AI, Saviynt is at the forefront of helping organizations safely advance their AI deployments and utilization. As a recognized leader in identity security, we provide solutions that empower and protect some of the world’s leading brands, Fortune 500 companies, and government institutions. For more details, please visit www.saviynt.com. Role Summary In this pivotal role, you will provide technical leadership and extensive knowledge in complex engineering domains, guiding architectural decisions while ensuring scalability, reliability, and quality across key platform components. As a Principal Engineer, you will act as a technical authority, mentor senior engineers, and tackle the most intricate technical challenges. The Connectors team plays a crucial role in facilitating seamless integrations between Saviynt's Identity Governance platform and a multitude of enterprise applications by developing and maintaining robust, scalable connector frameworks. We are committed to ensuring reliable data synchronization, provisioning, and lifecycle management across diverse external systems, forming a vital foundation for the entire platform. What You Will Be Doing ● Design and architect scalable, high-performance connector frameworks for enterprise application integrations.● Define technical standards, best practices, and design patterns for connector development.● Drive architectural decisions for complex integration scenarios involving over 200 enterprise applications.● Evaluate and recommend new technologies, tools, and frameworks to enhance connector reliability and performance.● Lead technical design reviews and provide guidance on system architecture and design trade-offs.
Join Ivo's Engineering Team!At Ivo, we are pioneers in the tech industry. Our engineers are innovators who have created groundbreaking solutions such as:• An AI agent that seamlessly integrates with MS Word to enhance document editing [2023]• Revolutionizing embedding models with agentic RAG technology [2023]• Advanced LLM-based legal fact extraction capabilities [2024]• A legal assistant designed to search extensive contract databases without compromising accuracy [2024]• Clustering legal documents from the same lineage [2025]• Automatic deviation analysis to uncover hidden risks in vast contract databases [2025]• Merging contracts with their amendments to create a “composite” contract timeline that has moved our clients to tears [2025]Role OverviewAs an Infrastructure Engineer at Ivo, you will lay the groundwork for our platform's future. Your responsibilities will include:• Designing and owning the future of our infrastructure, allowing you the freedom to innovate.• Managing multiple customer deployments, ensuring each receives tailored containers, databases, and VPCs.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics and logs into visually appealing dashboards and setting up pager alerts.• Leading infrastructure-related incidents and being on-call as necessary.• Enhancing our CI/CD system to reduce deployment time from ~12 minutes.If you're passionate about LLMs, you'll thrive in our engineering team, where you’ll have the opportunity to:• Develop real-time LLM evaluations to monitor the accuracy of our responses.• Collaborate with talented engineers to push the boundaries of DevOps.
Astranis is seeking a talented and motivated Software Engineer to join our Infrastructure team. In this role, you will be at the forefront of developing and maintaining critical software systems that support our innovative satellite technology. You'll collaborate with cross-functional teams to design, implement, and optimize our infrastructure solutions, ensuring high reliability and performance.
Join Cloudflare as a Principal Software Engineer, where you will play a pivotal role in designing and implementing innovative software solutions. You will collaborate with cross-functional teams to enhance our platform's scalability, security, and performance, making a significant impact on our global user base.
Feb 6, 2026
Sign in to browse more jobs
Create account — see all 5,853 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.