Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Proven experience in infrastructure operations and management. Familiarity with cloud platforms and deployment processes. Strong problem-solving skills and the ability to work under pressure. Excellent communication and teamwork abilities. Experience with automation tools and scripting languages is a plus.
About the job
Join Baseten as an Infrastructure Operations Engineer and become an integral part of our innovative team. In this role, you will be responsible for maintaining and enhancing our infrastructure, ensuring optimal performance and reliability. You will work collaboratively with cross-functional teams to develop and implement solutions that drive efficiency and scalability.
If you are passionate about infrastructure management and seek to make a significant impact in a fast-paced environment, we want to hear from you!
About Baseten
Baseten is a forward-thinking technology company based in San Francisco, dedicated to revolutionizing the way businesses leverage data. Our mission is to empower organizations with robust infrastructure solutions that enhance their operational efficiency and drive growth.
Join Baseten as an Infrastructure Operations Engineer and become an integral part of our innovative team. In this role, you will be responsible for maintaining and enhancing our infrastructure, ensuring optimal performance and reliability. You will work collaboratively with cross-functional teams to develop and implement solutions that drive efficiency and scalability.If you are passionate about infrastructure management and seek to make a significant impact in a fast-paced environment, we want to hear from you!
About HappyRobotHappyRobot is pioneering the AI-native operating system for the real economy, bridging the gap between intelligence and action. By harnessing real-time truths, specialized AI workers, and orchestrating intelligence, we empower enterprises to manage complex, mission-critical operations with unprecedented autonomy.Our AI OS accumulates knowledge, optimizes processes at every level, and evolves continually. Our initial focus is on supply chain and industrial-scale operations, where resilience, speed, and ongoing improvement are paramount—liberating humans to engage in strategy, creativity, and other high-value endeavors.To explore our vision further, check out our Manifesto. To date, HappyRobot has successfully raised $62 million, including a recent $44 million in Series B funding in September 2025, with support from esteemed investors like Y Combinator (YC), Andreessen Horowitz (a16z), and Base10—partners dedicated to our mission of redefining enterprise operations. We are using this investment to build a world-class team of individuals with relentless drive, exceptional problem-solving skills, and a passion for pushing boundaries in a dynamic, high-intensity environment. If this resonates with you, we invite you to join us at HappyRobot.About the RoleWe are in search of an Infrastructure Engineer to spearhead the enhancement of our operational resilience as we scale. You will be responsible for the stability, observability, and debugging processes that ensure our systems operate seamlessly. As the primary troubleshooter for complex failures in real-time, you will design tools that transform chaos into clarity and assist in transitioning our operations from reactive to proactive.This role carries significant impact and trust, as you will influence how we approach reliability—reducing incident frequency, creating internal tools, and directly enhancing developer focus and system uptime. If you thrive on uncovering the root causes of challenging issues and fortifying systems (and teams), this is your opportunity.
About Our TeamThe Infrastructure Engineering team operates within the IT department, dedicated to the reliable construction, deployment, and management of critical on-premises and hybrid environments that empower our internal services and vital research and development projects.This newly established team is committed to implementing rigorous Site Reliability Engineering (SRE) practices in environments where uptime, safety, recoverability, and security are paramount. We aim to replace unique, one-off infrastructure with standardized infrastructure-as-code components that enhance reliability and operational efficiency as OpenAI continues to grow.About This RoleWe are in search of an Infrastructure Engineering Lead who will architect, build, and maintain reliable, secure, and scalable infrastructure that supports identity, access, endpoint, and shared platform services throughout the organization.You will take full ownership of infrastructure and identity systems from conceptual design and provisioning to policy enforcement, upgrades, recovery, and ongoing operations. Your goal will be to develop robust, production-grade platforms that minimize operational hurdles, enforce security by default, and empower teams to work more effectively and confidently.This position is ideal for a senior engineer who excels in navigating ambiguity, relishes the challenge of overseeing complex systems from start to finish, and enhances reliability and security by transforming fragile implementations into standardized, repeatable infrastructure.This role is based at our San Francisco headquarters and requires in-office attendance.Key Responsibilities:Define and refine infrastructure patterns for on-prem and hybrid environments, including self-hosted platforms, vendor-supported systems, and lab settings.Establish standardized, production-grade deployment and operational models that replace custom-built solutions.Collaborate with IT, Security, Identity, and Network teams to ensure infrastructure is designed to meet reliability, security, and access standards.Design and enhance the production architecture for Identity and Access Management (IAM) adjacent platforms, such as Microsoft Entra, utilizing SRE principles.Develop common management protocols and shared resources within Azure subscriptions to ensure uniformity and policy compliance in operations.
About UsAt Salient, we are at the forefront of developing the AI infrastructure that will revolutionize financial operations, beginning with the automation of intricate and challenging workflows in loan servicing.Supported by prominent investors including a16z and Y Combinator, we have successfully raised $65 million in Series A funding.Achieving 8-figure ARR within less than two years, we are currently serving over 20% of the auto lending industry, managing millions of real customer calls and transactions each day.Our solutions are fully operational with major financial institutions, moving beyond mere proofs of concept.Excitingly, we are expanding into new segments of financial services!Our team thrives in an in-person office culture located in San Francisco, CA.We pride ourselves on being fully integrated with our clients, owning the entire tech stack, and rapidly advancing to implement modern AI in regulated sectors where precision, reliability, and performance are paramount.About the RoleWe are seeking a dedicated Staff Infrastructure Engineer who will architect and oversee the systems that enable Salient to scale effectively. In this influential individual contributor role, you will focus on infrastructure engineering, shaping how we build, deploy, and manage the infrastructure that handles millions of daily financial transactions and customer interactions. You will establish the technical direction for reliability, scalability, and developer velocity across the stack, ensuring that our platform is prepared for future developments.Why Join Us?We are among the fastest-growing companies in the Voice AI sector, having quadrupled our revenue last year and now managing close to one million calls daily.You will enjoy significant ownership of your work and contribute to our growth journey, both in business and engineering.As an AI-native organization, we continuously stay ahead in AI tools and engineering best practices, and you will have a direct hand in propelling this forward.Our culture is straightforward and results-driven, focused on building a successful and highly profitable business.Key ResponsibilitiesLead architectural decisions and technical evaluations for infrastructure-critical projects.Design, implement, and manage the cloud infrastructure (AWS/GCP) that powers Salient—covering everything from compute and networking to storage and observability.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Full-time|Remote|San Francisco, CA, New York, NY, Portland, OR, or Remote within Canada or United States
Join Mercury as a Senior Infrastructure Engineer, where you will be pivotal in shaping the infrastructure that supports our innovative financial solutions. You will work closely with cross-functional teams to design, implement, and maintain scalable and reliable infrastructure systems. This role is ideal for individuals who thrive in a fast-paced environment and are passionate about leveraging technology to drive business success.
Embark on a New Frontier! As a Software Engineer specializing in Space Infrastructure, you will play a pivotal role in enhancing our operational capabilities with a diverse array of satellites, including dedicated, rideshare, and constellation missions. Your expertise will bridge the realms of automated satellite operations, ground and flight software, and innovative solutions to on-orbit challenges.Our team is dedicated to ensuring the reliable, efficient, and standardized functioning of Loft's space infrastructure. Your contributions will be crucial in maintaining the stability of our satellite bus, the Hub, and Loft-owned payloads, providing a robust platform to facilitate customer missions.Loft Orbital's business model, as well as that of our customers, hinges on the dependability of our space infrastructure. This role offers you the opportunity to work on a variety of systems, from developing code that integrates with our Cockpit mission control system to executing operations onboard our satellites. Moreover, you may have the chance to step into the role of Flight Director, overseeing the health and safety of our satellite fleet.
About SesameAt Sesame, we envision a world where computers can interact with us in authentic, lifelike ways—seeing, hearing, and collaborating as humans do. Our mission is to create an innovative computer interface that seamlessly integrates voice agents into everyday life. Our diverse team comprises founders from Oculus and Ubiquity6 and seasoned professionals from Meta, Google, and Apple, each bringing extensive expertise in hardware and software. Join us in pioneering a future where technology feels alive.About the RoleAs a Backend Infrastructure Engineer at Sesame, you will play a pivotal role in shaping the foundational aspects of our technology stack. This position focuses on developing high-impact infrastructure, services, and tools that are broad-reaching rather than narrowly defined. You will tackle scalability and architectural challenges across various domains, including agentic workflows, speech recognition and synthesis, IoT, large-scale training, and efficient low-latency inference. If you're driven by the challenge of creating an ultra-efficient, scalable, and reliable engineering ecosystem through a blend of tooling, services, libraries, and infrastructure, this is the perfect opportunity for you.Responsibilities:Design and develop foundational infrastructure to support serving, training, and applications at Sesame.Enhance productivity for engineering teams by automating processes and creating exceptional tools.Deliver software solutions that empower product and machine learning engineers to build secure, scalable, and dependable systems from the ground up.Your responsibilities will encompass provider, service, security, and developer infrastructure, as well as the architecture and implementation of core services and libraries.
Join our dynamic team at Bland Inc. as a Senior Infrastructure Engineer, where you will play a critical role in designing and implementing robust infrastructure solutions. You will work alongside a talented group of professionals, using cutting-edge technology to drive innovation and efficiency.
About MercorMercor operates at the cutting edge of labor markets and artificial intelligence research. Collaborating with top AI laboratories and corporations, we supply the essential human intelligence that drives AI advancement.Our extensive talent network educates state-of-the-art AI models much like teachers impart knowledge to students: by sharing insights, experiences, and contextual understanding that cannot be encoded. Currently, over 30,000 experts in our network generate more than $2 million daily.At Mercor, we are pioneering a new realm of work where expertise fuels AI progress. Achieving this ambitious vision demands a dynamic, fast-paced, and deeply dedicated team. Here, you will collaborate with researchers, operators, and AI firms at the forefront of transforming societal systems.As a profitable Series C company with a valuation of $10 billion, Mercor operates five days a week from our new headquarters in San Francisco.About the RoleIn your role as an Infrastructure Engineer at Mercor, you will be instrumental in constructing and scaling the systems that support our rapid expansion. You will ensure that our infrastructure is highly reliable, cost-efficient, and capable of accommodating surges in traffic and computational demands. Your collaboration with product, research, and operations engineers will be vital in designing scalable architectures, optimizing deployments, and enhancing observability.We are broadening our search across Infrastructure roles, including Developer Productivity Engineer, Database Engineer, and Platform Engineer. Candidates will be matched to teams after the initial screening, so we encourage applications even if your expertise is predominantly in one area.What You'll Work OnDesigning and maintaining core infrastructure across cloud environments.Creating Infrastructure-as-Code workflows to automate deployments and scaling.Enhancing monitoring, logging, and alerting systems to ensure reliability.Managing CI/CD pipelines (Github, Spacelift) for seamless deployments.Assisting in disaster recovery planning and ensuring system availability.Collaborating with product and research teams to design architectures that meet workload demands.Identifying and resolving performance bottlenecks in compute, storage, and networking.
About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.
Be part of our mission to redefine AI by shaping the narrative surrounding document understanding.Role OverviewAt LlamaIndex, our Infrastructure team lays the groundwork for our product and provides essential tools that facilitate the development, deployment, and monitoring of our code. We are tasked with designing, constructing, and scaling the core infrastructure that drives a high-capacity data platform for AI applications. We seek individuals who are passionate about creating supportive systems that enhance our engineering capabilities and contribute to our rapidly expanding product suite.Ideal candidates will have a strong background in cloud infrastructure management, navigating various scalability challenges, and enhancing the productivity of the broader Engineering team. Key traits we value in our culture include a customer-centric mindset, collaboration, diligence, and optimism. We are looking for proactive team players who are eager to help us evolve our culture as we grow.Key ResponsibilitiesCollaborate with engineering teams to develop and maintain foundational systems that empower developers and support our rapid growth.Design and execute scalable infrastructure solutions suitable for various deployment models, including SaaS, single-tenant, and private environments.Oversee and optimize cloud resources and Kubernetes clusters to ensure cost-effectiveness and high performance.Facilitate successful external customer deployments by establishing clear infrastructure guidelines and principles.Enhance the release and deployment processes to improve efficiency and reliability.Ensure compliance with applicable regulations and implement comprehensive security measures across all deployment environments.QualificationsMinimum of 5 years of engineering experience.Experience working on Platform or Infrastructure teams on substantial projects involving infrastructure components like Terraform/CDKTF, Kubernetes, Helm, testing infrastructure, release management, and observability.Proficient in optimizing cloud resource utilization.Skilled in tuning Kubernetes clusters and cloud resources for optimal performance and cost efficiency.Dedicated to cultivating LlamaIndex’s engineering culture as we expand.Ability to balance speed and pragmatism in delivering solutions.
About the TeamAt OpenAI, we are revolutionizing the future of artificial intelligence. Together with our trusted capital and technology partners, we are constructing a state-of-the-art network of advanced datacenters tailored to meet the rigorous demands of AI workloads. Our Industrial Compute team is dedicated to ensuring that all datacenter systems are manufactured, delivered, and commissioned to meet world-class standards of quality, reliability, and performance.We collaborate closely with manufacturing vendors, general contractors, engineering teams, and operations organizations to guarantee that every component is primed for installation, startup, and long-term service. Our comprehensive approach spans vendor qualification through commissioning, ensuring operational readiness across our expansive global portfolio.About the RoleWe are looking for a skilled Quality Engineer (QE) to establish and oversee a scalable Product and Site Quality function within our industrial compute supply chain. You will be instrumental in driving end-to-end manufacturing quality, from supplier qualification and initial builds to factory acceptance, logistics, and installation readiness.This role involves cross-functional collaboration with Design (NPI), Test Engineering, Manufacturing Engineering, and Operations to achieve key metrics such as First Pass Yield (FPY), reliability, yield, and validation objectives. You will lead proactive and corrective actions to eliminate systemic risks, while also evaluating and developing future suppliers to ensure that production capacity and quality processes evolve alongside OpenAI’s datacenter infrastructure growth.Key ResponsibilitiesConduct supplier capability assessments and perform quality audits across production, testing, and delivery processes.Define and monitor quality metrics to identify trends and proactively address potential issues prior to shipment.Develop and maintain a datacenter-focused manufacturing quality program that aligns with global deployment needs.Integrate manufacturing quality requirements into sourcing, design, commissioning, and operational workflows.Ensure vendor teams adhere to OpenAI’s quality standards and specifications through hands-on support and structured feedback.Work closely with engineering, sourcing, construction, and operations teams to align quality priorities with project goals.Lead root cause analysis, implement corrective and preventive action plans, and resolve quality non-conformances.Audit quality activities at vendor and project sites to confirm compliance with requirements and ensure operational readiness.
Full-time|Remote|San Francisco, CA or Remote (USA)
Join Fieldguide as a Senior Infrastructure Engineer and be at the forefront of our innovative infrastructure solutions. In this role, you will lead the design, implementation, and maintenance of our infrastructure systems while ensuring optimal performance, security, and scalability. Your expertise will help shape our technology strategy and drive impactful projects.
Join Example Org, a pioneering software company revolutionizing real-time collaboration on essential workflows. Established in 2012, we proudly serve over 10,000 customers globally and have the backing of esteemed investors like Example Capital. With our Series C funding, we are valued at $750 million.As an Infrastructure Engineer, you will be an integral part of our dynamic team, reporting directly to the team manager. Your contributions will be vital in enhancing our workflows and driving our projects to success.Your ResponsibilitiesEngage in collaborative meetings to align on project deliverables.Lead innovative initiatives that push our technological boundaries.Assist in recruiting and building a strong team.Mentor and support the professional development of team members.
Join the Space Exploration Journey!As a Senior Software Engineer specializing in Space Infrastructure, you will play a pivotal role in enhancing our capabilities to manage a diverse fleet of satellites, including dedicated, rideshare, and constellation missions. Your work will involve the integration of automated satellite operations, both ground and flight software, while tackling challenges encountered in orbit.Our team is dedicated to ensuring the dependable, efficient, and standardized performance of Loft’s space infrastructure. You will oversee the operational stability of Loft satellites, focusing on the satellite bus, the Hub, and Loft's payloads, which serve as platforms for executing customer missions.Reliability is the cornerstone of Loft's business model and that of our clients. This role offers you the flexibility to engage with various systems, from coding for Cockpit, our mission control system, to writing software that runs onboard our satellites. Additionally, you may have the opportunity to serve as a Flight Director, overseeing the health and safety of our satellite fleet.
Are you a passionate engineer with a knack for building robust infrastructure? Join our dynamic team at fal as a Senior/Staff Infrastructure Engineer. In this pivotal role, you will design and implement innovative solutions that enhance our infrastructure's efficiency and reliability.As a key member of our engineering team, your responsibilities will include:Architecting scalable infrastructure solutions to meet our growing needs.Collaborating with cross-functional teams to identify and resolve infrastructure challenges.Implementing automation tools and frameworks to streamline operations.Monitoring performance and ensuring the security of our systems.Providing mentorship and guidance to junior engineers.We are looking for individuals who thrive in a fast-paced environment and have a deep understanding of infrastructure technologies.
Join Cloudflare as a Data Center Infrastructure Management (DCIM) Administrator within our Infrastructure Operations team. In this role, you will be responsible for overseeing the effective management of our data center infrastructure, ensuring optimal performance and reliability.As part of a dynamic team, you will engage in monitoring, reporting, and optimizing data center operations, collaborating with cross-functional teams to enhance operational efficiency.
About UsAt Imprint, we are revolutionizing the world of co-branded credit cards and innovative financial solutions, focusing on smarter, more rewarding, and brand-first experiences. We collaborate with renowned brands such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to establish modern credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our robust platform integrates advanced payment technologies, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the complexities of becoming a bank.Co-branded credit cards represent over $300 billion in U.S. annual spending, yet many are still managed by outdated banking systems. Imprint stands as the modern alternative—flexible, technology-driven, and tailored for today’s consumers. Supported by notable investors like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a world-class team dedicated to reshaping payment methods and driving brand growth. If you thrive in fast-paced environments, enjoy tackling complex challenges, and aspire to make a significant impact, we would be delighted to meet you.Discover more about us on Imprint's Technology Blog.The TeamThe Tech Platform Engineering Team at Imprint is pioneering the democratization of access to advanced technologies, empowering teams across our organization to innovate and excel. Our commitment to redefining the Fintech landscape drives us to build secure, highly available infrastructures while equipping our engineers with comprehensive development tools, allowing them to rapidly create world-class products.Your RoleDesign, build, and manage cloud and web infrastructure with a strong emphasis on security, reliability, and scalability.Implement and maintain infrastructure components across computing, networking, and data platforms.Adhere to security best practices in cloud infrastructure, ensuring proper access control, network isolation, and secure communication between services.Monitor system health and engage in incident response, root cause analysis, and reliability enhancements.Collaborate with platform, security, and product engineers to deliver safe and efficient infrastructure solutions.
Jan 16, 2026
Sign in to browse more jobs
Create account — see all 6,120 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.