Software Engineer, Compute Infrastructure

xAIPalo Alto, CA

On-site Full-time $180K/yr - $440K/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

We are looking for candidates who possess a strong background in software engineering, particularly in the realm of compute infrastructure. Ideal candidates will have experience with large-scale systems, container orchestration, and performance optimization. You should demonstrate a passion for problem-solving and a commitment to achieving excellence in your work.

About the job

About xAI

At xAI, we are driven by our mission to develop AI systems that profoundly understand the universe and assist humanity in its quest for knowledge. Our team is composed of passionate individuals who thrive on challenges and curiosity, emphasizing engineering excellence. We maintain a flat organizational structure where every member is expected to actively contribute to our mission. Leadership is earned through initiative and consistent delivery of excellence, fostering a strong work ethic and prioritization skills. Effective communication is essential, enabling team members to share insights and knowledge clearly.

About the Role

The Compute Infrastructure team at xAI is tasked with the design, construction, and management of extensive clusters and orchestration platforms that facilitate cutting-edge AI training, inference, and agent workloads at an unprecedented scale. In this role, you will redefine container orchestration beyond current systems like Kubernetes, manage exascale computing resources, optimize for high-performance training runs and production services, and work closely with research and systems teams to deliver reliable, ultra-scalable infrastructure that powers xAI's next-generation models and applications.

Responsibilities

Construct and oversee large-scale clusters to host, persist, train, and serve AI workloads with exceptional reliability and performance.
Design, develop, and enhance an in-house container orchestration platform that surpasses off-the-shelf solutions in scalability, isolation, resource efficiency, and fault-tolerance.
Collaborate with research teams to architect and optimize compute clusters tailored for extensive training runs, inference services, and real-time applications.
Profile, debug, and resolve intricate system-level performance bottlenecks, resource contention, scheduling dilemmas, and reliability issues across the entire stack.
Take ownership of end-to-end infrastructure initiatives employing first-principles design, rigorous testing, automation, and continuous optimization to meet the demands of frontier AI compute.

About xAI

xAI is at the forefront of AI innovation, dedicated to creating intelligent systems that enhance human understanding and drive knowledge acquisition. Our small, dynamic team is committed to pushing the boundaries of technology while fostering a culture of collaboration and continuous improvement.

Similar jobs

1 - 20 of 629 Jobs

Search for Lead Software Test Infrastructure Engineer

629 results

Select all on this page (20)

Apply

Lead Software Test Infrastructure Engineer

ALTEN Technology USA

Full-time|$135K/yr - $135K/yr|On-site|Palo Alto, California, United States

Join ALTEN Technology USA, a pioneering engineering firm that empowers clients to transform innovative concepts into reality, from advancing space exploration and developing life-saving medical devices to creating autonomous electric vehicles. With a talented team of over 3,000 professionals across North America, we collaborate with industry leaders in aerospace, medical devices, robotics, automotive, commercial vehicles, electric vehicles, rail, and beyond.As a member of the global ALTEN Group, which boasts over 57,000 engineers across 30 countries, we cover the full spectrum of product development, from initial consulting to complete project outsourcing.At ALTEN Technology USA, you will tackle some of the most complex engineering challenges on the planet, backed by strong mentorship, abundant career advancement opportunities, and a comprehensive benefits package. We're committed to nurturing a workplace culture where every employee feels valued, supported, and inspired to reach their full potential.We are looking for a Lead Software Test Infrastructure Engineer to enhance our team and facilitate the validation of automotive firmware and applications. In this role, you will design, develop, maintain, and scale automated testing frameworks and tools that streamline software testing processes. Additionally, you will provide technical leadership, mentor junior engineers, and steer the strategic vision for test infrastructure, CI/CD integration, and simulations. Your deep knowledge of embedded systems and the automotive industry will be crucial in ensuring the quality and reliability of our software solutions.

Feb 16, 2026

Apply

Senior Software Engineer - Test Infrastructure

Latitude

Full-time|On-site|Pittsburgh, PA, Palo Alto, CA, Detroit, MI

Role Overview Latitude is looking for a Senior Software Engineer focused on Test Infrastructure. This role centers on strengthening testing frameworks to help deliver reliable software. The position is available in Pittsburgh, PA, Palo Alto, CA, or Detroit, MI. What You Will Do Work closely with teams across engineering, product, and QA to support development efforts. Design, build, and maintain test infrastructure that supports software quality. Help improve and extend frameworks used for automated and manual testing.

Apr 16, 2026

Apply

Infrastructure Software Engineer

Mashgin

Full-time|On-site|Palo Alto, CA

Join Mashgin as an Infrastructure Software Engineer and be a part of our innovative team dedicated to enhancing the efficiency of our cutting-edge technology. You will play a critical role in designing, developing, and maintaining robust infrastructure systems that power our products and services. Your expertise will help us streamline operations, improve performance, and ensure reliability across our platforms.

Mar 2, 2026

Apply

Senior Software Engineer, Infrastructure

Ladder33

Full-time|$145K/yr - $192K/yr|Hybrid|Palo Alto, California - Hybrid/Remote - United States

Senior Software Engineer, Infrastructure About Ladder At Ladder, we identified a significant issue in the life insurance sector: the lengthy application process, the excessive paperwork, and the numerous in-person meetings with agents. Motivated by personal loss, our CEO, Jamie, set out to simplify the process of obtaining essential coverage for families. We innovated real-time underwriting using AI, transforming the months-long life insurance application into a matter of minutes. Our user-friendly digital experience ensures instant decisions and has garnered exceptional user reviews, with over $74 billion in coverage issued. About the Role We are in search of a Senior Software Engineer who will enhance developer productivity within Ladder's engineering team. You will take charge of modernizing our CI/CD pipelines, build systems, and developer tools, while also contributing to the robustness of our cloud infrastructure and data platform. The ideal candidate will possess a proven track record in software engineering, demonstrate leadership qualities, and have a thorough understanding of engineering infrastructure. This position is remote, available in any of the 22 states where Ladder is hiring: AZ, CA, CO, CT, FL, GA, KS, MA, MD, MN, NC, NH, NJ, NV, NY, OH, OR, PA, TX, VA, WA, WI. Please note that Ladder is not sponsoring or transferring OPT or H1-B visas at this time. How You’ll Make a Difference As a senior engineer in our team, your responsibilities will extend beyond coding; you will influence our platform strategy. Your contributions will include: Enhancing developer velocity across the engineering organization by measuring and optimizing the developer workflow, which encompasses build times, test parallelization, deployment speeds, and daily tooling. Shaping the architecture of Ladder’s production infrastructure by evaluating design trade-offs and making impactful technical decisions, such as transitioning from custom monitoring tools to native cloud provider integrations or redefining data pipeline rebuild processes in response to upstream logic changes. You will have the insight to see the holistic view across systems and determine where to allocate engineering resources effectively. Engaging in incident response for infrastructure issues, leading retrospectives, and ensuring actionable follow-through on resolutions.

Mar 20, 2026

Apply

Senior Software Engineer, Cloud & Infrastructure

Full-time|$137.9K/yr - $240K/yr|On-site|Palo Alto, California, United States

Senior Software Engineer, Cloud & Infrastructure | Software EngineeringPalo Alto, CA (on-site)At 1X, we are at the forefront of innovation, developing humanoid robots that collaborate with humans to address labor shortages and foster abundance.In this pivotal role, you will spearhead the design and implementation of sophisticated software that bridges the physical and digital realms of our global robotic operations. From deployment tools and fleet management solutions to customer interfaces and internal operational platforms, your goal is to develop systems that can scale from hundreds to tens of thousands of robots. You will take charge of architectural decisions, construct core components, and mentor engineers across the technology stack, ensuring reliability, simplicity, and performance throughout.

Oct 29, 2025

Apply

Software Engineer, Compute Infrastructure

xAI

Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

About xAIAt xAI, we are driven by our mission to develop AI systems that profoundly understand the universe and assist humanity in its quest for knowledge. Our team is composed of passionate individuals who thrive on challenges and curiosity, emphasizing engineering excellence. We maintain a flat organizational structure where every member is expected to actively contribute to our mission. Leadership is earned through initiative and consistent delivery of excellence, fostering a strong work ethic and prioritization skills. Effective communication is essential, enabling team members to share insights and knowledge clearly.About the RoleThe Compute Infrastructure team at xAI is tasked with the design, construction, and management of extensive clusters and orchestration platforms that facilitate cutting-edge AI training, inference, and agent workloads at an unprecedented scale. In this role, you will redefine container orchestration beyond current systems like Kubernetes, manage exascale computing resources, optimize for high-performance training runs and production services, and work closely with research and systems teams to deliver reliable, ultra-scalable infrastructure that powers xAI's next-generation models and applications.ResponsibilitiesConstruct and oversee large-scale clusters to host, persist, train, and serve AI workloads with exceptional reliability and performance.Design, develop, and enhance an in-house container orchestration platform that surpasses off-the-shelf solutions in scalability, isolation, resource efficiency, and fault-tolerance.Collaborate with research teams to architect and optimize compute clusters tailored for extensive training runs, inference services, and real-time applications.Profile, debug, and resolve intricate system-level performance bottlenecks, resource contention, scheduling dilemmas, and reliability issues across the entire stack.Take ownership of end-to-end infrastructure initiatives employing first-principles design, rigorous testing, automation, and continuous optimization to meet the demands of frontier AI compute.

Mar 6, 2026

Apply

Software Engineer - Data Infrastructure & Acquisition

Speechify

Full-time|Remote|Palo Alto, CA, USA

Role overview Speechify seeks a Software Engineer specializing in Data Infrastructure and Acquisition at its Palo Alto, CA office. This position focuses on building and refining the data pipelines and backend systems that support Speechify’s text-to-speech products. What you will do Design, develop, and improve data pipelines to meet product and business requirements Collaborate with engineering, product, and data teams to maintain reliable data flows Contribute to systems that support data-driven decisions and ongoing product improvements

Apr 25, 2026

Apply

Software Engineer, New Grad - Infrastructure

Palantir Technologies

Full-time|On-site|Palo Alto, CA

Join a Revolutionary CompanyAt Palantir, we are at the forefront of developing cutting-edge software that drives data-driven decision-making across various industries. Our innovative platforms empower organizations to tackle complex challenges—from discovering groundbreaking pharmaceuticals to predicting supply chain disruptions and reuniting families with missing children.The OpportunityAre you enthusiastic about enhancing software quality and performance while amplifying the impact of your engineering peers?As a Software Engineer in Palantir’s Foundations organization, you will play a pivotal role in shaping the infrastructure that supports our flagship platforms: Palantir Foundry, Palantir Gotham, and Palantir Apollo. You will have the chance to accelerate your professional growth as you contribute to building the foundational infrastructure that drives our engineering excellence.Within the Foundations organization, our teams consist of a small, dedicated group of engineers, each specializing in one of four critical infrastructure areas:• Backend Infrastructure: Focuses on maximizing the productivity of our backend developers while ensuring that Palantir’s platforms deliver high-performance and reliable RESTful services. Your work will involve creating infrastructure that allows for efficient management of numerous micro-service repositories and maintaining comprehensive audit logs of user activities.• Developer Infrastructure: Manages the systems and services that form the backbone of our developer ecosystem, incorporating both standard tools like GitHub and customized solutions for automating changes across multiple repositories.• Frontend Infrastructure: Enhances frontend developer productivity throughout the entire development stack, from the coding environment to the end-user experience in the browser. This includes building essential infrastructure for feature flags, internationalization, and preview functionalities, as well as optimizing build processes for extensive TypeScript codebases.• Storage Infrastructure: Innovates Palantir’s database and search systems, supporting diverse storage technologies across cloud, on-premise, and secure environments. This role involves evolving our existing solutions to meet growing data scale and latency requirements, alongside designing next-gen database offerings for enhanced workflows.Core Responsibilities:

Aug 19, 2025

Apply

Senior Software Engineer - API Infrastructure

Rubrik, Inc.

Full-time|On-site|Palo Alto, CA

Join Rubrik, Inc. as a Senior Software Engineer specializing in API Infrastructure. In this role, you will play a pivotal part in designing and developing robust and scalable APIs that empower our cloud data management solutions. Your expertise will help shape the future of our platform, driving innovation and enhancing our product offerings.

Mar 10, 2026

Apply

Senior Software Engineer - Infrastructure at Ricursive Intelligence | Palo Alto

Ricursive Intelligence

Full-time|On-site|Palo Alto

At Ricursive Intelligence, we are at the forefront of AI innovation, dedicated to creating self-improving systems that revolutionize chip design and development. Our mission is to bridge the gap between artificial intelligence and the hardware that powers it, thereby accelerating the journey towards artificial superintelligence.We are searching for a Senior Software Engineer specializing in Infrastructure with a minimum of 5 years of experience to spearhead the reliability and productization of our core systems. You will play a pivotal role in transforming AI research into practical applications in chip design.The ideal candidate is a proactive engineer with substantial experience in building production-grade systems, as well as expertise in integration and deployment practices, including CI/CD. You will oversee the complete development lifecycle, from regression testing frameworks to scalable cloud infrastructure, ensuring our systems are robust, reproducible, and efficient.Key Areas of Focus: Integration and Deployment, Regression Testing, Quality Control, Automated Deployment, Distributed Systems, Cloud Infrastructure.

Feb 27, 2026

Apply

Lead Software Engineer - AI Agents & Orchestration

Nubank

Full-time|On-site|USA, Palo Alto

About NubankNubank is a leading digital financial platform, serving over 127 million customers across Brazil, Mexico, and Colombia. Our mission is to simplify financial services and empower individuals, heralding a transformative purple future in Latin America.As a publicly traded company on the New York Stock Exchange (NYSE: NU), we leverage proprietary technology and data intelligence to deliver financial products that are accessible and user-friendly.Our innovation has been recognized globally, featuring in esteemed rankings like Time 100 Companies, Fast Company’s Most Innovative Companies, and Forbes World’s Best Bank. Explore more about joining our team on our Careers page!About the RoleAt Nubank, AI is central to our strategy for growth and innovation. We are evolving into an AI-native company where intelligent systems not only enhance our products but fundamentally shape them. As the Lead Software Engineer for AI Agents & Orchestration, you will play a pivotal role in this transition.Your responsibilities will include steering the technical strategy for the integration of Generative AI and Large Language Models (LLMs), impacting the experience of over 130 million customers. This position emphasizes building production-grade workflows and RAG pipelines that operate efficiently across various markets. You will utilize our specialized GenAI Platform, which encompasses inference services, evaluation frameworks, and orchestration layers, to transition from experimental phases to substantial global impact at 'Nu-speed'.Why This Role Stands OutAgency & Transformation: You are not just another team member; you are a key architect of the AI Native Bank.Impact at Scale: Your contributions will empower a diverse population exceeding that of many countries.The AI Multiplier: You will engage in a structured framework that acknowledges and rewards the innovative advancements you drive through AI.ResponsibilitiesLead AI Architecture: Define the technical vision for AI-native applications, ensuring that our LLM integrations are scalable, cost-effective, and dependable.Develop Agentic Workflows: Create and implement sophisticated agentic systems and RAG patterns that offer hyper-personalized financial experiences.Pursue Engineering Excellence: Establish the standard for AI development at Nubank, promoting reusable components.Collaborate Across Teams: Work alongside Product, Design, and other departments to ensure seamless integration of AI technologies.

Mar 6, 2026

Apply

Infrastructure Engineer, Foundation

Pylon

Full-time|On-site|Palo Alto

Role Overview Pylon is hiring an Infrastructure Engineer, Foundation, based in Palo Alto. This role focuses on designing, implementing, and maintaining infrastructure that supports the company’s core products and services. The work directly supports operational reliability and technical growth across the organization. What You Will Do Design and build infrastructure solutions to support Pylon’s main offerings Maintain and improve existing systems to ensure reliability and performance Work with teams across engineering and other functions to integrate and support infrastructure needs Identify opportunities to optimize system performance and scalability Who We’re Looking For Proactive approach to problem solving and infrastructure development Interest in building scalable systems and improving performance Comfort working closely with cross-functional teams

Apr 13, 2026

Apply

Manufacturing Test Engineer at tsmg | Palo Alto

tsmg

Contract|On-site|Palo Alto

Role ObjectiveJoin our dynamic team at tsmg as a Manufacturing Test Engineer, where you will collaborate closely with contract manufacturers, vendors, and suppliers to create comprehensive test plans. Your expertise will be essential in designing and maintaining test equipment, executing both manual and automated tests, resolving manufacturing test issues, and facilitating the successful bring-up of test lines.

Oct 10, 2025

Apply

LLM Infrastructure Engineer

Ricursive Intelligence

Full-time|On-site|Palo Alto

At Ricursive Intelligence, we are pioneering advancements in artificial intelligence by creating self-improving systems with a focus on innovative chip design. Our mission is to revolutionize chip development, effectively bridging the gap between AI and the hardware that supports it, thereby accelerating the journey towards artificial superintelligence.We are seeking exceptional engineers who are passionate about tackling a wide range of challenges in scaling, low-level optimization, and the fundamental infrastructure necessary for large language model training and inference.

Jan 19, 2026

Apply

Software Infrastructure Engineer - Build and Release

Rivian and Volkswagen Group Technologies

Full-time|On-site|Palo Alto, California

Rivian and Volkswagen Group Technologies unites two automotive leaders to drive progress in electric vehicle software. The team develops operating systems, zonal controllers, cloud platforms, and connectivity solutions, all aimed at advancing software-defined vehicles on a global scale. Their combined expertise covers connectivity, artificial intelligence, and security, shaping a smarter and more sustainable future for mobility. Role overview The Software Infrastructure Engineer - Build and Release will focus on building and maintaining robust infrastructure for Android development. This role centers on ensuring high uptime and reliability for continuous build and deployment pipelines in cloud environments. The engineer will address the unique demands of large-scale Android projects and help keep developer workflows efficient and stable. Improve build reliability and performance for Android projects Identify and reduce non-deterministic build failures Optimize systems to support hundreds of thousands of builds per month without downtime Support developer productivity by maintaining stable and efficient build tools Requirements Deep experience with the Android build ecosystem, including Gradle and Soong Background in Build DevOps and cloud-based build systems Strong track record of maintaining uptime, reliability, and performance in large-scale build environments Location Palo Alto, California

Apr 27, 2026

Apply

Senior Infrastructure Engineer

BitGo

Full-time|On-site|Palo Alto, California, United States

Role overview BitGo is looking for a Senior Infrastructure Engineer in Palo Alto, California. This role focuses on building and maintaining the company's infrastructure to support reliable, high-performing services. What you will do Work with teams across the company to design, implement, and improve infrastructure systems Ensure systems remain highly available and deliver strong performance Apply cloud technologies and infrastructure as code practices to support and enhance services What we look for Experience with cloud platforms Strong background in infrastructure as code Ability to collaborate with engineers from different disciplines

Apr 14, 2026

Apply

Senior Flight Test Engineer

Pivotal

Full-time|On-site|Palo Alto, CA

Pivotal stands at the forefront of the evolving electric Vertical Takeoff and Landing (eVTOL) aircraft market. We specialize in the design, development, and production of lightweight eVTOL aircraft, with our flagship model, BlackFly, being the first light eVTOL to successfully conduct manned missions and enter the consumer sector.Our aircraft are engineered to be efficient, compact, and user-friendly, catering to a diverse array of applications including consumer use, public services, and defense. Our innovative tilt-aircraft architecture and scalable platform have been operational for over a decade. Recently, we unveiled our next-generation model, the Helix, which is on track for general release and scalable production by early 2026.As mobility remains a high-priority area for modern technological investments, this is an opportune moment to join a company poised for success in the right market with a sound strategy. If you are ready to embark on an exciting journey, we encourage you to become part of our exceptional team and grow alongside us.

Jun 4, 2025

Apply

Controls Test Engineer at ridealso | Palo Alto

ALSO

Full-time|$240K/yr - $275K/yr|On-site|Palo Alto

About ridealso.At ridealso, we are pioneers in electric mobility, initially launched under Rivian. Our dynamic team is comprised of visionaries, creators, and innovators dedicated to revolutionizing the future of transportation with our cutting-edge, vertically integrated small electric vehicles (EVs). We aim to tackle today’s and tomorrow’s mobility challenges by inspiring individuals to choose ridealso—making transportation more affordable, enjoyable, and significantly more efficient, with energy savings of 10-50 times.As a Controls Test Engineer, you will be instrumental in defining and executing comprehensive testing strategies across software, hardware, and simulation environments, including Model-in-the-Loop (MIL), Software-in-the-Loop (SIL), and vehicle testing. Your role will involve developing automation frameworks, validating embedded systems and propulsion control modules, performing defect investigations, and conducting root-cause analyses to ensure the highest quality standards.

Jan 6, 2026

Apply

Robotics Test Engineer - Hands-On

Full-time|$119.8K/yr - $179.6K/yr|On-site|Palo Alto, California, United States

About 1X1X is a pioneering AI and robotics company located in Palo Alto, California. Our mission is to create an abundant society through the development of general-purpose robots capable of autonomously performing a wide array of tasks.We believe that true understanding and intelligence growth in humanoid robots can only be achieved by allowing them to coexist and learn alongside humans. Therefore, our focus is on designing friendly home robots that seamlessly integrate into daily life.We seek curious, driven, and passionate individuals eager to contribute to shaping the future of robotics and AI. If our mission resonates with you, we would love to hear from you and explore how you can be part of our exciting journey.Role OverviewAs a Robotics Test Engineer at 1X, you will engage in a multifaceted, hands-on role. This position is not limited to isolated testing; you will operate at the intersection of hardware, firmware, and software. Your primary objective will be to ensure that our robots are robust, reliable, and prepared for real-world applications by supporting the entire product lifecycle—from early research and development characterization through to final production validation. You’ll be expected to contribute across a wide range of technical areas and engage in cross-disciplinary debugging.

Jan 17, 2026

Apply

Engineering Lead - Consumer Subscriptions

xai

Full-time|On-site|Palo Alto, CA

About the Role xai is hiring an Engineering Lead for the Consumer Subscriptions team in Palo Alto, CA. This role shapes the direction and execution of subscription products that aim to improve user engagement and satisfaction. What You Will Do Guide and mentor a team of engineers focused on subscription solutions for consumers Oversee the full development lifecycle, from early ideas through to launch and ongoing improvements Set and uphold high standards for product quality and performance Encourage a collaborative culture that values creativity and technical growth Who We’re Looking For Experience leading engineering teams, ideally in consumer-facing products Strong background in building and maintaining subscription-based solutions Ability to balance technical depth with clear communication and team support

Apr 16, 2026

Create account — see all 629 results