Senior Backend Tooling Software Engineer jobs in Sunnyvale – Browse 690 openings on RoboApply Jobs

Senior Backend Tooling Software Engineer jobs in Sunnyvale

Open roles matching “Senior Backend Tooling Software Engineer” with location signals for Sunnyvale. 690 active listings on RoboApply Jobs.

690 jobs found

1 - 20 of 690 Jobs
Apply
companyCrusoe Energy logo
Full-time|On-site|Sunnyvale, CA - US

Join Crusoe Energy as a Senior Backend Tooling Software Engineer and play a pivotal role in enhancing our backend systems that support our innovative energy solutions. You will be responsible for designing, implementing, and maintaining scalable backend tooling solutions that cater to our growing infrastructure needs.Your expertise will help us streamline our operations, optimize workflows, and improve performance across our engineering teams. If you are passionate about backend engineering and enjoy tackling complex challenges, we invite you to apply and contribute to our mission of transforming energy use.

Apr 1, 2026
Apply
companyApplied Intuition, Inc. logo
Full-time|$189.7K/yr - $232.9K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of shaping the future of physical AI. Founded in 2017 and currently valued at $15 billion, this Silicon Valley-based company is pioneering the digital infrastructure necessary to embed intelligence into every moving machine worldwide. We serve the automotive, defense, trucking, construction, mining, and agriculture sectors across three primary domains: tools and infrastructure, operating systems, and autonomy. Our solutions are trusted by eighteen of the top twenty global automakers, as well as by the United States military and its allies. Our headquarters is located in Sunnyvale, California, with additional offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.As an in-office company, we expect our employees to primarily work from their Applied Intuition office five days a week. However, we understand the importance of flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, starting the day with morning meetings from home, or leaving early to accommodate family commitments.

Mar 7, 2026
Apply
companyIllumio logo
Full-time|On-site|Sunnyvale, California - HQ

Join Us in the Fight Against Cyber Threats!At Illumio, we are pioneers in ransomware and breach containment, transforming the way organizations tackle cyberattacks while fostering operational resilience. Our advanced Illumio AI Security Graph enables our breach containment platform to swiftly identify and neutralize threats across hybrid multi-cloud environments, preventing potential disasters before they escalate.As a recognized Leader in the Forrester Wave™ for Microsegmentation, we empower Zero Trust principles, enhancing the cyber resilience of the infrastructure, systems, and organizations that keep the world functioning.Our VisionOur Engineering team thrives on innovative leadership, autonomy, and ownership, cultivating a vibrant synergy that propels us forward in the dynamic cybersecurity landscape.By joining our team, you will contribute to the leader in Zero Trust Segmentation. You will work with an advanced technology stack that encompasses diverse operating systems, distributed applications, and sophisticated UI/visualization tools.Together, we are shaping the future of cybersecurity, continuously developing world-class products driven by a diverse team committed to innovation amidst unprecedented cybersecurity threats.Your Contributions:Develop and maintain containerized microservices and their components for a distributed multi-tenant system that processes data and real-time telemetry from various public clouds, providing customers with insights and security recommendations to mitigate cloud risks.Mentor junior engineers, new graduates, and interns, guiding them in their professional growth and integration into the team.Primarily write code in Java utilizing the Spring Boot framework, while engaging with data pipelines through Kafka, SQL, or similar interfaces.Leverage Kubernetes for service infrastructure; we welcome individuals from diverse technological backgrounds eager to learn.Take ownership of critical features and subsystems, managing the complete software development lifecycle from requirement clarification to successful deployment and customer utilization.Oversee operational aspects of the system, actively addressing challenges to ensure optimal performance.

Aug 12, 2025
Apply
companyApplied Intuition, Inc. logo
Full-time|$125K/yr - $185K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc., established in 2017 and currently valued at $15 billion, is at the forefront of revolutionizing physical AI. Based in Silicon Valley, we are building the digital infrastructure necessary to infuse intelligence into every moving machine globally. We cater to industries such as automotive, defense, trucking, construction, mining, and agriculture through three core offerings: tools and infrastructure, operating systems, and autonomy. Trusted by 18 of the world's top 20 automakers and the U.S. military alongside its allies, our solutions are pivotal in delivering physical intelligence. Our headquarters are located in Sunnyvale, California, with additional offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.We are an in-office organization, with the expectation that employees primarily work from their Applied Intuition office five days a week. However, we value flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, beginning the day with morning meetings from home, or leaving earlier to accommodate family obligations.About the RoleYour Responsibilities at Applied Intuition:Design, develop, and productionize scalable internal tools and AI workflows that facilitate system engineering and validation for autonomous vehicle initiatives.Integrate data across requirements management, modeling, and validation tools to ensure comprehensive traceability from system requirements to test outcomes.Build backend services and APIs to consolidate distributed engineering artifacts into a reliable, cohesive platform.Create dashboards and KPIs to evaluate requirement coverage, trace completeness, and validation progress.Take ownership of and enhance the core traceability data model, enabling bidirectional traceability, versioning, baselining, and change impact analysis.Refactor internal prototypes into production-grade, certifiable systems with robust reliability, access control, and auditability.

Mar 7, 2026
Apply
companyCeribell logo
Full-time|$141K/yr - $190K/yr|On-site|Sunnyvale, CA

About CeribellCeribell is at the forefront of medical technology, dedicated to revolutionizing the diagnosis and management of patients with serious neurological conditions. Our innovative Ceribell System is a cutting-edge, point-of-care electroencephalography (EEG) platform that meets the critical needs of patients in acute care settings. Already in use at hundreds of community hospitals, large academic institutions, and major integrated delivery networks across the nation, our team shares a collective mission to enhance critical care with our rapid seizure detection technology. Join us in making a difference!Position Overview:We are seeking a talented Senior Software Engineer with a strong backend focus to join our dynamic team in developing the next generation of EEG web applications that cater to vital medical use cases. In this role, you will be instrumental in designing, maintaining, and enhancing the backend systems for our EEG Portal web application, which is essential for healthcare providers, researchers, and clinical teams to access, monitor, and analyze EEG data. You will collaborate closely with fellow engineers, product managers, and stakeholders to ensure that our backend systems are robust, secure, and scalable within a medical environment.Key Responsibilities:Backend Development & Maintenance:Design, develop, and maintain backend systems to support the EEG Portal application, ensuring dependable performance and adherence to healthcare standards.Implement new features and enhancements to meet clinical and research demands, prioritizing efficiency and scalability.Troubleshoot, debug, and optimize backend systems to guarantee maximum uptime and reliability for users.Database Management:Write optimized database queries and execute data migration strategies.Monitor and fine-tune database performance, including indexing, replication, and backup processes.API Development & Integration:Develop and maintain RESTful APIs that interact with the frontend and other systems.Ensure APIs are secure, well-documented, and capable of handling large volumes of sensitive medical data.Integrate third-party services and platforms as needed to enhance functionality.Ensure backend services comply with regulatory standards, including data encryption, authentication, and auditing.

Mar 2, 2026
Apply
companyWalmart Inc. logo
Full-time|On-site|Sunnyvale

Are you an innovative software engineer with a passion for backend development? Join our dynamic team at Walmart Inc. as a Senior Software Engineer specializing in Backend Java. In this role, you will leverage your expertise to design, develop, and maintain scalable applications that drive our corporate technology initiatives.Your responsibilities will include collaborating with cross-functional teams to enhance system performance, implementing best practices in software development, and mentoring junior engineers. If you are looking to make a significant impact in a fast-paced environment, this is the opportunity for you!

Sep 24, 2025
Apply
companyApplied Intuition, Inc. logo
Full-time|$153K/yr - $222K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI technology. Established in 2017, the company has rapidly grown to a valuation of $15 billion, leading the way in developing the digital infrastructure essential for integrating intelligence into every moving machine globally. Our services cater to diverse sectors including automotive, defense, trucking, construction, mining, and agriculture, focusing on three pivotal areas: tools and infrastructure, operating systems, and autonomy. With eighteen of the top twenty global automakers and the U.S. military among our clients, our solutions are trusted to deliver reliable physical intelligence. Headquartered in Sunnyvale, California, we also have offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.We operate primarily in-office, expecting our team to work from our Applied Intuition office five days a week. However, we value flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, starting the day with morning meetings from home, or leaving early for family commitments.About the RoleAs a Senior Backend Software Engineer on the Remote Assistance team, you will play a pivotal role in architecting, developing, and maintaining the foundational services and infrastructure that empower our remote assistance platform. Your responsibilities will encompass designing APIs, data pipelines, and backend systems adept at managing real-time data streaming, command processing, and facilitating communication between autonomous vehicles and human operator workstations. This impactful position will be instrumental in realizing the future of autonomous trucking by ensuring our backend systems are both reliable and scalable.Your Responsibilities Include:Designing, developing, and deploying scalable, low-latency backend services for remote assistance session management and real-time data streaming.Architecting and implementing the cloud infrastructure for remote assistance, utilizing technologies such as protobuf and gRPC across multiple AWS regions.

Mar 5, 2026
Apply
companyTaara logo
Full-time|$160K/yr - $210K/yr|On-site|Sunnyvale, CA

About the TeamAt Taara, born from X, Google's Moonshot Factory, we are dedicated to connecting billions of individuals who currently lack access to affordable and reliable internet. Our innovative approach utilizes light to deliver faster and more economical connectivity solutions. Join us in our mission to bridge the digital divide and illuminate the future through groundbreaking wireless optical communication and photonics chip technologies.About the RoleAs a Senior Backend Software Engineer, Cloud & Infrastructure, you will serve as the architect of our global network's core operations. While our hardware establishes the connections, your software will oversee and optimize them. You will be responsible for designing and scaling distributed systems, APIs, and cloud-native infrastructures that monitor and control our wireless optical terminals deployed in the field.We are looking for a versatile candidate who excels in building dependable and scalable backend systems. You should be comfortable developing high-performance Go services, architecting extensive data pipelines for telemetry, and automating cloud infrastructures.Your Impact:Scale the Control Plane: Design and implement a cloud-native backend that manages thousands of optical terminals across the globe.Architect Telemetry Pipelines: Create robust data ingestion and processing systems to handle real-time performance metrics from our optical terminals.Bridge Edge and Cloud: Collaborate with hardware engineers to establish secure and efficient communication between our devices and the cloud.Automate Everything: Spearhead our Infrastructure as Code (IaC) strategy to ensure resilient and reproducible global deployments.Drive Observability: Develop monitoring tools and dashboards to empower our Network Operations Center (NOC) to troubleshoot complex optical links swiftly.

Feb 5, 2026
Apply
companyDigiCert Inc. logo
Full-time|$150K/yr - $175K/yr|On-site|Sunnyvale, CA

Who We AreDigiCert is a global leader in intelligent trust, dedicated to safeguarding the digital realm by ensuring security, privacy, and authenticity in every interaction. Our innovative AI-powered DigiCert ONE platform integrates PKI, DNS, and certificate lifecycle management, protecting infrastructure, software, devices, messages, and AI entities. Join over 100,000 organizations, including 90% of the Fortune 500, that rely on DigiCert to counteract today's threats and prepare for a quantum-safe future at www.digicert.com. Job SummaryAt DigiCert, we are constructing the trust foundation for an agentic future. As AI agents evolve from basic chatbots to complex autonomous systems with access to sensitive enterprise data, we recognize the urgent need for standardized Identity, Authentication, and Governance protocols. You will play a crucial role in developing the core security infrastructure that defines agent identification and control. Our vision is to establish a "Zero Trust" architecture for AI—ensuring every action an agent performs is cryptographically verifiable, authorized, and secure. Your contributions will help shape our technical culture, influence our technology stack, and scale a platform that will fortify the next generation of enterprise AI. What You Will DoCollaborate with a team of industry-leading cryptographers, AI/ML innovators, cloud infrastructure specialists, and security engineers to redefine the future of secure digital interactions.Design and build the Identity Foundation: Create and manage a cryptographically secure identity system that acts as the trusted source of truth for agent interactions.

Apr 8, 2026
Apply
companyCrusoe logo
Full-time|$209K/yr - $253K/yr|On-site|Sunnyvale, CA - US

At Crusoe, we are driven by our mission to accelerate the abundance of energy and intelligence. As the only vertically integrated AI infrastructure company, we manage each layer of our technology stack—from electrons to tokens—enabling the world's most ambitious AI workloads. Joining Crusoe means becoming part of a team that is actively building the future.We are at the forefront of a transformative industrial revolution, where the insatiable demand for AI compute meets the challenge of energy availability. Our energy-first approach not only enhances AI infrastructure but also ensures it's beneficial for the world and accelerates innovation.We seek passionate, problem-solving teammates who thrive in a fast-paced environment and share our ambitious vision. If you're eager to advance your career alongside experts in energy, manufacturing, data center construction, and cloud services, we want you on our team.If you're ready to engage in impactful work, assist our clients and partners in enhancing their AI strategies, and contribute to a high-performing collaborative team, we invite you to build with us at Crusoe.About This Role:We are looking for a Staff Streaming Software Engineer to become a vital part of our Observability team within the Cloud Infrastructure division. This team is responsible for building and managing real-time data platforms that deliver metrics, logs, traces, and event streams, empowering engineers across the organization to reliably operate Crusoe's AI cloud at scale.In this position, you will lead the technical direction for our high-throughput streaming systems, influencing architectural decisions and long-term investments throughout the observability stack. You will operate at the nexus of deep technical execution and organizational influence—identifying potential issues before they arise, shaping team approaches to building and managing streaming infrastructure, and collaborating with engineering leaders to align platform strategies with corporate objectives.This role offers a unique opportunity to define how observability data flows at scale within our rapidly expanding AI cloud and to make a lasting architectural impact on systems that the entire engineering organization relies on.

Apr 8, 2026
Apply
company
Full-time|$150K/yr - $250K/yr|On-site|Sunnyvale

Your ContributionBecome an integral part of a dynamic team focused on developing cutting-edge cybersecurity solutions from inception. With guidance from seasoned industry experts, you will have the unique opportunity to architect, build, and deliver highly impactful products that will shape the future of cybersecurity. This is a chance to elevate your career and skill set alongside a company poised for growth.Position SummaryAs a Backend Engineer, your primary responsibility will be to design and implement the management layer for a distributed embedded system. This involves creating the policy structure, ensuring reliable configuration storage, managing secrets, and developing engines that facilitate these functions. Your work will play a crucial role in building a fast, resilient system that directly contributes to the product line's success.

Mar 5, 2026
Apply
companyMindlance logo
Full-time|On-site|Sunnyvale

We are seeking a highly skilled Senior Backend/Java Engineer to join our dynamic team at Mindlance in Sunnyvale. The ideal candidate will possess robust Java/J2EE expertise and a thorough understanding of both Oracle and NoSQL databases. You should have between 5 to 10 years of experience working in a scalable, multi-threaded server-side environment, along with 2 to 5 years of experience specifically with databases. A self-starter mentality is essential, as is the ability to thrive in a fast-paced, ever-evolving environment. You will be expected to take ownership of your projects and deliver high-quality results. Familiarity with Enterprise Linux is a must, as is hands-on experience with Java. Experience in the AppleCare domain is a plus.

Jun 29, 2017
Apply
companyIllumio logo
Full-time|On-site|Sunnyvale, California - HQ

Join Our Visionary Team!Illumio stands at the forefront of ransomware and breach containment, revolutionizing the way organizations defend against cyberattacks while fostering operational resilience. Our innovative breach containment platform, powered by the Illumio AI Security Graph, is adept at identifying and mitigating threats across hybrid multi-cloud environments, effectively stopping the escalation of attacks before they can inflict significant damage.As a recognized leader in the Forrester Wave™ for Microsegmentation, Illumio empowers organizations to adopt Zero Trust principles, enhancing cyber resilience across infrastructures, systems, and organizations that are essential to global operations.Work Arrangement:This role requires 5 days of on-site presence at our Sunnyvale, CA Headquarters.Our Engineering Vision:Our Engineering team is fueled by a culture of visionary leadership, autonomy, and ownership, creating a collaborative environment that propels us forward in the dynamic realm of cybersecurity.By joining our team, you will be part of the leader in Zero Trust Segmentation, working with a cutting-edge technology stack that encompasses various operating systems, distributed applications, and advanced UI/visualization tools.Together, we are shaping the future of cybersecurity, building world-class products driven by diverse perspectives, backgrounds, and a shared commitment to innovation during a time of unprecedented cybersecurity threats.Your Contributions:You will create containerized microservices for a distributed multi-tenant system that processes data, real-time events, and network telemetry from multiple public clouds, delivering actionable insights, visibility, and security recommendations to enhance our customers’ cloud security posture.You will design your services, meticulously develop the details, defend your design choices among peers, and implement robust solutions.You will mentor junior engineers, recent graduates, and interns, fostering their growth and integration into the team.Your primary programming focus will be in Go, working with data pipelines utilizing SQL or similar interfaces. We welcome candidates from diverse programming backgrounds eager to learn.You will take ownership of critical features and subsystems, managing the software development lifecycle from requirement clarification to ensuring successful deployment and user adoption.

Mar 23, 2026
Apply
companyIntuitive Surgical, Inc. logo
Full-time|On-site|Sunnyvale

Intuitive Surgical, Inc. seeks a Senior Software Engineer to join the Platform Engineering team in Sunnyvale. This role centers on developing and maintaining the foundational software that powers advanced surgical technologies. Key responsibilities Design and build core platform software for surgical systems Collaborate with other engineering teams to create reliable and scalable solutions Drive ongoing enhancements that support improvements in surgical procedures and patient care Role focus This position emphasizes both architecture and hands-on development for the software platform. Work will directly impact the reliability and capabilities of surgical technologies used in healthcare settings.

Apr 24, 2026
Apply
companyIntuitive Surgical, Inc. logo
Senior Software Engineer in Test

Intuitive Surgical, Inc.

Full-time|On-site|Sunnyvale

Join our innovative team at Intuitive Surgical as a Senior Software Engineer in Test, where you will play a critical role in ensuring the quality and performance of our cutting-edge robotic systems. We are looking for a talented individual who is passionate about technology and thrives in a collaborative environment. As a senior member of our team, you will design, develop, and implement automated testing frameworks and strategies to enhance our software products and services.

Apr 13, 2026
Apply
companyIntuitive Surgical, Inc. logo
Senior Software Engineer in Test

Intuitive Surgical, Inc.

Full-time|On-site|Sunnyvale

Join our dynamic team at Intuitive Surgical, a leader in minimally invasive robotic surgery. We are seeking a talented and detail-oriented Senior Software Engineer in Test to enhance our quality assurance processes and ensure the reliability of our cutting-edge surgical systems. In this role, you will develop and execute automated tests, contribute to the design of testing frameworks, and collaborate closely with software engineers to drive quality improvements across our products.

Apr 4, 2026
Apply
companyApplied Intuition, Inc. logo
Full-time|$153K/yr - $222K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI technologies. Established in 2017 and currently valued at $15 billion, this Silicon Valley powerhouse is dedicated to creating the essential digital infrastructure that empowers intelligence in every moving machine globally. Our solutions cater to key sectors including automotive, defense, trucking, construction, mining, and agriculture, with a focus on tools and infrastructure, operating systems, and autonomy. Trusted by 18 of the top 20 global automakers, along with the United States military and its allies, Applied Intuition is headquartered in Sunnyvale, California, with a global presence in cities including Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo. Discover more at applied.co.Our company thrives on in-office collaboration, and we expect our employees to primarily work from their respective Applied Intuition offices five days a week. We understand the need for flexibility, allowing for responsible management of schedules, including occasional remote work, starting the day with morning meetings from home, or leaving early to accommodate family commitments.About the RoleWe are seeking talented infrastructure engineers with a deep understanding of scaling open-source data infrastructure to join our Data & ML Infrastructure group. This dynamic role involves engaging with the entire data lifecycle — from collection, ingestion, and storage to querying and retrieval. You will collaborate closely with various business units to design and develop both internal and external products. Managing vast amounts of data to meet the demands of Applied Intuition's platform is critical, and we need a proactive individual who can actively support our data products and verticals across the organization. At Applied Intuition, we encourage our engineers to take ownership of technical and product decisions, actively engage with both internal and external users for feedback, and contribute to a vibrant, collaborative team culture.

Jan 14, 2026
Apply
companyIntuitive Surgical, Inc. logo
Full-time|On-site|Sunnyvale

Join our innovative team at Intuitive Surgical, Inc. as a Senior User Interface Software Engineer. In this pivotal role, you will leverage your expertise to design and develop cutting-edge user interfaces that enhance the usability of our advanced robotic surgical systems. You will collaborate with cross-functional teams to deliver high-quality software solutions that meet the needs of surgeons and healthcare professionals worldwide.

Mar 11, 2026
Apply
companyApplied Intuition, Inc. logo
Full-time|$250K/yr - $250K/yr|On-site|Sunnyvale, California, United States

About Applied IntuitionApplied Intuition, Inc. is at the forefront of advancing physical AI technology. Established in 2017 and now valued at $15 billion, this innovative Silicon Valley company is developing the critical digital infrastructure necessary to infuse intelligence into every moving machine on Earth. Applied Intuition serves various sectors, including automotive, defense, trucking, construction, mining, and agriculture, focusing on three main areas: tools and infrastructure, operating systems, and autonomous solutions. The company is trusted by 18 of the top 20 global automakers, as well as the United States military and its allies, to deliver cutting-edge physical intelligence solutions. With its headquarters in Sunnyvale, California, and additional offices in Washington, D.C.; San Diego; Ft. Walton Beach, Florida; Ann Arbor, Michigan; London; Stuttgart; Munich; Stockholm; Bangalore; Seoul; and Tokyo, Applied Intuition continues to expand its global reach. Discover more at applied.co.We are an in-office company, expecting our team members to primarily work from the Applied Intuition office five days a week. However, we value flexibility and trust our employees to manage their schedules responsibly, which may include occasional remote work, starting the day with morning meetings from home, or leaving early for family commitments.About the RoleThe Senior Software Integration Engineer will engage in software application development and integration tasks (covering embedded applications, cloud solutions, and user interfaces) for customer projects within the VehicleOS team. The customer applications team collaborates on all internal vertical development, integrating Vehicle OS with customer-specific applications and platforms to deliver functional and efficient vehicle solutions.Key ResponsibilitiesDeliver comprehensive application-level software features that span both software and hardware in C/C++, aligning with customer specifications.Engage directly with customers to identify target use cases and oversee the project from initiation to successful integration.Develop end-to-end software integrations in C/C++, handling applications such as Matrix headlight control and smart vehicle functionalities.

Feb 11, 2026
Apply
companyCerebras Systems logo
Full-time|On-site|Sunnyvale CA or Toronto Canada

Cerebras Systems is at the forefront of AI technology, creating the world's largest AI chip that is 56 times the size of traditional GPUs. Our innovative wafer-scale architecture combines the compute power of dozens of GPUs into a single chip, simplifying the programming experience. This unique design enables us to achieve unparalleled training and inference speeds, allowing machine learning practitioners to run extensive ML applications seamlessly without the complexities of managing numerous GPUs or TPUs.Our clientele includes premier model laboratories, multinational corporations, and pioneering AI-driven startups. Notably, OpenAI has recently formed a multi-year collaboration with Cerebras, aiming to harness 750 megawatts of computational scale to revolutionize key workloads through ultra-high-speed inference.Thanks to our cutting-edge wafer-scale architecture, Cerebras Inference delivers the fastest Generative AI inference solution globally, achieving speeds over ten times faster than GPU-based hyperscale cloud inference services, thus transforming the user experience of AI applications and enabling real-time iterations and enhanced intelligence through additional agentic computation.Responsibilities:Lead the design and implementation of advanced system-level debugging, validation, and observability platforms.Develop automated systems for collecting and analyzing numerical data and execution anomalies.Create visualization and analysis tools to facilitate efficient root-cause investigations.Build frameworks for failure classification, regression detection, and anomaly monitoring.Enhance compilers, runtimes, and programming interfaces to support sophisticated profiling and instrumentation.Improve workflows related to system bring-up, low-level debugging, and validation.Collaborate cross-functionally with teams in compiler, hardware, firmware, runtime, and infrastructure domains.Establish best practices to ensure debuggability, reliability, and operational excellence.Lead impactful initiatives and support incident response while driving long-term corrective solutions.

Feb 20, 2026

Sign in to browse more jobs

Create account — see all 690 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.