Backend Infrastructure Software Engineer
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
About Logz.io
Logz.io is a leading observability platform that leverages the power of open-source technologies to deliver exceptional performance and reliability. We are at the forefront of making data observability easy and accessible for organizations worldwide.
Similar jobs
Search for Software Engineer, Infrastructure - Remote (USA)
75,929 results
About Orb:At Orb, we are pioneering a new frontier in how AI and software enterprises effectively monetize at scale. Our cutting-edge billing infrastructure transforms intricate usage-based pricing models into strategic advantages. With a developer-first mindset, we empower industry leaders like Vercel, Pinecone, and Replit by providing real-time billing automation, rapid pricing experimentation, and detailed revenue insights.Supported by $44.1 million in investments from premier venture capitalists, including Mayfield, Menlo Ventures, and Greylock, we are a dynamic team committed to shaping the future of monetization through innovative infrastructure solutions.We embrace a hybrid work culture, collaborating in the office three days a week. Our core values—customer centricity, urgency, proactive engagement, and meticulous attention to detail—guide our team's growth and collaboration.About the Role:As a vital member of our infrastructure team, you will uphold our commitment to high reliability standards. Our clients rely on our systems for continuous operation since any downtime can result in significant revenue losses. You will be responsible for the infrastructure that supports our entire product, encompassing event ingestion, API services, alerting mechanisms, invoicing, and much more.Your Responsibilities:Lead initiatives for infrastructure resilience, including recovery strategies, tenant isolation, and load managementEnhance the observability and operational efficiency of our systemsDevelop performance-critical, user-facing infrastructure, such as real-time event processing systemsStrategize scaling initiatives to accommodate substantial customer growthCollaborate with cross-functional engineering teams to ensure the development of robust product featuresShare knowledge and best practices with your talented peersAbout You:You possess a deep understanding of edge cases, potential failure modes, and performance bottlenecksYou have a talent for diagnosing and resolving complex errors and performance challengesYou thrive on building scalable infrastructure for high-growth productsYou are adept at mentoring fellow engineers in best practices, including observability and risk management strategiesYou bring 5+ years of experience in software engineering, particularly focused on infrastructure
Founded in 2007, Airbnb has evolved from a modest beginning when two hosts welcomed three guests in their San Francisco home to a global platform with over 5 million hosts and more than 2 billion guest arrivals worldwide. Our hosts provide unique stays and experiences that foster genuine connections with communities across the globe.The Community You Will Join: Joining the Networking team within Airbnb’s Cloud Infrastructure organization means taking part in the ownership of our entire production network infrastructure. This team is tasked with the design, development, and operation of the software and solutions that facilitate connectivity for all Airbnb users and services. This includes essential components like traffic proxy and load balancer, service mesh, VPC/backbone and cross-region connectivity, as well as network monitoring and security systems. With a global user base, our team's priorities are reliability, scalability, efficiency, and high availability.The Difference You Will Make:As a key member of the network infrastructure team, you will collaborate with talented engineers on innovative cloud-native network stack technologies ranging from Layer 3 to Layer 7. Your contributions will be vital in shaping the core infrastructure that links Airbnb users and services worldwide. You will have the opportunity to spearhead significant infrastructure initiatives such as global traffic load balancing, disaster recovery solutions, the next-generation service mesh, cross-region gateways, and edge security implementations.Airbnb is proud to be a part of the Cloud Native Computing Foundation (CNCF) end user community and actively collaborates with the open-source community (including projects like Kubernetes and Istio) as well as peer companies to address cloud-native engineering challenges at scale. You will have the chance to make a meaningful impact on the industry and within open-source communities.A Typical Day:Collaborate with open-source communities (e.g., Istio) to develop the next-generation service mesh for all Airbnb back-end services;Design and implement cross-region gateways and load balancers for global Airbnb services;Engage with external partners and internal engineering and security teams to deliver edge security systems.
Join our dynamic team at dev2 as a Principal Software Engineer and take the lead in shaping innovative software solutions. In this fully remote position, you will leverage your extensive experience to design, develop, and implement cutting-edge software applications that meet the needs of our clients. Your leadership will guide a team of talented engineers, fostering an environment of collaboration and creativity.
Why choose a career at Nebius?Nebius is at the forefront of revolutionizing cloud computing to empower the global AI economy. We provide innovative tools and resources to help our customers tackle real-world challenges and reshape industries, all while minimizing infrastructure costs and eliminating the need for extensive in-house AI/ML teams. Our workforce operates at the cutting edge of AI cloud infrastructure, collaborating with some of the most experienced and visionary leaders and engineers in the industry.Our Work EnvironmentHeadquartered in Amsterdam and publicly listed on Nasdaq, Nebius boasts a global presence with research and development hubs across Europe, North America, and Israel. Our diverse team of over 1400 employees includes more than 400 highly skilled engineers with profound expertise in hardware and software engineering, alongside a dedicated in-house AI research and development team.The RoleNebius manages large-scale, mission-critical bare-metal infrastructure. As an Infrastructure Software Engineer specializing in Python, you will architect and develop systems that provision, configure, test, and manage physical hardware at scale. Your work will be closely integrated with the hardware layer—interfacing directly with servers, networks, and management controllers—while facilitating highly automated and reliable infrastructure operations.You will work in close collaboration with hardware, networking, and data center operations teams to ensure our platforms are resilient, scalable, and production-ready.Your Key Responsibilities Include:Designing and developing backend services and automation using PythonBuilding and maintaining systems for hardware provisioning, testing, and lifecycle managementCreating software that operates directly on bare-metal environmentsIntegrating with Linux systems, utilizing Bash and low-level tools as necessaryImplementing and maintaining CI/CD pipelines for infrastructure-focused softwareWorking with networking services, including IPv4/IPv6, DHCP, DNS, network boot, and server boot workflowsInterfacing with BMC controllers and related hardware management systems
At Render, we are at the forefront of developing a cutting-edge cloud platform tailored for developers who are creating AI-native, full-stack, multi-service applications. Our mission is to bridge the gap between the robust capabilities of hyperscalers and the user-friendly simplicity of developer-centric platforms, enabling teams to deliver products quickly, scale effectively, and concentrate on their innovations rather than infrastructure.Unlike complex hyperscalers or transient edge/serverless solutions, Render provides a developer-first experience with persistent compute resources, dynamic autoscaling, integrated orchestration, and observability. This allows teams to launch, scale, and manage real-world applications without the need to write infrastructure code or oversee server management. Whether you're developing LLM-powered applications, scalable SaaS solutions, or asynchronous processing pipelines, Render empowers teams to accelerate their workflows and scale with confidence from MVP to millions of users.Our platform is trusted by over 4.5 million developers globally and continues to experience rapid growth. In February 2026, we secured an additional $100M in Series C funding, bringing our total funding to $257M, to further our vision of making cloud infrastructure both powerful and intuitive, especially for the fast-paced world of modern AI development.We pride ourselves on our diverse and talented team that emphasizes craftsmanship, speed, and exceptional user experience. If you are passionate about shaping the future of intelligent cloud solutions and empowering developers worldwide, we would love to connect with you.
About CapeFounded in early 2022 by experts from Palantir and Anduril, Cape is on a mission to revolutionize privacy in the wireless world. Inspired by a passion for mobile privacy and national security, our CEO is determined to empower individuals to regain control over their personal data. We are not just another cellular provider; we are the pioneers of a movement that prioritizes privacy at its core.At Cape, we believe that your location and relationships are personal and should remain confidential. Privacy is not a limitation; it’s a suite of features that enhances your freedom. With backing from Andreessen Horowitz and other top-tier investors, we are excited to expand our team and innovate further.Join Our TeamAs relentless builders, we are committed to pushing the boundaries of technology. Our team thrives on innovation and collaboration, and we trust you to deliver exceptional results while making a significant impact. You will work alongside talented engineers and colleagues in an inspiring environment.Your RoleIn this dynamic role, you will:Engage in an exciting early-stage startup environment; be ready to embrace challenges.Help restore the privacy that many have forfeited due to smartphone proliferation.Apply your technical skills to solve complex issues impacting consumer privacy and national security.Explore innovative uses of emerging technologies.Work on new projects from inception, shaping the technology stack and methodologies.Enjoy the flexibility of working remotely while having the option to collaborate in our DC or NY offices, fostering a vibrant and informal culture.We offer a competitive salary, benefits, and equity with significant growth potential.
Join Our Innovative Team at ScrunchScrunch, a forward-thinking startup fueled by venture capital, is committed to leading brands into an AI-driven future. We empower individuals to harness the capabilities of large language models (LLMs) to discover, comprehend, and act on the information that matters most.As AI search and conversational agents are set to redefine the marketing landscape, Scrunch partners with leading AI platforms to assist marketing teams in reimagining how their products and services are discovered and promoted. With our collaboration with platforms such as ChatGPT, Claude, and Gemini, we are at the forefront of transforming marketing, representing the most significant evolution since the internet's inception.With a robust $26 million investment from prominent firms such as Mayfield Fund, Decibel, Homebrew, and GTM Capital, Scrunch has experienced rapid growth since our commercial launch. Currently, over 500 brands, including Fortune 500 companies like Lenovo, trend-setting brands like Skims, and dynamic startups like Clerk, rely on our platform.Position OverviewWe are on the lookout for a Senior Infrastructure Engineer to enhance our team. This role offers high ownership and responsibility, focusing on the design, construction, and maintenance of the essential systems that power Scrunch's platform. You will work on cloud infrastructure, developer tools, observability, and reliability—critical components of our operations that directly impact our customers.Location Requirements:This position is available to candidates located in and legally authorized to work in the U.S. While Scrunch is primarily a remote organization, we welcome hybrid or in-office arrangements for candidates based in the NYC or Salt Lake City metro areas, as we enjoy collaborative in-person interactions!Applicants must reside in one of the following states: Arizona, California, Colorado, Florida, Illinois, Indiana, Massachusetts, Maryland, Missouri, Minnesota, New Hampshire, New Jersey, New York, Ohio, Texas, or Utah.Currently, we are unable to hire candidates outside these states.QualificationsWe seek candidates who:Possess experience in a high-velocity software development environment.Have a proven track record in designing, building, and maintaining scalable cloud services, ideally on Google Cloud Platform (GCP).Have successfully deployed and operated edge computing functions and understand the strategic use of edge logic.
AcuityMD
Infrastructure Software Engineer AcuityMD is at the forefront of revolutionizing access to medical technologies through our innovative software and data platform. We empower MedTech companies to gain insights into product usage, customer variability, and opportunities to enhance patient care. With approximately 6,000 new medical devices approved by the FDA each year, our platform accelerates the journey from product development to physician access, ultimately improving patient outcomes. Backed by prominent investors including Benchmark, Redpoint, ICONIQ Growth, and Ajax Health, we are a rapidly scaling SaaS organization. As an Infrastructure Software Engineer, you will collaborate closely with various teams across Engineering and Production. Your role will involve designing, building, and maintaining core platform services—encompassing compute, networking, storage, CI/CD, and developer tooling—that drive our applications from start to finish. You will enhance reliability, security, and efficiency, while fostering an exceptional developer experience. Additionally, you will contribute to our strategic objectives as we advance our infrastructure and practices to maximize both internal and customer impact. Team Mission Our Platform Team serves as the backbone for the organization, ensuring that product teams can deliver swiftly and safely. We create direct customer value through our cloud capabilities and security measures while supporting internal success through partnerships with application teams, enabling a superior development experience. Responsibilities Steward core platform services: Implement scalable container orchestration, service mesh, ingress, and secrets management. Cross-functional partnership: Collaborate with Product, Engineering, Data, and Security to drive external and internal value. Harden reliability: Enhance observability through logging, metrics, and tracing, along with automated remediation to boost availability and reduce latency. Automate everything: Utilize infrastructure-as-code and configuration management to ensure systems and processes are repeatable, auditable, and secure. Scale cost-effectively: Optimize cluster utilization and autoscaling, balancing performance, reliability, and costs. Level-up developer experience: Develop internal tooling, templates, and best practices that minimize cognitive load and expedite time-to-deploy for product teams. On-call & incident response: Engage in a sustainable on-call rotation, lead post-mortems, minimize repetitive tasks, and reduce mean time to recovery through automation. Enable fast, safe delivery: Enhance CI/CD pipelines to facilitate swift and secure software releases.
At ClickUp, we're not just developing software; we're shaping the future of work! In an era marked by work sprawl, we envisioned a better solution. That's why we developed the first genuinely integrated AI workspace, merging tasks, documents, chat, calendar, and enterprise search, all enhanced by context-aware AI, empowering millions of teams to escape silos, reclaim their time, and achieve unprecedented productivity levels. At ClickUp, you will have the chance to learn, utilize, and innovate with AI in ways that influence not only our product but the future of work as a whole. Join us and be part of a bold, innovative team that's redefining possibilities! ClickUp's mission is to enhance global productivity — and it begins with equipping our engineering team with the right tools, frameworks, and best practices to maintain the functionality and performance of our versatile work app. We are seeking seasoned Software Engineers to assist in scaling our test infrastructure for a modern, AI-driven web and mobile platform that serves millions of users worldwide.In this role, you will design and enhance robust testing infrastructures for frontend, backend, services, or mobile applications. You will contribute to the development of unit, integration, end-to-end, API, performance, load, and scalability testing frameworks across a hybrid monolithic and microservices architecture. Your focus will be on AI-assisted testing solutions, test isolation, prioritization and quarantining strategies, test data generation, CI quality metrics, test impact analysis, and flake detection and elimination. Our goal is to maximize reliability while sustaining velocity.
This remote Software Engineer, Infrastructure role at Tavus centers on building and maintaining the systems behind Tavus applications. The position focuses on designing infrastructure that can grow with the company, improving system performance, and ensuring security remains strong throughout all operations. Key responsibilities Develop and maintain the infrastructure supporting Tavus products Collaborate with team members to design scalable systems Monitor performance and optimize systems across platforms Implement and uphold security measures Role overview This position suits engineers interested in the backbone of application delivery. Work will include both hands-on development and close collaboration with others to ensure systems remain reliable, efficient, and secure in a remote setting.
Join Affirm as a Senior Software Engineer focused on Infrastructure, where you will play a pivotal role in designing and implementing scalable systems that support our growing platform. Collaborate with cross-functional teams to enhance our infrastructure, ensuring that it is robust, efficient, and secure. Your contributions will directly impact the performance and reliability of our services, helping us to deliver exceptional value to our customers.
Join Finalis as a Senior Software Engineer and play a pivotal role in shaping our innovative technology solutions. We are looking for a skilled engineer to work collaboratively in a hybrid environment, contributing to our cutting-edge projects while mentoring junior team members. You will have the opportunity to enhance your technical expertise and drive impactful changes in our software development processes.
We are seeking a passionate Backend Infrastructure Software Engineer who embraces complex challenges and is eager to explore innovative solutions. In this role, you will take charge of the systems that facilitate the ingestion, storage, and serving of data at scale, significantly contributing to the reliability and performance of our backend infrastructure.Our data pipeline serves as the foundation for our operations. As we strive to enhance ingestion throughput, query performance, and storage efficiency, we need a skilled engineer like you to accelerate our progress and help us achieve our goals with intelligence and reliability.
Afresh Technologies, Inc.
Join Afresh as a Senior Software Engineer in our Infrastructure team, where you will play a crucial role in designing and developing scalable infrastructure solutions. You will collaborate with a dynamic team of engineers to enhance our existing systems and ensure optimal performance and reliability.
About the RoleAt Abnormal Security, we empower enterprises of all sizes to combat cybercrime with our innovative cloud products. Our Platform Infrastructure team is at the heart of this mission, building and managing the vital systems that enable our AI-driven detection and prevention capabilities, ensuring reliability, scalability, and security at cloud scale.We are seeking a Staff Software Engineer to spearhead foundational initiatives across various aspects of Platform Infrastructure. In this position, you will lead a talented team, define the roadmap for a truly self-service infrastructure platform, and spearhead ambitious technical projects that leverage AI to enhance our system development and operations.The ideal candidate:Effectively addresses complex and ambiguous challenges, converting them into actionable strategies.Demonstrates leadership by setting an example and delving into details when necessary.Embodies our VOICE values and creates software solutions that exceed customer expectations.Builds trust through collaborative efforts across Engineering, Product, and Design teams.Team Mission: To construct and enhance the core infrastructure—compute, orchestration, and data platform—that supports Abnormal’s AI/ML products at scale. We prioritize our platforms as products: ensuring they are usable, reliable, secure, and cost-effective.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Join the Revolution at CheckAt Check, we are transforming the payroll landscape. Our mission goes beyond just building a successful business; we collaborate with our partners to innovate payroll solutions. As pioneers of embedded payroll, we are reshaping the payment process, enabling payroll businesses to launch, expand, and succeed with ease. Discover our journey | Listen in.Check is more than an API; we are the catalyst for developing and scaling payroll operations.Our TeamThe payroll system is in dire need of innovation. We invite you to join a passionate team dedicated to making an impactful change! At Check, you will leverage creative problem-solving and critical thinking to influence every business we partner with. We view challenges as opportunities for improvement, valuing the unique contributions of each team member in our collective mission.If you're ready to dive in and transform payroll, let's collaborate to simplify complexity and enhance the future for businesses of all sizes.Your RoleAt Check, engineering is our foundation. We believe that payroll should resemble modern financial software; achieving this requires a comprehensive understanding of systems and reliable infrastructure that our partners can trust. Every product we deliver relies on scalable and secure systems that ensure timely payments and payroll processing.We are seeking a Staff Software Engineer who possesses strong software design capabilities coupled with hands-on infrastructure experience. In this position, you will focus on the essential systems that drive payroll operations, enhancing our service scalability, production operations, and empowering engineers with the tools to deliver software confidently and securely.You will collaborate across product and platform areas to enhance our cloud infrastructure, fortify our deployment and monitoring strategies, and streamline the architecture that supports embedded payroll services. The challenges you will address often intersect infrastructure, product, and operational domains.This opportunity is perfect for someone who has managed complex systems end-to-end in a dynamic environment and takes pride in developing resilient, comprehensible infrastructure that is vital to our operations.
Siftstack
At Sift, we are transforming the way modern machines are designed, tested, and operated. Our cutting-edge platform offers engineers real-time observability over high-frequency telemetry, effectively removing bottlenecks and facilitating faster, more reliable development.Sift originated from our efforts at SpaceX, where we worked on projects like Dragon, Falcon, Starlink, and Starship. These experiences highlighted the necessity for scalable telemetry, robust debugging of flight systems, and unwavering mission reliability, leading to the creation of innovative infrastructure. Founded by a team with backgrounds from SpaceX, Google, and Palantir, Sift is tailored for mission-critical systems where precision and scalability are paramount.As an early engineer at Sift, your role will extend beyond mere code writing; you'll be instrumental in defining architecture, shaping product development, and influencing a culture dedicated to addressing genuine engineering challenges. If you are eager to tackle complex technical problems and contribute to foundational systems that support intricate machines from the ground up, we encourage you to apply.In This Role, You'll:Design, develop, and uphold scalable, resilient infrastructure solutions that cater to our expanding platform and clientele.Collaborate closely with software engineers to enhance application performance and reliability.Establish monitoring, alerting, and logging frameworks to proactively identify and resolve issues.Automate deployment processes and refine infrastructure management using advanced DevOps tools and methodologies.Advance our backend architecture and infrastructure for both cloud and on-premise deployments.Collaborate with the team to set and prioritize our roadmap for maximum customer impact.Lead efforts to enhance infrastructure reliability, performance, and cost efficiency.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Sign in to browse more jobs
Create account — see all 75,929 results

