Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
Key ResponsibilitiesEnhance and support our core Python platform responsible for request routing, AI workload orchestration, GPU server capacity management, observability, authentication, rate limiting, and more. Manage our infrastructure layer utilizing Terraform, Ansible, and provider APIs to oversee our fleet of GPU workers. Take ownership of technologies such as K8s, FluxCD, Nomad, Prometheus, Thanos, Grafana, Loki, distributed networking storage, and other foundational elements of our platform. Formulate a vision and strategic roadmap for our infrastructure development over the next 1, 2, and 5 years.
About the job
Join our innovative team at fal as a Staff Software Engineer specializing in large-scale computation platforms. We are seeking a seasoned software engineer with extensive experience in developing backend systems that efficiently orchestrate workloads and manage resource constraints. Your expertise in foundational cloud infrastructure and Linux provisioning will be crucial as you work towards achieving high reliability and scalability with minimal operational overhead.
About fal
fal is at the forefront of computational technology, dedicated to innovating and optimizing large-scale computation platforms. We value creativity and ambition, offering our team members the resources and opportunities to grow and excel in a dynamic work environment.
About Our TeamAt OpenAI, our Storage Infrastructure team is at the forefront of enabling data accessibility, placement, and lifecycle management through advanced APIs. We prioritize scalability, reliability, security, and usability to meet the demands of our pioneering AI research.Role OverviewWe are seeking a talented Software Engineer to join our Storage Infrastructure team, where you will architect and maintain Exascale systems designed to efficiently and reliably manage research data across multiple regions.The ideal candidate will have extensive experience in distributed systems, particularly in developing exascale data management solutions or distributed filesystems.Your ResponsibilitiesDesign and develop software solutions to manage exascale data, ensuring accessibility for researchers.Enhance the reliability, predictability, and cost efficiency of our storage systems.Collaborate with researchers to understand and address diverse data use cases.Implement robust security measures to protect our critical datasets.Ideal Candidate ProfileStrong foundation in distributed systems principles with a proven ability to design and implement scalable, reliable, and secure storage architectures.Proficiency in programming languages relevant to storage systems development.Experience with cloud platforms, particularly Azure.Familiarity with AI/ML data access patterns.A proactive approach and adaptability in a fast-paced, dynamic environment.About OpenAIOpenAI is a cutting-edge AI research and deployment organization committed to ensuring that general-purpose artificial intelligence benefits all of humanity. We strive to push the boundaries of AI capabilities while ensuring safety and human-centric development. Our mission is to encompass and appreciate diverse perspectives, voices, and experiences that reflect the full spectrum of humanity.We are proud to be an equal opportunity employer, committed to fostering an inclusive workplace where all individuals are respected and valued.
Join Crusoe as a Staff Software Engineer focusing on innovative storage solutions. In this role, you will leverage your expertise to design, develop, and optimize systems that enhance our storage capabilities. You will work alongside a talented team, pushing the boundaries of technology in a collaborative environment.
Join our innovative team at DigitalOcean as a Senior Software Engineer I specializing in Storage solutions. In this role, you will be responsible for developing and enhancing our storage products, ensuring high availability and performance. You will work collaboratively with cross-functional teams to design, implement, and optimize scalable storage systems. If you are passionate about building robust software and enjoy tackling complex challenges, we want to hear from you!
Full-time|$180K/yr - $250K/yr|On-site|San Francisco
Join our innovative team at fal as a Staff Software Engineer specializing in large-scale computation platforms. We are seeking a seasoned software engineer with extensive experience in developing backend systems that efficiently orchestrate workloads and manage resource constraints. Your expertise in foundational cloud infrastructure and Linux provisioning will be crucial as you work towards achieving high reliability and scalability with minimal operational overhead.
Full-time|$240K/yr - $310K/yr|On-site|San Francisco, CA - US
At Crusoe, we are dedicated to accelerating the abundance of energy and intelligence. As a pioneering AI infrastructure company, we control every aspect of our operations — from energy generation to the digital tokens that power the world’s most ambitious AI workloads. Joining Crusoe means being part of a team that is shaping the future at an unprecedented pace.We are amid a transformative industrial revolution. The endless demand for AI computing power poses significant challenges, particularly concerning energy supply. Our energy-first strategy not only enhances AI infrastructure but also contributes positively to the environment, empowering innovators in the AI sector.We seek proactive, problem-solving team members who recognize the scale of our mission and are eager to navigate uncharted territories. If you aspire to advance your career alongside experts in energy, manufacturing, data center construction, and cloud services, we invite you to become part of our dynamic team.If you are ready to engage in the most impactful work of your career, assist our customers and partners in elevating their AI strategies, and contribute to a high-performing, supportive team, we welcome you to build the future with us at Crusoe.About This RoleThe Cloud Storage team at Crusoe is searching for a Senior Staff Software Engineer to act as the principal architect for our storage strategy. Unlike a Staff Engineer who leads feature development, a Senior Staff Engineer will define the long-term technical roadmap essential for our AI-scale infrastructure. You will play a crucial role in establishing the architectural strategy, ensuring the integrity and global scalability of our specialized storage services. Your work will focus on the underlying physics of the stack, bridging high-performance NVMe hardware with globally distributed object storage solutions that compete with S3.Your ResponsibilitiesArchitectural Vision & Strategy: Lead the development and execution of the long-term technical strategy for Crusoe's storage engine, while identifying and integrating industry trends such as CXL and NVMe-oF into a unified roadmap.System Programming Expertise: Utilize your extensive experience in system programming with languages such as C, C++, Go, and Rust to lay the groundwork for our V2 storage re-architecture.Storage Protocols: Design and implement solutions employing industry-standard storage protocols, including NFS, SMB, iSCSI, and NVMe/TCP.
Full-time|On-site|San Francisco, Seattle, New York, Toronto
Join Stripe as a Staff Software Engineer in our Stream Compute team, where you will play a pivotal role in building scalable solutions that power the financial infrastructure of the internet. As a member of our innovative engineering team, you will leverage your expertise to design and implement robust software solutions that enhance the performance and reliability of our streaming data capabilities.
At Rylo, we are revolutionizing the way you capture and share your experiences. Our state-of-the-art camera is designed to record your surroundings with breathtaking clarity and stability, eliminating the hassle of traditional video capture. Created by a team of visionary engineers from Instagram and Apple, our innovative stabilization software and user-friendly smartphone app ensure that every shot you take is a masterpiece. With Rylo, you can focus on enjoying the moment while we handle the technicalities of creating stunning videos.Experience Rylo in actionAs a Software Engineer specializing in Computational Photography, you will play a crucial role in enhancing the core algorithms that power the Rylo camera and future products. Your work will fundamentally enhance the photography and cinematography experience, focusing on improving image quality and developing groundbreaking computational photography features. You will engage in the complete lifecycle of algorithm development, from design and implementation to quality evaluation and performance optimization, culminating in successful deployment.Your collaboration with software engineers, hardware engineers, and designers will allow you to push the boundaries of consumer camera technology.
Full-time|$150K/yr - $237.5K/yr|On-site|San Francisco, California, United States
About Us at Redwood MaterialsAt Redwood Materials, we are revolutionizing the global battery supply chain by integrating recovery, reuse, and recycling. Founded in 2017, we are committed to keeping critical minerals in circulation and driving the energy transition. Our mission is to provide low-cost, large-scale energy storage solutions while producing battery materials in the U.S. for the first time, utilizing batteries we already have.Team ObjectivesOur team develops the software platform that enables optimized control and market participation of battery energy storage systems (BESS), with solutions operating on-site and in the cloud.Our work encompasses the entire technology stack: compute infrastructure, telemetry, asset modeling, alerting, feature ingestion and storage for forecasting and optimization, simulation, operational control orchestration, and seamless integration with energy markets.As a compact team, we uphold high standards for the quality and speed of our deliverables. We emphasize collaboration, trust, continuous learning, and engineering excellence while enjoying the process of building great solutions together.Your Role and ResponsibilitiesYou will actively contribute to our platform with potential focuses including:Data engineering for BESS — managing telemetry pipelines, feature ingestion, and storage for forecasting and optimizationSimulation and training infrastructure — developing platforms for orchestrating large-scale simulationsReal-time forecasting and optimization workflows — managing pipelines that drive BESS operationsEnergy market integration — overseeing data ingestion and bid managementSome of the technologies you will work with include Kubernetes, Rust, Python, NATS, PostgreSQL/TimescaleDB, SQLite, and more as our systems evolve.What We SeekStrong instincts for software design — you think systemically, reason from first principles, and grasp the real-world challenges of building available, reliable, scalable, and secure distributed systemsExperience with AI-accelerated development — you're comfortable leveraging AI to enhance software development processes
Databricks is looking for a Senior Software Engineer focused on Compute Infrastructure in San Francisco, California. This position centers on building and improving compute architecture to support greater performance and scalability across Databricks' platform. What you will do Develop and optimize compute infrastructure to handle demanding data processing and analytics workloads. Work closely with teams from different disciplines to deliver reliable, high-quality solutions for customers. Impact Your contributions will help define how data processing and analytics evolve at Databricks. The work directly supports customers’ ability to scale and perform complex tasks in the cloud. Who we’re looking for Strong background in cloud technologies and compute systems. Enjoys tackling complex technical challenges. Collaborative approach to problem-solving with cross-functional teams.
Team and Platform Focus The Compute Infrastructure team at OpenAI designs, builds, and maintains the systems that support AI research at scale. This work brings together accelerators, CPUs, networking, storage, data centers, orchestration software, agent infrastructure, developer tools, and observability. The aim is to create a reliable, unified experience for researchers and product teams across the company. Projects span the full stack: capacity planning, cluster lifecycle management, bare-metal automation, and distributed systems. The team manages Kubernetes scheduling, system optimization, high-performance networking, storage, fleet health, reliability, workload profiling, benchmarking, and improvements to the developer experience. Even small improvements in communication, scheduling, hardware efficiency, or debugging can significantly accelerate research. OpenAI matches engineers to areas within Compute Infrastructure that align with their skills and interests. Role Overview This Software Engineer role centers on building and evolving the compute platform that supports OpenAI’s research and products. Candidates may bring expertise in low-level systems, high-performance computing, distributed infrastructure, reliability, CaaS, agent infrastructure, developer platforms, tooling, or infrastructure user experience. The most important qualities are strong analytical skills, the ability to write resilient code, and a collaborative approach that helps colleagues move faster and with more confidence. What You Will Work On Working close to hardware or at the user interaction layer Developing CaaS and agent infrastructure Managing control and data planes that connect the system Bringing new supercomputing capabilities online Optimizing training workloads through profiler traces and benchmarks Improving NCCL and collective communication Analyzing GPUs, NICs, topology, firmware, thermal dynamics, and failure modes Designing abstractions to unify diverse clusters into a single platform Areas of Expertise No one is expected to cover every area listed. Some engineers focus on system performance, kernel or runtime behavior, large-scale networking protocols, RDMA, NCCL, GPU hardware, benchmarking, scheduling, or hardware reliability. Others improve the platform’s usability through APIs, tools, workflows, and developer experience. The team values strong engineering judgment and a drive to advance the field.
Join our team as a Staff Software Engineer specializing in Online Storage at Plaid! We are seeking a talented engineer who is passionate about building reliable, scalable, and efficient online storage solutions. In this role, you will design and implement systems that handle vast amounts of data while ensuring high performance and security.As a key member of our engineering team, you will collaborate with cross-functional teams to understand requirements and deliver innovative solutions. Your expertise will help shape our data storage architecture and influence the direction of our technology stack.
Full-time|$137.5K/yr - $276K/yr|On-site|San Francisco, California, United States
About Redwood MaterialsAt Redwood Materials, we are on a mission to localize the global battery supply chain by integrating recovery, reuse, and recycling. Founded in 2017, we are pioneering the delivery of low-cost, large-scale energy storage solutions and producing battery materials in the U.S. for the first time, all sourced from existing batteries.Role Overview:As an integral member of the Redwood Energy engineering team, you will play a hands-on role in the design, development, and integration of innovative second-life battery-based energy storage systems. This position focuses on creating robust and reliable system software for the Site Controller, which acts as the central nervous system for our products. You will be responsible for designing and implementing containerized services for networked device management, orchestrating site-level controls, managing time series data, and conducting system diagnostics.We are looking for highly motivated candidates who are adaptable to a fast-paced startup environment and eager to tackle exciting technical challenges. If you thrive in dynamic settings and are excited about contributing to a new department at Redwood, we encourage you to apply!The job level may be adjusted based on the applicant's experience and responsibilities.
Join Crusoe as a Senior Staff Software Engineer specializing in innovative storage solutions that drive our cutting-edge technology forward. In this pivotal role, you will work closely with cross-functional teams to design, implement, and optimize robust storage systems that are integral to our operations.
Full-time|$204K/yr - $255K/yr|On-site|United States
Founded in 2007, Airbnb has evolved from a simple idea of welcoming guests into a home to a global community of over 5 million hosts. With more than 2 billion guest arrivals across nearly every country, we provide unique stays and experiences that foster authentic connections between guests and communities.Join Our Community:The Online Data organization at Airbnb is dedicated to enhancing customer experiences through real-time data insights. We enable our builders to craft exceptional experiences by providing user-friendly data interfaces and tools, eliminating the need for database expertise.Your Impact:As the Engineering Manager for the Control Plane team within the Distributed Transactional Database organization, you will lead a diverse group of talented software engineers. Your mission will be to develop robust and automated software solutions for database operations, ensuring seamless integration of our open-source database with Airbnb’s Compute, Networking, and Security infrastructures.Here are some key control plane services your team will manage:Orchestration logic to oversee the lifecycle of database storage, compute, and metadata management nodes. (Blog link)An operator that automates cluster provisioning, operations, and restoration.Configuration management that identifies and rectifies unexpected configuration drifts.Open-source Developer Experience: Streamlining the development and testing processes for open-source projects including image building, CICD, and performance certification.
ABOUT USAt Applied Compute, we are pioneers in developing Specific Intelligence for enterprises, creating agents that learn continuously from a company’s processes, data, expertise, and objectives. Our mission is to establish a continual learning platform that captures context, memory, and decision traces throughout the organization, enabling specialized agents to perform meaningful tasks.Why Join Us: Our team operates at a unique intersection of innovation. Our product team is responsible for crafting a platform that serves as the backbone for a new generation of digital coworkers. Meanwhile, our research team explores the cutting edge of post-training and reinforcement learning to enhance product experiences. Our applied research engineers collaborate closely with clients to deploy agents effectively in real-world scenarios. This synergy of robust product development, extensive research, and direct client engagement is essential for us to revolutionize AI in the enterprise landscape.Our Team: Comprising engineers, researchers, and operations experts, our team includes many former founders with extensive experience. We have developed RL infrastructure at OpenAI, data foundations at Scale AI, and other systems at companies like Two Sigma and Watershed. We proudly serve Fortune 50 clients and are supported by top-tier investors including Kleiner Perkins, Benchmark, Sequoia, Lux, and Greenoaks.Who Thrives Here: We seek individuals passionate about utilizing cutting-edge research and complex systems to address real-world challenges. Comfort navigating diverse environments, whether it’s a new codebase, unfamiliar customer data architecture, or unexplored problem domains, is essential. Our team values genuine client engagement — listening, empathizing, and understanding the realities of work in their organizations. Those with entrepreneurial spirits, rich project experiences, or proven capabilities to manage tasks end-to-end will excel in our environment.THE POSITIONAs a Software Engineer, you will be instrumental in building the products and interfaces utilized by customers and internal teams. You will manage the entire application platform stack, from collaborative human-AI workspace systems to backend workflows orchestrating sandboxed agent sessions, and the continual learning SDK that provides engineers with oversight of the agent development lifecycle.
About UsAt Software Apps Inc., we are pioneering the future of technology with our groundbreaking product, Sky, which utilizes natural-language computing tailored for your Mac. Join us in our mission to innovate and transform how users interact with technology.Discover more about our team, values, and vision on our careers page: www.software.inc/jobsOur ValuesCollaboration is Key: We thrive on teamwork and believe in the power of in-person collaboration. Every team member is seen as a leader, and we encourage ownership of projects to foster growth.Honest Communication: Empathetic and open communication is vital for our close-knit team. We strive to listen as much as we talk, respecting every voice in the room.Cultivating Curiosity: In the ever-evolving landscape of AI and computing, staying curious is essential. We ask questions that guide our decisions, ensuring we stay aligned with our vision.The RoleWe are seeking a talented Software Engineer to play a pivotal role in shaping our product. You will be responsible for developing new, user-facing software. Your ability to balance ambition with feasibility will be crucial as you engage in an iterative process of building, testing, gathering feedback, and refining your work.Your Daily Responsibilities Will Include:Creating Innovative Software: Utilize your skills and passion to transform visionary ideas into actionable plans. Sometimes, this requires taking bold steps without knowing the final outcome.Taking Ownership: You'll have full responsibility for the success of your projects. Your commitment to incorporating feedback and improving quality is essential. We trust you to handle significant responsibilities.Thinking Big and Small: Understanding that every choice impacts user experience, you’ll focus on details that create seamless and magical interactions.Documenting Your Work: Keeping thorough and clear documentation is key to our collaborative approach.
Full-time|$170K/yr - $240K/yr|On-site|San francisco, CA
About the RoleAt Sigma Computing, we are revolutionizing how enterprises manage their data through an advanced high-performance platform built on modern data architecture. As we expand our engineering team, we seek passionate engineers eager to tackle complex challenges and deliver significant capabilities across our technology stack. Join our talented team committed to making data effortlessly accessible for all users.What You Will Be DoingDevelop and maintain testing infrastructure and tools, including AI-driven test generation, test harnesses, API testing, hermetic test environments, and performance testing frameworks.Enhance the testability of our systems and services.Utilize modern programming languages and tools such as K6, Rust, and Go.Establish and refine best practices to ensure high-quality standards in our systems and services.Engage in test planning and quality strategy reviews.Collaborate with peers and stakeholders through design and code reviews to uphold best practices across available technologies, dedicating a majority of your time to delivering high-quality code.
Full-time|$166K/yr - $201K/yr|On-site|San Francisco, CA - US
At Crusoe, our mission is to advance the availability of energy and intelligence. We are developing the engine that propels a future where individuals can engage in ambitious AI projects without compromising on scale, speed, or sustainability.Join us in revolutionizing the AI landscape with our sustainable technology. At Crusoe, you'll not only foster meaningful innovation but also make a tangible impact while collaborating with a team that is pioneering responsible and transformative cloud infrastructure solutions.About This Role:As a Senior Software Engineer on our storage team, you will be an integral part of our core engineering unit, tasked with designing, constructing, and optimizing our next-generation cloud storage products. We seek a hands-on engineer with profound expertise in storage system development. Your role will involve creating highly performant, reliable, and scalable distributed storage systems that are vital to both our infrastructure and our clients' AI and HPC workloads.Your Responsibilities Include:Developing Our Multi-Petabyte Cloud Storage PlatformCreating core components of our foundational storage products specifically designed for high-performance AI and ML applications.Enhancing distributed file, block, and object storage solutions, with an emphasis on filesystem-based approaches.System Design & ArchitectureDesigning and implementing scalable, resilient storage architectures that are highly extensible.Proposing and prototyping innovative strategies to enhance performance and system throughput for our most demanding customer workloads.Developing observability, metrics, and tooling for our services and infrastructure.High-Velocity Problem SolvingIdentifying and resolving unique and complex problems in distributed systems at the scale we operate.Providing ongoing support for production systems and customer workloads, which includes troubleshooting, performance tuning, and incident management.Cross-Functional CollaborationCultivating strong collaboration with other engineering teams (e.g., Software Infrastructure, Product) and cross-functional departments.Taking ownership and representing the storage team in critical business initiatives.
Full-time|Remote|San Francisco, CA; New York, NY; Remote - US
Join Airtable as a Senior Software Engineer in the Compute team where you'll play a pivotal role in developing scalable and robust software solutions. You will collaborate with cross-functional teams to design and implement innovative features that enhance our platform's capabilities. Your expertise in software engineering principles and practices will be essential as you contribute to a dynamic and agile environment.
Patreon is a dynamic media and community platform empowering over 300,000 creators to connect with their most dedicated fans through exclusive work and experiences. We provide creators with diverse avenues to engage their audience and establish sustainable businesses, including paid and free memberships, community chats, live video interactions, and direct sales through one-time purchases. Our overarching mission is clear: to support and fund the creative class. We have made significant strides in this area, having:Generated over $10 billion for creators since our inceptionOffered more than 100 million free memberships to fansEstablished over 25 million paid memberships on Patreon today.As we continue to strengthen our creator platform, we are seeking a Senior Storage Platform Software Engineer to contribute to our mission.This position is based in San Francisco and requires in-office attendance two days a week as part of our hybrid work model.
Oct 20, 2025
Sign in to browse more jobs
Create account — see all 5,592 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.