Dagster LabsRemote with offices in San Francisco, CA / New York, NY / Minneapolis, MN
Remote Full-time $190K/yr - $230K/yr
Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
About the Role
As a key member of our Platform Engineering Team, you will contribute to the advancement of our Dagster+ product by refining our Platform API and the underlying systems. You will face complex engineering challenges, from CI/CD systems to database performance, deep Python internals, and infrastructure primitives. This role is fundamentally a software engineering position where you will collaborate with fellow engineers to design and build scalable systems that power Dagster Cloud, focusing on high-quality code. We seek engineers who approach platform challenges with a builder's mindset rather than a purely operational focus.
This is a full-time position with a competitive salary, equity, and benefits. While we are a distributed team with offices in San Francisco, New York, and Minneapolis, we welcome fully remote candidates authorized to work in the United States. We offer flexible remote work options to maximize your productivity, whether at home or in a coworking space. Dagster Labs cultivates a collaborative, remote-first culture, ensuring you have all the support you need to thrive.
About the job
Dagster Labs develops tools that enable organizations to build scalable and efficient data platforms. The company’s core offerings include Dagster, an open-source project popular among developers, and Dagster+, a managed cloud solution. These products support thousands of teams, ranging from early-stage startups to established enterprises, in their analytics, machine learning, and AI initiatives.
With the rapid growth of AI, the need for reliable, high-quality data has never been greater. Dagster Labs is dedicated to simplifying the testing, comprehension, and usability of data platforms. Many top AI companies have adopted Dagster as a foundational part of their technology stack.
Team culture
The team operates with strong funding and a collaborative spirit. High standards, open communication, and a focus on trust and curiosity shape the work environment. The company values a workplace free from egos and unnecessary drama.
Locations
This is a remote-first company with offices in San Francisco, New York, and Minneapolis.
About Dagster Labs
Dagster Labs is at the forefront of empowering organizations to build productive data platforms. Our open-source tool, Dagster, is widely embraced by developers, while Dagster+ provides a robust managed cloud solution. We are trusted by numerous teams globally to drive their data analytics and AI advancements.
About Our TeamThe Platform Systems team at OpenAI is at the forefront of innovation, merging advanced AI technologies with large-scale distributed systems. We are tasked with creating the engineering and research infrastructure essential for training OpenAI's premier models on some of the most powerful, custom-built supercomputers globally.Our team is dedicated to developing the core software for model training, delving deep into the technological stack. This encompasses collective communication, compute efficiency, parallelism strategies, fault tolerance, failure detection, and observability. The systems we design are pivotal to enhancing OpenAI's research capabilities, facilitating reliable and efficient training at the leading edge of technology.We work in close partnership with researchers across the organization, continuously integrating insights from various OpenAI projects to advance our training platform.About the RoleAs a Software Engineer specializing in Platform Systems, you will architect and develop distributed systems that enhance visibility into large-scale training operations, ensuring their dependable operation at scale.Your responsibilities will include designing systems for failure detection, tracing, and observability that pinpoint slow or malfunctioning nodes, identify performance bottlenecks, and assist engineers in optimizing extensive distributed training tasks. This infrastructure is integral to the functionality of OpenAI's training stack and is continuously evolving to accommodate new use cases and increasingly intricate workloads.This position is central to our training infrastructure, merging systems engineering, performance analysis, and large-scale debugging.Key ResponsibilitiesDesign and develop distributed failure detection, tracing, and profiling systems tailored for large-scale AI training jobs.Create tools to identify slow, faulty, or errant nodes and deliver actionable insights into system behavior.Enhance observability, reliability, and performance across OpenAI's training platform.Troubleshoot and resolve issues within complex, high-throughput distributed systems.Collaborate effectively with systems, infrastructure, and research teams to advance platform capabilities.Adapt and expand failure detection and tracing systems to support new training paradigms and workloads.Ideal Candidate ProfilePossesses a deep passion for performance, stability, and observability in distributed systems.Demonstrates proficiency in systems engineering and performance analysis.Has experience in debugging high-throughput distributed systems.Exhibits strong collaboration skills with a track record of working with cross-functional teams.Shows adaptability and eagerness to embrace new technologies and methodologies.
At NerdWallet, we are committed to empowering individuals to make informed financial decisions. Our team comprises exceptional individuals who thrive in an inclusive, flexible, and candid environment. Whether you choose to work remotely or in the office, we prioritize your well-being, professional development, and the impact you can make. We believe that when one of us elevates our skills, the whole team benefits.As part of NerdWallet’s Platform team, you will oversee the systems that serve as the backbone of our consumer experience. This includes management of our centralized product data platform, partner ingestion pipelines, publishing and click-tracking infrastructure, GraphQL gateway operations, and our high-traffic, headless WordPress CMS. These platforms deliver precise, compliant, and high-performance product and content experiences to millions of users on both web and mobile platforms. We are searching for a Senior Engineering Manager to lead this team in modernizing legacy services into scalable and reliable systems while advancing our vision of a decoupled, adaptable platform that facilitates quicker publishing, enhanced observability, and future growth.In the role of Senior Engineering Manager for Platform Systems, you will guide and support a team of engineers in delivering high-quality, scalable, and secure software that aligns with NerdWallet’s product and business objectives. You will collaborate closely with Product Managers and other cross-functional partners to define the roadmap, prioritize tasks, and eliminate obstacles, while nurturing strong engineering practices and a culture of continuous improvement. Your responsibilities will include ensuring technical quality, team well-being, and daily operations, while mentoring engineers, making strategic technical decisions, and balancing immediate deliverables with long-term sustainability, compliance, and reliability.This position reports to the Director of Engineering.Opportunities for Impact:Lead, mentor, and develop a high-performing engineering team responsible for NerdWallet’s platform systems, including the Content Platform, CMS, and Product Data Platform.Collaborate with Product Managers and cross-functional teams to strategize, prioritize, and execute the product roadmap.Champion consistent adherence to software development best practices, including code quality, testing, documentation, and operational excellence.Influence and guide technical and architectural decisions to ensure solutions are scalable, secure, reliable, and compliant with regulatory standards.Balance immediate project needs with long-term project vision and maintainability.
Full-time|$300K/yr - $320K/yr|On-site|San Francisco, CA | New York City, NY | Seattle, WA
About AnthropicAt Anthropic, we are dedicated to creating AI systems that are not only reliable and interpretable but also steerable. Our mission is to ensure that AI is both safe and beneficial for our users and society at large. Our quickly growing team comprises passionate researchers, engineers, policy experts, and business leaders who collaborate to construct beneficial AI systems.About the RoleWe are seeking talented software engineers to join our Platform organization, where we create foundational tools that enhance product development across Anthropic. You will take ownership of the infrastructure and systems that teams rely on to deliver products reliably and at scale, whether for internal stakeholders or for hundreds of thousands of external users and companies worldwide.This position is ideal for engineers who excel in tackling complex infrastructure challenges and designing systems that are reliable, scalable, and elegant. You will play a pivotal role in defining the performance quality standards for our company, supporting the next generation of LLM-first products, and transforming the developer experience into a best-in-class model.We have several teams actively hiring, and placements will be made post-interview based on your interests and experience, along with our organizational needs. This flexible approach allows us to align talented engineers with the projects where they can have the most significant impact and potential for growth.Platform Acceleration: Your work will focus on maximizing the productivity of our product engineers. You will design and optimize essential development infrastructure that drives our AI product development, including development environments, observability tools, and CI/CD pipelines. Working closely with product teams, you'll identify and eliminate friction points in their development workflows, thus significantly enhancing productivity across our entire product organization and accelerating our mission.Service Infrastructure: We maintain the core systems that underpin Anthropic's engineering efforts, from service mesh and observability systems to deployment pipelines and shared libraries. Our contributions enable product teams to build and operate reliable services at scale, making us a vital multiplier of effectiveness across the entire company.Multicloud: We create and maintain the infrastructure that allows Anthropic to operate seamlessly across multiple cloud providers, emphasizing cloud-agnostic tools, cross-cloud networking, and multi-region deployments.Authentication & Identity: We construct and uphold the critical infrastructure that supports identity management and authentication across Anthropic's platforms.
Join Cloudflare as a Distributed Systems Engineer within our dynamic Data Platform team, focusing on Analytics and Alerts. In this position, you will play a pivotal role in building and optimizing distributed systems that power our data analytics capabilities, providing real-time insights and alerts to enhance our customer experience.
Join Condor Software as a Full-Stack Platform EngineerAt Condor, we are revolutionizing the financial infrastructure that supports clinical development. With billions invested annually in discovering and developing new therapies, we strive to connect clinical operations and finance into a cohesive system. By integrating real-time financial intelligence, we empower R&D and finance leaders with the tools they need to make informed, high-stakes decisions.We are an AI-driven, pharma-native infrastructure provider, scaling industry standards in collaboration with top-tier partners. Our platform facilitates prediction, control, and execution in the most complex R&D environments worldwide.The Importance of Your RoleHaving established ourselves as a trusted partner for enterprise teams, we are now focused on the challenging task of scaling our platform to meet increasing demands. As a rapidly growing company, backed by prominent investors like Felicis and 645 Ventures, this is a unique opportunity to contribute to the foundational infrastructure that will redefine how therapies reach patients.Your ResponsibilitiesAs a Full-Stack Platform Engineer, you will be pivotal in building and scaling the core platform that supports the financial intelligence infrastructure relied upon by leading biopharma companies. This role encompasses critical engineering tasks at the intersection of backend systems, cloud infrastructure, and intelligent automation, with a strong emphasis on reliability and scalability.Your primary focus will be on backend architecture, where you'll design and implement services that drive complex financial and operational workflows. You'll be instrumental in shaping data flow, workflow orchestration, and enabling emerging AI-driven capabilities. This role goes beyond simple integration; you'll be crafting robust primitives that support other teams as our product and customer base expand.Working as a core member of a cross-functional product team, you will closely collaborate with product managers, designers, quality engineers, and data specialists to transition features from concept to production. While backend expertise is crucial, you will also engage across the stack to ensure the platform's capabilities are effectively leveraged.
Join Our Team at ConversionAt Conversion, we are pioneering the future of marketing automation through our AI-driven platform, designed specifically for modern software companies. Traditional marketing tools are often outdated and fragmented, leading to ineffective strategies. Our mission is to repair these broken funnels and misaligned messaging.With Conversion, growth teams can manage their entire go-to-market strategy seamlessly within a single interface. From customer acquisition to activation and retention, everything is streamlined, personalized, and enhanced by AI technology.Having secured over $28 million in funding from notable investors like Abstract Ventures, True Ventures, and HOF Capital, we are experiencing rapid growth with over $5 million in ARR and more than 4000 satisfied customers.Our compact yet high-performing team based in San Francisco is looking for passionate individuals who thrive in a creative, product-led marketing environment. If you're eager to collaborate with top-tier talent and contribute to innovative solutions, we want to connect with you!Over $28M raised in funding$0 to $6M ARR achieved in under a year, serving 4000+ customersJoin an elite team with backgrounds from Airbnb, Palantir, Pinterest, IMC Trading, Shopify, LinkedIn, Microsoft, and more
At Plaid, we envision a future where people's financial interactions are profoundly enhanced. We are committed to driving this change by creating innovative tools and experiences that empower thousands of developers in building their own products. Plaid supports the financial well-being of millions by facilitating seamless connections between users and their financial accounts with the applications they rely on. Our network integrates with over 12,000 financial institutions across the US, Canada, the UK, and Europe. Established in 2013, Plaid is headquartered in San Francisco, with additional offices in New York, Washington D.C., London, and Amsterdam.The Platform Engineering division at Plaid consists of diverse teams focused on essential infrastructure, data management, storage solutions, privacy, and enhancing developer efficiency. Together, these teams ensure that our technology platform remains scalable, robust, and secure to support our rapid expansion. As a Platform Engineer, you will play a vital role in designing, developing, and maintaining the foundational infrastructure and internal platforms that empower all engineering teams at Plaid to innovate swiftly and securely. You will collaborate cross-functionally with product engineering teams to launch new features and uphold operational excellence throughout each product's lifecycle.
Full-time|$179.4K/yr - $224.3K/yr|On-site|San Francisco, CA; New York, NY
In a world where software is rapidly evolving, artificial intelligence (AI) is at the forefront, transforming how we interact with technology. At Scale AI, we recognize the immense potential of AI to enhance human capabilities, offering personalized support across various aspects of life—from coaching and tutoring to shopping and travel guidance. As enterprises, startups, and governments rush to integrate large language models (LLMs) into their operations, it is crucial to ensure these systems are safe, aligned, and effective. This involves rigorous human evaluation and reinforcement learning through human feedback (RLHF) during all stages of model development.Our innovative products, including the Generative AI Data Engine, SGP, and Donovan, are designed to empower the most advanced LLMs and generative models globally. By leveraging world-class RLHF, human data generation, model evaluation, safety, and alignment, we are shaping the future of human-AI interaction.As a member of our Platform Engineering team, you will play a pivotal role in designing and developing the foundational platforms that support Scale's operations. Your responsibilities will include architecting our core cloud infrastructure, enhancing our data lifecycle, and transforming the software development process for engineers at Scale. You will gain invaluable insights into the AI landscape as it develops within diverse sectors.
Mercor’s platform team is looking for a Software Engineer in San Francisco or New York City. The focus is on building and improving the scalable software systems that support the company’s main products. Key responsibilities Design and develop software systems to help the platform grow and stay reliable Add new features and refine existing ones to strengthen platform functionality Collaborate with engineering, product, and design teams to deliver solutions Location This role is based in San Francisco or New York City.
MidiHealth is seeking a Senior Software Engineer to join the Platform Engineering team. This hybrid role is based in the SF Bay Area and centers on building and enhancing the software that drives MidiHealth’s healthcare technology platform. The work contributes directly to improving patient outcomes through technology. Key responsibilities Design and develop software solutions for the core platform Collaborate with engineering, product, and cross-functional teams to deliver integrated features Support the reliability and scalability of the platform Location This position requires regular on-site work in the SF Bay Area as part of a hybrid schedule.
Full-time|$405K/yr - $485K/yr|On-site|San Francisco, CA | New York City, NY | Seattle, WA
About AnthropicAt Anthropic, we are dedicated to developing AI systems that are not only safe but also interpretable and steerable. Our mission is to ensure that AI serves the best interests of our users and society at large. Our rapidly growing team comprises passionate researchers, engineers, policy experts, and business leaders, all collaborating to create beneficial AI technologies.About the Role:We are seeking skilled software engineers to join our Platform team. This team is responsible for building the essential foundations that enhance product development across Anthropic. We manage the infrastructure and systems that teams rely on for reliable and scalable deployment to both internal users and hundreds of thousands of external clients worldwide.In this role, you will independently define and scope complex, multi-month projects, fostering cross-organizational alignment in ambiguous problem areas. Your architectural decisions will significantly influence the way Anthropic develops and scales its products. You will collaborate closely with research teams to transform cutting-edge capabilities into market-ready solutions, ensuring a lasting impact on our platforms that serve both internal and external engineering teams.Multiple teams are actively hiring, and team assignments will be determined post-interview based on your skills, interests, and the needs of the organization. This flexible approach helps us align talented engineers with backend product initiatives where they can make the most significant impact and experience growth.Your Responsibilities:Platform Acceleration: You will enhance developer productivity for product engineers at Anthropic. This includes architecting and optimizing critical development infrastructure that fuels our AI product development, encompassing development environments, observability, and CI/CD pipelines. You will work closely with product teams to understand their workflows and eliminate friction points, resulting in a profound multiplier effect across the product organization, thus accelerating our mission.Service Infrastructure: You will build and maintain the core infrastructure that supports Anthropic's engineering efforts, ranging from service mesh and observability systems to deployment pipelines and shared libraries. Your contributions will empower product teams to construct and operate reliable services at scale, positioning you as a critical force multiplier within the company.Multicloud: You will develop and sustain the infrastructure that allows Anthropic to function seamlessly across various cloud providers, focusing on cloud-agnostic solutions that enhance our operational efficiency.
Join the Team at Bretton AIBretton AI stands at the forefront of artificial intelligence in the financial services sector, providing an essential platform that organizations like Robinhood, Mercury, and Gusto rely on to streamline critical tasks, including anti-money laundering and counter-terrorism efforts.With over $95 million raised from renowned investors such as Greylock, Y Combinator, and Thomson Reuters Ventures, we are located in the heart of San Francisco. Our talented team hails from prestigious companies including SpaceX, Google, Netflix, Stripe, and Plaid.Your RoleAs a Platform Engineer, you will be instrumental in constructing and maintaining the infrastructure that underpins our compliance solutions at scale. You will take charge of reliability, performance, and scalability across our core platforms.Your responsibilities will include designing Postgres schemas, optimizing queries, developing high-throughput services, orchestrating long-running workflows with Temporal, and ensuring we maintain a commitment to a remarkable 99.9%+ uptime for our clients who process thousands of cases daily.In this role, you'll produce backend code for production, design distributed systems, and manage the infrastructure that allows our product teams to deploy rapidly and efficiently.Key ResponsibilitiesDevelop APIs and services capable of managing thousands of concurrent requests.Create ingestion → assessment → delivery pipelines for our case management.Implement Temporal workflows to manage long-running, stateful processes.Design and refine PostgreSQL schemas and queries for compliance workflows.Oversee CI/CD, deployments, and infrastructure-as-code while enhancing observability through logging, tracing, and metrics.Troubleshoot slow queries, memory leaks, and race conditions in our production systems.What We're SeekingEssential Qualifications4+ years for Engineer / 6+ years for Sr. Engineer / 8+ years for Staff Engineer in backend or infrastructure roles.Strong understanding of systems fundamentals, including distributed systems, databases, APIs, and concurrency.Demonstrated production experience with live systems, including debugging, incident response, and participation in on-call rotations.
Full-time|$190K/yr - $230K/yr|Remote|Remote with offices in San Francisco, CA / New York, NY / Minneapolis, MN
Dagster Labs develops tools that enable organizations to build scalable and efficient data platforms. The company’s core offerings include Dagster, an open-source project popular among developers, and Dagster+, a managed cloud solution. These products support thousands of teams, ranging from early-stage startups to established enterprises, in their analytics, machine learning, and AI initiatives. With the rapid growth of AI, the need for reliable, high-quality data has never been greater. Dagster Labs is dedicated to simplifying the testing, comprehension, and usability of data platforms. Many top AI companies have adopted Dagster as a foundational part of their technology stack. Team culture The team operates with strong funding and a collaborative spirit. High standards, open communication, and a focus on trust and curiosity shape the work environment. The company values a workplace free from egos and unnecessary drama. Locations This is a remote-first company with offices in San Francisco, New York, and Minneapolis.
Aura is seeking a talented and experienced Senior Software Engineer, Platform to join our innovative team. In this role, you will be responsible for designing and implementing scalable software solutions that enhance our platform capabilities. You will work closely with cross-functional teams to ensure the delivery of high-quality software that meets the needs of our users.
Role overview MidiHealth seeks a Staff Software Engineer for its Platform Engineering team. This hybrid role is based in the San Francisco Bay Area and centers on building software that supports the platform’s growth, performance, and usability. What you will do Design and develop software solutions that address platform engineering challenges Collaborate with experienced engineers to improve reliability and expand platform features Contribute to efforts that boost performance and create a smooth user experience Location This position requires a hybrid schedule, including on-site work in the SF Bay Area.
Full-time|$248.4K/yr - $310.5K/yr|On-site|San Francisco, CA; New York, NY
In a world where software is rapidly evolving, artificial intelligence is at the forefront of this transformation. At Scale AI, we recognize the immense potential of AI to significantly enhance human capabilities. Imagine having a personal tutor, coach, assistant, shopper, travel guide, and therapist available to you throughout your life. As global industries adapt to this revolutionary landscape, leading platform companies are racing to develop large language models (LLMs) at an unprecedented scale, while enterprises strive to integrate these technologies into their existing products. Ensuring these models are safe, aligned, and genuinely useful requires meticulous human evaluation and reinforcement learning from human feedback (RLHF) throughout their development cycle. This innovative approach has given ChatGPT a substantial advantage in the marketplace.At Scale, our offerings include the Generative AI Data Engine, SGP, Donovan, and other advanced tools that drive the most sophisticated LLMs and generative models globally. Our world-class RLHF, human data generation, model evaluation, safety, and alignment are crucial in shaping how humanity will interact with AI.The backbone of these products is our Platform Engineering team. As a Staff Software Engineer, you will play a pivotal role in defining and executing the architectural roadmap and implementation of essential platforms and software systems. You will provide a visionary approach while promoting adoption across our engineering organization for orchestration, data abstraction, data pipelines, identity and access management, and foundational cloud infrastructure. This role offers the opportunity to engage with cutting-edge AI advancements as Scale collaborates with enterprises, startups, governments, and leading tech companies.
Join our innovative team at Unify as a Senior Software Engineer, Platform, where you will play a crucial role in enhancing our platform capabilities. You will collaborate with cross-functional teams to design, develop, and implement high-quality software solutions that meet our clients' needs.
About the Role sfcompute is looking for a Software Engineer focused on the Developer Platform in San Francisco, CA. This role works closely with teams across the company to design, build, and maintain software that supports developers and improves user experience. What You'll Do Work with engineers, product managers, and other partners to deliver platform features Develop and maintain software that enables and supports developer workflows Contribute to the quality and reliability of the Developer Platform Help shape the direction of platform technology at sfcompute
Join our dynamic team at Benchling as a Software Engineer focused on our Developer Platform. In this role, you will work collaboratively to design, develop, and enhance our platform tools, ensuring they meet the evolving needs of our users. Your contributions will directly impact our mission to accelerate life sciences research.
Airbyte stands at the forefront of open-source data movement, enabling data teams to seamlessly transfer information from diverse applications, APIs, unstructured sources, and databases to data warehouses, lakes, and AI applications. With tens of thousands of connectors and a widespread adoption across hundreds of thousands of companies, we have demonstrated the viability of large-scale data integration. Our ongoing mission is to construct an advanced agentic data infrastructure, meticulously designed for AI agents requiring swift and accurate access to data across numerous sources. We aim to make data universally accessible and actionable.Having secured $181M from leading investors such as Benchmark, Accel, Altimeter, Coatue, and Y Combinator, we are committed to a product-led growth strategy where we create exceptional solutions that resonate with our users. This funding empowers us to explore innovative avenues while maintaining a nimble and experimental approach in an AI-driven landscape.The Role:As a critical member of the Data Replication team, you will serve as an infrastructure and reliability engineer within a full-stack product team that executes over 3 million sync jobs weekly, facilitating thousands of data use cases across various regions and cloud environments. You will be responsible for building and maintaining the infrastructure, establishing reliability standards, reducing incidents, and streamlining the shipping process for engineers through enhanced tooling. You should feel equally at home working with Terraform files, Kubernetes clusters, and postmortem documentation.We encourage our engineers to actively leverage AI as a force multiplier—utilizing agentic tools to automate repetitive tasks, enhance incident response, and develop smarter internal tooling. If you haven't yet embraced this approach, we hope you're eager to start. We value how you work just as much as what you produce. Trust, transparency, and craftsmanship are paramount here.What You’ll Do:Take ownership of the infrastructure that supports the Data Replication platform, including Kubernetes clusters, CI/CD pipelines, secrets management, networking, and cloud resource configuration across AWS and GCP.Collaborate with product engineers to ensure reliable integration of product features with infrastructure.Enhance observability, alerting, and anomaly detection systems with a focus on LLM automation.Develop and improve AI-augmented release and internal tooling, including canary deployments, progressive rollouts, automated release qualification, and rollback automation—all with a focus on LLM automation.Establish high standards for infrastructure within the team by creating self-serve tools, writing runbooks, and mentoring engineers.
Mar 17, 2026
Sign in to browse more jobs
Create account — see all 5,822 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.