Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Proficient in programming languages such as JavaScript, Python, or Go. Strong understanding of observability principles, including logging, monitoring, and tracing. Experience with cloud platforms and serverless architectures. Excellent problem-solving abilities and a strong analytical mindset. Ability to work collaboratively in a fast-paced, agile environment.
About the job
Join our dynamic team at Cloudflare as a Software Engineer focused on Workers Observability. In this pivotal role, you'll be instrumental in enhancing the observability features of our Workers platform, ensuring optimal performance and reliability for our users. You will collaborate with cross-functional teams, tackle complex technical challenges, and contribute to the advancement of our innovative cloud solutions.
About Cloudflare, Inc.
Cloudflare is a leading web performance and security company dedicated to helping businesses build a better internet. With cutting-edge technology and a commitment to innovation, we empower organizations to enhance their online presence while safeguarding them against threats. Join us as we shape the future of the internet!
Join Gusto as a Staff Software Engineer specializing in Observability, where you will play a pivotal role in enhancing our software's performance and reliability. Utilize your expertise to develop and implement monitoring solutions that provide insights into application behavior, ensuring a seamless experience for our users.Your contributions will directly impact our engineering processes and product quality. Collaborate with cross-functional teams to identify and resolve issues proactively, while also driving initiatives to improve system observability.
Full-time|On-site|San Francisco, CA | New York City, NY | Seattle, WA
Join Anthropic as a Staff+ Software Engineer specializing in Observability, where you will play a crucial role in enhancing our systems to ensure high-performance and reliability. Collaborate with cross-functional teams to develop innovative solutions, implement observability metrics, and drive improvements that enable better decision-making and user experiences.
Become part of the innovative engineering teams at OpenAI, where we create and deliver groundbreaking AI technologies responsibly and safely to the world!Our Applied Engineering team collaborates across research, engineering, product, and design disciplines to deploy OpenAI's cutting-edge technology for both consumers and businesses. We are committed to learning from our deployments and ensuring that AI is utilized ethically while maximizing its benefits. To us, safety takes precedence over unchecked growth.About the RoleWe are in the process of developing OpenAI's observability product, which encompasses everything from scalable infrastructure to an intuitive, AI-enhanced user interface. Our systems process petabytes of logs and billions of time series metrics throughout our infrastructure. We are now integrating intelligence to create features like agents that summarize service events, auto-generate dashboards, and assist engineers in debugging through user-friendly notebook-like interfaces.We are looking to hire software engineers at all levels of our stack—be it infrastructure, backend, or product. You will be part of a dynamic, resourceful team that develops both foundational infrastructure and innovative internal tools, ensuring the reliability, performance, and observability of OpenAI's production systems.What You’ll DoLead the development of core observability infrastructure, focusing on distributed logging, time series, and trace storage.Create AI-integrated tools that empower engineers to autonomously identify, comprehend, and resolve issues.Enhance user interface experiences including dashboards, notebooking, and interactive debugging.Work collaboratively with engineers, researchers, user operations, and various teams to craft the next generation of the observability product.You Might Be a Fit If You:Have experience operating large-scale distributed systems in production, particularly logging systems or time series databases.Excel in ambiguous environments and tackle unscoped challenges head-on.Possess full-stack development skills or a strong product sensibility; you are eager to build practical tools that users will engage with.Demonstrate robust knowledge of systems, networking, and cloud infrastructure (Kubernetes, AWS, etc.).Bonus: Have built or contributed to observability systems (e.g., Prometheus, OpenTelemetry, etc.).Why This Team?We combine infrastructure and product development to create real AI applications for in-house use.Your contributions will directly enhance the reliability of GPT-based products at OpenAI.
Full-time|On-site|San Francisco, CA • New York, NY • United States
Join Figma as a Software Engineering Manager specializing in Observability. In this pivotal role, you will lead a dynamic team of engineers in developing cutting-edge solutions that enhance visibility and performance across our platform. Your expertise will drive the design and implementation of observability tools that empower our engineering teams to optimize their workflows, ensuring the robustness and reliability of our applications.
Join our dynamic team at Cloudflare as a Software Engineer focused on Workers Observability. In this pivotal role, you'll be instrumental in enhancing the observability features of our Workers platform, ensuring optimal performance and reliability for our users. You will collaborate with cross-functional teams, tackle complex technical challenges, and contribute to the advancement of our innovative cloud solutions.
Full-time|$170K/yr - $240K/yr|On-site|San Francisco, CA
About the Role Sigma Computing is growing its engineering team in San Francisco, CA. The company builds technology to help users access data with ease. As a Senior Software Engineer focused on Observability and Reliability, you will work alongside engineers who value high standards and collaboration. What You Will Do Design and build observability platforms and tools, including metrics collection, logging, distributed tracing, dashboards, alerting, and application performance monitoring. Work with technologies such as Go, OpenTelemetry, and Kubernetes to solve reliability challenges. Take part in on-call rotations to help maintain strong uptime for Sigma’s services. Create tools and processes to improve cloud incident triage and reduce downtime. Define and promote practices that make systems and services measurable and observable. Join design and code reviews with peers and stakeholders to reinforce quality and effective collaboration.
Role overview Adyen seeks a Senior Software Engineer in San Francisco to focus on Customer Developer Observability. This position aims to enhance the tools and systems that let clients monitor and analyze their performance across the Adyen platform. What you will do Collaborate with cross-functional teams to design and build observability solutions. Create and implement features that provide customers with deeper insights into their systems and data. Help improve the customer experience by making monitoring and analysis more effective and accessible.
Full-time|$166K/yr - $201K/yr|On-site|San Francisco, CA - US
At Crusoe, we are on a mission to accelerate the availability of energy and intelligence. We are building the foundational technology that empowers individuals to innovate boldly with AI while maintaining speed, scale, and sustainability.Join us in the AI revolution with sustainable technology at Crusoe, where you will lead significant innovations, make a real impact, and collaborate with a team that is pioneering responsible and transformative cloud infrastructure.About the Role:We are seeking a highly proficient engineer with extensive experience in designing and managing observability platforms at scale. You will be responsible for architecting, developing, and operating Crusoe’s next-generation observability stack, which will allow engineers to gain insights into the internal state of distributed systems through metrics, logs, and traces. Your contributions will guarantee reliability, performance, and actionable insights across Crusoe’s global infrastructure and cloud platform.Key Responsibilities:Design and manage scalable observability systems (metrics, logging, tracing) in multi-datacenter Kubernetes environments.Architect comprehensive telemetry pipelines, covering ingestion, storage, querying, and visualization.Enhance monitoring and alerting mechanisms with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry.Develop scalable log collection and processing pipelines utilizing Fluent Bit, Vector, Loki, or ELK/Opensearch stacks.Implement distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrate with service meshes, load balancers, and APIs.Establish and promote the adoption of SLOs, SLIs, and error budgets across various services and teams.Automate the provisioning and scaling of observability infrastructure using Kubernetes, Terraform, and custom tools (Go, Python).Ensure the reliability and cost-effectiveness of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure).Integrate security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls.Collaborate with engineering teams to embed observability into applications, services, and infrastructure.Mentor engineers and influence Crusoe’s observability strategy and technical roadmap.
Full-time|Remote|Remote with offices in San Francisco, CA / New York, NY / Minneapolis, MN
Join Dagster Labs as a Software Engineer specializing in our Observability Product. In this fully remote role, you will play a crucial part in enhancing the visibility and performance of our software solutions. Collaborate with cross-functional teams to develop and implement innovative observability features that empower our users to monitor and optimize their applications effectively.
Join Crusoe as a Senior Software Engineer specializing in Observability, where you will play a pivotal role in enhancing our systems and ensuring robust performance across our platforms. You will collaborate with cross-functional teams to develop innovative solutions that improve the visibility and reliability of our software applications.
Full-time|$194K/yr - $267K/yr|On-site|San Francisco, California
Discover OktaOkta is recognized as The World’s Identity Company, empowering individuals to securely leverage any technology across various devices and applications. Our versatile Okta Platform and Auth0 Platform provide reliable access, authentication, and automation, placing identity at the forefront of business security and expansion.At Okta, we value diverse perspectives and experiences. We seek continuous learners and individuals who can enhance our team with their distinct backgrounds.Join us as we create a world where identity is truly yours.We are in search of a highly skilled Observability Site Reliability Engineer specializing in Google Cloud, to take charge of and elevate our Observability ecosystem within GCP. In this position, you will progress beyond basic monitoring to develop a world-class, comprehensive, and scalable Observability Platform that supports our SRE teams and business collaborators. You will implement infrastructure as code by employing Terraform and demonstrating strong coding skills in Go, Python, or Ruby to automate the deployment of agents and collectors across intricate distributed systems.Key ResponsibilitiesAutomated Infrastructure: Design, build, and maintain scalable observability infrastructure utilizing tools such as Terraform.GCP Observability Engineering: Enhance the collection, processing, and storage of Observability data to guarantee high reliability and low latency for our Splunk and Grafana services.Incident Response: Engage in on-call rotations and conduct post-incident reviews to foster systemic improvements and promote 'observability-driven development.'Automation: Minimize 'toil' by automating the deployment and scaling of observability agents and collectors.
Join Adyen as an Engineering Manager for our Developer Observability team! In this pivotal role, you will lead a dynamic group of engineers dedicated to enhancing the observability of our developer platforms. You will be responsible for driving technical innovation, mentoring your team, and collaborating closely with cross-functional partners to deliver exceptional developer experiences.As a leader, you will empower your team to excel in building tools and solutions that provide insights into system performance, ensuring our developers have everything they need to thrive. If you are passionate about technology, leadership, and fostering a culture of excellence, we want to hear from you!
About GridwareGridware is an innovative technology firm based in San Francisco, committed to safeguarding and optimizing the electrical grid. We have pioneered a revolutionary grid management approach known as Active Grid Response (AGR), which emphasizes the monitoring of electrical, physical, and environmental factors that influence grid reliability and safety. Our cutting-edge AGR platform leverages high-precision sensors to identify potential issues early, facilitating proactive maintenance and fault prevention. This holistic strategy aids in enhancing safety, minimizing outages, and ensuring the grid operates with maximum efficiency. Gridware is supported by prominent climate-tech and Silicon Valley investors. For further details, please visit www.Gridware.io.Role OverviewWe are looking for a talented Staff Software Engineer to act as a pivotal technical force within our team, enhancing the overall software engineering capabilities through architectural innovation, mentorship, and fostering a culture of excellence. In this role, you will design and develop the essential software systems that drive Gridware's platform. This encompasses everything from backend services that oversee our distributed network of devices to the front-end interfaces that visualize grid health, fleet diagnostics, and real-time field events.Your responsibilities will span the entire technology stack, building and scaling systems that integrate hardware, firmware, and cloud infrastructure to enable dependable communication, fleet visibility, and expedited decision-making. This position offers significant ownership and impact, allowing you to influence how our technology supports and protects critical infrastructure at scale.
About BroccoliBroccoli is revolutionizing the $500 billion home services industry by developing an AI operating system designed to empower trades businesses such as HVAC and roofing. Our intelligent AI agents handle customer interactions, manage job bookings, and ensure every lead is effectively captured.With the backing of prominent venture capital firms and a successful $27 million Series A funding round, we are on an aggressive growth trajectory. Collaborating with top private equity-backed home service platforms, we anticipate expanding our team fivefold by 2026, presenting a unique opportunity to join us early and make a significant impact.Why Join Broccoli?As a Staff Engineer, you will be instrumental in establishing the technical backbone of Broccoli AI. Your responsibilities will include ownership of critical systems, influencing architectural decisions, and shaping our development and deployment processes on a large scale.Immediate Impact: Your contributions will directly enhance production systems, benefiting hundreds of customers.Category Creation: Play a pivotal role in defining a new category of AI-powered workforce within an expansive market.Speed & Ownership: Enjoy the advantages of a small team with rapid feedback loops and substantial decision-making authority.Founder Collaboration: Partner closely with experienced founders to drive product and technical vision.What You’ll DoDesign, develop, and scale backend systems and internal tools for our AI agent platform.Take ownership of essential APIs and integrations, including systems like ServiceTitan.Lead complex features from initial design through to production deployment.Enhance real-time voice capabilities, reliability, and intelligence of AI agents.Mentor fellow engineers and help implement best practices across the team.Balance speed and quality while scaling systems to accommodate live customer traffic.What We’re Looking For7+ years of experience in backend or full-stack engineering.Strong system design and architectural skills.Proven experience in deploying and maintaining production systems at scale.Ability to thrive in high-growth, ambiguous startup environments.A proactive approach with a strong execution mindset.
Full-time|Hybrid|San Francisco, CA; Santa Clara, CA; Seattle, WA; New York, NY
Join Carta's engineering team as a Staff Software Engineer, where you will play a crucial role in developing innovative solutions that enhance our platform. You will collaborate with cross-functional teams to design, implement, and maintain scalable systems, ensuring high performance and responsiveness to requests from the front-end.We're looking for a passionate engineer who thrives in a fast-paced environment and is excited about tackling complex challenges. If you are eager to contribute to cutting-edge technology and drive impactful projects, we want to hear from you!
Role overview The Staff Software Engineer position at Amplitude, Inc. is based in San Francisco, CA. This role centers on developing and enhancing software to broaden the platform’s features. Day-to-day work includes direct software development and frequent collaboration with colleagues from various teams. What you will do Design, build, and maintain scalable software applications that support the platform’s growth. Collaborate with product managers and designers to deliver features that address user needs. Mentor junior engineers and contribute to their technical and professional development. Review code and help improve engineering practices throughout the team. Stay current with emerging technologies and industry trends to guide technical choices.
Join our innovative team at Crusoe as a Staff Software Engineer. In this pivotal role, you will leverage your advanced software engineering skills to design, develop, and optimize cutting-edge solutions that enhance our technology stack. Collaborate with cross-functional teams to drive projects from concept to completion, ensuring high-quality deliverables that meet user needs and business objectives.
Why Join AngelListAt AngelList, we tackle some of the most challenging issues in venture capital and private markets. Our team is driven by precision, urgency, and a vision for the future. If you're passionate about transforming the startup funding landscape, this is your opportunity.About AngelListOur mission is to fuel innovation by enhancing the success rate of startups globally. We achieve this by creating the financial infrastructure that facilitates investment in transformative companies. AngelList stands at the intersection of venture capital and the startup ecosystem, supporting over $171 billion in assets and facilitating investments in more than 13,000 startups, including over 300 unicorns. With 57% of premier U.S. VC deals involving AngelList investors, we are ambitious in our goals.If you are excited about shaping the future of private markets, we invite you to join our journey.About the Role: As a Staff Software Engineer for our Nova platform, you will be pivotal in defining and designing our development processes. You will be responsible for establishing the domain model and architectural patterns that will guide our engineering team. If you are a catalyst who can empower a team of eight to perform like a team of twenty by establishing clear patterns, embedding engineering context into our codebase, and driving vital initiatives forward, we want to hear from you.
About UsAt Imprint, we are revolutionizing the landscape of co-branded credit cards and financial products, making them smarter, more rewarding, and fundamentally brand-centric. We collaborate with esteemed brands like Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to create innovative credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our sophisticated platform integrates cutting-edge payments infrastructure, intelligent underwriting, and a seamless user experience, enabling brands to offer impactful financial products without the need to become a bank.With co-branded cards accounting for over $300 billion in annual spending in the U.S., the majority are still managed by traditional banks. Imprint stands as the modern solution: agile, technology-driven, and tailored for today’s consumers. Supported by industry leaders like Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a top-tier team to redefine payment methods and empower brand growth. If you're eager to work at a fast pace, tackle complex challenges, and make a significant impact, we want to hear from you.Discover more about us on Imprint's Technology Blog.Your RoleDesign and spearhead the development of secure, reliable, and scalable backend systemsProvide technical leadership and mentorship across diverse teams and projectsSet and advocate for coding standards, architectural principles, and technical vision within the engineering organizationCollaborate closely with product, design, and engineering leadership to align technical strategy with business objectivesDrive the continuous enhancement of system performance, scalability, and reliability to accommodate rapid growth and evolving business needsLead complex, high-impact projects from inception through deployment and ongoing optimizationEnhance developer experience through advanced tooling, comprehensive observability frameworks, and improvements in platform reliabilityMentor and nurture engineers through code reviews, design discussions, and technical guidanceContribute to the technical roadmap and identify innovative opportunities
Full-time|$200K/yr - $275K/yr|On-site|San Francisco, CA
Supported by prominent Silicon Valley investors, Peregrine Technologies empowers public safety organizations, state and local governments, federal agencies, and private-sector institutions to address societal challenges with unparalleled speed and precision. Our AI-driven platform transforms fragmented and isolated data into actionable operational intelligence, quickly surfacing mission-critical information to facilitate informed and timely decisions that enhance outcomes across various contexts. Currently, Peregrine serves hundreds of customers across more than 30 states and two countries, impacting over 125 million individuals—and we are amplifying our influence as we expand into enterprise markets and internationally.Our TeamAs an engineering team, we place a strong emphasis on empathy, believing it enhances our solutions. Understanding how users interact with our product is essential, and engineers will collaborate closely onsite to grasp the diverse use cases that Peregrine addresses.We are seeking a Staff Software Engineer to join our core engineering teams. In this role, you will collaborate cross-functionally with design and product management to develop robust, scalable, and user-centric systems. Our teams confront a variety of challenges, from enabling real-time user collaboration on intricate maps to constructing high-scale backend architectures capable of processing billions of data points.We value ownership and teamwork; you will take complete responsibility for significant features and work closely with fellow engineers to ensure successful completion. We believe that humility and empathy are vital for crafting the right solutions—you will work directly with our deployment team and users as we refine our offerings to meet their needs. Creativity and perseverance will be key to executing our vision.RoleWe are looking for a Staff Software Engineer to join our expanding team, lead impactful projects, cultivate an inclusive team culture, and steer technical decision-making.This position is ideal for someone who excels in both people management and hands-on technical leadership. You will build a high-performing team, guide them through complex technical challenges, and ensure their work aligns with our business goals. The ideal candidate will effectively mentor while upholding high standards in technical execution.You will drive significant work that delivers value to our customers, from supporting emergency responders during hurricanes to de-escalating intricate organized crime situations. We are developing innovative capabilities that enable...
Jan 13, 2026
Sign in to browse more jobs
Create account — see all 5,813 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.