Staff+ Software Engineer, Observability

AnthropicSan Francisco, CA | New York City, NY | Seattle, WA

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

Proven experience in software engineering with a focus on observability and monitoring systems. Strong programming skills in languages such as Python, Go, or Java. Experience with cloud platforms and container orchestration technologies. Ability to analyze and troubleshoot complex systems issues. Excellent communication and teamwork skills.

About the job

Join Anthropic as a Staff+ Software Engineer specializing in Observability, where you will play a crucial role in enhancing our systems to ensure high-performance and reliability. Collaborate with cross-functional teams to develop innovative solutions, implement observability metrics, and drive improvements that enable better decision-making and user experiences.

About Anthropic

Anthropic is a forward-thinking technology company committed to building safe and beneficial artificial intelligence. We foster a collaborative environment that encourages innovation and values diverse perspectives, making it a great place for driven individuals to thrive.

Similar jobs

1 - 20 of 7,337 Jobs

Search for Ai Observability Research Engineer

7,337 results

Select all on this page (20)

Apply

AI Observability Research Engineer

Anthropic

Full-time|$320K/yr - $405K/yr|On-site|San Francisco, CA

About AnthropicAt Anthropic, we are dedicated to developing AI systems that are reliable, interpretable, and controllable. Our mission is to ensure that artificial intelligence remains safe and beneficial for individuals and society at large. Our rapidly expanding team comprises passionate researchers, engineers, policy experts, and business leaders collaborating to create positive AI solutions.About the TeamAs the scale of AI training and deployment increases, so does the volume of data that requires monitoring and comprehension. Our team utilizes Claude to interpret this data effectively. We manage an integrated suite of tools that empowers Anthropic to pose open-ended inquiries, identify unexpected patterns, and maintain significant human oversight over extensive datasets.Our tools are widely utilized internally, driving ongoing enforcement, threat intelligence investigations, model audits, and much more. We are seeking skilled engineers and researchers to enhance existing applications and innovate new ones from the ground up.About the RoleAs a Research Engineer on our team, you will design and develop systems that enable AI to analyze vast, unstructured datasets—think tens or hundreds of thousands of conversations or documents—and generate structured, reliable insights. You will engage with the entire technology stack, from foundational analysis frameworks to user-facing applications and interfaces.This is a high-impact position. The tools you create will be utilized by numerous researchers and investigators, directly influencing our capacity to assess and counteract both misuse and misalignment.

Feb 20, 2026

Apply

Engineering Manager, AI Observability & Evaluations Platform

LangChain

Full-time|$200K/yr - $250K/yr|On-site|San Francisco, CA

About Us:At LangChain, we are dedicated to making intelligent agents a standard part of everyday life. Our goal is to provide the essential framework for agent engineering, empowering developers to transition their ideas from prototypes to production-ready AI agents that teams can trust. Initially launched as a widely embraced open-source initiative, our evolution has led us to offer a robust platform tailored for building, evaluating, deploying, and managing agents at scale.Our platforms, including LangChain, LangGraph, LangSmith, and Agent Builder, are now instrumental for teams delivering innovative AI solutions across diverse sectors, from startups to major corporations. Industry leaders such as Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, and Vanta, along with 35% of the Fortune 500, rely on LangChain for their AI initiatives.Having successfully secured $125M in Series B funding from prominent investors like IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are poised for continued growth and innovation. At LangChain, every team member plays a vital role in shaping our projects and collaborative work environment, making it a place where your input can significantly influence the future of technology.About The Role:We are seeking a dynamic Engineering Manager to spearhead the development of LangSmith, our observability and evaluation platform designed for LLM applications. In this role, you will set the technical vision, cultivate and mentor a high-performing engineering team, and collaborate closely with product and design teams to deliver features that enable developers to construct and deploy reliable AI systems with assurance.You will: Build, mentor, and expand a talented team of engineers, fostering a culture of collaboration, ownership, and accountability.Enhance LangChain’s engineering culture through mentorship, commitment to high-quality code, and technical excellence.Define long-term technical strategy and guarantee the scalability and reliability of the LangSmith AI Observability Platform.Work alongside product and design teams to outline project scope, sequence, and success metrics for key initiatives.Uphold a high standard of technical excellence while ensuring the team remains focused and operates with urgency.Lead by example in producing clean, maintainable, and thoroughly tested code using Go/Python and TypeScript.Engage directly with customers to grasp their needs and translate those insights into actionable product enhancements.

Feb 6, 2026

Apply

Senior Frontend Engineer for AI Observability & Evals Platform

LangChain

Full-time|$175K/yr - $225K/yr|On-site|San Francisco, CA

About Us:At LangChain, we are dedicated to making intelligent agents a common part of everyday technology. Our goal is to provide a robust foundation for agent engineering that empowers developers to transition from prototypes to production-ready AI agents that teams can depend on. Initially starting as a widely embraced open-source toolset, we have expanded our offerings to include a comprehensive platform for the building, evaluating, deploying, and managing of agents at scale.Currently, our tools—LangChain, LangGraph, LangSmith, and Agent Builder—are utilized by teams developing real AI products in both startups and large enterprises. Millions of developers rely on LangChain to power AI initiatives at notable companies such as Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.Having secured $125M in Series B funding from leading investors like IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are in an exciting phase of product development and rapid growth, where every team member has a substantial impact on our projects and collaborative efforts. At LangChain, your contributions will play a crucial role in shaping how this technology manifests in the real world.About the Role:This position requires in-person attendance 5 days a week in San Francisco, CA, as well as options in New York and Boston.We are seeking a seasoned frontend engineer to innovate and improve features on LangSmith, our enterprise platform designed for LLM application observability, testing, and debugging.What You Will Do:Create new user-facing features utilizing React and TypeScript.Develop reusable components and front-end libraries for future projects.Convert designs and wireframes into high-quality, maintainable code.Optimize components for peak performance across diverse web-capable devices and browsers.Collaborate with fullstack and backend developers as well as UX/UI designers to enhance usability and experience.You’re a Good Fit If You Have:Extensive frontend engineering experience, with strong command of React, JavaScript, and TypeScript.Practical experience with frontend development tools such as Babel, Vite, Webpack, NPM, and Yarn.Familiarity with REST APIs and experience collaborating closely with fullstack and backend developers.

Jun 9, 2025

Apply

Founding Audio AI Research Engineer

David AI

Full-time|On-site|San Francisco

Join Our Innovative Team at David AIDavid AI is pioneering the audio data research landscape. We adopt a rigorous R&D methodology for developing datasets that parallels the standards upheld by leading AI laboratories. Our vision is to seamlessly integrate AI into everyday experiences, with audio serving as the perfect conduit. The evolution of audio AI is rapidly unfolding, yet the availability of high-quality training data remains a critical challenge. This is where David AI steps in.Founded in 2024 by a talented group of former engineers and operators from Scale AI, we have quickly become a trusted partner to numerous FAANG companies and AI research labs. Recently, we secured $50 million in a Series B funding round with notable investors, including Meritech, NVIDIA, and Alt Capital.Our culture is built on sharp intellect, humility, ambition, and a close-knit community. We invite exceptional minds in research, engineering, product development, and operations to join us as we advance the field of audio AI.Research Team OverviewAt David AI, we are convinced that superior model capabilities stem from high-quality, differentiated data. Our research team is dedicated to conducting ambitious, long-term studies into audio technology while collaborating with both internal and external partners to implement cutting-edge research insights into practical applications.Your Role as a Founding Audio AI Research EngineerIn this position, you will establish the research framework that influences how premier AI labs develop their audio models. You will have access to a top-tier team of human AI trainers, robust computing resources, and the autonomy to shape your research agenda.Key ResponsibilitiesCreate and implement comprehensive evaluation frameworks for assessing audio AI capabilities in areas such as speech, emotion detection, conversational dynamics, and acoustic patterns.Investigate and prototype innovative methodologies for audio quality assessment, automated labeling, and optimizing data collection processes.Design focused data collection pipelines aimed at capturing novel, high-value audio capabilities.Develop automated systems for ongoing classifier enhancement and prompt engineering evaluation.Assess cutting-edge models and formulate actionable research strategies.Publish your findings in prestigious conferences.

Jun 24, 2025

Apply

Senior Fullstack Engineer for AI Observability & Evals Platform

LangChain

Full-time|$175K/yr - $225K/yr|On-site|San Francisco, CA

About Us:LangChain is dedicated to making intelligent agents commonplace. We are pioneering the foundations of agent engineering in the real world, empowering developers to transition from prototypes to production-ready AI agents that teams can depend on. Initially known for our widely embraced open-source tools, we have expanded to provide a comprehensive platform for constructing, assessing, deploying, and managing agents at scale.Our products, including LangChain, LangGraph, LangSmith, and Agent Builder, are utilized by teams delivering genuine AI solutions in both startup environments and large corporations. Millions of developers trust our technology to elevate AI initiatives at organizations such as Replit, Clay, Coinbase, Workday, Lyft, Cloudflare, Harvey, Rippling, Vanta, and 35% of the Fortune 500.With $125M raised in our Series B funding from IVP, Sequoia, Benchmark, CapitalG, and Sapphire Ventures, we are poised for continued product development and accelerating growth, where each team member plays a significant role in shaping our technology and collaborative culture.About the Role:On-site 5 days a week in San FranciscoWe are seeking a Senior Fullstack Engineer for our commercial product, LangSmith, which serves as an observability and evaluation platform. In this role, you will have the chance to influence the technical direction of our platform while engaging with enterprise clients, developer end-users, and internal stakeholders.Lead the technical architecture and implementation of essential product features for LangSmith, utilizing our entire stack of Go, Python, and TypeScript.Work closely with product and design teams to iterate and refine new features.Mentor and support junior team members, driving ambitious project timelines while upholding high engineering standards.Set an example by producing clean, maintainable, and thoroughly tested code.

Feb 19, 2025

Apply

Software Engineer, Observability

OpenAI

Full-time|On-site|San Francisco

Become part of the innovative engineering teams at OpenAI, where we create and deliver groundbreaking AI technologies responsibly and safely to the world!Our Applied Engineering team collaborates across research, engineering, product, and design disciplines to deploy OpenAI's cutting-edge technology for both consumers and businesses. We are committed to learning from our deployments and ensuring that AI is utilized ethically while maximizing its benefits. To us, safety takes precedence over unchecked growth.About the RoleWe are in the process of developing OpenAI's observability product, which encompasses everything from scalable infrastructure to an intuitive, AI-enhanced user interface. Our systems process petabytes of logs and billions of time series metrics throughout our infrastructure. We are now integrating intelligence to create features like agents that summarize service events, auto-generate dashboards, and assist engineers in debugging through user-friendly notebook-like interfaces.We are looking to hire software engineers at all levels of our stack—be it infrastructure, backend, or product. You will be part of a dynamic, resourceful team that develops both foundational infrastructure and innovative internal tools, ensuring the reliability, performance, and observability of OpenAI's production systems.What You’ll DoLead the development of core observability infrastructure, focusing on distributed logging, time series, and trace storage.Create AI-integrated tools that empower engineers to autonomously identify, comprehend, and resolve issues.Enhance user interface experiences including dashboards, notebooking, and interactive debugging.Work collaboratively with engineers, researchers, user operations, and various teams to craft the next generation of the observability product.You Might Be a Fit If You:Have experience operating large-scale distributed systems in production, particularly logging systems or time series databases.Excel in ambiguous environments and tackle unscoped challenges head-on.Possess full-stack development skills or a strong product sensibility; you are eager to build practical tools that users will engage with.Demonstrate robust knowledge of systems, networking, and cloud infrastructure (Kubernetes, AWS, etc.).Bonus: Have built or contributed to observability systems (e.g., Prometheus, OpenTelemetry, etc.).Why This Team?We combine infrastructure and product development to create real AI applications for in-house use.Your contributions will directly enhance the reliability of GPT-based products at OpenAI.

Feb 19, 2026

Apply

Staff+ Software Engineer, Observability

Anthropic

Full-time|On-site|San Francisco, CA | New York City, NY | Seattle, WA

Mar 12, 2026

Apply

AI Research Engineer

Hex Technologies

Full-time|$150.4K/yr - $285K/yr|On-site|SF or NYC

About the Role Hex Technologies is at the forefront of the AI revolution, providing an innovative platform that transforms modern Data Science and Data Analytics workflows. As an AI Research Engineer, you will collaborate with product teams to create cutting-edge AI experiences, including the Notebook Agent. Your responsibilities will include conducting experiments, fine-tuning models, deploying AI infrastructure, and developing robust experimentation tools. Your primary focus will be enhancing Hex's context engine and advancing the capabilities of our Notebook Agent, designed for professionals engaged in complex and impactful data tasks. The Notebook Agent serves as a sophisticated data copilot, capable of writing SQL and Python, crafting visually stunning reports, and collaborating with analysts to explore new data inquiries. Your efforts will help data teams within Hex deliver highly accurate and tailored data experiences for their stakeholders, empowering data-driven decision-making across the organization. If you are a passionate builder eager to amplify these capabilities for thousands of users, join us on the leading Data Science platform with unparalleled user context.

Mar 17, 2026

Apply

AI Research Scientist/Engineer

Perplexity

Full-time|On-site|San Francisco

Join Perplexity in our mission to redefine the future of AI-powered search and agent experiences! We are on the lookout for exceptional AI Research Scientists and Engineers who are eager to push the boundaries of technology. Our innovative products, including Sonar models, Deep Research Agent, Comet Agent, and advanced search tools, are designed to handle hundreds of millions of queries and are scaling rapidly. If you have a passion for cutting-edge AI and want to contribute to state-of-the-art experiences, we would love to hear from you!Team StructureDepending on your interests and expertise, you will have the opportunity to collaborate within one of three specialized teams:1. Core Research TeamThis team focuses on the generation and enhancement of foundational models that underpin all our products, emphasizing core model capabilities and the development of infrastructure that serves the entire organization.2. Agent Products TeamThis team specializes in fine-tuning and optimizing models specifically for our Deep Research Agent and Labs/Canvas products, ensuring that our agent functionalities provide exceptional user experiences.3. Comet Agent TeamDedicated to the development and refinement of the Comet Agent product, this team addresses the unique requirements and optimizations necessary for Comet's specific use cases.ResponsibilitiesResearch & DevelopmentPost-train state-of-the-art large language models (LLMs) using cutting-edge supervised and reinforcement learning methodologies (SFT/DPO/GRPO).Utilize our comprehensive query-answer dataset to scale model performance across our Sonar, Deep Research, Comet, and Search products.Remain abreast of the latest advancements in LLM research, focusing on model training, optimization, and personalization techniques.Implement preference optimization and personalization features to elevate user experience.Innovate and develop in-house enhancements and optimizations for state-of-the-art models.Translate research ideas into algorithms and conduct experiments to deploy new models.Infrastructure & ImplementationOversee the full-stack data, training, and evaluation pipelines that are essential for model development.Create robust and efficient training frameworks (based on Megatron/PyTorch) for post-training LLMs.Establish necessary infrastructure components to support cutting-edge model development.

Sep 19, 2025

Apply

Machine Learning Research Engineer - Data at Liquid AI | San Francisco

Liquid AI

Full-time|On-site|San Francisco

About Liquid AIFounded as a spin-off from MIT CSAIL, Liquid AI specializes in creating versatile AI systems designed for optimal performance across various deployment platforms, including data center accelerators and on-device hardware. Our technology emphasizes low latency, minimal memory consumption, privacy, and dependability. We collaborate with leading enterprises in sectors such as consumer electronics, automotive, life sciences, and financial services. As we experience rapid growth, we are on the lookout for exceptional talent to join our team.The OpportunityThe Data team at Liquid AI drives the development of our Liquid Foundation Models, focusing on pre-training, vision, audio, and emerging modalities. With the stagnation of public data sources, the effectiveness of our models increasingly relies on specially curated datasets. We are seeking engineers with a machine learning mindset who can efficiently gather, filter, and synthesize high-quality data at scale.At Liquid AI, we regard data as a research challenge rather than an infrastructural issue. Our engineers conduct experiments, design ablations, and assess how data-related decisions impact model quality. We will align you with a team where you can experience rapid growth and make a significant impact, be it in pre-training, post-training reinforcement learning, vision-language, audio, or multimodal applications.While we prefer candidates in San Francisco and Boston, we are open to considering other locations.What We're Looking ForWe are in search of a candidate who:Thinks like a researcher and executes like an engineer: You should be able to formulate hypotheses, conduct experiments, and evaluate results. Our engineers produce research-level code while our researchers implement production systems.Learns quickly and adapts: You will be working in rapidly evolving modalities, so the ability to quickly grasp new domains and thrive in ambiguity is essential.Prioritizes data quality: We hold data quality in high regard; tasks such as filtering, deduplication, augmentation, and evaluation are key responsibilities, not afterthoughts.Solves problems autonomously: Data engineers operate within training groups (pre-training and multimodal). While collaboration is crucial, we expect ownership and self-direction.The WorkDevelop and maintain data processing, filtering, and selection pipelines at scale.Establish pipelines for pretraining, midtraining, supervised fine-tuning, and preference optimization datasets.Design synthetic data generation systems utilizing large language models (LLMs), structured prompting, and domain-specific generative techniques.

Jul 29, 2025

Apply

Senior Software Engineer - Cloud Availability Platform Engineering (Observability)

Crusoe

Full-time|$166K/yr - $201K/yr|On-site|San Francisco, CA - US

At Crusoe, we are on a mission to accelerate the availability of energy and intelligence. We are building the foundational technology that empowers individuals to innovate boldly with AI while maintaining speed, scale, and sustainability.Join us in the AI revolution with sustainable technology at Crusoe, where you will lead significant innovations, make a real impact, and collaborate with a team that is pioneering responsible and transformative cloud infrastructure.About the Role:We are seeking a highly proficient engineer with extensive experience in designing and managing observability platforms at scale. You will be responsible for architecting, developing, and operating Crusoe’s next-generation observability stack, which will allow engineers to gain insights into the internal state of distributed systems through metrics, logs, and traces. Your contributions will guarantee reliability, performance, and actionable insights across Crusoe’s global infrastructure and cloud platform.Key Responsibilities:Design and manage scalable observability systems (metrics, logging, tracing) in multi-datacenter Kubernetes environments.Architect comprehensive telemetry pipelines, covering ingestion, storage, querying, and visualization.Enhance monitoring and alerting mechanisms with Prometheus, Alertmanager, Thanos/Cortex, Grafana, and OpenTelemetry.Develop scalable log collection and processing pipelines utilizing Fluent Bit, Vector, Loki, or ELK/Opensearch stacks.Implement distributed tracing platforms (Tempo, Jaeger, OpenTelemetry) and integrate with service meshes, load balancers, and APIs.Establish and promote the adoption of SLOs, SLIs, and error budgets across various services and teams.Automate the provisioning and scaling of observability infrastructure using Kubernetes, Terraform, and custom tools (Go, Python).Ensure the reliability and cost-effectiveness of telemetry pipelines while supporting high-volume workloads (AI/ML, HPC clusters, GPU infrastructure).Integrate security best practices into observability platforms, including RBAC, TLS, secret management, and multi-tenant access controls.Collaborate with engineering teams to embed observability into applications, services, and infrastructure.Mentor engineers and influence Crusoe’s observability strategy and technical roadmap.

Oct 1, 2025

Apply

Applied AI Research Engineer

Netic

Full-time|On-site|San Francisco

At Netic, we are revolutionizing the essential services sector with our advanced AI-driven revenue engine, which supports the backbone of the American economy.Backed by $43M in funding from illustrious investors such as Founders Fund, Greylock, Hanabi, and Dylan Field, who spearheaded our Series B, we have empowered our clients to secure hundreds of thousands of jobs across various service industries throughout North America. Our platform has enabled companies to operate with an AI-first approach.Join our innovative team of relentless builders hailing from renowned organizations like Scale, Databricks, HRT, Meta, MIT, Stanford, and Harvard. Together, we are applying frontier AI to solve complex challenges in the physical economy, where data is intricate and the results are both immediate and impactful.As an Applied AI Research Engineer, you will immerse yourself in pioneering research, gain a thorough understanding of the business functions we automate, and lead targeted machine learning projects that yield remarkable outcomes.

May 30, 2025

Apply

Research Engineer, Applied AI Engineering

OpenAI

Full-time|On-site|San Francisco

Join Our Innovative TeamAt OpenAI, we are pioneering the field of artificial intelligence, empowering innovation and shaping the future through transformative research. Our mission is to democratize AI, ensuring its benefits are accessible to all. We are on the lookout for forward-thinking Research Engineers to join our Applied Group, where you will convert groundbreaking research into practical applications that can revolutionize industries, enhance human creativity, and tackle complex challenges.Your Impactful RoleAs a Research Engineer within OpenAI's Applied Group, you will collaborate with some of the brightest minds in AI. Your work will involve deploying cutting-edge models in production settings, transforming theoretical breakthroughs into impactful solutions. If you are passionate about making AI technology accessible and effective, this is your opportunity to leave a significant impact.In this role, you will:Innovate and Deploy: Create and implement advanced machine learning models addressing real-world issues. Translate OpenAI's research from theory to practice, developing AI-driven applications that make a meaningful difference.Collaborate with Experts: Engage closely with researchers, software engineers, and product managers to comprehend intricate business challenges and deliver AI-based solutions. Become part of a vibrant team where creativity and ideas flourish.Optimize and Scale: Develop scalable data pipelines, fine-tune models for peak performance and precision, and ensure readiness for production. Contribute to projects that leverage state-of-the-art technology and innovative methodologies.Learn and Lead: Stay at the forefront of advancements in machine learning and AI. Participate in code reviews, share insights, and exemplify best practices to maintain high standards in engineering.Make a Difference: Oversee and maintain deployed models, ensuring they consistently deliver value. Your contributions will directly shape how AI benefits individuals, businesses, and society as a whole.You may excel in this position if you possess:A Master's or PhD in Computer Science, Machine Learning, Data Science, or a related discipline.Proven experience in deep learning and transformer models.Expertise with frameworks such as PyTorch or TensorFlow.A robust understanding of data structures, algorithms, and software engineering principles.Experience with cloud platforms and deploying machine learning models in production.

May 22, 2024

Apply

Senior Observability Engineer

DigitalOcean

Full-time|Remote|San Francisco

Join DigitalOcean as a Senior Observability Engineer, where you will play a critical role in enhancing our monitoring and observability platforms. Your expertise will help us ensure that our systems are performant, reliable, and scalable, providing a seamless experience for our customers.

Mar 10, 2026

Apply

Researcher, Health AI

OpenAI

Full-time|Hybrid|San Francisco

Join Our Innovative TeamAt OpenAI, our Safety Systems team is at the forefront of ensuring that AI models are safe, robust, and reliable for real-world applications. We are committed to the principle set forth in our charter: to widely distribute the benefits of AI technology. Our Health AI unit is dedicated to providing equitable access to high-quality medical information. By bridging AI safety research with healthcare applications, we strive to develop trustworthy AI systems that empower medical professionals and enhance patient care.The OpportunityWe are on the lookout for passionate researchers who are eager to contribute to AI safety and improve health outcomes globally. As a Health AI Research Scientist, your role will involve developing safe and effective AI models tailored for healthcare applications. You will implement innovative methods to enhance the behavior, knowledge, and reasoning capabilities of our models. This necessitates research into safety and alignment techniques that can be generalized to ensure a beneficial AGI.This position is based in San Francisco, CA, utilizing a hybrid work model (3 days in the office per week) and we provide relocation assistance for new hires.Key Responsibilities:Design and implement scalable methods to enhance the safety and reliability of our models, such as Reinforcement Learning from Human Feedback (RLHF), automated red teaming, and scalable oversight.Assess methodologies using health-related data to ensure models deliver accurate, reliable, and trustworthy information.Develop reusable libraries to apply general alignment techniques across our models.Proactively analyze the safety of our models and systems, identifying potential risk areas.Collaborate with cross-functional teams to embed safety methods into core model training and drive safety improvements in OpenAI products.Ideal Candidate Profile:Align with OpenAI’s mission to ensure AGI benefits everyone and resonate with our charter.Exhibit a strong passion for AI safety and enhancing global health outcomes.Possess 4+ years of experience in AI research, with a focus on health applications.Demonstrate proficiency in machine learning frameworks and safety techniques.Showcase effective communication skills for cross-team collaboration.

Jan 29, 2025

Apply

Research Engineer at Resolve AI | San Francisco

Resolve AI

Full-time|On-site|San Francisco

About Resolve AIAt Resolve AI, we are redefining the role of software maintenance and production troubleshooting by creating a revolutionary, fully autonomous AI Production Engineer. Our technology is designed to diagnose and resolve intricate system issues from start to finish.Founded by industry leaders Spiros Xanthos and Mayank Agarwal, who are the masterminds behind OpenTelemetry and have previously spearheaded initiatives at Splunk Observability, our team boasts two successful exits to Splunk and VMware.Having successfully secured over $150M in funding from prestigious investors like Lightspeed, Greylock, and Unusual Ventures, alongside notable individuals such as Jeff Dean (Chief Scientist, Google DeepMind) and Fei-Fei Li (Professor, Stanford), we are well-positioned for growth.Joining Resolve AI now presents a unique opportunity to be part of an AI-driven company that is at the forefront of transforming engineering workflows.

Sep 9, 2024

Apply

Staff Software Engineer - Observability

Gusto

Full-time|Remote|San Francisco, CA

Join Gusto as a Staff Software Engineer specializing in Observability, where you will play a pivotal role in enhancing our software's performance and reliability. Utilize your expertise to develop and implement monitoring solutions that provide insights into application behavior, ensuring a seamless experience for our users.Your contributions will directly impact our engineering processes and product quality. Collaborate with cross-functional teams to identify and resolve issues proactively, while also driving initiatives to improve system observability.

Mar 27, 2026

Apply

AI Lab Research Engineer

Lila Sciences

Full-time|On-site|Cambridge, MA USA; San Francisco, CA USA

Join our innovative team at Lila Sciences as an AI Lab Research Engineer. In this role, you will contribute to cutting-edge AI research and development, focusing on enhancing the capabilities of our laboratory systems. Your expertise will help drive forward our mission to revolutionize scientific research through artificial intelligence.

Apr 7, 2026

Apply

Software Engineering Manager - Observability

Figma, Inc.

Full-time|On-site|San Francisco, CA • New York, NY • United States

Join Figma as a Software Engineering Manager specializing in Observability. In this pivotal role, you will lead a dynamic team of engineers in developing cutting-edge solutions that enhance visibility and performance across our platform. Your expertise will drive the design and implementation of observability tools that empower our engineering teams to optimize their workflows, ensuring the robustness and reliability of our applications.

Feb 27, 2026

Apply

Applied AI Researcher in AI Systems

Distyl AI

Full-time|On-site|San Francisco

About Distyl AIDistyl AI specializes in creating high-performance AI systems that enhance the fundamental operational processes of Fortune 500 companies. Through a strategic alliance with OpenAI, proprietary software accelerators, and extensive expertise in enterprise AI, we deliver effective AI solutions with swift time-to-value, often within a quarter.Our innovations have empowered Fortune 500 clients in various sectors, including insurance, consumer packaged goods, and non-profit organizations. Joining our team means you will assist organizations in recognizing, developing, and extracting value from their Generative AI investments, frequently for the first time. We prioritize customer needs, working backward from the client's challenges and ensuring we generate financial benefits while enhancing the experiences of end-users.Distyl is guided by seasoned leaders from top-tier companies like Palantir and Apple and enjoys backing from prominent investors including Lightspeed, Khosla, Coatue, Dell Technologies Capital, Nat Friedman (Former CEO of GitHub), Brad Gerstner (Founder and CEO of Altimeter), along with board members from numerous Fortune 500 firms.What We Are Looking ForAt Distyl, we are at the forefront of leveraging AI within enterprises. We seek imaginative researchers who aspire to go beyond incremental enhancements on benchmarks and are eager to redefine the application of software in innovative ways.Our researchers hail from diverse academic disciplines but possess a robust research background, operate in an AI-centric manner, and would find conventional research environments unfulfilling.Key ResponsibilitiesThe AI Systems team is dedicated to architecting complex, comprehensive solutions that integrate perception, reasoning, planning, and execution. Researchers amalgamate various components (LLMs, retrievers, evaluators, memory systems, and execution agents) into resilient, scalable systems that deliver consistent performance across dynamic enterprise workflows.Researchers in AI Systems examine the principles governing intricate system interactions. They analyze coordination, information flow, and emergent behavior across multiple agents and models. Their research reveals the foundational mechanics of robustness, composability, and alignment, ultimately establishing the design paradigm for constructing intelligent systems.

Oct 16, 2025

Create account — see all 7,337 results