Software Engineer Labs jobs in San Francisco – Page 4 | RoboApply Jobs

Software Engineer Labs jobs in San Francisco· Page 4

Results 61–80 of 5,650 for “Software Engineer Labs” in San Francisco.

5,650 jobs found

61 - 80 of 5,650 Jobs
Apply
Judgment Labs logo
Full-time|On-site|San Francisco

At Judgment Labs, we are at the forefront of developing Infrastructure for Agent Behavior Monitoring (ABM). Unlike traditional observability which primarily captures logging exceptions and latency, our innovative ABM technology identifies behavioral anomalies—such as instruction drifts and context retrieval loss—within scaled production environments.Numerous…

Nov 25, 2025
Apply
Lemurian Labs logo
Full-time|On-site|SF Bay Area

Join Lemurian Labs on our ambitious mission to harness the potential of artificial intelligence while minimizing our ecological impact. Our commitment to responsible innovation drives us to create sustainable AI solutions that benefit society and the environment alike. After all, innovation should empower the world, not compromise it.We are developing a cutting-edge, high-performance compiler that enables developers to 'build once, deploy anywhere.' This means seamless cross-platform compatibility, allowing you to train your models in the cloud and deploy them at the edge—all while ensuring optimal resource efficiency and scalability.If you are passionate about scaling AI sustainably and making AI development both powerful and accessible, we invite you to be a part of our team at Lemurian Labs. Collaborate with us as we build the future responsibly and innovatively.

Mar 13, 2025
Apply
TRM Labs logo
Full-time|$200K/yr - $275K/yr|On-site|San Francisco, CA

Contribute to a Safer World.At TRM Labs, we specialize in cutting-edge blockchain analytics and AI solutions designed to assist law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in identifying, investigating, and preventing crypto-related fraud and financial crimes. Our advanced blockchain intelligence and AI platforms provide essential tools for tracing the flow of funds, detecting illicit activities, building comprehensive cases, and mapping out potential threats. Trusted by top agencies and corporations worldwide, TRM Labs empowers a more secure and reliable environment for everyone.Join our AI Engineering Team, dedicated to pioneering next-generation AI applications with a focus on Large Language Models (LLMs) and agentic systems. We strive to develop high-performance infrastructure, robust pipelines, and operational tools that facilitate the rapid, safe, and scalable deployment of AI systems.Our work involves managing petabyte-scale data pipelines, delivering models with millisecond latency, and ensuring the observability and governance required for production-ready AI. We actively evaluate and integrate state-of-the-art tools in the LLM and agent domains—ranging from open-source stacks to vector databases and orchestration tools—to enhance TRM's innovation capabilities.Your Impact:Design and implement a robust agentic framework that supports tool usage, context retrieval, memory, and planning.Create intelligent, modular agents that automate investigative responsibilities and enhance analyst decision-making.Expand and optimize our LLM infrastructure (e.g., OpenAI, Anthropic, local models), including prompt engineering, RAG, and evaluation cycles.Develop safe, observable, and auditable agent behaviors to ensure reliability in sensitive environments.Assess performance based on metrics like reasoning, latency, success rate, and hallucination, iterating improvements based on user feedback and telemetry data.Foster a culture of ownership, rapid experimentation, and ethical AI practices.Qualifications:Solid engineering background with significant experience in backend or systems development (Python preferred).Practical experience in building LLMs, agents, and tooling frameworks (LangChain, semantic caches, vector databases, etc.).Proficient with AI operational tooling and frameworks, ensuring effective deployment and management.

Jan 21, 2026
Apply
TRM Labs logo
Full-time|$200K/yr - $240K/yr|On-site|San Francisco, CA

Join Us in Building a Safer World.At TRM Labs, we specialize in blockchain analytics and AI solutions aimed at assisting law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in identifying, investigating, and preventing crypto-related fraud and financial crime. Our innovative platforms leverage blockchain intelligence and AI technology to trace funds, detect illicit activity, and construct comprehensive threat profiles. Trusted by leading organizations worldwide, TRM Labs is committed to enabling a safer and more secure environment for all.Our AI Engineering Team is dedicated to pioneering next-generation AI applications, particularly in the realm of Large Language Models (LLMs) and agentic systems. Our goal is to develop resilient pipelines and high-performance infrastructure that facilitate the swift, safe, and scalable deployment of AI systems.We manage extensive petabyte-scale pipelines, ensuring model serving with millisecond latency while providing the necessary observability and governance to make AI production-ready. Our team actively evaluates and integrates leading-edge tools in the LLM and agent space, including open-source stacks, vector databases, evaluation frameworks, and orchestration tools to accelerate TRM’s innovation pace.As a Senior or Staff ML Systems Engineer – LLM, you will play a pivotal role in constructing and scaling our technical infrastructure for AI/ML systems. Your responsibilities will include:Creating reusable CI/CD workflows for model training, evaluation, and deployment, integrating tools such as Langfuse, GitHub Actions, and experiment tracking.Automating model versioning, approval processes, and compliance checks across various environments.Developing a modular and scalable AI infrastructure stack that encompasses vector databases, feature stores, model registries, and observability tools.Collaborating with engineering and data science teams to embed AI models and agents into real-time applications and workflows.Continuously assessing and incorporating state-of-the-art AI tools (e.g., LangChain, LlamaIndex, vLLM, MLflow, BentoML).Promoting AI reliability and governance while enabling experimentation, ensuring compliance, security, and continuous uptime.Enhancing AI/ML Model Performance and ensuring data accuracy and consistency, leading to improved model training and inference.Implementing infrastructure to facilitate both offline and online evaluation of LLMs and agents.

Mar 12, 2026
Apply
Lemurian Labs logo
Full-time|On-site|SF Bay Area

About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.

Oct 31, 2025
Apply
Pylon Labs logo
Full-time|On-site|San Francisco

Join Pylon Labs: Pioneering the Future of B2B Post SalesAt Pylon, we're revolutionizing B2B post-sales support with our all-in-one platform, seamlessly powered by conversational data and enhanced with intelligent features. Our mission is to empower our customers to operate their businesses in real-time.With backing from renowned investors including a16z, BCV, General Catalyst, and Y Combinator, we proudly support over 1000 companies, such as Linear, Cognition (developers of Devin), Modal Labs, and Incident.io, who rely on Pylon for their customer success and support workflows. We are also honored to be featured on the Enterprise Tech 30 List.This position is fully in-person, as we believe in the value of collaboration and teamwork.Key ResponsibilitiesFacilitate the migration of new customers from their legacy support systems to Pylon.Conduct customer migration meetings within your first week.Act as the primary problem-solver for customers, guiding them in restructuring their processes and data on Pylon.Become a product expert and assist customers in finding the right solutions, even when they are unsure of their needs.Participate in pre-sales discussions as the go-to expert for potential solutions.Develop expertise not just in Pylon, but also in systems we migrate from (like Zendesk, Intercom, etc.).Manage multiple customer migrations concurrently and drive them to successful completion.Continuously enhance the migration process for efficiency.QualificationsBased in or willing to relocate to San Francisco, with a strong desire to work in-person.Highly organized, diligent, and capable of handling multiple migrations simultaneously.Detail-oriented and quick to learn; you will need to master various tools swiftly.A passion for experimenting with the product and recommending improvements.Benefits Comprehensive medical, dental, and vision insurance for employees 401(k) retirement savings plan Commuter benefits Parental leave

Feb 18, 2026
Apply
TRM Labs logo
Full-time|$220K/yr - $220K/yr|On-site|San Francisco, CA

Create a Safer World.TRM Labs specializes in blockchain analytics and artificial intelligence solutions designed to assist law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in identifying, investigating, and combating crypto-related fraud and financial crime. Our blockchain intelligence and AI platforms offer innovative tools for tracing the origins and destinations of funds, detecting illicit activities, building comprehensive cases, and visualizing threats. Trusted by top agencies and organizations globally, TRM Labs is committed to fostering a safer, more secure world for everyone.As an AI Engineering Manager on the Product Engineering team, you will redefine how users engage with software by leveraging cutting-edge technologies, free from the constraints of conventional SaaS. You will lead a diverse team of frontend, backend, and full-stack engineers to develop tools that empower crime fighters to effectively address the escalating risks posed by AI-driven criminal activities. Your mission will involve delivering workflows that are more autonomous, auditable, and ten times more efficient than existing SaaS solutions.This position seamlessly blends leadership, technical expertise, and ownership of projects from inception to large-scale production systems. You will work closely with Product, Design, and AI-centric teams to transform vague concepts into user-friendly, scalable product experiences.This role is an ideal match for you if…You are an Engineering Manager who actively contributes by reviewing code, engaging in pull requests when necessary, and maintaining a deep understanding of the architecture.You have a passion for pioneering advancements in AI, experimenting with large language models (LLMs), agents, and tools like Claude Code, and you excel at coordinating multiple agents simultaneously.You thrive in dynamic environments and can turn ambiguities into actionable momentum without relying on detailed project specifications.Engaging with customers excites you, and you leverage those interactions to inform product development.You are dedicated to hiring and motivating exceptional engineers, recognizing that team quality is a critical product decision.You aspire to have genuine ownership over the process, from initial concepts to scalable systems and the culture that supports both.Your impact will include:Leading and nurturing a team of engineers across various disciplines, setting high standards for craftsmanship, delivery speed, and accountability.Taking full ownership of pivotal AI-driven product initiatives from conception through to execution, ensuring high-quality outcomes.

Mar 12, 2026
Apply
TRM Labs logo
Full-time|$200K/yr - $240K/yr|On-site|San Francisco, CA

Contribute to a Safer Future.TRM Labs is at the forefront of blockchain analytics and AI technology, empowering law enforcement, financial institutions, and cryptocurrency enterprises to identify and combat cryptocurrency-related fraud and financial crime. Our innovative blockchain intelligence and AI tools are designed to trace fund flows, pinpoint illicit activities, build comprehensive cases, and provide actionable insights into potential threats. Trusted by prominent agencies and organizations globally, TRM is committed to fostering a safer and more secure environment for everyone.Join our dynamic AI Engineering Team, dedicated to pioneering next-generation AI applications, with a particular emphasis on Large Language Models (LLMs) and agent-based systems. Our objective is to create efficient pipelines, high-caliber infrastructure, and operational tools that facilitate the rapid, safe, and scalable deployment of AI systems.We oversee petabyte-scale data pipelines, deliver models with millisecond latency, and ensure the observability and governance necessary to make AI production-ready. Our team actively evaluates and integrates cutting-edge technologies in the LLM and agent domains, utilizing open-source stacks, vector databases, evaluation frameworks, and orchestration tools that enhance TRM’s agility and innovation capacity.As a Senior or Staff AI Infrastructure Engineer, you will play a pivotal role in constructing and scaling the technical framework for AI and ML systems. Your responsibilities will include:Developing reusable CI/CD workflows for model training, evaluation, and deployment, integrating tools like Langfuse, GitHub Actions, and experiment tracking systems.Automating model versioning, approval workflows, and compliance checks across various environments.Building a modular and scalable AI infrastructure stack, encompassing vector databases, feature stores, model registries, and observability tools.Collaborating with engineering and data science teams to embed AI models and agents into real-time applications and workflows.Continuously assessing and integrating state-of-the-art AI tools (e.g., LangChain, LlamaIndex, vLLM, MLflow, BentoML).Driving AI reliability and governance, facilitating experimentation while ensuring compliance, security, and uptime.Enhancing the performance of AI and ML models.Ensuring data accuracy, consistency, and reliability for improved model training and inference.Deploying infrastructure to support both offline and online evaluations of LLMs and agents.

Mar 12, 2026
Apply
Multiply Labs logo
Full-time|On-site|San Francisco

Join Multiply Labs as a Senior Robotics Software EngineerAt Multiply Labs, we are at the forefront of innovation in San Francisco, California, backed by prestigious investors like Casdin Capital, Lux Capital, and Y Combinator. Our mission is to transform the landscape of cell therapy manufacturing through the development of advanced robotic systems that automate and scale production, making these life-changing treatments more accessible and affordable. Our solutions enable biopharma companies to streamline their processes while mitigating regulatory risks and lowering costs that can exceed $1M per patient.To see our technology in action, visit www.multiplylabs.com and connect with us on LinkedIn. You can also explore our latest peer-reviewed study demonstrating the efficacy of our automated cell expansion processes at cytotherapy.org.Position OverviewAs a pivotal member of our Robotics Software Engineering Team, you will lead the development of intelligent software that drives our automated manufacturing systems. This role is ideal for a dedicated and hands-on engineer who is passionate about leveraging robotics and software to solve complex challenges in the biopharma sector. You will work alongside a world-class team, contributing to solutions that directly impact patient lives.

Oct 15, 2025
Apply
TRM Labs logo
Full-time|$200K/yr - $220K/yr|On-site|San Francisco, CA

Join Us in Building a Safer World.At TRM Labs, we leverage blockchain analytics and AI solutions to empower law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in their quest to detect, investigate, and thwart crypto-related fraud and financial crime. Our advanced platforms provide the intelligence needed to trace funds, identify illicit activities, build strong cases, and offer a comprehensive view of emerging threats. Trusted by top-tier agencies and businesses globally, TRM is committed to fostering a safer and more secure world for everyone.We are seeking a talented Senior Frontend Platform Engineer to become a pivotal part of our dynamic Frontend Engineering team. This small yet agile team is dedicated to developing the visualization and rendering systems that drive TRM’s investigative platform. Your work will enable users to navigate complex blockchain activities through interactive graph exploration, entity networks, and data-rich investigative interfaces. In this influential role, you will contribute to the core frontend platform and visualization infrastructure, facilitating analysts in exploring extensive relational datasets and identifying illicit activities on a large scale. As a key early team member, you will focus on high-performance rendering systems, graph visualization, and reusable frontend infrastructure that supports TRM’s investigative workflows.Your Impact:Design and implement the visualization platform that powers TRM’s investigative interfaces, including graph exploration and large-scale data visualization.Develop high-performance rendering systems utilizing technologies such as Canvas, WebGL, and GPU-accelerated rendering to visualize intricate datasets.Create reusable visualization libraries, SDKs, and platform primitives that empower teams across TRM to craft robust data exploration experiences.Engage in technical design discussions and code reviews to enhance architecture, performance, and maintainability.Deepen your understanding of crypto and blockchain investigation workflows to guide product design and platform capabilities.Shape the future of data exploration tools used by investigators and financial institutions worldwide.Qualifications:Proficient in JavaScript and TypeScript with a strong background in building large-scale frontend systems.Experience in developing data-dense web applications, such as analytics platforms and visualization tools.

Mar 12, 2026
Apply
Amari AI logo
Full-time|$150K/yr - $200K/yr|On-site|San Francisco

Join Our Team at Listen LabsSurveys are often seen as a chore. They tend to gather superficial data, acting merely as a stand-in for genuine conversations. Unlike surveys, conversations allow for follow-up questions and deeper insights that make the experience enjoyable for the participant. However, engaging in one-on-one conversations with everyone is not feasible...Listen Labs is revolutionizing this landscape with our innovative AI interviewer, designed to facilitate engaging dialogues while maintaining the simplicity and scalability of traditional surveys.Technical Challenges AheadTransforming Qualitative Data into Quantitative InsightsAt Listen Labs, we are focused on organizing free-form conversations into structured data. This complex task involves utilizing embeddings, fine-tuned large language models, and more to derive meaningful insights.Crafting the Right QuestionsEngaging in a productive conversation is an art. We are continuously refining our methods to ensure our LLMs generate the most relevant outputs.Addressing Multi-Modality ChallengesWe have developed a rapid speech-to-speech pipeline, yet there remain several challenges to enhance its performance. Our interviewer utilizes audio inputs and outputs to the LLM, a task that requires precision.Our InvestorsWith over $30 billion in market cap generated from online surveys, our partner at Sequoia, Bryan Schreier, was the initial investor in Qualtrics, a company valued at $12 billion.Who We're Looking ForYou thrive in dynamic environments and enjoy taking ownership of products from inception to completion.You are eager to work with Next.js, TypeScript, and large language models.Collaboration is in your DNA; you enjoy creating systems with fellow engineers in mind.You possess strong written and verbal communication skills.You recognize when to seek help, often to weigh trade-offs.You approach all engineering tasks with seriousness, understanding that foundational work is essential.You have a strong technical background.You are passionate about building products from start to finish and enhancing user experiences.You are excited about the potential of large language models.What We OfferCompetitive salary ranging from $150,000 to $200,000.Flexible work schedule and a supportive work environment.Opportunities for professional growth and development.

Apr 4, 2024
Apply
TRM Labs logo
Full-time|$153K/yr - $220K/yr|On-site|San Francisco, CA

Join Us in Building a Safer World.At TRM Labs, we are dedicated to providing cutting-edge blockchain analytics and AI solutions to empower law enforcement agencies, national security organizations, financial institutions, and cryptocurrency businesses. Our mission is to help these entities detect, investigate, and combat crypto-related fraud and financial crime. Through our innovative blockchain intelligence and AI platforms, we offer robust tools for tracing funds, identifying illicit activities, building cases, and understanding threat landscapes. Trusted by leading agencies and businesses around the globe, TRM is committed to fostering a safer, more secure world for everyone.Our success hinges on our ability to attract, hire, and retain exceptional talent. Our Talent team comprises seasoned professionals with extensive experience in recruiting top-tier Engineering, Data Science, Go-To-Market (GTM), and People teams for some of the most renowned startups in Silicon Valley. In your role as a Senior Technical Recruiter, you will play a critical part in enhancing TRM's capability to engage and secure the best talent available, with your performance directly impacting the efficiency and quality of our talent acquisition efforts.Your Impact:Establish a dynamic recruiting process that identifies and attracts top talent for TRM.Oversee the comprehensive recruiting lifecycle across multiple roles and teams.Ensure a positive candidate experience at every interaction.Craft innovative sourcing strategies to build a diverse and high-performing team.Continuously enhance our recruiting processes to boost hiring efficiency.Your Qualifications:A minimum of 7 years of recruiting experience in high-growth technology startups based in the U.S.Demonstrated expertise in filling challenging positions, utilizing sourcing tools, conducting market analysis, and creating tailored outreach strategies.Ability to analyze and interpret recruitment metrics and market data, applying insights to refine strategies and offer actionable feedback to hiring managers.Quick adaptability to new roles and requirements, exhibiting curiosity, flexibility, and resilience in a fast-paced, high-growth setting.Exceptional communication skills, enabling effective collaboration with hiring managers, providing updates, and managing expectations while building trust as a strategic partner in the hiring process.

Jan 30, 2025
Apply
TRM Labs logo
Full-time|$200K/yr - $275K/yr|On-site|San Francisco, CA

Contribute to a Safer World.At TRM Labs, we leverage blockchain analytics and AI innovations to empower law enforcement, national security entities, financial institutions, and cryptocurrency enterprises in identifying, investigating, and thwarting crypto-related fraud and financial misconduct. Our advanced blockchain intelligence and AI solutions facilitate the tracing of fund sources and destinations, recognition of illicit activities, case-building, and the development of a comprehensive threat landscape. We are a trusted partner to prominent organizations globally, dedicated to fostering a safer and more secure environment for all.Our AI Engineering Team is focused on pioneering next-generation AI applications, with a particular emphasis on Large Language Models (LLMs) and agentic systems. Our goal is to create resilient pipelines, high-performance infrastructure, and operational tools that ensure AI systems can be deployed with speed, safety, and scalability.We handle petabyte-scale data pipelines, deliver models with millisecond-level latency, and ensure the observability and governance necessary for AI to be production-ready. Our team is actively engaged in assessing and integrating state-of-the-art tools in the LLM and agent ecosystem, including open-source frameworks, vector databases, evaluation methodologies, and orchestration tools that empower TRM to innovate more rapidly than market demands.Your Impact:Design and implement a robust agentic framework that facilitates tool usage, context retrieval, memory integration, and strategic planning.Develop intelligent, modular agents that automate investigative workflows and enhance analyst decision-making capabilities.Expand and optimize our LLM infrastructure (e.g., OpenAI, Anthropic, on-premise models), including prompt engineering, retrieval-augmented generation, and user feedback loops.Create safe, observable, and auditable agent behaviors, ensuring reliability in sensitive operational environments.Assess performance based on metrics such as reasoning efficacy, latency, success rates, and hallucination instances, iterating based on user insights and system telemetry.Foster a culture of ownership, rapid experimentation, and ethical AI deployment.What We Seek:Proven engineering expertise with extensive experience in backend or systems development (Python preferred).Practical experience in building with LLMs, agent systems, and tooling frameworks (LangChain, semantic caches, vector databases, etc.).Strong understanding of AI principles, particularly in relation to agentic systems and LLMs.

Mar 24, 2026
Apply
Pylon Labs logo
Full-time|On-site|San Francisco

Join Pylon Labs and Shape the Future of B2B Post-Sales Support!At Pylon, we are revolutionizing the B2B post-sales landscape with our innovative all-in-one support platform. Our solution harnesses the power of conversational data and advanced intelligence, enabling our clients to streamline operations in real-time.Backed by prominent investors such as a16z, BCV, General Catalyst, and Y Combinator, we proudly serve over 1000 companies, including Linear, Cognition (creators of Devin), Modal Labs, and Incident.io. We are also featured on the Enterprise Tech 30 List.About the RoleThis is a unique opportunity to establish and enhance the support function at a pioneering customer support company!You will not only utilize our support product daily but will also play a crucial role in providing feedback, suggesting roadmap enhancements, and improving processes.Your ResponsibilitiesAddress customer inquiries regarding our product across various topics.Create and update knowledge base articles, including troubleshooting guides and feature descriptions.Actively engage with Pylon's suite of support tools, providing valuable feedback to influence our product roadmap.Collaborate with product and engineering teams to resolve bugs and troubleshoot issues.Contribute to the development of a scalable support process.Experiment with new processes, features, and leverage AI technologies.QualificationsMust be located in (or willing to relocate to) San Francisco and eager to work in-person.Comfortable engaging with customers via chat and video calls.A passion for exploring and experimenting with our product.1-8 years of relevant experience.A technical background and enthusiasm for technology is a significant advantage.Our Benefits Comprehensive medical, dental, and vision insurance for employees 401(k) retirement plan Commuter benefits Generous parental leave 14 company holidays and more!

Sep 26, 2025
Apply
Judgment Labs logo
Full-time|On-site|San Francisco

At Judgment Labs, we are revolutionizing the monitoring of agent behavior through our innovative infrastructure for Agent Behavior Monitoring (ABM). Unlike traditional observability metrics focused solely on logging exceptions and latency, our approach identifies behavioral anomalies including instruction drifts and context retrieval losses within scaled production environments.Numerous teams developing autonomous agents depend on Judgment Labs to gain insights into their systems' performance after deployment. Rather than merely reacting to incidents, they can cluster patterns across conversations and workflows, correlate regressions with specific interaction types, and accurately identify where reliability falters in their operational contexts.We are proud to announce that we have raised over $30 million in two funding rounds over the last five months. Our esteemed investors include Lightspeed, SV Angel, Valor Equity Partners, Nova Global, and notable individuals like Chris Manning and Michael Ovitz.The Role:We seek passionate Research Engineers to help us develop AI systems that utilize agent interaction data to enhance our understanding of agent behavior, facilitate large-scale evaluations, and drive improvements through iterative learning and feedback.Your research will have a tangible impact. You will engage directly with real-world agent data, implement cutting-edge methodologies in production, and witness your contributions being deployed in real-time. By enhancing the measurability and debuggability of agent behavior, your work will empower teams across finance, legal, operations, and other critical domains. You will lead projects from inception to completion, enjoying substantial autonomy while collaborating closely with our team to create self-improving agent systems.What You'll Do:Develop systems that aggregate, index, and analyze extensive agent interaction data to derive valuable evaluation metrics.Create agent-based systems for the analysis and evaluation of complex, long-term behaviors.Design and execute post-training and optimization workflows aimed at enhancing agent performance.Build internal tools and infrastructure that promote rapid experimentation, analysis, and training.What We're Looking For:You should resonate with at least one of the following:A strong focus on data quality, evaluation, and benchmarking, with a hands-on approach to working with complex datasets.Experience in developing agent systems and applying them in real-world or production environments.A robust background in machine learning or related fields, with an eagerness to advance agent technology.

Jan 11, 2026
Apply
Judgment Labs logo
Full-time|On-site|San Francisco

At Judgment Labs, we are pioneering the way that Agent Behavior Monitoring (ABM) is approached. Unlike conventional observability methods that primarily focus on logging exceptions and latency, our innovative ABM technology identifies behavioral anomalies such as instruction drifts and context retrieval losses in large-scale production environments.Our platform is trusted by numerous teams developing autonomous agents, enabling them to gain insights into system behavior post-deployment. By moving beyond reactive incident management, our users can analyze patterns across conversations and workflows, correlate regressions to specific interaction types, and accurately identify where reliability issues arise within their operational context.Recent funding success: We have successfully raised over $30M across two funding rounds within the last five months, attracting notable investors such as Lightspeed, SV Angel, Valor Equity Partners, and more.

Jan 11, 2026
Apply
TRM Labs logo
Full-time|$220K/yr - $220K/yr|On-site|San Francisco, CA

Join Us in Building a Safer Financial System.At TRM Labs, we are at the forefront of blockchain analytics and AI technology, dedicated to empowering law enforcement, national security, financial institutions, and cryptocurrency businesses in the fight against crypto-related fraud and financial crime. Our advanced platforms leverage blockchain intelligence and AI to trace the flow of funds, identify illicit activities, build robust cases, and provide a comprehensive understanding of threats. Trusted globally, TRM Labs is committed to creating a safer and more secure environment for everyone.Our mission is to develop an innovative financial system that benefits billions around the globe. By integrating threat intelligence with machine learning, our next-generation platform enables institutions and governments to detect cryptocurrency fraud and financial crimes on an unmatched scale.As a Machine Learning Infrastructure Engineer at TRM Labs, you will collaborate with a talented team of data scientists, engineers, and product managers. Your role will involve designing and maintaining scalable GPU-powered infrastructure that supports our AI systems. You will work at the intersection of distributed systems, cloud infrastructure, and applied machine learning, laying the groundwork for high-throughput, production-level ML workloads.

Feb 25, 2026
Apply
Merge Labs logo
Full-time|On-site|San Francisco Bay Area

At Merge Labs, we are pioneering research at the intersection of biological and artificial intelligence, aiming to enhance human potential, autonomy, and experience. Our innovative approach focuses on developing cutting-edge brain-computer interfaces that communicate with the brain at high bandwidth, seamlessly integrate with advanced AI, and are designed to be safe and accessible for everyone.About Our TeamThe Lab Operations team serves as the essential operational foundation of Merge Labs, overseeing the complete lifecycle of our R&D environment—from the conceptualization and construction of our physical infrastructure to its large-scale operation. Our responsibilities encompass Equipment Management, Safety, Lab Facilities, Procurement, and Logistics, ensuring that the laboratory operates without a hitch. We view our operations role as a driving force, with a mission to establish a top-tier Lab Operations System that not only empowers our scientists and engineers but also accelerates their progress through optimal integrated systems and operational excellence.Key Responsibilities:Manage daily lab operations by executing essential tasks that keep the lab organized and functional—overseeing bench organization, glassware management, reagent and media inventory, and maintaining the meticulous upkeep necessary for a high-performing wet lab.Ensure accurate lab systems by logging equipment into asset databases, completing maintenance tracking sheets, and conducting inventory counts while adhering to our scheduled protocols to reflect real-time operations.Implement the equipment management strategy, which includes tagging new assets, updating the database, and scheduling preventive maintenance to maintain the accuracy of our operational records. This includes hands-on work such as running instrument calibrations, executing scheduled freezer defrost cycles, and routine equipment maintenance to keep the lab operational.Support the lab's supply chain by placing orders, receiving and logging shipments, restocking workstations and cold storage, and proactively identifying low inventory levels before they impact operations.Uphold a safe working environment by conducting daily facility checks—such as walkthroughs and hazard assessments—to ensure compliance with safety protocols.

Feb 2, 2026
Apply
World Labs logo
Full-time|$250K/yr - $325K/yr|On-site|San Francisco

About World Labs: At World Labs, we are revolutionizing the way artificial intelligence interacts with the world around us. Our foundational world models are designed to perceive, generate, reason, and engage with the 3D environment, thereby unlocking the full potential of AI through spatial intelligence. We are committed to transforming the realms of storytelling, creativity, design, simulation, and immersive experiences, both in virtual and physical worlds. Our team is composed of world-class professionals, each united by a shared curiosity and passion for technology. With backgrounds spanning AI research, systems engineering, and product design, we create a dynamic feedback loop that seamlessly integrates cutting-edge research with user-centric products. Role Overview: We are seeking a Senior Full Stack Product Engineer who excels at the intersection of product development and engineering. You should possess robust coding skills across the tech stack, a keen eye for crafting intuitive user experiences, and the technical prowess to develop sophisticated 3D web applications. In this role, you will take ownership of product features from inception to deployment, building responsive, user-centric web applications as well as backend generative AI services and APIs. Collaboration with our research team will be crucial as we work together to translate innovative AI research into practical applications. As a player-coach, your engineering expertise and passion for product development will drive our success.

Feb 18, 2026
Apply
Planet Labs logo
Full-time|$144.5K/yr - $180.6K/yr|Hybrid|San Francisco, CA

Welcome to Planet Labs. We are dedicated to leveraging space technology to enhance life on Earth.As the architects of the largest constellation of imaging satellites ever created, Planet Labs provides an unparalleled dataset of empirical information through our innovative cloud-based platform. Our impact spans commercial, environmental, and humanitarian sectors, merging our identity as both a space and data company.Our data empowers customers globally to innovate, drive revenue, conduct research, and tackle the most pressing challenges our world faces.At Planet, we manage every aspect of hardware design, manufacturing, data processing, and software engineering, fostering a dynamic environment filled with experts from diverse fields.We prioritize a people-first approach in our corporate culture and community, striving for continuous improvement to support our team members and prepare for future growth. Join us at Planet Labs and contribute to our mission of transforming global perspectives.With a global workforce, our employees work remotely from locations including San Francisco, Washington DC, Germany, Austria, Slovenia, and The Netherlands.About the Role:Planet Labs aims to capture daily images of the entire world, making significant global changes visible, accessible, and actionable. We are at a pivotal moment, transitioning from extensive AI research to a focused delivery model. To facilitate this, we are establishing a new product group dedicated to launching an AI Geospatial Assistant that will revolutionize how our customers utilize global imagery to address critical challenges in forensics and daily change detection.Our objective is to simplify complex insights through an intuitive interface, requiring no user training. Operating with a startup mentality, our team emphasizes rapid learning and customer-driven milestones to efficiently progress from private alpha to general availability.As a Software Engineer, you will play a key role in developing the backend systems that will bring our AI Geospatial Assistant to fruition. While our research teams focus on creating core models, you will be tasked with the 'last mile' of delivery, designing high-throughput backend services, scaling our systems, and ensuring that our workflows operate swiftly, reliably, and cost-effectively on a global scale.This position is full-time and hybrid, requiring you to work from our San Francisco office three days a week.

Mar 6, 2026

Sign in to browse more jobs

Create account — see all 5,650 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.