Llm Engineer Shape The Future Of Natural Language Computing jobs in San Francisco – Browse 5,294 openings on RoboApply Jobs

Llm Engineer Shape The Future Of Natural Language Computing jobs in San Francisco

Open roles matching “Llm Engineer Shape The Future Of Natural Language Computing” with location signals for San Francisco. 5,294 active listings on RoboApply Jobs.

5,294 jobs found

1 - 20 of 5,294 Jobs
Apply
companySoftware Apps Inc. logo
Full-time|Hybrid|San Francisco

About UsAt Software Apps Inc., we are pioneering the field of natural-language computing with our flagship product, Sky, designed specifically for Mac users. Our team is driven by a shared commitment to innovation, collaboration, and excellence. To learn more about our mission and values, visit www.software.inc/jobs.The RoleWe are seeking a talented LLM Engineer to be a vital contributor to our product development. In this role, you will design and optimize our data pipelines, refine our system architecture, and implement evaluation mechanisms. Your expertise will guide strategic decisions, balancing ambition with practicality, while you manage the iterative process of fine-tuning, evaluation, and deployment.Your Daily Responsibilities Will Include......Innovating Exceptional Software. Your enthusiasm and insight will transform visionary concepts into actionable strategies, even if it sometimes means taking risks without a clear path ahead. Your capacity to learn and adapt is more crucial than existing knowledge....Taking Full Ownership of Projects. You will be the driving force behind the success of your systems and features, demonstrating a commitment to delivering results. Your proactive approach will ensure continual improvement through feedback and quality enhancements....Influencing Architectural Decisions. Leverage your familiarity with the latest model architectures to create robust systems for data collection, training, and inference. Your role will include optimizing model performance while carefully considering user privacy and effective data gathering....Thinking Both Broadly and Precisely. You recognize that your infrastructure choices significantly affect the end-user experience. With a focus on performance, you will develop large-scale models for cloud inference or finely tuned on-device models that prioritize efficiency.

May 14, 2025
Apply
companySoftware Apps Inc. logo
Full-time|On-site|San Francisco

About UsAt Software Apps Inc., we are pioneering the future of technology with our groundbreaking product, Sky, which utilizes natural-language computing tailored for your Mac. Join us in our mission to innovate and transform how users interact with technology.Discover more about our team, values, and vision on our careers page: www.software.inc/jobsOur ValuesCollaboration is Key: We thrive on teamwork and believe in the power of in-person collaboration. Every team member is seen as a leader, and we encourage ownership of projects to foster growth.Honest Communication: Empathetic and open communication is vital for our close-knit team. We strive to listen as much as we talk, respecting every voice in the room.Cultivating Curiosity: In the ever-evolving landscape of AI and computing, staying curious is essential. We ask questions that guide our decisions, ensuring we stay aligned with our vision.The RoleWe are seeking a talented Software Engineer to play a pivotal role in shaping our product. You will be responsible for developing new, user-facing software. Your ability to balance ambition with feasibility will be crucial as you engage in an iterative process of building, testing, gathering feedback, and refining your work.Your Daily Responsibilities Will Include:Creating Innovative Software: Utilize your skills and passion to transform visionary ideas into actionable plans. Sometimes, this requires taking bold steps without knowing the final outcome.Taking Ownership: You'll have full responsibility for the success of your projects. Your commitment to incorporating feedback and improving quality is essential. We trust you to handle significant responsibilities.Thinking Big and Small: Understanding that every choice impacts user experience, you’ll focus on details that create seamless and magical interactions.Documenting Your Work: Keeping thorough and clear documentation is key to our collaborative approach.

May 14, 2025
Apply
companyWhatnot logo
Full-time|On-site|San Francisco, CA

Join Whatnot as an LLM Platform Engineer where you'll be at the forefront of developing and optimizing cutting-edge language models. In this role, you will collaborate with a dynamic team of engineers and data scientists to enhance our machine learning infrastructure and algorithms. Your contributions will directly impact the efficiency and effectiveness of our language understanding capabilities.

Mar 3, 2026
Apply
companyFuture logo
Full-time|$200K/yr - $250K/yr|Remote|Remote

Future builds digital personal training experiences that connect people with expert coaches through a seamless app. Since 2017, the company has grown from an idea in a San Francisco café to the largest provider of personal training sessions in the US. In January 2025, Future merged with Autograph, founded by Tom Brady, and is expanding its reach through new partnerships and AI-driven coaching tools. Future continues to invest in technology, grow its coaching roster, and form partnerships with leading athletes. The team is focused on shaping the future of fitness by making expert coaching accessible to more people. Role overview This remote Cloud Infrastructure Engineer position centers on designing, building, and maintaining the cloud platform that underpins Future’s products. The role is hands-on and impacts daily operations for engineering teams, focusing on reliability, security, and efficiency. What you will do Develop and maintain infrastructure-as-code best practices using AWS CDK, keeping cloud resources version-controlled, repeatable, and peer-reviewed. Design and manage AWS infrastructure components, such as ECS, RDS Aurora, API Gateway, S3, and networking, with attention to reliability, performance, and cost efficiency. Build and support an observability stack, including structured logging, distributed tracing, and monitoring, to provide insights into system performance. Requirements Strong experience with AWS and a focus on building resilient, automated systems. Commitment to operational excellence, security, and cost efficiency. Emphasis on enabling engineering teams to deliver work quickly and confidently.

Apr 29, 2026
Apply
companyPlaud Inc. logo
Full-time|On-site|San Francisco, CA

About Plaud Inc.Plaud is at the forefront of developing the most reliable AI work companion designed for professionals, enhancing productivity through innovative note-taking solutions. Since our inception in 2023, we have garnered the trust of over 1,500,000 users globally. Our mission is to enhance human intelligence by creating state-of-the-art intelligence infrastructure and interfaces that effectively capture, extract, and utilize information from verbal, auditory, visual, and cognitive inputs.Headquartered in San Francisco, Plaud Inc. is a Delaware-incorporated company that is redefining human-AI collaboration through a unique synergy of hardware and software solutions. We prioritize the highest standards of data security and privacy, being compliant with SOC 2, HIPAA, GDPR, ISO27001, ISO27701, and EN18031.To discover more about our offerings, please visit https://www.Plaud.ai and connect with us on Instagram, X, Facebook, LinkedIn, and YouTube.

Dec 12, 2025
Apply
companyLlamaIndex logo
Full-time|On-site|San Francisco

Join our innovative team and help define the future of AI, focusing on the narrative of document understanding.About the Role:We are on the lookout for talented AI engineers to become a part of our dedicated document understanding team. In this role, you'll be at the crossroads of computer vision, natural language processing, and production machine learning systems, driving advancements in document parsing and comprehension.Our team powers LlamaParse, LlamaExtract, and other advanced processing solutions, handling millions of intricate documents such as PDFs, PowerPoints, Word files, and spreadsheets. Your contributions will significantly influence numerous developers who are creating RAG applications and document agents, in addition to enhancing our open-source frameworks that revolutionize industry standards in document processing.Depending on your expertise and interests, you may concentrate on data curation, model fine-tuning, or ML infrastructure. We are hiring multiple candidates and will collaborate with you to identify the perfect role for your skills.

Nov 21, 2025
Apply
companyYutori logo
Full-time|On-site|San Francisco, California, United States

At Yutori, we are transforming the way individuals engage with the digital realm by developing AI agents capable of efficiently performing everyday online tasks. Our approach is to create a comprehensive, agent-first ecosystem, encompassing everything from training proprietary models to designing innovative generative product interfaces.To further this mission, we are seeking a skilled AI Engineer to join our pioneering team. Ideal candidates should possess strong technical expertise and a passion for crafting superhuman AI agents that can navigate the web autonomously.Our founders — Devi Parikh, Abhishek Das, and Dhruv Batra — bring a wealth of experience in AI research and product development, particularly in generative, multimodal, and embodied AI, honed during their time at Meta. Our team merges AI proficiency with a design-oriented approach to advance Yutori’s objectives.Yutori is proudly supported by a distinguished group of visionary investors, including Elad Gil, Sarah Guo, Jeff Dean, Fei-Fei Li, Amjad Masad, Guillermo Rauch, Akshay Kothari, Soleio, Oliver Cameron, Julien Chaumond, Logan Kilpatrick, Bryan McCann, Vladlen Koltun, Jamie Cuffe, Michele Catasta, and many others.

Mar 26, 2025
Apply
companygleanwork logo
Full-time|Remote|San Francisco Bay Area

Join gleanwork as a Machine Learning Engineer specializing in LLM evaluations and observability. In this role, you will be instrumental in developing cutting-edge machine learning systems that enhance our understanding and effectiveness of language learning models. You will collaborate with cross-functional teams to drive the integration of advanced analytics and machine learning solutions.

Mar 16, 2026
Apply
company
Full-time|$200K/yr - $240K/yr|On-site|San Francisco, CA

Join Us in Building a Safer World.At TRM Labs, we specialize in blockchain analytics and AI solutions aimed at assisting law enforcement, national security agencies, financial institutions, and cryptocurrency businesses in identifying, investigating, and preventing crypto-related fraud and financial crime. Our innovative platforms leverage blockchain intelligence and AI technology to trace funds, detect illicit activity, and construct comprehensive threat profiles. Trusted by leading organizations worldwide, TRM Labs is committed to enabling a safer and more secure environment for all.Our AI Engineering Team is dedicated to pioneering next-generation AI applications, particularly in the realm of Large Language Models (LLMs) and agentic systems. Our goal is to develop resilient pipelines and high-performance infrastructure that facilitate the swift, safe, and scalable deployment of AI systems.We manage extensive petabyte-scale pipelines, ensuring model serving with millisecond latency while providing the necessary observability and governance to make AI production-ready. Our team actively evaluates and integrates leading-edge tools in the LLM and agent space, including open-source stacks, vector databases, evaluation frameworks, and orchestration tools to accelerate TRM’s innovation pace.As a Senior or Staff ML Systems Engineer – LLM, you will play a pivotal role in constructing and scaling our technical infrastructure for AI/ML systems. Your responsibilities will include:Creating reusable CI/CD workflows for model training, evaluation, and deployment, integrating tools such as Langfuse, GitHub Actions, and experiment tracking.Automating model versioning, approval processes, and compliance checks across various environments.Developing a modular and scalable AI infrastructure stack that encompasses vector databases, feature stores, model registries, and observability tools.Collaborating with engineering and data science teams to embed AI models and agents into real-time applications and workflows.Continuously assessing and incorporating state-of-the-art AI tools (e.g., LangChain, LlamaIndex, vLLM, MLflow, BentoML).Promoting AI reliability and governance while enabling experimentation, ensuring compliance, security, and continuous uptime.Enhancing AI/ML Model Performance and ensuring data accuracy and consistency, leading to improved model training and inference.Implementing infrastructure to facilitate both offline and online evaluation of LLMs and agents.

Mar 12, 2026
Apply
companyWaymo logo
Full-time|Hybrid|Mountain View, CA USA; San Francisco, CA USA;

Join Waymo as a Senior Machine Learning Engineer focusing on Perception LLM/VLM. In this role, you will leverage cutting-edge machine learning techniques to enhance our autonomous driving technology. You will collaborate with a talented team of engineers and researchers to develop algorithms that improve our perception systems, ensuring safety and efficiency on the road.

Mar 12, 2026
Apply
company
Full-time|On-site|San Francisco Bay Area

About Retell AI Retell AI builds voice AI technology that helps businesses transform their call center operations. In just 18 months, thousands of companies have adopted Retell’s AI voice agents to streamline sales, support, and logistics, work that once required large human teams. Backed by investors including Y Combinator and Alt Capital, Retell has grown annual recurring revenue from $5M to $36M with a focused team of 20. The company’s goal for 2026: a modern customer experience platform where AI powers entire contact centers. Retell is developing AI “workers” that can serve as frontline agents, quality assurance analysts, and managers, handling, evaluating, and improving customer interactions on their own. Named a top 50 AI app by a16z: https://tinyurl.com/5853dt2x Ranked #4 on Brex’s Fast-Growing Software Vendors of 2025: https://www.brex.com/journal/brex-benchmark-december-2025 Featured on the Lean AI Leaderboard: https://leanaileaderboard.com/ Role Overview: Research Scientist – LLM Retell AI is hiring a Research Scientist focused on large language models (LLMs) and audio processing. This role suits machine learning researchers who want to push the boundaries of real-time AI and see their work in production. What You Will Do Investigate new approaches in large language models and audio processing for human-like voice agents Design and implement evaluation methods for complex, real-world conversational systems Prototype systems to improve reasoning, reduce latency, and enhance conversation quality Work closely with engineering and product teams to bring research advances into production Impact Research at Retell directly shapes the capabilities of voice AI agents for thousands of businesses. The work blends advanced research with practical deployment, improving how customers interact with automated systems across industries. Location This position is based in the San Francisco Bay Area.

Apr 14, 2026
Apply
companyDatabricks logo
Full-time|$190K/yr - $253.8K/yr|On-site|Mountain View, California; San Francisco, California

P-931 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world—from revolutionizing transportation to fast-tracking medical innovations. We achieve this by developing and managing the foremost data and AI infrastructure platform, enabling our clients to leverage profound data insights to enhance their enterprises. Founded by engineers with a customer-centric approach, we seize every chance to resolve technical challenges, from crafting next-generation UI/UX for data interactions to scaling our services and infrastructure across millions of virtual machines. And we’re just getting started. Within Databricks, the Compute Infrastructure organization is responsible for building and operating the essential framework that supports all Data, AI, and stateful workloads across major cloud platforms. Our system launches tens of millions of VMs daily, manages thousands of Kubernetes clusters, and must deliver exceptional elasticity, reliability, and cost-effectiveness. We are in search of an Engineering Manager to lead a team focused on pivotal components of this platform. Your contributions will significantly impact product delivery speed, customer satisfaction, and our company's scalability. The impact you will have: Own and enhance the compute platform to support all Databricks workloads, enabling engineers to create top-tier products with high velocity and superior performance. Recruit exceptional engineers and nurture their development through guidance, feedback, and career advancement opportunities. Elevate the technical and operational standards through robust design practices, rigorous testing, and a culture of engineering excellence and platform thinking. Collaborate with engineering and product leadership to establish long-term strategies and roadmaps. Lead cross-functional initiatives encompassing both product and infrastructure domains. Influence architectural decisions that extend beyond your immediate team.

Feb 13, 2026
Apply
companyGlacier logo
Full-time|$175K/yr - $250K/yr|Hybrid|San Francisco Office

Join our innovative team at Glacier! This hybrid role requires in-office presence on Tuesdays and Thursdays.At Glacier, we're on a mission to address one of the most pressing challenges of our time: waste management. Did you know that over half of recyclables in the U.S. end up in landfills? We're committed to changing that narrative. Our efforts not only aim to enhance recycling practices, but also to mitigate carbon emissions, conserve energy, and protect our natural resources.We develop advanced sorting robots tailored to efficiently separate recyclables, combined with AI-driven business analytics that empower recyclers to optimize their operations and promote a more circular economy.Our technology has garnered the trust of major clients, including Colgate, Amazon, and municipal recycling facilities, as we turn recycling data into actionable insights. Our innovations have been featured in TIME's Best Inventions, a TIME documentary, and various leading publications like TechCrunch, Fortune, and CBS.The Role:We are seeking a dynamic and experienced technical leader to spearhead Glacier's Computer Vision strategy and manage our engineering team. This is a hands-on leadership position overseeing a distributed team across the U.S. and globally.Computer Vision is integral to our product offerings and significantly influences our company's achievements. This position will report directly to the co-founder and CTO.What You’ll Do:Drive the vision, strategy, and execution of Glacier's Computer Vision roadmap.Lead and cultivate our distributed Computer Vision engineering team through hiring, onboarding, and performance management.

Feb 19, 2026
Apply
companyfal logo
Full-time|$180K/yr - $250K/yr|On-site|San Francisco

Join our innovative team at fal as a Staff Software Engineer specializing in large-scale computation platforms. We are seeking a seasoned software engineer with extensive experience in developing backend systems that efficiently orchestrate workloads and manage resource constraints. Your expertise in foundational cloud infrastructure and Linux provisioning will be crucial as you work towards achieving high reliability and scalability with minimal operational overhead.

Dec 16, 2025
Apply
companyPinterest, Inc. logo
Full-time|Remote|San Francisco, CA, US; Remote, US

Join Pinterest as a Principal Engineer for our Compute Platform, where you'll play a crucial role in driving the architecture and implementation of scalable systems that power our services. You will lead a team of talented engineers, guiding them in building innovative solutions that enhance our platform's performance and reliability.In this position, you will have the opportunity to collaborate with cross-functional teams, mentor junior engineers, and contribute to the development of best practices and high-quality code. If you are passionate about technology and eager to make an impact, we would love to hear from you!

Mar 4, 2026
Apply
companyCrusoe logo
Full-time|On-site|San Francisco, CA - US

Join Crusoe as a Senior Engineering Manager in Compute where you will play a pivotal role in leading cutting-edge engineering teams. You will be responsible for overseeing the development and execution of our innovative computing solutions, ensuring performance and reliability across various platforms.Your leadership will guide teams toward achieving engineering excellence, fostering a collaborative environment, and driving strategic initiatives. This is an opportunity to make a significant impact within a rapidly growing company at the forefront of technology.

Feb 25, 2026
Apply
companysim logo
Full-time|On-site|San Francisco

About the RoleIn this pivotal position, you will spearhead the complete development of our core agentic workflow engine and Copilot—an innovative AI assistant designed to facilitate developers in creating and troubleshooting workflows through natural language, transforming abstract intentions into actionable, dependable agents. This role encompasses our backend (Next.js), orchestration layer, and all integrations with LLMs and external APIs.As the current leading tool for constructing workflows using natural language, your mission is to preserve and enhance its status—ensuring it remains swift, reliable, and adept at managing increasingly intricate agentic architectures.This foundational role allows you to establish architectural decisions, reliability benchmarks, and coding patterns that will define the functionality of our core product. Collaborating within a team of five, your contributions will be deployed to tens of thousands of developers on rapid release cycles.What You'll DoTake ownership of the agentic workflow engine: the runtime responsible for executing multi-step, tool-utilizing agent workflows in production.Develop and enhance Copilot—our natural language interface for creating, modifying, and debugging workflows.Design and sustain integrations with LLM providers (OpenAI, Anthropic, Google, local models via Ollama) and external APIs.Architect the orchestration layer that transforms visual flows into reliable, observable agent executions.Establish technical standards for the codebase—covering reliability, testing, code quality, and architectural patterns.Engage across the stack in our Next.js monorepo, delivering to production daily.Troubleshoot complex issues at the nexus of LLMs, distributed systems, and developer tooling.What We're Looking ForA capable generalist engineer with a history of deploying complex distributed systems or developer tools into production.Extensive experience with TypeScript/JavaScript and Bun; familiarity with working in a Next.js monorepo.Proven record of owning and managing production systems—you've been on call, debugged challenging issues, and ensured reliability.Experience integrating with LLMs or building upon foundational model APIs.Demonstrated high agency and ownership—within a small team, you help define the roadmap as much as you execute it.Strong architectural opinions that are adaptable—you prioritize building correctly while maintaining rapid development.Experience with AWS infrastructure is a plus.

Mar 2, 2026
Apply
company
Full-time|On-site|San Francisco Bay Area

About OpenArtOpenArt is an innovative AI Storytelling and Visual Creation Platform that empowers millions of creators globally. We are on a mission to develop the next generation of creative tools powered by advanced AI technology, enabling users to create videos, visuals, characters, and narratives with unmatched speed and creativity. We envision a future where creativity is driven by AI, and we are at the forefront of this transformation. Why Join OpenArtTake ownership of the measurement and attribution layer for a rapidly growing AI enterprise.Make a direct impact on growth—your contributions enhance acquisition efficiency and revenue generation.Engage in a highly collaborative role across product, marketing, and data.Build systems from scratch—define our tracking, attribution, and growth optimization strategies.Enjoy a high-ownership, low-process, fast-paced environment.Experience 7–10X revenue growth over the past two years as we expand our growth infrastructure. About the RoleWe are seeking a passionate Growth Data Engineer to develop and manage OpenArt's marketing data and attribution systems, ensuring precise measurement and optimization of all acquisition and lifecycle efforts.This pivotal role bridges data engineering, marketing technology, and analytics, focusing on ensuring reliable data flow between product, data warehouse, CRM, and advertising platforms.You will guarantee that every marketing investment is measurable, attributable, and optimizable—tracking the journey from first touch to revenue generation. What You’ll DoDesign and maintain marketing attribution systems (including UTM tracking, user journeys, and conversion mapping).Implement and oversee first-party data passback to ad platforms (e.g., conversions, revenue, LTV signals).Configure and maintain tracking infrastructure:Google Tag Manager (client + server-side)Google Ads (Enhanced Conversions)Meta Pixel + Conversions APILinkedIn, TikTok, and various other advertising platforms.Establish and maintain conversion events and schemas across platforms.Ensure consistent event definitions and naming conventions across product, CRM, and ad platforms.Collaborate with teams to optimize data utilization and reporting.

Mar 26, 2026
Apply
companyRylo logo
Full-time|On-site|San Francisco, CA

At Rylo, we are revolutionizing the way you capture and share your experiences. Our state-of-the-art camera is designed to record your surroundings with breathtaking clarity and stability, eliminating the hassle of traditional video capture. Created by a team of visionary engineers from Instagram and Apple, our innovative stabilization software and user-friendly smartphone app ensure that every shot you take is a masterpiece. With Rylo, you can focus on enjoying the moment while we handle the technicalities of creating stunning videos.Experience Rylo in actionAs a Software Engineer specializing in Computational Photography, you will play a crucial role in enhancing the core algorithms that power the Rylo camera and future products. Your work will fundamentally enhance the photography and cinematography experience, focusing on improving image quality and developing groundbreaking computational photography features. You will engage in the complete lifecycle of algorithm development, from design and implementation to quality evaluation and performance optimization, culminating in successful deployment.Your collaboration with software engineers, hardware engineers, and designers will allow you to push the boundaries of consumer camera technology.

Mar 1, 2026
Apply
companyUncountable logo
Full-time|$130K/yr - $175K/yr|Hybrid|New York, San Francisco, Munich or London

We appreciate your interest in joining the Uncountable Engineering team!About the RoleAt Uncountable, we are on a mission to revolutionize R&D by creating an AI-first ecosystem. Our cutting-edge tools are designed to empower scientists at Fortune 500 companies to enhance their discovery processes utilizing Generative AI.As an LLM Applications Engineer, you will play a pivotal role in shaping our LLM infrastructure. You will not only build user interfaces but also architect retrieval systems, agentic workflows, and data pipelines that connect intricate experimental data with actionable AI insights, significantly modernizing the data interaction for top-tier researchers.

Jan 19, 2026

Sign in to browse more jobs

Create account — see all 5,294 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.