Technical Staff Member Post Training Reinforcement Learning jobs in Palo Alto – Browse 225 openings on RoboApply Jobs

Technical Staff Member Post Training Reinforcement Learning jobs in Palo Alto

Open roles matching “Technical Staff Member Post Training Reinforcement Learning” with location signals for Palo Alto. 225 active listings on RoboApply Jobs.

225 jobs found

1 - 20 of 225 Jobs
Apply
companyxai logo
Full-time|On-site|Palo Alto, CA

xai is seeking a Technical Staff Member focused on Post-Training and Reinforcement Learning at its Palo Alto, CA location. This position centers on advancing AI technology through hands-on project work and collaboration. Role overview This role involves contributing to projects that explore and extend the capabilities of AI systems. The work emphasizes post-training techniques and reinforcement learning methods, supporting the ongoing development of advanced solutions. Collaboration Teamwork is central to this position. You will work closely with colleagues to share ideas, refine approaches, and help shape the next generation of AI systems at xai.

Apr 29, 2026
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA; San Francisco, CA

About xAIAt xAI, we are on a mission to develop cutting-edge AI systems that not only comprehend the complexities of the universe but also empower humanity in its quest for knowledge. Our team is small yet highly driven, dedicated to achieving engineering excellence. We welcome individuals who relish challenges and have an insatiable curiosity. Operating with a flat organizational structure, we encourage hands-on contributions from all team members towards our collective mission. Leadership is earned through initiative and consistent high performance, making work ethic and prioritization crucial. Strong communication skills are essential, as sharing knowledge effectively with teammates is a key expectation.About the RolexAI is on the lookout for skilled software engineers to construct robust data pipelines, develop comprehensive evaluation frameworks for benchmarking large language models (LLMs), and create automation solutions that enhance the productivity of our researchers and engineers.Focus AreasDeveloping and maintaining frameworks for agent, data, and model evaluation tasks.Creating environments for AI agents.Designing tools to automate common workflows.Enhancing alerts, metrics, and error handling for large-scale reinforcement learning tasks.Refactoring existing agent, data, evaluation, and training frameworks for improved modularity.Establishing operational procedures and coding standards to facilitate the transition from small-scale experiments to large-scale reinforcement learning training.Implementing unit tests and CI/CD frameworks to support rapid development cycles.Ideal ExperienceProven experience in building and maintaining frameworks utilized by multiple engineers.Expertise in creating high-performance sandboxes, virtual machines, and simulations.Experience in developing full-stack applications for workflow automation and data visualization.Capability in rapidly iterating research into production cycles.Knowledge in test automation and CI/CD practices.Typical Challenges You Will EncounterExploring new agentic model capabilities...

Dec 29, 2025
Apply
companyOdyssey AI Lab logo
Full-time|On-site|Palo Alto

About UsOdyssey is at the forefront of artificial intelligence research, specializing in general-purpose world models that are revolutionizing consumer, enterprise, and intelligence applications. Our innovative models, such as the Odyssey-2 Pro, represent the next significant advancement in AI technology.Position OverviewWe are on the lookout for passionate individuals who are dedicated to maximizing performance from complex systems. Our goal is to develop inference infrastructure capable of scaling to hundreds of thousands of users within a year while handling vast and continuously expanding datasets. Your role will be critical in ensuring our models achieve outstanding speed, reliability, and scalability during both training and inference phases, optimizing efficiency to minimize TFLOPS per user and the costs associated with training compute.Key ResponsibilitiesEnhance models for real-time use by a user base in the hundreds of thousands.Design and execute distributed training strategies aimed at reducing training time and resource usage across extensive GPU clusters.Collaborate with a high-caliber team of ML researchers and engineers to ensure model architectures are performance-driven from the start.Create advanced tools to pinpoint performance issues and stability challenges in both training and deployment environments.Innovate new approaches, frameworks, and system designs that improve performance metrics throughout our model development and inference infrastructure.Enjoy a considerable degree of autonomy in making technical decisions.Utilize state-of-the-art GPUs in your work.

Mar 11, 2026
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA; San Francisco, CA

Join xAI as a Member of the Technical Staff specializing in Inference Engineering. Our mission is to engineer cutting-edge AI systems that enhance humanity's understanding of the universe. We are a dynamic, compact team dedicated to excellence, where each member is encouraged to take initiative and contribute significantly to our objectives. The ideal candidate will thrive in a collaborative environment, showcasing their expertise in optimizing model inference and developing robust systems capable of serving billions of users. If you are passionate about pushing the boundaries of AI technology and enjoy tackling complex challenges, we want you on our team.

Dec 29, 2025
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

Join Our Innovative Team at xAIAt xAI, we are on a mission to develop advanced AI systems capable of deep comprehension of the universe, ultimately assisting humanity in its quest for knowledge. Our team is small yet highly motivated, dedicated to engineering excellence and driven by curiosity. We encourage individuals who thrive in challenging environments and seek to push boundaries. Embracing a flat organizational structure, we empower all employees to be hands-on contributors to our mission, rewarding those who demonstrate initiative and commitment to exceptional results. Strong communication skills are essential, enabling team members to share insights effectively.About the RoleThe Mid-Training Team at xAI is tasked with creating an omni model that comprehensively understands the universe through various modalities: text, image, video, and audio. We are seeking talented engineers with expertise in multimodal mid-training data to help us achieve this goal.Technical Expertise RequiredPythonJAX and XLASparkRayLocationThis position is located in the Bay Area, specifically in Palo Alto and San Francisco. Candidates should either be based in the Bay Area or be open to relocation.Key ResponsibilitiesScale synthetic coding data to trillions of tokens utilizing large-scale Docker verification.Transform flagship model intelligence into streamlined flash models through synthetic data generation.Optimize mid-training data mixtures to enhance reinforcement learning outcomes.Engineer innovative long-context data recipes.Develop comprehensive and diverse evaluation methods for mid-training checkpoints.Preferred QualificationsExpertise in machine learning and large model scaling, with a solid understanding of various scaling laws.Proven ability to design and execute machine learning experiments.Familiarity with state-of-the-art techniques for curating training data across text, image, audio, and video modalities.Strong analytical and problem-solving skills.

Jan 8, 2026
Apply
companySimile logo
Full-time|On-site|Palo Alto

About SimileAt Simile, we believe in revolutionizing decision-making by simulating the complexities of society. Just as pilots and surgeons train in controlled environments, we aim to equip organizations with the ability to anticipate human behavior through advanced AI simulations. Our pioneering research has established a new frontier in AI-based modeling, allowing us to forecast human behavior across various scenarios and scales.With substantial backing of $100 million from prominent investors including Index Ventures and AI luminaries such as Andrej Karpathy and Fei-Fei Li, we are committed to pushing the boundaries of artificial intelligence.Join Our Infrastructure TeamThe Infrastructure team at Simile is crucial to our platform's success. We design and implement the foundational systems that enable our AI agents to function securely and efficiently on a large scale. We specialize in high-scale cloud networking and distributed systems, ensuring enterprise-grade privacy.Our Work is Organized Around Three Key Pillars:Cloud Foundation: Overseeing our multi-cloud environment (AWS/GCP) to ensure high availability and cost-efficiency through Infrastructure-as-Code.Enterprise Deployments: Creating streamlined pathways for VPC peering, PrivateLink, and BYOC (Bring Your Own Cloud) architectures tailored for our largest clients.Platform & Reliability: Developing CI/CD pipelines and observability stacks (including p99 latency tracking and SLOs) that empower our entire engineering organization to deliver safely and effectively.Role OverviewWe are on the lookout for a driven Infrastructure Engineer who is passionate about navigating the intricacies of modern deployment strategies. You will take charge of our infrastructure roadmap from conception through to operational execution, ensuring our platform remains resilient, compliant, and primed for global scalability.Key ResponsibilitiesArchitect Multi-Cloud Environments: Design and expand multi-region architectures across AWS and GCP, addressing global data residency and failover needs.Enhance Engineering Velocity: Collaborate with Product Engineering, Research, and Security teams to develop internal tools and paved pathways that accelerate development and empower engineering teams.

Feb 27, 2026
Apply
company1X logo
Full-time|$180K/yr - $250K/yr|On-site|Palo Alto, California, United States

AI Research Engineer specializing in Reinforcement Learning | AI & RoboticsLocation: Palo Alto, CA (on-site)About 1XAt 1X, we are pioneering the development of humanoid robots that collaborate with humans to address labor shortages and foster abundance in various industries.The RoleAs an AI Research Engineer with a focus on Reinforcement Learning, you will play a vital role in enhancing NEO's capabilities through advanced RL algorithms. This position involves working in both simulated and real-world environments to create robust behaviors and implement them within home settings. Your contributions will be crucial in increasing the safety, efficiency, and versatility of our robotic systems.

Mar 19, 2024
Apply
companyxAI logo
Full-time|$150K/yr - $450K/yr|On-site|Palo Alto, CA

About xAIAt xAI, our vision is to develop AI systems that deeply comprehend the universe and assist humanity in its quest for understanding. Our team is a close-knit, highly driven group committed to engineering excellence. We welcome individuals who relish challenges and thrive on curiosity. Operating within a flat organizational structure, we expect all employees to be hands-on contributors to our mission. Proactive leadership is recognized, and a strong work ethic combined with exceptional prioritization skills is essential. Effective communication is crucial, as employees must be able to share knowledge clearly and precisely with colleagues.ROLE OVERVIEW:Join our Grok Voice Model team to engineer the leading voice AI technology. We aim to facilitate seamless, natural, low-latency spoken interactions that are expressive, multilingual, and reliable across devices and real-time applications. We manage the entire training pipeline, encompassing extensive data curation, high-quality audio processing, cutting-edge speech-language pre-training, and rigorous post-training to maximize quality, speed, and stability.Our aspiration is to make conversing with AI feel like engaging with the most charming, knowledgeable, and kind individual imaginable. We are in search of exceptionally intelligent, execution-focused engineers to help us achieve this goal.

Mar 16, 2026
Apply
companySimile logo
Full-time|$100K/yr - $400K/yr|On-site|Palo Alto

Simile develops AI simulations that capture complex societal dynamics by using generative agents modeled on real human behavior. The company aims to help organizations make more responsible decisions by offering realistic simulations, similar to how pilots or surgeons train before facing real-world situations. Simile is building a Foundation Model to predict human behavior in a variety of contexts and at different scales. The company is supported by $100 million in funding from investors including Index Ventures, Hanabi, A*, and Bain Capital Ventures. Role overview The Member of Technical Staff (Research) joins the Research team in Palo Alto. This position involves hands-on work throughout the lifecycle of Simile’s human behavior models: training, evaluation, deployment, and monitoring. The team bridges research and product, with a strong focus on experimental rigor and system reliability to support critical decision-making. Researchers are expected to take full ownership of their projects, from designing experiments to deploying models in production. The work has a direct impact on real-world applications. Key responsibilities Redesigning data schemas: Overhaul foundational data architecture by redefining schemas across multiple databases to strengthen system capabilities. Improving computational performance: Refactor algorithms in internal data pipelines, resolving bottlenecks to enable efficient, large-scale model training. Building and managing infrastructure: Design and operate model training infrastructure, write high-performance code for NVIDIA hardware, and ensure smooth data ingestion and workflow efficiency. Developing scientific evaluations: Create advanced evaluation tools that extend beyond standard benchmarks to verify model reliability and accuracy. Location This position is based in Palo Alto.

Apr 21, 2026
Apply
companySimile logo
Full-time|On-site|Palo Alto

Join Our Team at SimileAt Simile, we are revolutionizing decision-making in society by providing AI simulations that accurately model human behavior. Just as pilots and surgeons rely on simulations for training, we believe that businesses deserve the same rigor when making high-stakes decisions.Our groundbreaking work has led to the creation of the first AI simulation of society, featuring generative agents that reflect real human experiences. Backed by $100 million from top investors, including Index Ventures and renowned AI experts, we are on a mission to predict human behavior with unparalleled accuracy.The RoleAs an Applied Research Engineer and Member of Technical Staff (MTS), you will be integral in refining our models of human behavior. With a strong emphasis on scientific rigor, you will participate in the entire research cycle—from designing experiments to implementing them in production systems that influence real-world decisions.Your Responsibilities Will Include:Data Insight Extraction: Analyze extensive proprietary datasets, including unstructured interviews and behavioral data, to uncover meaningful insights.Hardware Proficiency: Develop and optimize algorithms for cutting-edge NVIDIA hardware, conducting experiments that inform our model training.Scientific Leadership: Design thorough evaluations that validate our behavioral simulations against industry standards.Pushing Boundaries: Engage with the latest research in simulation and AI, continuously enhancing our methodologies and documentation.

Mar 18, 2026
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA; San Francisco, CA

Join xAI as a Technical Staff Member focused on Pre-training Data Infrastructure. In this pivotal role, you will design and implement large-scale data processing systems that handle massive datasets with both CPU and GPU processing. Your responsibilities will include creating tools for orchestrating complex data pipelines, enhancing data discoverability and quality, and managing innovative data pipelines for high-quality training data. We seek a proactive individual with a robust understanding of distributed data systems and an eagerness to contribute to groundbreaking AI technologies.

Dec 29, 2025
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

About xAIAt xAI, we are on a mission to develop advanced AI systems that enhance our understanding of the universe and help humanity achieve its knowledge goals. Our dedicated team is small yet highly driven, emphasizing engineering excellence. We value individuals who thrive on curiosity and are eager to tackle challenges head-on. With a flat organizational structure, we empower all employees to take initiative and contribute meaningfully to our mission. Exceptional work ethic, prioritization skills, and strong communication abilities are essential for success in our collaborative environment. About the RoleWe are in search of outstanding engineers eager to embark on an innovative project aimed at integrating Grok into every aspect of our Advertising Platform. We seek individuals with extensive experience in developing high-performance advertising products and systems at scale, encompassing bidding, auction, marketplace dynamics, ranking, prediction, and product functionalities. Your expertise will help us leverage xAI’s technology stack to revolutionize our advertising solutions.Your ResponsibilitiesUtilize state-of-the-art Grok models to enhance all facets of our advertising stack, including candidate selection, ranking, auctions, campaign optimization, creative development, and improving the advertiser experience.Take ownership of systems and products that drive significant revenue for the company.Who You AreYou possess 3+ years of industry experience in creating large-scale, high-throughput, AI-driven advertising solutions.Technical ProficienciesProficient in Python, Jax, and Rust.LocationOur engineering team is based in Palo Alto, CA. While we typically work from the office five days a week, we offer flexible work-from-home options when needed.Interview ProcessUpon submitting your application, our team will review your resume and documentation of your outstanding work. If your application meets our criteria...

Jan 20, 2026
Apply
companySpace Exploration Technologies Corp. logo
Full-time|On-site|Palo Alto, CA

Join SpaceX as a Member of Technical Staff, where you will play a crucial role in supporting government initiatives and projects that require top-secret clearance. Your expertise will contribute to the development of cutting-edge technologies that are transforming space exploration and enabling humanity's future in space.

Apr 8, 2026
Apply
companyxai logo
Full-time|On-site|Palo Alto, CA

Join our innovative team at xai as a Member of Technical Staff specializing in Web Foundations. In this role, you will collaborate with cross-functional teams to develop and enhance our web infrastructure, ensuring high performance and scalability. You will have the opportunity to leverage cutting-edge technologies to build and maintain robust web applications that serve our global user base.

Mar 20, 2026
Apply
companyvinci4d logo
Full-time|On-site|Palo Alto HQ

About UsAt vinci4d, we are pioneering a revolutionary co-pilot for hardware designers, aiming to empower 9 million mechanical engineers to accelerate their design processes by a factor of 1000.Our innovative approach involves developing a geometry and physics-driven foundation model tailored for diverse part designs.We are proud to have secured initial funding from Khosla Ventures, propelling us forward in our mission.About YouYou are a passionate Computational Fluid Dynamicist skilled in developing flow solvers for intricate geometrical applications. Your expertise encompasses external flows over dynamic vehicles (aircraft, missiles, automobiles), as well as fluid movement through machinery such as ducts, turbo-machinery, refrigeration systems, and computer heat sinks. Terms like conjugate heat transfer, multi-physics applications, and complex geometries excite you.Your ResponsibilitiesDesign, develop, and deploy simulation codes for single and multi-physics configurations, emphasizing Conjugate Heat Transfer.Optimize code performance on contemporary architectures, including GPUs.Implement advanced linear and nonlinear solvers to reduce time to solution.Collaborate in strategizing, planning product roadmaps, and prioritizing development alongside early customers and design partners.Develop and launch critical product features.Engage in continuous learning while creating products that resonate with engineers, gaining insights into entrepreneurship.

Nov 17, 2025
Apply
companySpaceX logo
Full-time|On-site|Palo Alto, CA

SpaceX seeks a Technical Staff Member to join its government sector projects in Palo Alto, CA. This position focuses on designing, developing, and implementing technical solutions for government clients, with a strong emphasis on aerospace technology. The work directly supports SpaceX’s mission to advance multi-planetary life. Key responsibilities Develop and deliver technical solutions that meet government requirements Collaborate with a team to solve complex engineering problems Contribute expertise to projects that extend the frontiers of aerospace technology Location This role is based in Palo Alto, CA.

Apr 20, 2026
Apply
companyMithril logo
Full-time|$170K/yr - $230K/yr|On-site|Palo Alto / San Francisco Bay Area

Mithril is building AI infrastructure to make GPU computing accessible for enterprises, AI startups, and research organizations. The company’s customers include LG AI Research, Saronic, and the Broad Institute. Mithril was founded by a former Google DeepMind research scientist and a Stanford CS PhD, and has raised $80 million in seed and Series A funding from Sequoia Capital, Lightspeed Venture Partners, and others. Platform revenue has grown more than sixfold in the past year. Fast Company recognized Mithril as the 8th Most Innovative Company in Artificial Intelligence for 2026. The team is transitioning from bare-metal operations to a cloud-native, multi-provider platform, introducing an auction and flexibility model. This is an opportunity to help shape the platform from its early stages. Role overview The Software Engineer - Technical Staff Member will work across three main areas: Consumption: Developer-facing product, billing, and API Platform: Orchestration and marketplace solutions Supply: Cloud provider integrations and capacity management Engineers at Mithril take on significant ownership, building features end-to-end that support critical customer workloads and drive revenue. The scope includes backend systems, marketplace logic, and customer interfaces. Architectural decisions here have a direct impact on Mithril’s growth and scalability. What makes this role unique This position blends deep systems work with product-facing challenges. Engineers contribute to the orchestration engine that manages GPU capacity across providers, as well as the interfaces customers use to reserve, bid, and utilize resources. The systems built in this role handle financial transactions, real workloads, and market mechanisms such as spot auctions, reservation pricing, and capacity allocation. For those interested in the mechanics of GPU infrastructure markets and building the technology behind them, this role offers direct involvement. Location This role is based in Palo Alto or the San Francisco Bay Area.

Apr 22, 2026
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

About xAIAt xAI, our mission is to develop AI systems that genuinely comprehend the universe and support humanity's quest for knowledge. We pride ourselves on having a compact, highly driven team dedicated to engineering excellence. Our environment is perfect for individuals who relish challenges and flourish through curiosity. We embrace a flat organizational structure where every employee is encouraged to be proactive and to directly contribute to our mission. Leadership opportunities are awarded to those who demonstrate initiative and consistently achieve outstanding results. Strong work ethic and effective prioritization skills are key attributes we value. Additionally, all team members must possess excellent communication skills to share insights and knowledge clearly and concisely with their colleagues.About the RoleWe are looking for talented Applied Engineers to join a pivotal project that serves around 600 million users monthly. This is a unique opportunity for professionals with an engineering or scientific background to leverage their expertise in recommendation systems, ranking algorithms, search technologies, and more. You will be at the crossroads of cutting-edge AI development and tangible real-world impact, enhancing our ability to connect users with relevant content, accounts, and experiences.What You'll DoDesign and architect innovative recommendation algorithms for diverse product surfacesUtilize xAI’s extensive infrastructure and AI tools to significantly enhance user experiencesDevelop data pipelines and training jobs that continuously adapt from product dataIterate and refine algorithms using real-time user feedback through experimentationEnsure the scalability and efficiency of machine learning systemsWho You AreFamiliarity with data infrastructure technologies such as Kafka, Clickhouse, and SparkProven experience in implementing recommender systems and/or deep learning applications at scaleProficient in one or more deep learning software frameworks, such as JAX or PyTorchExceptional candidates may have experience in writing CUDA kernels

Jan 20, 2026
Apply
companyvinci4d logo
Full-time|On-site|Palo Alto HQ

About Us At vinci4d, we are revolutionizing the hardware design landscape with our cutting-edge AI assistant, aimed at empowering engineers to accelerate their design iterations by a staggering 1000 times.Our innovative foundation model, driven by geometry and physics, is tailored for each category of part design.We are on the lookout for passionate individuals who thrive on product development to enhance our Minimum Viable Product (MVP).Your ResponsibilitiesAs a pivotal member of our team, you will design and develop the essential pipelines and tools that transform our vision into reality. Your contributions will significantly expedite our development processes by facilitating smooth transitions from code development to deployment. Key responsibilities include:Enhancing product features for scalability and developing advanced APIs to support intricate engineering workflows.Establishing and integrating LLM or VLM infrastructure.Implementing an MLOps framework for training deep learning models using geometry and physics data.Collaborating with early customers and design partners to strategize and prioritize the development roadmap.Building and deploying critical product features.Gaining invaluable experience while creating products that resonate with engineers, and learning about the entrepreneurial journey.QualificationsA minimum of 6 years in developing and delivering features within the high-performance computing domain.Expertise in C++, Python, or any relevant language necessary for system setup, along with familiarity with tools such as gRPC, Protocol Buffers, Docker, Kubernetes, and Bazel.Experience with cloud computing for data generation, scraping, and assisting data science teams in model training with MLOps is a plus.CUDA experience is desirable but not mandatory.Frontend development experience is a bonus.Prior experience in a startup environment will be highly valued.You’ll Thrive in This Role If YouAre enthusiastic about entrepreneurship and the process of transforming ideas from concept to reality.

Jul 18, 2025
Apply
companyxAI logo
Full-time|$180K/yr - $440K/yr|On-site|Palo Alto, CA

About xAI xAI is focused on building advanced AI systems capable of understanding complex problems and supporting humanity’s search for knowledge. The team values curiosity, hands-on problem solving, and strong communication. Leadership comes from initiative and results, not hierarchy. Team members share insights openly and work closely together. Role Overview: Technical Staff Member - Multimodal Intelligence This position sits within the multimodal team at xAI in Palo Alto, CA. The goal: push the boundaries of multimodal intelligence by building systems that understand and generate image, video, audio, and text data. What You Will Do Work on every stage of the multimodal pipeline, including data acquisition, tokenizer training, large-scale pre-training, infrastructure scaling, and tooling. Develop and deliver end-to-end product experiences that showcase advanced multimodal capabilities. Collaborate with teams across xAI to advance multimodal reasoning, world modeling, tool use, and interactive human-AI collaboration. Help build models that perceive, understand, and interact with the world in real time. Team Culture Flat structure: leadership is earned by initiative and performance. Open communication and collaboration are essential. Curiosity and a drive to tackle tough challenges are highly valued.

Apr 17, 2026

Sign in to browse more jobs

Create account — see all 225 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.