Reinforcement Learning Software Engineer

Preference ModelSan Francisco

On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

QualificationsTo succeed in this role, candidates should meet the following qualifications:Strong analytical and technical skills. Ability to work collaboratively in a fast-paced environment. Excellent communication skills.

About the job

About Us

At Preference Model, we are at the forefront of developing advanced training data essential for the evolution of artificial intelligence. While today's AI models exhibit significant power, they often fall short in diverse applications due to limitations in their training data. We specialize in creating reinforcement learning environments that present AI with authentic research and engineering challenges, enabling them to iterate and learn through realistic feedback loops.

Our founding team boasts experience from Anthropic’s data department, where we established the data infrastructure, tokenizers, and datasets that supported Claude. We collaborate with top-tier AI research labs to bring AI closer to its groundbreaking potential and are proudly backed by a16z.

About the Role

As a Software Engineer on our team, your responsibilities will include:

Designing and Developing Reinforcement Learning Environments: Architect comprehensive simulation platforms that encompass environmental context, task definitions, and reward functions to facilitate AI agents' learning and performance of intricate tasks.
Building Robust Training Infrastructure: Create scalable systems for post-training AI models, focusing on orchestration, performance optimization, and monitoring capabilities.
Implementing Realistic Model Evaluations: Develop metrics for evaluating AI agent performance and establish the infrastructure and tools necessary for conducting these evaluations.
Influencing Technical Strategy: Take charge of architectural decisions, impact product roadmaps, and contribute significantly to our engineering culture as an early-stage team member.

About You

You might be a great fit for this role if you possess the following qualities:

Adept at leveraging language models effectively.
Ability to innovate and think outside the box.
A minimum of 4 years of software engineering experience, showcasing your ability to take ownership of projects.
Proficiency in Python, Rust, or TypeScript, with the capability to work across the entire software stack.
Hands-on experience with modern deployment practices, containerization, and cloud infrastructure (such as Kubernetes, AWS, or GCP).
Strong problem-solving skills demonstrated through algorithmic challenges or complex system design tasks.

Nice-to-Haves

Preferred candidates will have experience in:

Machine learning infrastructure or reinforcement learning.

About Preference Model

Preference Model is a pioneering company dedicated to creating the next generation of training data for AI. Our mission is to harness the potential of artificial intelligence through innovative reinforcement learning environments that address real-world challenges.

Similar jobs

1 - 20 of 3,203 Jobs

Search for Learning Development Specialist Contract

3,203 results

Select all on this page (20)

Apply

Learning & Development Specialist (Contract)

Unity Technologies

Contract|$60K/yr - $110K/yr|On-site|San Francisco, CA, USA

Join Our Team!This is a remarkable opportunity for an experienced Learning & Development Specialist to contribute to Unity's transformative journey. As a key member of the Learning Programs team, you will leverage your expertise to create and facilitate dynamic learning experiences that resonate. Your role will be pivotal in converting both technical and behavioral concepts into impactful, scalable training solutions designed to enhance business performance.Your Responsibilities:Collaborate with colleagues and stakeholders to design, develop, and implement comprehensive learning programs covering leadership development, team effectiveness, and interpersonal skills.Utilize your strategic insight to craft scalable learning pathways tailored for various business units, including technical teams.Assist in the design and enhancement of Unity's Learning Experience Platform (LXP), including the management of our content catalog.Facilitate engaging and interactive workshops and training sessions (in-person and virtual) for diverse audiences, from first-time managers to senior executives.

Mar 6, 2026

Apply

Customer Support Learning & Enablement Specialist

Mercury

Full-time|Remote|San Francisco, CA, New York, NY, Portland, OR, or Remote within Canada or United States

Role Overview Mercury is looking for a Customer Support Learning & Enablement Specialist to strengthen the skills and knowledge of our customer support team. This role shapes how our team helps clients by building and delivering training programs that keep support practices sharp and effective. What You Will Do Design and roll out training initiatives for the customer support team Equip team members with the tools and knowledge needed to assist clients confidently Promote a culture where learning and continuous improvement are part of daily work Location This position is open to candidates in San Francisco, CA, New York, NY, Portland, OR, or remote within Canada or the United States.

Apr 14, 2026

Apply

Community Contracts and Funding Specialist

GGRC

Full-time|$63K/yr - $75.6K/yr|Hybrid|San Francisco, California, United States

Community Contracts and Funding SpecialistSalary Range: $63,011 - $75,613Join GGRC as a Community Contracts and Funding Specialist, where you will play a pivotal role in our Community Services Department. This position involves collaborating with service providers and community partners to create, manage, and monitor contracts, while also overseeing the distribution of special funds.This is a hybrid position that allows for flexibility, as you will work from our San Francisco office with occasional travel required between San Francisco, San Mateo, and Marin counties.Key Responsibilities: Develop and implement Purchase of Service contracts for various community services, including Housing and Supported Living Services. Oversee invoice payments and funding transfers, ensuring all required documentation is submitted prior to authorizing payments. Serve as a liaison between GGRC and legal counsel to update contract language and content. Establish and maintain a contract tracking system to ensure timely updates and compliance. Manage contracts related to Requests for Proposal and other community service funding. Collaborate with team members to refine contracts, including language, scope of work, milestones, and budgets for Home and Community Based Services Compliance and Language Access funding. Monitor funding expenditures, review invoices, and approve payments in coordination with the fiscal department. Act as a liaison with the Department of Developmental Services to secure funding approvals and modifications. Report on contract progress to relevant GGRC staff members. Review and implement updates based on DDS guidance and directives. Utilize GrantVantage and similar software for tracking and managing grants. Conduct training sessions for staff regarding DDS directives and funding usage. Ensure confidentiality of all client information in compliance with HIPAA and the California Lanterman Act.

Mar 2, 2026

Apply

Learning & Development Partner

Mixpanel

Full-time|Hybrid|San Francisco, US (Hybrid)

As a Learning & Development Partner at Mixpanel, you will play a pivotal role in enhancing the skills and capabilities of our team members. You will collaborate with various departments to identify training needs and implement effective learning programs that align with our organizational goals. Your expertise in adult learning principles and instructional design will be crucial in fostering a culture of continuous improvement and professional growth.

Mar 2, 2026

Apply

Learning and Development Lead

Cardless, Inc.

Full-time|On-site|San Francisco

Join Cardless, Inc. as a Learning and Development Lead, where you will spearhead initiatives to enhance employee skills and growth within our dynamic team. You will be responsible for designing and implementing comprehensive training programs that align with our organizational goals. Collaborate with leaders to identify learning needs and cultivate a culture of continuous improvement.

Mar 19, 2026

Apply

Machine Learning Specialist in Behavioral Modeling

Palladio AI

Full-time|On-site|San Francisco Bay Area

Join the Revolution in Behavioral IntelligenceAmplify Your InfluenceYou have achieved remarkable success in your career, creating robust behavioral or neuroscience models that have driven significant outcomes. You possess a talent for discerning patterns in user behavior, comprehending motivations, and optimizing end-to-end user experiences.Now, envision extending your impact across multiple products and organizations, enhancing the entire app ecosystem. Every application at your fingertips becomes smarter, more engaging, and indispensable to its users.Your expertise can empower product teams to innovate more rapidly, delight users, and boost revenue, all thanks to the behavioral intelligence you develop once and deploy universally.We share this vision: our team has accomplished this repeatedly at industry leaders like Uber, Apple, Google, and Chime, generating tens of billions of dollars in value for products vital to billions globally. We are poised to elevate our impact even further.Does this resonate with the next chapter you're seeking? If so, continue reading.Palladio: Pioneering BreakthroughsPalladio AI is an innovative AI platform aimed at transforming product-led growth and enhancing the value our clients provide in users’ daily lives.Our initial focus is on mobile gaming, where development is swift, user engagement is high, and experimentation yields immediate results—making it the perfect testing ground for our platform.Your ContributionsOur team is constructing foundational systems in behavioral modeling, causal inference, forecasting, and agentic platforms. You will play a pivotal role in extending these areas: creating machine learning and AI-driven behavioral models to identify and highlight product opportunities while deploying self-improving learning loops with each iteration. Your work will analyze user sentiments, thoughts, decisions, and actions—translating behavioral insights into opportunities that enhance product intuitiveness, engagement, and rewards. In essence, you will convert first-principles data science, neuroscience, cognitive science, and machine learning into scalable solutions across various industries.Your ProfileUser-Focused. You empathize with users' challenges, needs, and goals throughout their journeys, measure success through user outcomes, and convert insights into innovative and engaging product experiences.Scientific Innovator. You...

Feb 14, 2026

Apply

Quote-to-Order Operations Specialist (Contract, Remote)

jobmobz1

Contract|Remote|San Francisco

We are seeking a dedicated and detail-oriented Quote-to-Order Operations Specialist to join our dynamic team on a contract basis. As a specialist in this role, you will be responsible for managing the transition of quotes into orders, ensuring accuracy and efficiency in the order processing workflow.Your expertise will be crucial to support our sales and operations teams, helping to streamline processes and enhance customer satisfaction. This is a fully remote position that offers flexibility and the opportunity to work with a talented group of professionals.

Mar 19, 2026

Apply

Senior Learning & Development Manager - Enterprise Capabilities

Ripple

Full-time|$184K/yr - $205K/yr|On-site|San Francisco, CA, United States

At Ripple, we are revolutionizing the way value is exchanged across the globe, creating a world where financial transactions flow as seamlessly as information. Our innovative crypto solutions empower financial institutions, businesses, governments, and developers to enhance the global financial landscape, fostering greater economic equity and opportunity for individuals everywhere. Join us and experience unparalleled professional growth in an environment where your contributions are valued and supported.If you are eager to make a significant impact and unlock exceptional career advancement, we invite you to join our team and help build tangible value in the world.THE ROLE:As a Senior Learning & Development Manager focusing on enterprise capabilities, you will spearhead a transformative Learning & Development function that acts as a strategic partner to our business. This role is pivotal in shaping the learning experience at Ripple, driving initiatives that enhance AI fluency, embedding cultural frameworks, and identifying capability gaps to elevate our organizational performance.

Mar 17, 2026

Apply

Senior Python Developer - AI/ML

Sustainable Talent

Contract|On-site|San Francisco, Ca

Location: San Francisco, Bay Area, CAEmployment Type: ContractAbout UsSustainable Talent specializes in providing adaptable workforce solutions, linking elite talent with prominent global enterprises. We collaborate with some of the world's most innovative companies to assist them in expanding their teams with the right expertise. This role is with one of our esteemed high-tech clients, a frontrunner in AI technology and digital transformation, dedicated to leveraging advanced data science and machine learning to address complex issues.About the RoleWe are looking for a Senior Python Developer who will be responsible for developing and optimizing applications that facilitate machine learning and data science projects. You will focus on constructing scalable data pipelines, deploying AI models, and ensuring optimal performance and efficiency in production settings. Close collaboration with data scientists and cross-functional teams will be essential for transitioning research into practical solutions. If you are enthusiastic about AI, cloud technologies, and writing clean, efficient code, this is an exceptional opportunity to engage in groundbreaking innovation.What You’ll DoDevelop and optimize Python-based applications that support data science and machine learning initiatives.Implement and deploy machine learning models utilizing TensorFlow, PyTorch, and Scikit-learn.Design and construct scalable data pipelines to preprocess and transform large datasets for AI/ML applications.Collaborate closely with data scientists to convert research into production-ready ML solutions.Develop predictive analytics and deep learning solutions to enhance business intelligence and decision-making.Ensure model performance optimization and scalability in production environments.Integrate ML models into cloud-based platforms (AWS, GCP, Azure) and microservices architectures.Write clean, efficient, and well-documented Python code adhering to software engineering best practices.Collaborate with cross-functional teams to drive innovation and improve AI-driven products.What We’re Looking For7+ years of software engineering experience with a strong focus on Python development.Proven expertise in machine learning frameworks such as TensorFlow, PyTorch, and Scikit-learn.Experience in developing scalable data pipelines and cloud-based architectures.Strong problem-solving skills and ability to work in a collaborative environment.Excellent communication skills and a passion for AI technologies.

Feb 23, 2026

Apply

Reinforcement Learning Environment Engineer

AfterQuery

Full-time|On-site|San Francisco

About AfterQuery AfterQuery develops training data and evaluation frameworks that leading AI labs use to improve their models. The team partners with major research institutions to build datasets and run assessments that go beyond standard benchmarks. As a post-Series A company based in San Francisco, AfterQuery values contributions from every team member. Work here directly shapes the next generation of AI models. Role Overview The Reinforcement Learning Environment Engineer designs datasets and evaluation systems that influence how advanced AI models learn and improve. This role involves close collaboration with research teams, hands-on experimentation with new data collection methods, and the creation of metrics to track model progress. Work moves from theoretical analysis to practical experiments, feeding directly into large-scale model training efforts. What You Will Do Develop data segments that expose key failure modes in sectors such as finance, software engineering, and enterprise operations. Refine reward signals for Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning from Value Reinforcement (RLVR) systems. Define quantitative metrics for dataset quality, diversity, and their effects on model alignment and capability. Work closely with research teams to translate training objectives into concrete data requirements and evaluation criteria. This position is based in San Francisco.

Apr 14, 2026

Apply

L&D Specialist, Manager Development

OpenAI

Full-time|On-site|San Francisco

About Our TeamThe People Programs team at OpenAI is integral to fostering an environment where every employee can excel, evolve, and contribute to our vision of making AGI beneficial for all of humanity. This dynamic team is dedicated to crafting and implementing human-centric strategies and programs that align with our rapid growth, distinctive culture, and innovation-driven ethos.Our Coaching, Learning & Development team oversees a variety of initiatives, including onboarding for new hires, manager development, and the learning systems utilized throughout OpenAI.About the RoleWe are seeking a dedicated Manager Development Specialist to enhance our manager development programming and coaching through exceptional program management and logistical coordination at OpenAI.This position will oversee the comprehensive operations and systems that ensure the efficient, scalable delivery of manager development programs. The role encompasses both virtual and in-person initiatives, management of learning systems and data, and the scaling of impact through coaching operations, facilitation support, and resources tailored for managers. Collaboration with Learning & Development, HR Business Partners, Workplace, and external partners will be crucial.Your Responsibilities:Program Operations and Facilitation: Spearhead the daily logistics for manager development programs, including scheduling, communications, stakeholder coordination, and seamless execution across both virtual and in-person formats; regularly lead live sessions.Systems & Data: Oversee program setup, enrollment, and tracking within our Learning Management System (Sana) to ensure accurate measurement, reporting, and scalable delivery of learning initiatives.Instructional Design: Collaborate in the design and enhancement of manager development programs using adult learning principles to adapt to the evolving needs of the organization.Coaching Operations: Supervise the intake, onboarding, scheduling, and tracking of coaching participants, cohorts, and timelines in partnership with HR Business Partners, leadership, and external coaching providers.Learning Content & Enablement: Develop, maintain, and broaden access to resources for managers—including asynchronous training, job aids, templates, toolkits, and intranet content—to reinforce learning beyond live sessions.Team Support: Provide ad hoc program, facilitation, and operational assistance to the team as needed.

Feb 3, 2026

Apply

Developer Relations Specialist

Chalk

Full-time|$50K/yr - $70K/yr|On-site|SF

About ChalkChalk is revolutionizing the data platform landscape to empower the next generation of machine learning applications. We dismantle the complexities, latency issues, and scalability challenges that have historically limited ML capabilities. Our state-of-the-art platform seamlessly integrates Rust-speed performance with user-friendly tools that developers appreciate. Top-tier companies rely on Chalk for various applications, including preventing fraudulent credit card transactions, identity verification, and optimizing renewable energy capture. We are proud to have recently secured a $50 million Series A funding round, led by Felicis.About the RoleWe are seeking a passionate Developer Relations Specialist to become an integral part of our expanding Go-To-Market (GTM) team. This role serves as the technical liaison between Chalk and the AI/ML and data community. We need someone with a profound understanding of modern data infrastructure, experience in sales-driven environments, and the ability to create engaging and informative content.You will collaborate closely with the sales, product, and marketing teams to articulate how Chalk can enhance the technical stacks of our users—through product launches, community outreach, enablement efforts, and proactive engagement. Your contributions will range from crafting in-depth technical articles to developing proof-of-concept projects, producing instructional videos, and conducting live demonstrations. You will play a vital role in shaping the narrative around Chalk.Our team works in the office five days a week, but we are flexible with unavoidable conflicts. Please note, this position is not hybrid.What You Will DoAct as the technical ambassador for Chalk among data engineers, ML teams, and infrastructure leaders.Produce and disseminate impactful content including technical blogs, field guides, explanatory materials, demonstrations, tweet threads, and short videos.Collaborate with product and sales departments to create resources that cater to enterprise clients—such as diagrams, presentations, proof-of-concepts, and ROI calculators.Represent Chalk at conferences, meetups, and customer interactions.Engage with prospects and customers to define best practices and relay insights back to the product team.Cultivate and expand a community focused on real-time data infrastructure and production ML.What Excites YouA robust technical background in data infrastructure, ML tools, or developer platforms.

Oct 3, 2023

Apply

Developer Relations Specialist

Chalk Inc.

Full-time|$150K/yr - $250K/yr|On-site|NY or SF

About Chalk Chalk is revolutionizing the data landscape by constructing a powerful data platform designed for the next generation of machine learning applications. We simplify complexities, eliminate latency, and scale barriers that have historically limited ML capabilities. Our platform integrates Rust-speed performance with intuitive tools that developers appreciate. Leading organizations rely on Chalk for a diverse range of applications, including preventing fraudulent credit card transactions, validating identities, and optimizing clean energy utilization. Recently, we secured a $50 million Series A funding round led by Felicis. About the Role We are seeking a Developer Relations Specialist to join our expanding Go-To-Market (GTM) team. In this role, you will act as the crucial technical liaison between Chalk and the AI/ML and data communities. This is a hands-on position for someone with a profound understanding of contemporary data infrastructure, experience in sales-driven environments, and the ability to produce engaging educational content. Your collaboration with sales, product, and marketing teams will be essential in helping technical audiences comprehend how Chalk integrates into their technology stack—spanning product launches, community building, and outreach initiatives. From crafting in-depth technical articles to developing proof-of-concept projects, creating engaging walkthrough videos, or conducting live demonstrations, you will play a vital role in narrating the Chalk story. Please note that we operate in-office five days a week, with flexibility for unavoidable conflicts. This is not a hybrid role. What You Will Do Act as the technical representative of Chalk to data engineers, machine learning teams, and infrastructure leaders. Generate and disseminate impactful content: technical blog posts, field guides, explainers, demos, tweet threads, and short-form videos. Partner with product and sales teams to create resources that assist enterprise clients—diagrams, presentations, proofs of concept, ROI calculators, etc. Represent Chalk at external events through talks, meetups, and customer engagement channels. Engage with prospects and customers to establish benchmarks for success and relay insights back to the product team. Build and nurture a community focused on real-time data infrastructure and production machine learning. What Excites You A strong technical foundation in data infrastructure and ML tools.

Mar 10, 2026

Apply

Lead Learning Designer - Spanish Curriculum Development

Speak

Full-time|On-site|San Francisco

About UsOur mission is to transform language learning for everyone.Learning a new language can profoundly impact one’s life, bridging gaps to diverse cultures, careers, and communities. With two billion individuals globally striving to learn a language, the most effective method (one-on-one tutoring) remains largely inaccessible and stagnant for decades. Speak is pioneering a human-level, AI-supported tutor available in your pocket: a conversation-driven experience that empowers learners to engage, receive immediate feedback, and progress through meticulously crafted lessons. The result is a comprehensive journey from novice to confident speaker across a multitude of languages.Launched in South Korea in 2019, Speak has rapidly become the leading language learning app, now catering to learners in numerous markets across 15+ languages. As one of the foremost AI companies worldwide, we have successfully secured over $150 million in venture funding from notable investors like OpenAI, Accel, Founders Fund, and Khosla Ventures, with a globally distributed team across San Francisco, Seoul, Tokyo, Taipei, and Ljubljana.About This RoleWe are seeking a dynamic Lead Learning Designer to enhance and expand one of Speak's largest markets: Spanish. In this pivotal role, you will empower our team to innovate faster, explore new possibilities, and deliver exceptional Spanish learning experiences on a grand scale.You will work closely with members of the Learning Design team, as well as Product, Engineering, Analytics, Business Operations, and Marketing, to shape the future of our Spanish curriculum. You will be entrusted with leading high-priority Spanish curriculum initiatives, requiring minimal oversight.If you are a Spanish pedagogy specialist with a proven track record in the ed-tech sector, capable of executing projects independently and thriving in a collaborative, creative environment, we would love to connect with you.Your ResponsibilitiesOversee the Spanish curriculum and high-priority content projects from inception to completion—ensuring timely delivery of premium lessons and experiences through strategic planning, proactive communication, and effective execution.Lead cross-functional initiatives with EPD and Marketing as the designated Spanish content DRI; apply a holistic content perspective, operational discipline, and creativity to test innovative learning experiences.Exercise sound judgment on complex cross-language projects—acknowledging distinct implications between languages, identifying potential risks early, and proactively asking critical questions to maintain project momentum.Develop curriculum and prototype content that resonates with learners and meets educational standards...

Dec 17, 2025

Apply

Senior Machine Learning Engineer - Applied Research & Model Development

Lightfield

Full-time|On-site|HQ: San Francisco

About LightfieldAt Lightfield, we are pioneering the future of CRM with our AI-native platform that seamlessly integrates with your email, calendar, and meetings. Our innovative solution captures every interaction, transforming it into structured context, including accounts, tasks, follow-ups, and insights, ensuring that nothing is overlooked.We are fundamentally reimagining CRM by employing a flexible approach that adapts to how teams operate, rather than imposing rigid systems. Lightfield continuously learns, automates processes, and surfaces valuable insights that fuel growth. We are dedicated to creating a CRM platform that is not only fast and intelligent but also genuinely helpful.Our team is backed by prestigious investors like Greylock, Lightspeed, and Coatue, and has a rich history in building successful products, including Tome, a generative AI presentation tool utilized by over 25 million users. Our collective experience spans notable companies such as Llama, Instagram, Facebook Messenger, Pinterest, Google, and Salesforce.About the RoleJoin our dynamic AI/ML team at Lightfield, where we are developing the core experiences of our product through cutting-edge applications that amaze our customers. We are currently focused on creating a robust, domain-specific AI that surpasses conventional LLMs.We thrive on the challenge of crafting innovative AI solutions for professionals engaged in significant work, and we're eager to expand our AI/ML team to rise to this challenge.Your ResponsibilitiesDesign and deliver extraordinary AI experiences that empower sales teams.Collaborate closely with founders and executives to shape Lightfield's AI/ML strategy.Lead the training of new models utilizing both historical and synthetic training data.Develop and prototype innovative LLM-driven experiences, transforming them into robust product features.Contribute to building a top-tier AI/ML engineering team through recruitment and mentorship.Your Profile5+ years of industry experience in Natural Language Processing (NLP) with a strong portfolio of model training.Solid understanding of deep learning AI/ML frameworks and cloud services.Hands-on experience in ML Operations (ML Ops).Deep expertise in NLP and model training, particularly with Large Language Models (LLMs).Demonstrated ability to adapt open-source generative models for specific applications, with a comprehensive understanding of their architecture.

Oct 10, 2024

Apply

Contract to Hire .NET Developer Position at 360IT Professionals | San Francisco

360IT Professionals

Contract|On-site|San Francisco

Join our dynamic team at 360IT Professionals as a .NET Developer on a contract-to-hire basis. This role offers a unique opportunity for professional growth within a supportive and innovative environment. You will be responsible for developing and maintaining high-quality applications using the .NET framework.

Mar 15, 2017

Apply

Machine Learning Engineer - Imitation & Reinforcement Learning for Robotics

Bedrock Robotics

Full-time|On-site|San Francisco, CA

Be Part of the Future of Autonomous RoboticsAt Bedrock Robotics, we are pioneering the transition of AI from theoretical frameworks to practical applications in the built environment. Our team is comprised of seasoned professionals who have been instrumental in the success of innovative companies such as Waymo, Segment, and Uber Freight. We are at the forefront of deploying autonomous technologies in heavy construction machinery, significantly enhancing the efficiency and safety of multi-billion dollar infrastructure projects across the nation.With backing from $350 million in funding, our mission is to address the urgent need for housing, data centers, and manufacturing facilities, while simultaneously responding to the construction industry's labor shortages.This position is where cutting-edge algorithms meet the practical world of construction. You will work alongside industry experts and top-tier engineers to tackle complex real-world challenges that cannot be simulated. If you are eager to leverage advanced technology for impactful problem-solving within a skilled team, we encourage you to apply.

Jan 31, 2026

Apply

Reinforcement Learning Software Engineer

Preference Model

Full-time|On-site|San Francisco

About UsAt Preference Model, we are at the forefront of developing advanced training data essential for the evolution of artificial intelligence. While today's AI models exhibit significant power, they often fall short in diverse applications due to limitations in their training data. We specialize in creating reinforcement learning environments that present AI with authentic research and engineering challenges, enabling them to iterate and learn through realistic feedback loops.Our founding team boasts experience from Anthropic’s data department, where we established the data infrastructure, tokenizers, and datasets that supported Claude. We collaborate with top-tier AI research labs to bring AI closer to its groundbreaking potential and are proudly backed by a16z.About the RoleAs a Software Engineer on our team, your responsibilities will include:Designing and Developing Reinforcement Learning Environments: Architect comprehensive simulation platforms that encompass environmental context, task definitions, and reward functions to facilitate AI agents' learning and performance of intricate tasks.Building Robust Training Infrastructure: Create scalable systems for post-training AI models, focusing on orchestration, performance optimization, and monitoring capabilities.Implementing Realistic Model Evaluations: Develop metrics for evaluating AI agent performance and establish the infrastructure and tools necessary for conducting these evaluations.Influencing Technical Strategy: Take charge of architectural decisions, impact product roadmaps, and contribute significantly to our engineering culture as an early-stage team member.About YouYou might be a great fit for this role if you possess the following qualities:Adept at leveraging language models effectively.Ability to innovate and think outside the box.A minimum of 4 years of software engineering experience, showcasing your ability to take ownership of projects.Proficiency in Python, Rust, or TypeScript, with the capability to work across the entire software stack.Hands-on experience with modern deployment practices, containerization, and cloud infrastructure (such as Kubernetes, AWS, or GCP).Strong problem-solving skills demonstrated through algorithmic challenges or complex system design tasks.Nice-to-HavesPreferred candidates will have experience in:Machine learning infrastructure or reinforcement learning.

Mar 18, 2026

Apply

Machine Learning Engineer - API Multicloud

OpenAI

Full-time|On-site|San Francisco

Role overview OpenAI seeks a Machine Learning Engineer to focus on API development in multicloud settings. This role is based in San Francisco and centers on advancing the capabilities of API products across different cloud providers. What you will do Use advanced machine learning techniques to enhance API offerings Collaborate with colleagues from various specialties to launch new features and refine existing ones Support ongoing improvements and innovation for API services that operate across multiple cloud environments

Apr 22, 2026

Apply

Developer Events Specialist

Buildkite

Full-time|Remote|San Francisco Bay Area, Remote

About Buildkite Buildkite's Continuous Integration (CI) platform is the solution of choice for some of the world's top engineering teams, powering software delivery for over one billion daily users. Job Overview We are on the lookout for a passionate Developer Events Specialist to design, coordinate, and execute engaging events and community initiatives that elevate Buildkite’s presence in crucial tech spaces. You will be responsible for managing logistics and creating memorable experiences for user conferences, community meetups, workshops, and sponsored events, ensuring that attendees leave feeling energized and connected. This is a hands-on role focused on logistics, requiring you to oversee the entire event lifecycle from venue selection and vendor management to day-of execution. Collaboration with our Go-to-Market (GTM) and Brand teams will be essential to ensure seamless execution that aligns with our brand identity. While coding skills are not a requirement, a solid understanding of software team dynamics and best practices in the SaaS and DevTools sectors is necessary. What You’ll Do Manage Comprehensive Event Logistics Plan and facilitate developer-centric events such as user conferences, community meetups, workshops, dinners, and roundtable discussions. Oversee vendor relationships, venue selections, budget allocations, timelines, and every operational aspect from initial planning to post-event follow-ups. Create and maintain standard processes and checklists to ensure that each event surpasses the last in execution and impact. Establish a Strong Community Presence Contribute to the development and expansion of Buildkite's presence within developer communities, initially targeting the Bay Area. Maintain an organized calendar of recurring sponsored and co-marketing programs, coordinating all details for each initiative. Act as a dependable point of contact for community partners, co-sponsors, and speakers.

Mar 13, 2026

Create account — see all 3,203 results