Backend Engineer – Inference Optimization

VerceptSeattle

On-site Full-time $150/yr - $250/yr

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.

Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Mid to Senior

Qualifications

Essential Qualifications:Extensive experience in optimizing model inference pipelines, including model quantization and KV caching. Strong proficiency in backend systems and high-performance programming languages (Python, C++, or Rust). Familiarity with distributed serving, GPU acceleration, and large-scale system architectures. Proven ability to debug complex performance issues across model, runtime, and hardware layers. Adaptability to work in fast-paced environments with ambitious technical objectives. Preferred Qualifications:Practical experience with vLLM or similar inference frameworks. Background in GPU kernel optimization (CUDA, Triton, ROCm). Experience in scaling inference across multi-node or heterogeneous clusters. Prior involvement in model compilation (e.g., TensorRT, TVM, ONNX Runtime). Hands-on experience with model quantization strategies.

About the job

About Us

At Vercept, we are an energetic and mission-focused team with a proven history of academic excellence. Our talented researchers have made significant contributions to the field of artificial intelligence, receiving accolades such as best paper awards at leading AI conferences and achieving remarkable citation rankings. We are committed to pioneering transformative research that sets new standards in the industry and aim to revolutionize the world—one innovative breakthrough at a time.

What We Seek & Why You Should Join Us

We are in search of a Backend Engineer specializing in Inference Optimization who is passionate about tackling some of the most challenging systems issues in AI. In this role, you will focus on enhancing the performance of foundation model inference, operating at the cutting edge of machine learning and high-performance systems engineering. This is an exciting opportunity to establish new standards for latency, throughput, and efficiency on a large scale.

Role Overview

As a Backend Engineer, you will take ownership of the design and optimization of inference pipelines for large-scale models. Collaborating closely with researchers and infrastructure engineers, you will identify bottlenecks and implement advanced techniques such as quantization and KV caching, ensuring the deployment of high-performance serving systems in production. Your contributions will directly influence how swiftly and cost-effectively users engage with next-generation AI.

What We Expect From You

Essential Qualifications:

Extensive experience in optimizing model inference pipelines, including model quantization and KV caching.
Strong proficiency in backend systems and high-performance programming languages (Python, C++, or Rust).
Familiarity with distributed serving, GPU acceleration, and large-scale system architectures.
Proven ability to debug complex performance issues across model, runtime, and hardware layers.
Adaptability to work in fast-paced environments with ambitious technical objectives.

Preferred Qualifications:

Practical experience with vLLM or similar inference frameworks.
Background in GPU kernel optimization (CUDA, Triton, ROCm).
Experience in scaling inference across multi-node or heterogeneous clusters.
Prior involvement in model compilation (e.g., TensorRT, TVM, ONNX Runtime).
Hands-on experience with model quantization strategies.

About Vercept

Vercept is a dynamic organization at the forefront of AI research and development. Our team comprises leading experts who have made remarkable contributions to the AI field, driving innovation and excellence.

Similar jobs

1 - 20 of 1,064 Jobs

Search for Senior Engineer 2 Inference Optimization Specialist

1,064 results

Select all on this page (20)

Apply

Senior Engineer 2: Inference Optimization Specialist

DigitalOcean

Full-time|Remote|Seattle

Join DigitalOcean as a Senior Engineer 2 specializing in Inference Optimizations, where you'll play a pivotal role in enhancing our machine learning infrastructure. You will collaborate with cross-functional teams to develop and implement high-performance solutions that optimize inference processes. If you are passionate about building scalable systems and have a knack for solving complex engineering challenges, we want to hear from you!

Mar 17, 2026

Apply

Backend Engineer – Inference Optimization

Vercept

Full-time|$150/yr - $250/yr|On-site|Seattle

About UsAt Vercept, we are an energetic and mission-focused team with a proven history of academic excellence. Our talented researchers have made significant contributions to the field of artificial intelligence, receiving accolades such as best paper awards at leading AI conferences and achieving remarkable citation rankings. We are committed to pioneering transformative research that sets new standards in the industry and aim to revolutionize the world—one innovative breakthrough at a time.What We Seek & Why You Should Join UsWe are in search of a Backend Engineer specializing in Inference Optimization who is passionate about tackling some of the most challenging systems issues in AI. In this role, you will focus on enhancing the performance of foundation model inference, operating at the cutting edge of machine learning and high-performance systems engineering. This is an exciting opportunity to establish new standards for latency, throughput, and efficiency on a large scale.Role OverviewAs a Backend Engineer, you will take ownership of the design and optimization of inference pipelines for large-scale models. Collaborating closely with researchers and infrastructure engineers, you will identify bottlenecks and implement advanced techniques such as quantization and KV caching, ensuring the deployment of high-performance serving systems in production. Your contributions will directly influence how swiftly and cost-effectively users engage with next-generation AI.What We Expect From YouEssential Qualifications:Extensive experience in optimizing model inference pipelines, including model quantization and KV caching.Strong proficiency in backend systems and high-performance programming languages (Python, C++, or Rust).Familiarity with distributed serving, GPU acceleration, and large-scale system architectures.Proven ability to debug complex performance issues across model, runtime, and hardware layers.Adaptability to work in fast-paced environments with ambitious technical objectives.Preferred Qualifications:Practical experience with vLLM or similar inference frameworks.Background in GPU kernel optimization (CUDA, Triton, ROCm).Experience in scaling inference across multi-node or heterogeneous clusters.Prior involvement in model compilation (e.g., TensorRT, TVM, ONNX Runtime).Hands-on experience with model quantization strategies.

Sep 11, 2025

Apply

Senior Engineer 2: Inference Data Plane

DigitalOcean

Full-time|On-site|Seattle

Join our dynamic team at DigitalOcean as a Senior Engineer 2 specializing in the Inference Data Plane. In this pivotal role, you will be responsible for enhancing and optimizing our data processing infrastructure, ensuring high performance and scalability. You will collaborate with cross-functional teams to develop innovative solutions that drive our platform forward and improve the user experience.

Mar 17, 2026

Apply

Senior Engineer - Inference Data Plane

DigitalOcean, Inc.

Full-time|On-site|Seattle

Join DigitalOcean as a Senior Engineer in our Inference Data Plane team, where you'll play a vital role in developing and optimizing our data processing architecture. You'll collaborate with cross-functional teams, leverage cutting-edge technologies, and contribute to impactful projects that enhance our platform. If you're passionate about data-driven solutions and thrive in a dynamic environment, we want to hear from you!

Mar 24, 2026

Apply

System Optimization Specialist at MacDonald-Miller | Seattle

MacDonald-Miller Facility Solutions

Full-time|$120K/yr - $155K/yr|On-site|Seattle, Washington, United States

Join MacDonald-Miller Facility Solutions, the premier mechanical contracting firm in the Northwest, where we specialize in designing, delivering, and servicing HVAC, plumbing, and automation system solutions for commercial buildings. With a dedicated team of over 1,600 employees across 13 offices, we offer a diverse range of projects that keep you engaged and inspired.At MacDonald-Miller, we pride ourselves on a rich history of exceeding customer expectations and executing our work with excellence. Our clients trust us with their most challenging projects, which include:New Construction – Engineering, fabrication, and installation of mechanical systems for new projects, adhering to lean construction practices.Special Projects – Retrofits and mechanical repairs for existing structures to enhance efficiency.Service – Scheduled preventive maintenance to ensure tenant comfort and 24/7 emergency response.Building Performance – Control systems, fault detection, energy services, and remote monitoring.Performance Contracting – Sustainable Solutions: As the Prime Contractor, we deliver energy-efficient solutions in the built environment for both private and public sector clients.Our team thrives on shared Core Culture Values:Collaboration – We believe in the power of teamwork and diverse strengths to achieve a common vision.Dedication – Committed to personal and professional excellence, we act with integrity and follow through on our promises.Safety – We prioritize a safe workplace, fostering an environment where everyone returns home safely each day.Community – We build genuine relationships and create a caring, welcoming atmosphere.Innovation – Continuous creative problem-solving keeps us at the forefront of our industry.Fun! – We take our work seriously but maintain a good-natured approach to collaboration.

Apr 10, 2026

Apply

Staff AI Software Engineer - Edge Model Optimization & Deployment

FieldAI

Full-time|On-site|Seattle, Washington

At FieldAI, we are revolutionizing the way robots engage with the real world. Our innovative Machine Learning team based in Seattle develops risk-aware, dependable, and field-ready AI systems that address the most challenging problems in robotics, unleashing the full potential of embodied intelligence. We adopt a practical methodology that transcends standard, purely data-driven techniques or transformer-exclusive architectures, merging state-of-the-art research with real-world applications. Our solutions have achieved global deployment, and we consistently enhance model performance through rapid refinements driven by actual field use.We are on the lookout for a talented Staff AI Software Engineer specializing in Edge Model Optimization & Deployment. In this pivotal role, you will take the lead in optimizing, integrating, and deploying our machine learning models onto real robotic platforms. You will be responsible for the entire edge inference stack, including profiling and accelerating models, enhancing runtime performance across latency, throughput, memory, and power consumption. Collaborating closely with perception, autonomy, and platform teams, you will ensure the delivery of robust on-robot behavior in real-world environments. You will set the technical direction, elevate engineering standards, and guarantee that our models operate efficiently and reliably on constrained hardware in diverse settings.This is a unique opportunity to influence the future of robotic autonomy by converting cutting-edge machine learning into high-performance, production-ready edge deployments that function reliably within complex, dynamic environments on actual robots.

Feb 2, 2026

Apply

Senior Product Specialist - Implementation and Onboarding

Zenoti

Full-time|On-site|Seattle, Washington, United States

Zenoti develops a cloud-based software platform tailored for the beauty and wellness sector. The system streamlines daily operations for salons, spas, medspas, and fitness studios, covering online appointment scheduling, point-of-sale activities, customer relationship management, staff coordination, inventory oversight, and marketing efforts. Over 30,000 businesses across more than 50 countries rely on Zenoti. Notable clients include European Wax Center, Hand & Stone, Massage Heights, Rush Hair & Beauty, Sono Bello, Profile by Sanford, Hair Cuttery, CorePower Yoga, and TONI&GUY. Zenoti has surpassed a $1 billion valuation and was named GeekWire’s Next Tech Titan. The company received an $80 million investment from TPG and ranked number 316 on Deloitte’s 2020 Technology Fast 500™. Zenoti also achieved Great Place to Work Certified™ status for 2021-2022. Further details are available at https://www.zenoti.com.

Apr 21, 2026

Apply

Senior Marketing Operations Specialist

PitchBook, a Morningstar company

Full-time|$105K/yr - $135K/yr|On-site|Seattle, Washington, United States

At PitchBook, a Morningstar company, we are committed to looking ahead. Our drive for innovation, evolution, and self-investment enables us to unlock the best potential in every team member. We foster a deeply collaborative environment where excitement, energy, and enjoyment resonate throughout the company.Our extensive learning programs and mentorship initiatives cultivate a culture of curiosity, motivating us to continuously explore new solutions and enhance our processes. With the rapid evolution of our industry and our ambitious goals, we embrace the challenges and ambiguities that arise, excelling by pushing our limits. We are unafraid to take risks, learn from failures quickly, and persistently strive for excellence.If you possess a positive attitude and a willingness to roll up your sleeves to accomplish tasks, PitchBook is the ideal workplace for you.About the Role:The Revenue Operations department is pivotal in creating, maintaining, and enhancing PitchBook's systems that support our client-facing teams in Sales and Customer Success. This team collaborates across functions with Sales, Marketing, Finance, and IT to drive efficiency, accuracy, and revenue growth.As a Senior Marketing Operations Specialist at PitchBook, you will spearhead the design, development, and optimization of intricate automation workflows across marketing and sales functions to foster scalable, data-driven demand generation and customer engagement programs. You will act as a strategic partner to marketing leadership and cross-functional teams, leveraging your profound expertise in marketing automation platforms (such as Marketo) to architect solutions that enhance operational efficiency and maximize campaign effectiveness. This position mandates a robust technical skill set, business acumen, and the ability to convert marketing objectives into innovative automation strategies.

Mar 24, 2026

Apply

Senior Product Specialist

Zenoti

Full-time|On-site|Seattle, Washington, United States

Join Zenoti as a Senior Product Specialist, where you will play a pivotal role in shaping our product strategies and enhancing user experiences for our innovative platform. You'll work closely with cross-functional teams to drive product development and ensure alignment with our business goals.

Apr 7, 2026

Apply

Senior Product Specialist at Zenoti | Seattle, WA

Zenoti

Full-time|$85K/yr - $110K/yr|On-site|Seattle, Washington, United States

Join Zenoti as a Senior Product SpecialistAt Zenoti, we deliver an innovative, all-encompassing cloud-based software solution tailored for the beauty and wellness industry. Our platform enables users to seamlessly manage every facet of their business—from online appointment scheduling and point-of-sale systems to customer relationship management, employee management, inventory control, and integrated marketing initiatives. By utilizing Zenoti, our clients can enhance operational efficiency, minimize costs, and boost customer retention and revenue.Zenoti empowers over 30,000 salons, spas, medspas, and fitness studios across more than 50 countries. Our esteemed clientele includes renowned global brands such as European Wax Center, Hand & Stone, Massage Heights, Rush Hair & Beauty, Sono Bello, Profile by Sanford, Hair Cuttery, CorePower Yoga, and TONI&GUY.Recent accolades include achieving a unicorn valuation exceeding $1 billion, recognition as a Next Tech Titan by GeekWire, securing an $80 million investment from TPG, and ranking as the 316th fastest-growing company in North America on Deloitte’s 2020 Technology Fast 500™. We are also honored to be recognized as a Great Place to Work Certified™ for 2021-2022, affirming our commitment to empowering individuals to thrive and discover their potential.

Apr 12, 2026

Apply

Senior Marketo Automation Specialist

PitchBook, a Morningstar Company

Full-time|$105K/yr - $135K/yr|On-site|Seattle, Washington, United States

At PitchBook, a Morningstar company, we are always looking ahead. We continuously innovate and invest in our team to foster an environment that brings out the best in everyone. Our collaborative spirit thrives on the excitement and energy that permeates our workplace.Our rich learning programs and mentorship opportunities cultivate a culture of curiosity, pushing us to find new solutions and improved methods. As we navigate a rapidly evolving industry with high ambitions, we embrace the challenges and uncertainties along the way. We excel by pushing ourselves, taking calculated risks, and striving for excellence.If you possess a positive attitude and a strong work ethic, PitchBook is the ideal place for you.About the Role:The Revenue Operations department plays a crucial role in developing, maintaining, and enhancing PitchBook’s systems that support our Sales and Customer Success teams. This department collaborates across Sales, Marketing, Finance, and IT to drive efficiency and revenue growth.As a Senior Automation Specialist at PitchBook, you will spearhead the design, development, and optimization of intricate automation workflows for marketing and sales initiatives. Your expertise will drive scalable, data-driven demand generation and customer engagement programs. You will serve as a strategic partner to marketing leadership and cross-functional teams, leveraging your extensive knowledge of marketing automation platforms (including Marketo) to create solutions that improve operational efficiency and maximize campaign impact. This role demands a robust technical skill set, business savvy, and the capability to translate marketing objectives into innovative automation strategies.Primary Job Responsibilities:

Mar 24, 2026

Apply

Software Engineer Level 2

Stripe, Inc.

Full-time|On-site|Seattle, WA

About the Role Stripe is hiring a Software Engineer Level 2 in Seattle, WA. This role focuses on building payment solutions that help businesses operate worldwide. As part of the engineering team, the position involves working closely with colleagues from different specialties to create and improve scalable software applications. What You Will Do Design and develop software that supports Stripe’s payment products Work with cross-functional teams to deliver reliable, scalable solutions Help implement features that enable businesses to manage payments efficiently

Apr 14, 2026

Apply

Senior Marketing Automation Specialist

PitchBook Data

Full-time|$105K/yr - $135K/yr|On-site|Seattle, Washington, United States

At PitchBook, a subsidiary of Morningstar, we are dedicated to forward-thinking innovation. Our commitment to growth, collaboration, and fun fosters an environment where creativity thrives. We offer extensive learning programs and mentorship initiatives that nurture a culture of curiosity and drive us to discover innovative solutions. In a fast-paced and evolving industry, we embrace challenges and are willing to take risks to achieve excellence.If you have a positive attitude and a willingness to dive into projects, PitchBook is the perfect place for you.About the Role:The Revenue Operations team is tasked with the creation, maintenance, and continual enhancement of PitchBook’s systems that support our Sales and Customer Success teams. This department collaborates cross-functionally with Sales, Marketing, Finance, and IT to drive efficiency, accuracy, and revenue growth.As a Senior Marketing Automation Specialist, you will take the lead in designing, developing, and optimizing intricate automation workflows for marketing and sales functions. Your role will be pivotal in advancing scalable, data-driven demand generation and customer engagement initiatives. You will partner strategically with marketing leadership and other cross-functional teams, leveraging your deep expertise in marketing automation platforms (like Marketo) to design solutions that enhance operational efficiency and elevate campaign success. This position requires a robust technical skillset, strong business acumen, and the capacity to translate marketing objectives into innovative automation strategies.

Mar 24, 2026

Apply

Senior Design Engineer

Lumber Manufacturing Company

Full-time|On-site|SeattleHQ

Join our innovative team at Lumber Manufacturing Company as a Senior Design Engineer. In this pivotal role, you will leverage your extensive experience in design engineering to create cutting-edge solutions for our products. You will collaborate with cross-functional teams to drive product development from concept through to production.Your responsibilities will include developing and optimizing design processes, conducting feasibility studies, and ensuring that our designs meet industry standards. We value creativity and technical expertise, and you will have the opportunity to influence the design strategy significantly.

Mar 13, 2026

Apply

Senior Identity Solutions Specialist

Okta, Inc.

Full-time|$200K/yr - $308K/yr|On-site|Chicago, Illinois; Dallas, Texas; New York, New York; North Carolina; San Francisco, California; Seattle, Washington

Okta is seeking a Senior Identity Solutions Specialist to join its presales team. This position centers on endpoint and identity security, supporting organizations as they secure digital environments using Okta’s identity and access management tools. The role reports to the Senior Manager within the Office of the Field CTO (OFCTO). Team approach Okta’s Global Presales team brings together Solutions Engineers, architects, and technical leaders who combine technical skill with business understanding. The group acts as trusted advisors, helping customers and partners navigate complex identity challenges. Collaboration and a focus on customer outcomes guide the team’s work, with tailored solutions designed to help organizations realize the value of Okta’s Identity Platform. What you will do Lead strategic discussions with technical and engineering stakeholders at customer organizations. Advise on industry trends, best practices, and strategies for identity-driven digital transformation. Shape executive-level messaging and participate in conversations that connect Okta’s solutions to customer objectives. Develop insights that link Okta’s capabilities to real-world identity and security challenges. Conduct technical discovery sessions with security and infrastructure leaders to identify issues such as endpoint security, passwordless adoption, and identity threat vectors. Deliver tailored product demonstrations and architectural sessions, focusing on Okta Device Access and Identity Threat Protection. Locations Chicago, Illinois; Dallas, Texas; New York, New York; North Carolina; San Francisco, California; Seattle, Washington

Apr 20, 2026

Apply

Senior Electrical Engineer

AECOM

Full-time|On-site|Seattle

Join our dynamic team at AECOM as a Senior Electrical Engineer, where you will play a pivotal role in designing innovative electrical systems for a variety of projects. This position offers the opportunity to work on cutting-edge engineering initiatives while collaborating with a talented group of professionals.Your responsibilities will include developing electrical designs, conducting system analyses, and ensuring compliance with industry standards. We are looking for a creative thinker who thrives in a fast-paced environment and is eager to contribute to impactful projects.

Feb 26, 2026

Apply

Senior Product Specialist in SaaS Onboarding and Implementation

Zenoti

Full-time|On-site|Seattle, Washington, United States

Zenoti develops a cloud-based platform tailored for the beauty and wellness sector, bringing together appointment booking, POS, CRM, employee and inventory management, and marketing tools in a single mobile app. The company serves over 30,000 businesses worldwide, including well-known names such as European Wax Center and CorePower Yoga. This Senior Product Specialist position is based in Seattle and focuses on onboarding and implementation for customers using Zenoti’s SaaS platform. The role sits within the Adoption organization, working with both new and existing clients. What you will do Analyze business requirements and map them to Zenoti product features that drive customer adoption. Use deep product and functional knowledge to identify customer challenges, then configure solutions within the Zenoti application. Work closely with internal teams and customers to ensure smooth onboarding and implementation experiences. Deliver comprehensive training, helping customers achieve their business objectives by making the most of the platform. Location Seattle, Washington, United States

Apr 21, 2026

Apply

Senior Mechanical Engineer - Facilities Engineering

TKDA

Full-time|Hybrid|SEA - Seattle, Washington

Join our dynamic, employee-owned team at TKDA as a Senior Mechanical Engineer! We are looking for a highly skilled professional with significant expertise in modeling and designing HVAC, plumbing, and mechanical systems tailored for industrial and commercial facilities to enhance our Facilities Engineering team.In this pivotal role, you will collaborate with a diverse group of engineering disciplines, including Mechanical, Electrical, Civil, and Structural engineers, as well as architects, to deliver exceptional planning, design, construction management, and commissioning services across a wide array of projects. Your experience will be instrumental in projects ranging from light and heavy industrial facilities to food and beverage plants, and educational and commercial spaces.You will engage in energy modeling and conduct life cycle cost analysis while designing advanced sustainable building performance systems. Key responsibilities will include equipment selection and sizing, heat load analysis, and the creation of detailed mechanical plans, specifications, and construction documentation. This position also involves participating in project development and fee estimating, attending client presentations and interviews, liaising with code officials, and conducting inspections and assessments at project locations. There will also be opportunities to take on Project Manager responsibilities for mechanically intensive projects and larger multidisciplinary initiatives.Beyond project duties, this role offers an exciting chance to contribute to the growth of our Seattle market. We are seeking a motivated individual eager to engage in regional business development efforts, enhance client relationships, and help us expand our footprint in the Pacific Northwest.Hybrid Work Environment: Work in-office Tuesday through Thursday, with the option to work from home on Monday and Friday.

Feb 25, 2026

Apply

Senior Product Specialist - SaaS Onboarding and Implementation

Zenoti

Full-time|On-site|Seattle, Washington, United States

Join Zenoti as a Senior Product Specialist where you will lead the onboarding and implementation of our cutting-edge SaaS solutions. In this pivotal role, you will collaborate with cross-functional teams to ensure seamless integration of our products into clients' operations, driving their success and satisfaction.Your expertise in SaaS implementation will be critical in guiding clients through the process, providing training, and delivering exceptional support. You will analyze client needs, develop tailored strategies, and contribute to continuous improvement initiatives.

Apr 7, 2026

Apply

Senior Site Reliability Engineer

Comtech LLC

Contract|On-site|Seattle

Position: Senior Site Reliability Engineer Location: Seattle, WADuration: 12 monthsInterview: In-person for local candidates or via Phone + SkypeAs a Senior Site Reliability Engineer, you will play a pivotal role in the ongoing maintenance and administration of enterprise-level internet systems. Your primary responsibility will be to diagnose and resolve operational issues, ensuring the seamless functioning of our infrastructure. You will also be tasked with developing tools and scripts to enhance these processes.Collaboration with various teams will be essential to document our enterprise infrastructure and monitoring systems effectively. Additionally, you'll oversee the planning and execution of projects ranging from small to large scale within our Technology teams, reporting directly to your manager. This role demands a high level of technical expertise in both traditional enterprise systems and cutting-edge cloud-native applications.If you share our belief that a simple cup of coffee can transform lives and enhance experiences, we invite you to join us in delivering exceptional services to customers worldwide.

Sep 1, 2017

Create account — see all 1,064 results