1 - 20 of 94,558 Jobs

Search for Machine Learning Engineer - Model Distillation

94,558 results

Apply
Featherless AI logoFeatherless AI logo
Full-time|Remote|Remote (world)

About the PositionWe are on the lookout for a Machine Learning Engineer specialized in model distillation to assist us in developing compact, rapid, and efficient models while maintaining high-quality standards. This role will involve blending research with practical applications—transforming state-of-the-art methodologies into scalable systems.This is a pro…

Jan 22, 2026
Apply
Ekumen Labs logoEkumen Labs logo
Full-time|Remote|LATAM (Remote)

Join our dynamic team at Ekumen Labs as a Machine Learning Data Engineer. In this role, you will leverage your expertise in machine learning and data engineering to design and implement robust data pipelines that facilitate the deployment of machine learning models. Your contributions will be crucial in transforming raw data into valuable insights that drive business decisions and enhance the efficiency of our operations.

Mar 20, 2026
Apply
Poolside logoPoolside logo
Full-time|Remote|Remote (EMEA/East Coast)

Poolside is dedicated to advancing Artificial General Intelligence (AGI), with a focus on innovation and engineering that drives economic and scientific progress. The company brings together experts in research and software development, all working toward high-quality systems. As a remote-first team, Poolside’s staff are based across Europe and North America. Team members meet in person for three days each month, with longer offsite sessions twice a year. This structure supports collaboration among people with both research and engineering backgrounds. Role overview The Reinforcement Learning Infrastructure Engineer joins the reinforcement learning team to improve reasoning and coding capabilities in Large Language Models (LLMs). The role covers the full process: researching new algorithms, designing and scaling RL environments, and implementing solutions across the stack. Work is supported by access to thousands of GPUs. Core mission Build and scale infrastructure for reliable, efficient LLM training using advanced reinforcement learning methods. Key responsibilities Stay current on research and developments in LLMs, reinforcement learning, and code generation. Develop strategies for fine-tuning training and inference, ensuring integration throughout the development process. View GDPR Policy

Apr 27, 2026
Apply
Blue Rose Research logoBlue Rose Research logo
Full-time|$165K/yr - $210K/yr|Remote|Remote

About Us: Blue Rose Research is dedicated to creating innovative data and AI tools that empower Democrats to secure electoral victories. Our multidisciplinary team merges engineering, data science, and strategic political insight to fuel decision-making for leading campaigns and progressive organizations. We analyze electoral trends, test advertising strategies, and leverage generative AI to enable campaigns to respond effectively to current events with impactful messaging. Our guidance has influenced the allocation of hundreds of millions in campaign spending. As a compact, mission-driven team, we prioritize rapid development, bold experimentation, and assist progressives in communicating effectively and achieving success, all while driven by a sense of curiosity and a commitment to utilizing technology for positive change. Machine Learning Lead (LLM & Applied AI) We are seeking a Machine Learning Lead to oversee a team of skilled data scientists developing ML-driven solutions that inform strategies for civic leaders and organizations. Reporting directly to the Director of Engineering, you will take charge of the team’s roadmap and technical vision. This role is hands-on, requiring collaboration with your team to construct infrastructure, train models, and deploy them into production. If you are eager to apply your technical skills in a meaningful context that promotes public welfare, this position offers the opportunity to create a significant impact. Key Responsibilities: Lead a team of senior data scientists dedicated to optimizing large language models, conducting innovative R&D, and developing production inference systems. Work in partnership with senior leadership to establish the team’s roadmap and align priorities with organizational objectives. Facilitate weekly meetings and stand-ups to ensure team progress and remove any obstacles to execution. Provide technical guidance across projects utilizing open-weight and proprietary LLMs along with other advanced ML methodologies. Oversee testing, optimization, and data integrity to guarantee the accuracy, reliability, and readiness of models for production. Encourage creative problem-solving and methodological rigor when custom solutions are necessary beyond standard ML techniques. Convert complex model outputs into actionable insights for stakeholders, ensuring that our technical efforts yield real-world benefits. About You: 1+ years of experience leading data science teams; 6+ years in machine learning or data engineering. Strong foundation in applied statistics, model selection, tuning, and performance evaluation. Proficient in Python, SQL, and contemporary ML frameworks.

Dec 20, 2025
Apply
Featherless AI logoFeatherless AI logo
Full-time|Remote|Remote (world)

About the RoleWe are seeking a passionate Machine Learning Engineer to spearhead the enhancement of model inference performance at scale. In this role, you will bridge the gap between theoretical research and practical application by transforming cutting-edge models into efficient, scalable, and user-centric systems.This position is perfect for individuals who thrive in a technically challenging environment, enjoy in-depth system profiling down to the kernel and GPU levels, and excel at converting innovative research ideas into production-ready performance improvements.What You’ll DoEnhance inference latency, throughput, and cost-efficiency for large-scale ML models deployed in productionAnalyze and troubleshoot GPU/CPU inference pipelines focusing on memory, kernels, batching, and I/O performanceImplement and optimize techniques including:Quantization strategies (fp16, bf16, int8, fp8)KV-cache optimization and reuseSpeculative decoding, batching, and streamingModel pruning and architectural simplifications for optimized inferenceCollaborate closely with research engineers to transition novel model architectures to productionConstruct and uphold inference-serving systems using frameworks such as Triton, custom runtimes, or bespoke stacksBenchmark performance across various hardware setups (NVIDIA / AMD GPUs, CPUs) and cloud configurationsEnhance the reliability, observability, and cost efficiency of systems under real workload conditionsWhat We’re Looking ForSignificant experience in ML inference optimization or high-performance machine learning systemsStrong grasp of deep learning fundamentals (attention mechanisms, memory architecture, compute graphs)Practical experience with PyTorch (or similar frameworks) and model deployment techniquesFamiliarity with GPU performance enhancements (CUDA, ROCm, Triton, or kernel-level optimizations)Proven capability in scaling inference systems for real-world users beyond research benchmarksAdaptability to work in a fast-paced startup atmosphere, embracing ownership and navigating ambiguityPreferred QualificationsExperience with LLM or long-context model inferenceKnowledge of various inference frameworks (TensorRT, ONNX Runtime, vLLM, Triton)

Jan 22, 2026
Apply
Agero logoAgero logo
Full-time|$150K/yr - $200K/yr|Remote|Remote

About Agero:At Agero, we're redefining the vehicle ownership experience. Our mission is to enhance the relationship between clients and their customers through innovative people and data-driven technology. As a leading B2B provider of digital driver assistance services, we are transforming traditional processes into efficient, digital, and connected solutions. Our offerings include a state-of-the-art dispatch management platform powered by Swoop, comprehensive accident management services, and a growing ecosystem of consumer support. Partnering with top automobile manufacturers and insurance carriers, Agero oversees 150 million vehicle coverage points and responds to approximately 12 million service events annually. Headquartered in Medford, MA, and part of The Cross Country Group, we operate across North America. To learn more, visit https://www.agero.com/.Note: For our technical roles, we prefer to start in person! You may need to travel to Medford for onboarding, but we will manage all travel arrangements and expenses for you.Role Description and Mission:The Engineering Manager for Data Science and Machine Learning is a pivotal leadership position responsible for overseeing a talented team of Data Scientists, ML Engineers, and Software Engineers dedicated to architecting, building, and operating our next-generation Dispatch Optimization platform. This role requires extensive expertise in Data Science, Machine Learning, Operations Research, and scalable cloud-native service development.You will champion scientific rigor and engineering excellence to convert model outputs into real-time, impactful dispatch decisions that enhance cost efficiency and service quality.

Feb 11, 2026
Apply
Hudson Manpower logoHudson Manpower logo
Contract|Remote|Remote job

Hello,We are excited to present a fantastic opportunity for a GenAI Engineer within the dynamic field of AI and Machine Learning. This role is fully remote, allowing you to work from anywhere, and is a perfect fit for someone with extensive experience in developing prototypes and proofs of concept.Position OverviewAs a GenAI Engineer, you will leverage your deep understanding of AI and machine learning principles to create innovative solutions. Your expertise in Python and familiarity with tools such as Hugging Face, Langchain, and OpenAI API will be essential in driving project success.Key Responsibilities:Develop prototypes, PoCs, and MVPs using advanced AI/ML methodologies.Utilize deep learning frameworks including TensorFlow, Keras, and PyTorch to implement solutions.Engage with cloud platforms such as Google Model Garden, Amazon Bedrock, and Nvidia Nim to enhance project outcomes.Work collaboratively in a fast-paced environment, bringing innovative solutions to complex problems.Ideal Candidate:The successful candidate will possess a strong foundation in AI and machine learning, coupled with a passion for problem-solving and a collaborative mindset.

Jun 19, 2025
Apply
Agero logoAgero logo
Full-time|$150K/yr - $200K/yr|Remote|Remote

About Agero:At Agero, we are at the forefront of transforming the vehicle ownership experience. Our mission is to innovate and enhance this experience through a unique blend of passionate individuals and data-driven technology, which strengthens the bonds between our clients and their customers. As the leading B2B, white-label provider of digital driver assistance services, we are redefining the industry by turning manual processes into digital, transparent, and connected solutions. Our offerings include an industry-leading dispatch management platform powered by Swoop, comprehensive accident management services, expert consumer affairs, and connected vehicle capabilities, along with a growing marketplace of services, discounts, and support powered by a strong partner ecosystem. We are proud to cover over 150 million vehicles in collaboration with major automobile manufacturers, insurance carriers, and more. Managing one of the largest national networks of service providers, Agero handles approximately 12 million service events annually. Headquartered in Medford, Mass., with operations across North America, we are a proud member of The Cross Country Group. To learn more, visit https://www.agero.com/.Note: For our technical roles, we encourage an in-person start! You may need to travel to Medford for your initial onboarding. Don't worry about the logistics; once you’re hired, we take care of all travel arrangements and expenses.Role Description and Mission:As the Engineering Manager for Data Science and Machine Learning, you will play a pivotal leadership role overseeing a talented team of Data Scientists, ML Engineers, and Software Engineers. Your focus will be on architecting, building, and operating our next-generation Dispatch Optimization platform. This position requires profound expertise in Data Science, Machine Learning, constrained Optimization (Operations Research), and the development of scalable cloud-native services.You will lead scientific rigor and engineering excellence to convert model outputs into real-time, high-impact dispatch decisions that optimize cost efficiency and enhance service levels.

Feb 11, 2026
Apply
poolside logopoolside logo
Full-time|Remote|Remote (EMEA/East Coast)

Poolside is committed to building Artificial General Intelligence within this decade. The company emphasizes rapid innovation, applied research, and large-scale deployment. By increasing the scale and capability of its models, Poolside aims to create economic value while focusing on user and customer success. The broader vision is to make AI central to meaningful work and scientific progress. The team works remotely across Europe and North America, meeting in person for three days each month and gathering for longer offsites twice a year. Researchers and engineers collaborate closely, sharing responsibility for building high-quality systems. Strong engineering practices support fast development and amplify results. Role overview The Reinforcement Learning Engineer role focuses on advancing the reasoning and coding abilities of Large Language Models using reinforcement learning. This position combines research and hands-on engineering: designing new training algorithms, developing and scaling RL environments, and implementing solutions throughout Poolside’s technology stack. Substantial GPU resources are available to support these projects. Mission Expand the reasoning and coding capabilities of foundational models through reinforcement learning. Key responsibilities Design and run experiments to improve reasoning and code generation in large language models. Oversee projects from initial idea through to integration. Stay current with the latest research in the field and contribute to ongoing research efforts. This is a remote position open to candidates based in EMEA or on the East Coast of North America. View GDPR Policy

Apr 27, 2026
Apply
Prolific logoProlific logo
Full-time|Remote|Remote

Join Prolific as an AI Training - Machine Learning Specialist! This is a unique opportunity to work in a fully remote environment where you will contribute to the development and enhancement of machine learning models. You will collaborate closely with data scientists and engineers to ensure the quality and efficiency of AI training processes.As a key member of our team, your responsibilities will include optimizing training datasets, experimenting with various algorithms, and fine-tuning models to achieve high-performance outcomes. Your insights and expertise will help shape the future of AI at Prolific.

Feb 11, 2026
Apply
Prolific logoProlific logo
Full-time|Remote|Remote

Join Prolific as a Machine Learning Specialist focused on AI Training! We are seeking a talented individual to contribute to the development and implementation of AI models. This is a fully remote position offering the opportunity to work with cutting-edge technology in a dynamic environment.As a key member of our team, you will analyze data, improve algorithms, and ensure the high performance of machine learning systems. Your expertise will help us drive innovation and optimize processes.

Feb 11, 2026
Apply
Prolific logoProlific logo
Full-time|Remote|Remote

*]:pointer-events-auto scroll-mt-[calc(var(--header-height)+min(200px,max(70px,20svh)))]"> *]:pointer-events-auto scroll-mt-[calc(var(--header-height)+min(200px,max(70px,20svh)))]" data-turn-id="788a8b78-cfcd-42e1-beb1-30da5949a95d" data-testid="conversation-turn-8" data-scroll-anchor="true" data-turn="assistant">Join us at Prolific as a Machine Learning Specialist focused on AI Training! In this fully remote role, you'll leverage your expertise in machine learning to develop and enhance AI models, contributing to innovative projects that impact our clients globally. Collaborate with a talented team, participate in cutting-edge research, and help us drive the future of AI technology.

Feb 11, 2026
Apply
plantingspace logoplantingspace logo
Full-time|Remote|Remote

Join us at Plantingspace as we develop an advanced AI system tailored for analysts and scientists, leveraging a revolutionary approach to reasoning and knowledge representation. Our innovative platform surpasses the capabilities of traditional large language models (LLMs) by integrating algorithms symbolically, enabling multi-step analyses, verifiable reasoning paths, and uncertainty assessments. We aim to create transformative applications that enhance and automate research across diverse fields such as Finance, Strategy Consulting, Engineering, and Material Sciences.We are seeking talented Product Software Engineers who have a proven track record in the commercialization of research software. Your role will focus on the productization of our cutting-edge system, creating compelling use-case demonstrations that highlight its unique advantages. Key responsibilities include implementing applications in quantitative domains (like financial analysis and physics simulations) within our system's framework, identifying its strengths, and developing showcase demos. Additionally, you will play a crucial role in identifying limitations, closing gaps, and enhancing the product based on your insights.

Dec 17, 2025
Apply
Canonical logoCanonical logo
Full-time|Remote|Home based - Worldwide

Canonical, a trailblazer in open source software and operating systems, is redefining the tech landscape for enterprises globally. Our flagship platform, Ubuntu, plays a pivotal role in transformative initiatives across public cloud, data science, AI, engineering innovation, and IoT. We proudly serve top-tier public cloud providers and prominent industry leaders across various sectors. Embracing a model of global collaboration, our diverse team of over 1,200 professionals spans 75+ countries, with minimal in-office roles. We convene bi-annually in unique global locations to align our strategies and execute our vision.As a growing, founder-led, and profitable company, we are excited to welcome a MLOps Solutions Engineer to empower enterprises to harness the power of AI/ML through cutting-edge open source technologies on both public and private cloud infrastructures, including Linux and Kubernetes. Our team offers expert insights to tackle real-world challenges, facilitating the adoption of Ubuntu, Kubeflow, MLFlow, Feast, DVC, and other advanced analytics and machine learning technologies. We are committed to developing the premier open source data platform, integrating traditional SQL databases with contemporary NoSQL data solutions while transforming data into actionable insights and executable models.This role is ideal for MLOps engineers who thrive on engaging with customers and addressing their challenges during the presales cycle. As solutions architects, you will innovate customer-centric solutions through architecture design, presentations, and training. This is primarily an architectural role focused on developing ML frameworks for external clients, rather than software development.We seek candidates with a robust technical foundation who possess a business-oriented mindset and are motivated by commercial success. As part of our global Field Engineering team, you will collaborate closely with enterprise sales leaders, tackling some of the toughest challenges in contemporary data architecture. Whether it’s training LLMs across hybrid cloud infrastructures with GPU sharing or processing millions of real-time financial transactions, you will be at the forefront of solving complex problems daily.Location: Most of our team operates remotely. We are expanding our teams across EMEA, Americas, and APAC time zones, making this opportunity accessible to candidates from nearly any location.

Jan 20, 2026
Apply
Hudson Manpower logoHudson Manpower logo
GenAI/ML Engineer

Hudson Manpower

Contract|Remote|Remote job

GenAI/ML Engineer100% Remote PositionW2 Candidates OnlyWe are seeking a talented GenAI/ML Engineer to join our innovative team at hudsonmanpower. This fully remote role allows you to leverage your expertise in artificial intelligence and machine learning to create prototypes, proofs of concept (PoCs), and minimum viable products (MVPs).Key Responsibilities:Develop and prototype cutting-edge AI/ML solutions.Utilize your strong foundation in AI, deep learning, and machine learning principles.Leverage programming skills in Python, along with tools such as Hugging Face, Langchain, and OpenAI API.Work with deep learning frameworks including TensorFlow, Keras, and PyTorch.Engage with cloud platforms like Google Model Garden, Amazon Bedrock, and Nvidia Nim.Handle multi-modal data and intelligent agent-based tools.Qualifications:Self-motivated and passionate about solving complex problems using AI/GenAI.Collaborative mindset with a flair for innovation.

May 28, 2025
Apply
Tether logoTether logo
Full-time|Remote|Remote job

Join Tether and Shape the Future of Digital FinanceAt Tether, we are not merely developing products; we are at the forefront of a financial revolution. Our innovative solutions empower businesses—from exchanges and wallets to payment processors and ATMs—to effortlessly integrate reserve-backed tokens across various blockchains. By leveraging blockchain technology, Tether enables instant, secure, and global storage, transfer, and receipt of digital tokens, all at a significantly reduced cost. Transparency is our foundation, fostering trust in every transaction.Innovate with TetherTether Finance: Our groundbreaking product suite includes the world’s most reliable stablecoin, USDT, trusted by hundreds of millions worldwide, along with pioneering services in digital asset tokenization.Tether Power: Committed to sustainable growth, our energy solutions maximize excess power for Bitcoin mining through eco-friendly practices in cutting-edge, geographically diverse facilities.Tether Data: Accelerating advancements in AI and peer-to-peer technology, we minimize infrastructure costs and enhance global communications with innovative solutions like KEET, our flagship application that redefines secure and private data sharing.Tether Education: We democratize access to premium digital learning, equipping individuals to thrive in the digital age and gig economy, driving global growth and opportunity.Tether Evolution: At the convergence of technology and human potential, we are redefining possibilities, crafting a future where innovation and human capabilities synergize in powerful, unprecedented ways.Why Join Us?Our team is a global talent powerhouse, operating remotely from diverse locations across the globe. If you are eager to make an impact in the fintech landscape, this is your chance to collaborate with some of the brightest minds, breaking barriers and establishing new benchmarks. We have rapidly expanded, maintained our agility, and solidified our reputation as an industry leader.If you possess exceptional English communication skills and are prepared to contribute to the most innovative platform globally, Tether is your destination.Are you ready to be part of the future?

Feb 17, 2026
Apply
plantingspace logoplantingspace logo
Full-time|Remote|Remote

At plantingspace, we are pioneering an innovative AI system tailored for analysts and scientists, leveraging a revolutionary approach to reasoning and knowledge representation. Our technology surpasses traditional state-of-the-art LLMs by intricately combining algorithms in a symbolic manner, enabling groundbreaking features such as multi-step analysis, transparent reasoning paths, and uncertainty assessment. We envision our applications transforming research and analysis across various fields, including Finance, Strategy Consulting, Engineering, Material Sciences, and beyond.We seek passionate Bayesian Software Engineers equipped with a solid background in Bayesian statistics to contribute to the development of our advanced models and algorithms for statistical inference and machine learning. Your role will involve designing, implementing, and optimizing statistical procedures that can be applied to a diverse range of models, all of which will be integrated within our expansive software system.

Sep 30, 2025
Apply
0g logo0g logo
Full-time|Remote|Remote

Key Responsibilities:Platform Development: Architect, implement, and uphold a robust and scalable AI platform to effectively integrate, train, and deploy machine learning models.Infrastructure Optimization: Oversee and enhance cloud infrastructure (e.g., AWS, GCP, Azure) to meet high-performance AI workload requirements while ensuring cost efficiency.Cross-Functional Integration: Work collaboratively with various teams to merge AI capabilities with blockchain systems, prioritizing data security and compliance with decentralization principles. Contribute to decentralized app (DApp) and smart contract development as necessary.Security & Documentation: Establish and enforce stringent data security and privacy protocols for the platform. Develop and sustain comprehensive technical documentation to facilitate knowledge sharing and support future scalability.Innovation: Keep abreast of the latest advancements in AI, blockchain, and cloud technologies to promote ongoing innovation.Required Qualifications:Bachelor's degree or higher in Computer Science or a related discipline.Demonstrated experience in developing and deploying large-scale AI/ML platforms.Strong expertise in Python and familiarity with AI/ML frameworks such as TensorFlow and PyTorch.Practical experience with leading cloud platforms (AWS, GCP, or Azure).Comprehensive understanding of distributed systems and cloud-native technologies.Exceptional problem-solving abilities along with strong attention to detail and communication skills.Preferred Qualifications (Bonus Points):Knowledge of blockchain technology and smart contract development (e.g., Solidity).Experience with blockchain security best practices.Familiarity with DevOps tools and CI/CD pipelines.A record of contributions to open-source projects.What We Offer:Purpose: Be part of an initiative aimed at making AI a public good.Growth: A self-directed environment where you can take initiative to shape your role and career.Compensation: Competitive remuneration package.

Nov 17, 2025
Apply
Platacard logoPlatacard logo
Full-time|Remote|Worldwide

About Our Team: Join the expanding Financial Assistant team at Platacard, where we are dedicated to creating intelligent systems designed to empower users in managing their finances, grasping spending habits, and seamlessly interacting with financial products. Our team is integral to enhancing customer experience, engagement, and the overall value of our core offerings. Operating in a regulated environment, we prioritize accuracy, safety, and trust. Utilizing AWS, Go, Python, and cloud-based models, we remain adaptable, integrating both off-the-shelf tools and custom solutions. Our focus is on crafting systems that yield significant and valuable results for our organization. As a pivotal member of our cross-functional team, you will collaborate with backend, mobile, and LLM engineers to drive innovation.

Apr 6, 2026
Apply
fal logofal logo
Full-time|Remote|Remote

Join our innovative team as a Research Scientist specializing in Engineering, where your expertise in machine learning and generative media will drive the development of cutting-edge products. You will leverage your extensive knowledge of the latest advancements in the field to identify gaps in the market and create solutions that address real customer challenges. This role may involve pioneering new training methods or architectures, as well as fine-tuning existing models with unique datasets. Your ability to assess the return on investment for various approaches will be crucial, as we prioritize research that leads to tangible product development.

Dec 16, 2025

Sign in to browse more jobs

Create account — see all 94,558 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.