1 - 20 of 54,378 Jobs

Search for [독파모] AI Research Engineer - Vision Language Model

54,378 results

Apply
upstageai logoupstageai logo
Full-time|Remote|Remote job

At Upstage, we are dedicated to our vision of "Making AI Beneficial" and our mission of "Building Intelligence for the Future of Work." We are developing next-generation AI solutions based on Vision-Language Models (VLM) that go beyond merely reading text to comprehensively understand visual information such as images, charts, and tables. By extracting hidde…

Jun 18, 2025
Apply
Nace.ai logo
Full-time|On-site|Palo Alto, CA

Position OverviewJoin our innovative team at Nace.ai as we push the boundaries of artificial intelligence through cutting-edge research in large language models (LLMs) and vision-language models (VLMs). We are in search of a talented AI Research Engineer with a strong focus on adaptive learning methodologies, including meta-learning and hypernetworks. Your role will encompass the design and implementation of advanced architectures for dynamic model adaptation, enhancing model reasoning capabilities, and effectively sharing insights with both research and engineering teams.Essential Qualifications:Demonstrated experience with LLMs or VLMs in both research and production environments.Strong foundational knowledge in Natural Language Processing, Machine Learning, or related fields, particularly in language model development.A proven history of tackling complex challenges in language understanding and generation, employing rigorous quantitative methods.Exceptional communication skills for conveying research findings to varied technical audiences.Proficiency in Python and familiarity with deep learning frameworks such as PyTorch, JAX, or TensorFlow, alongside experience in distributed training and model optimization.Desirable Qualifications:PhD in Computer Science, Computational Linguistics, or a closely related discipline with an emphasis on language models and adaptive learning frameworks.Substantial research and engineering background with LLMs/VLMs, particularly in meta-learning or parameter-efficient adaptation, supported by grants, fellowships, patents, or contributions to open-source initiatives.First-author publications in recognized peer-reviewed conferences (ACL, EMNLP, NeurIPS, ICML, ICLR) or journals focusing on language models, meta-learning, hypernetworks, or adaptive AI.Preferred Technical Expertise:In-depth research knowledge in LLM reasoning, hypernetworks, multi-task learning, meta-learning, and the design of innovative LLM adaptation techniques, including online continual learning.

Mar 17, 2026
Apply
Bosch Group logoBosch Group logo
Full-time|On-site|Sunnyvale

Bosch Group is looking for a Research Scientist in Sunnyvale to focus on Vision, Language, and Action (VLA) Models. This role centers on research and development at the intersection of machine learning, computer vision, and natural language processing. The aim is to create advanced models that improve how people interact with technology. What you will do Advance research in Vision, Language, and Action models through original work and experimentation Collaborate with colleagues across different fields to design and implement new approaches Use knowledge in machine learning, computer vision, and NLP to address practical challenges Support projects that seek to enhance user experience and technology interaction Location This is an on-site position in Sunnyvale.

Apr 28, 2026
Apply
Zoox logoZoox logo
Internship|On-site|Foster City, CA

Zoox seeks a PhD Research Intern in Vision Language Action Models for its Foster City, CA office. This internship focuses on research that bridges visual perception, language, and action, with an emphasis on developing new algorithms in this area. Responsibilities Research topics that combine computer vision, language understanding, and action modeling Create and evaluate algorithms that connect visual data with language-driven tasks Work alongside researchers and engineers on shared projects Apply creative problem-solving to help advance Zoox’s technology Collaboration and Learning Interns join a team that encourages fresh ideas and continuous learning. The environment supports close collaboration with experts focused on advancing autonomous systems.

Apr 20, 2026
Apply
Black Canyon Consulting logoBlack Canyon Consulting logo
Full-time|On-site|NIH-Bethesda

Location: Bethesda, Maryland (On-site / Not-remote)OverviewJoin our innovative team at Black Canyon Consulting as a Senior AI and Large Language Model (LLM) Engineer. We are on the lookout for a seasoned professional to spearhead the design, customization, and integration of cutting-edge large language models (LLMs) into biomedical research workflows and information retrieval systems. The ideal candidate will possess substantial hands-on experience in training, fine-tuning, augmenting, and deploying LLMs in production settings, particularly within the biomedical or life sciences sectors. This role is product-focused and aims to shape the future of AI-driven search, retrieval, and knowledge discovery tools.In this pivotal position, you will act as a subject matter expert (SME) across diverse product and engineering teams. You will be instrumental in defining, architecting, and implementing LLM-driven functionalities across a suite of NCBI services. Strong technical acumen, sound architectural judgment, and effective collaboration within existing product and technical ecosystems are essential.This hands-on role emphasizes building and strategic influence, requiring a candidate who can guide both the development process and the architectural choices made.Please note: We are only considering serious candidates with a minimum of 3+ years of relevant experience following their most recent degree. Recent graduates or those with less experience need not apply.

Feb 24, 2026
Apply
Mirage logoMirage logo
Full-time|On-site|Union Square, New York City

About Mirage Mirage builds an AI-powered video platform that connects production and editing through natural language processing. Our models use contextual awareness to mirror the choices of skilled editors, streamlining workflows for experienced teams and making video creation more accessible to a wider audience. Learn More Our Product (Captions by Mirage) Our Research (Seeing Voices, technical white paper) Latest Updates (Mirage on X / Twitter) Mirage has been featured in TechCrunch, Forbes AI 50, and Fast Company. Our Investors Mirage is backed by leading venture firms and entrepreneurs, including Index Ventures, Kleiner Perkins, Sequoia Capital, Andreessen Horowitz, and others. Location Requirement All roles at Mirage require in-person work at our Union Square headquarters in New York City. Role Overview: Research Engineer – Large Language Models Mirage seeks a Research Engineer to design, build, and scale systems for training and deploying large language models, with a focus on multimodal creative applications in video analysis. This role works closely with researchers to turn new ideas into efficient, production-ready systems that strengthen our platform.

Apr 14, 2026
Apply
Spellbrush logoSpellbrush logo
Full-time|On-site|San Francisco or Tokyo

Overview: At Spellbrush, we are revolutionizing the gaming experience by crafting a 3D first-person adventure game where an AI companion plays a pivotal role. Imagine MiSide enhanced with language learning models, seamlessly integrated into gameplay rather than merely serving as a role-play chatbot.About UsAt Spellbrush, we are dedicated to creating exceptional anime games, and we proudly stand as a global leader in generative AI. Our flagship project is niji・journey.Our mission is straightforward: to use AI to animate characters and redefine narrative-driven gaming.What We’re CreatingWe have engineered an innovative in-house LLM storytelling system that merges AI, narrative, and gameplay, transcending the limitations of conventional chat-only encounters.This results in an AI companion that collaborates with players in solving puzzles, retains memories across different worlds, and alters the progression of each chapter.About The RoleJoin our elite team to redefine video game experiences. Collaborate with leading minds in the industry, including the creator of Warudo and Cytoid, as well as a Google DeepMind veteran behind Project Astra, and top-tier AI researchers.As an integral early member of this team, you will enjoy significant artistic and research autonomy in shaping what could be the next era of LLM-driven storytelling.

Sep 2, 2025
Apply
Oumi logoOumi logo
Full-time|On-site|Seattle, WA

Join Oumi as a Research ScientistAbout Oumi: Oumi is committed to democratizing frontier AI, believing that collective and open development is essential for its safe and efficient advancement. Our mission is to provide the safest, highest quality, and most adaptable AI technologies to empower individuals and organizations, ultimately benefiting humanity as a whole.Our Solutions: Oumi offers a comprehensive platform for building advanced AI models, facilitating every stage from data preparation to model training and deployment. We collaborate with academic partners and the broader community to enhance open foundation models.Our Values: At Oumi, our foundation is built on open-source principles and collaboration. Our endeavors are:Research-driven: Conducting and sharing original AI research in partnership with global academic institutions.Community-focused: Encouraging contributions from researchers and developers worldwide.Accessibility-oriented: Designing our platform to lower barriers for organizations of all sizes to engage with AI.Position SummaryAs a Research Scientist at Oumi, you will play a crucial role in our research team, focusing on the advancement of Large Language Models (LLMs), Vision Language Models (VLMs), and associated technologies. Your work will involve pioneering research, engaging in open-source projects, and collaborating with fellow researchers and engineers on various dimensions of LLM/VLM development including training, evaluation, data curation, and benchmark formulation.Key Responsibilities:Model Development: Engage in research to develop and assess innovative LLMs, VLMs, and other AI models, exploring novel architectures and training methodologies.Data Curation: Create strategies for assembling high-quality datasets for LLM training and evaluation, employing techniques such as data synthesis.

May 5, 2025
Apply
Bosch Group logoBosch Group logo
Full-time|Remote|Sunnyvale

Role overview Bosch Group seeks an AI Research Scientist with expertise in Large Language Models (LLMs) and Agentic AI for its Sunnyvale office. The role centers on applying advanced research methods to develop new AI solutions that address real-world challenges across industries. Collaboration and team This position joins a team of specialists who combine creativity with technical skill. Open collaboration shapes the group’s approach, and team members regularly share ideas to tackle complex problems together.

Apr 23, 2026
Apply
Intrinsic Robotics logoIntrinsic Robotics logo
Internship|$57.69/hr - $57.69/hr|On-site|Mountain View, California

Intrinsic Robotics, part of Google's AI robotics division, is dedicated to advancing industrial robotics by combining artificial intelligence, perception, and simulation. The team’s mission is to make intelligent robotics more accessible and practical for businesses, entrepreneurs, and developers. Interns at Intrinsic Robotics work alongside engineers, roboticists, designers, and technologists in a collaborative environment. Projects here directly influence real-world robotics applications and contribute to new economic opportunities in the field. Role overview The Vision Foundation Model Research Intern will focus on designing, training, and evaluating vision foundation models for industrial robotics. Responsibilities include deploying these solutions with industry partners and aiming for high model performance in real-world scenarios. What you will do Contribute to robotics software development, including projects on the Flowstate platform. Work closely with experienced professionals and follow modern software development practices. Research and implement deep learning models in Python, using PyTorch or Jax. See the results of your work applied with customers in the industrial robotics sector. Requirements Current enrollment in a Master's program in Computer Science or a related discipline (PhD candidates are preferred). Strong programming skills in Python and practical experience with PyTorch or Jax. Knowledge of training foundation models across multiple nodes. Interest or background in end-to-end robotics applications or integrating robotic hardware in industrial settings. Ability to troubleshoot, launch, and debug training and inference workflows. Clear verbal and written communication abilities. Location This internship takes place in Mountain View, California.

Apr 28, 2026
Apply
Zyphra logoZyphra logo
Full-time|On-site|San Francisco

Zyphra is an innovative leader in artificial intelligence, located in the heart of San Francisco, California.Role Overview:As a Research Engineer specializing in Language Model Pre-Training, you will play a pivotal role in defining our language model strategy through comprehensive pretraining development. Your close collaboration with our pretraining team will ensure that your insights contribute to the advancement of our next-generation models.Key Responsibilities:Conduct large-scale training runs and implement model parallelization techniques.Optimize the performance of our pretraining stack.Oversee dataset collection, processing, and evaluation.Research architecture and methodologies, including optimizer ablations.Qualifications:Demonstrated engineering prowess in developing reliable and robust systems.A quick learner with a passion for implementing innovative ideas.Exceptional communication and collaboration skills, capable of working effectively on both research and engineering implementations at scale.Preferred Skills:Profound expertise in addressing machine learning challenges and training models.Experience training on large-scale (multi-node) GPU clusters.In-depth understanding of model training pipelines, including model/data parallelism and distributed optimizers.Strong methodology for conducting rigorous ablations and hypothesis testing.Familiarity with large-scale, high-performance data processing pipelines.High proficiency in PyTorch and Python programming.Ability to navigate and understand extensive pre-existing codebases swiftly.Published research in machine learning in reputable venues is an advantage.Postgraduate degree in a relevant scientific field (Computer Science, Electrical Engineering, Mathematics, Physics).Why Join Zyphra?We value a research methodology that emphasizes thoughtful, methodical progress towards ambitious objectives. Both deep research and engineering excellence are given equal importance.Join us in an environment that fosters innovation, collaboration, and professional growth.

Aug 28, 2025
Apply
Anthropic logoAnthropic logo
On-site|On-site|New York City, NY; San Francisco, CA; Seattle, WA

About AnthropicAt Anthropic, we are dedicated to developing AI systems that are reliable, interpretable, and steerable. We aim to ensure that AI is safe, beneficial, and aligned with the needs of both our users and society. Our expanding team consists of passionate researchers, engineers, policy experts, and business leaders collaborating to create groundbreaking AI solutions.About the RoleWe are seeking a talented Research Engineer with a solid foundation in computer vision, who shares our belief that visual and spatial reasoning are essential for unleashing the full potential of large language models (LLMs). In this collaborative role, you will engage in research, development, and evaluation of cutting-edge Claude models, with a specific emphasis on enhancing visual and spatial capabilities. You will contribute across multiple facets of our research initiatives, employing a full-stack approach that encompasses pretraining, reinforcement learning (RL), and runtime techniques such as agentic harnesses. Additionally, you will work closely with our product team to ensure that your vision enhancements positively influence Claude's performance in real-world applications.

Jan 29, 2026
Apply
Basis logo
Full-time|On-site|New York Office

About BasisBasis is a nonprofit organization dedicated to applied artificial intelligence research. Our mission is twofold: to understand and build intelligence and to advance society’s problem-solving capabilities. We strive to unravel the mathematical principles behind reasoning, learning, decision-making, understanding, and explanation, while also developing software that embodies these principles.Our commitment extends to enhancing our ability to tackle complex issues that are beyond today's capabilities and accelerating our potential to address future challenges. We are creating a technological framework inspired by human reasoning, alongside fostering a collaborative organization that prioritizes human values.About the RoleAs a Research Scientist, you will spearhead Basis’ initiatives to deepen our understanding of the conceptual, mathematical, and computational principles of intelligence. We seek individuals who excel technically and are passionate about exploring foundational concepts. Our research team values rigorous, high-quality scientific endeavors while encouraging experimentation and exploration of innovative ideas.Basis thrives on collaboration, both internally and with external partners. We are looking for team players who enjoy tackling significant problems that require collective effort.Research FocusDespite the growing acknowledgment that acquiring and understanding world models is crucial for intelligence, current AI systems face challenges in mirroring this human capability. Key uncertainties remain regarding the essence of a world model, methods for reliably detecting its presence in agents, and approaches to develop agents capable of learning these models effectively.Our research, particularly within the MARA project, seeks to establish new foundations and technologies for modeling, abstraction, and reasoning in AI systems. MARA's goal is to identify principled methods for how intelligence constructs, refines, and employs world models through interactive experimentation. Achieving this will require advancements in knowledge representation, abstraction, reasoning, active learning, and reinforcement learning, necessitating a first-principles reevaluation of world modeling.

Oct 31, 2025
Apply
Mirage logoMirage logo
Full-time|On-site|Union Square, New York City

Mirage builds an AI-native platform for video production and editing, centered in Union Square, New York City. The platform uses natural language to guide intelligent orchestration, allowing advanced models to understand context and mimic the creative decisions of experienced editors. This approach aims to boost productivity for professional teams and open up video creation to a wider audience. About the Team The team at Mirage brings together people from a range of backgrounds, blending technical and artistic skills to solve tough challenges in generative media. The work goes beyond routine model development, focusing on problems that remain unsolved across the industry. Role Overview: Research Scientist, Large Language Models This early team role offers the chance to shape the core technology behind Mirage. The position involves tackling foundational questions in generative AI and creative tooling, with the potential to influence how people create and edit video for years to come.

Apr 14, 2026
Apply
Basis logo
Full-time|On-site|New York Office

About BasisBasis is a pioneering nonprofit organization dedicated to applied AI research, driven by a dual mission.Firstly, we aim to deepen our understanding and development of intelligence. This involves establishing the mathematical foundations of reasoning, learning, decision-making, understanding, and explanation, along with creating software that embodies these principles.Secondly, we strive to enhance society’s capacity to tackle complex challenges. This means broadening the scale and complexity of the problems we can address today, while also accelerating our future problem-solving capabilities.To realize these aims, we are constructing an innovative technological framework inspired by human reasoning and fostering a collaborative organization that prioritizes human values.About the RoleAs a Research Scientist at Basis, you will play a crucial role in advancing our understanding of the theoretical, mathematical, and computational principles underlying intelligence.Our Research Scientists possess key characteristics:Outstanding technical expertise—strong mathematical and computational foundations.A creative builder’s mindset—capable of designing, constructing, and refining complex systems based on foundational principles.Commitment to scientific rigor—engaging in high-quality, robust scientific inquiry without hesitance to experiment, learn from mistakes, and explore unconventional ideas.At Basis, collaboration is key, both within our teams and with external partners. We seek individuals who thrive in collaborative environments and are eager to tackle challenges that transcend individual capabilities.Programming Languages Research ScientistsThis role is tailored for experts in the design, implementation, and analysis of programming languages. You will assist in the design and implementation of the foundational computational reasoning systems under development at Basis.Key focus areas of programming languages research include compiler design, partial evaluation, program analysis, abstract interpretation, and program transformation. This work is conducted within the context of constructing reasoning systems, leading to engagement with topics such as probabilistic programming, automatic differentiation, and SAT/SMT solvers.Expectations:Possess a PhD (or equivalent experience) in a relevant field.

Jan 30, 2025
Apply
Bosch Group logoBosch Group logo
Full-time|On-site|Sunnyvale

Role overview Bosch Group is looking for a Senior Research Scientist in Sunnyvale to focus on Vision, Language, and Action (VLA) Models. The position involves leading research efforts that advance artificial intelligence and machine learning, with a particular focus on integrating visual, language, and action-based data into unified models. What you will do Lead research initiatives in VLA models, helping to shape the next generation of AI applications. Collaborate with a team of experts to design and build models that connect vision, language, and action understanding. Contribute to Bosch Group’s technology strategy and support progress in AI research. Impact The research from this role will directly inform Bosch Group’s strategic choices and drive advancements in artificial intelligence technology.

Apr 28, 2026
Apply
ifm-us logo
Full-time|On-site|Sunnyvale, CA

About the Institute of Foundation ModelsWe are a premier research facility focused on the development, comprehension, application, and risk mitigation of foundation models. Our objective is to propel research, cultivate the upcoming generation of AI innovators, and contribute significantly to a knowledge-centric economy.As a vital member of our team, you will engage with state-of-the-art foundation model training alongside distinguished researchers, data scientists, and engineers, addressing critical and transformative challenges in AI development. Participate in creating revolutionary AI solutions poised to redefine entire sectors. Your strategic and innovative problem-solving abilities will play a crucial role in establishing MBZUAI as a global leader in high-performance computing for deep learning, fostering impactful discoveries that motivate the next wave of AI visionaries.The RoleWe are the AllWorld Team within the Institute of Foundation Model (IFM) at MBZUAI. Our team is at the forefront of developing the PAN (Physical, Agentic, and Networked) world models—next-generation foundation models designed to unlock machine intelligence beyond traditional linguistic capabilities.Our mission is to confront the fundamental issues of world modeling and to establish a new paradigm for next-generation machine reasoning. We are seeking enthusiastic individuals who align with our vision and are excited to explore the boundaries of AI with us.

May 2, 2025
Apply
Anthropic logoAnthropic logo
Full-time|Remote|Remote-Friendly (Travel-Required) | San Francisco, CA | New York City, NY

Anthropic is looking for a Research Engineer focused on model evaluations. This position involves research and development to assess and strengthen the performance of AI models. Teams are based in San Francisco and New York City, and the role supports remote work with required travel. Key responsibilities Design and implement evaluations for Anthropic's AI models Collaborate with team members to enhance model performance Contribute to research that pushes the boundaries of AI systems Location Remote-friendly (travel required) San Francisco, CA New York City, NY

Apr 28, 2026
Apply
Cohere logoCohere logo
Full-Time|On-site|New York

About UsAt Cohere, our mission is to amplify intelligence to benefit humanity. We specialize in training and deploying cutting-edge models for developers and enterprises, enabling them to create extraordinary AI experiences such as content generation, semantic search, retrieval-augmented generation (RAG), and intelligent agents. Our work is pivotal in driving the widespread adoption of artificial intelligence.We are deeply passionate about our creations. Each team member plays a crucial role in enhancing our models and maximizing the value they deliver to our clients. We thrive on hard work and agility, always prioritizing the needs of our customers.Cohere is made up of a diverse team of leading researchers, engineers, designers, and more, all dedicated to their craft. We value unique perspectives as essential for developing exceptional products.Join us in our journey to shape the future of AI!Role OverviewAs Large Language Models (LLMs) redefine the capabilities of AI, inference remains a critical bottleneck. Our Model Efficiency team is at the forefront of enhancing LLM inference efficiency across our foundational models. We focus on groundbreaking advancements in the model execution stack, encompassing:Optimization of model architecture and mixture of experts (MoE) routingInnovations in decoding and inference-time algorithmsCo-design of software and hardware for GPU accelerationPerformance enhancements without sacrificing model qualityNote: We have offices in Toronto, Montreal, San Francisco, New York, Paris, Seoul, and London. We embrace a remote-friendly culture, strategically distributing teams based on interests, expertise, and time zones to foster collaboration and flexibility. Our Model Efficiency team primarily operates in the EST and PST time zones.As a Staff Research Engineer, you'll be instrumental in developing, prototyping, and deploying methodologies that significantly enhance the speed and efficiency of our models in production.Ideal Candidate ProfileYou may be an excellent fit for our Model Efficiency team if you:Hold a PhD in Machine Learning or a closely related disciplinePossess a deep understanding of LLM architecture and optimization techniques for inference under resource constraintsBring substantial experience in model optimization and performance enhancement strategies

Nov 7, 2025
Apply
ifm-us logo
Full-time|On-site|Sunnyvale, CA

About the Institute of Foundation Models At the Institute of Foundation Models, we are on a mission to innovate and enhance the development of foundation models. Our research lab is committed to advancing AI through understanding, utilization, and effective risk management of these models. We aim to empower the next generation of AI developers and contribute significantly to a knowledge-driven economy.Joining our team means you will work at the forefront of foundation model training, collaborating with elite researchers, data scientists, and engineers. You will tackle pivotal challenges in AI development and contribute to the creation of revolutionary AI solutions that could transform various industries. Your strategic and innovative problem-solving skills will play a key role in establishing MBZUAI as a global leader in high-performance computing for deep learning, fostering impactful discoveries that will inspire future AI visionaries.The Role We are in search of a Foundation Model DevOps Engineer who will focus on Operational Stability to support our AI research infrastructure. You will be responsible for creating an efficient environment that facilitates model development. Your role involves building tooling, release pipelines, and storage policies that alleviate burdens on our research team. You will manage the foundational layer, ensuring that researchers have immediate, secure, and reliable access to essential tools, data, and computational resources.Key Responsibilities Model Release Engineering High-Fidelity Release Management: You will uphold the standards of our public presence, ensuring that all releases (weights, code, training logs, data) are reproducible, comprehensively documented, and presented with the professionalism of a leading open-source product.CI/CD for Research: You will design and implement pipelines that automate the testing and packaging of intricate model releases, transitioning us from manual procedures to automated validation.Repo Administration: You will administer the organization’s Git repositories, ensuring optimal performance and accessibility.

Jan 16, 2026

Sign in to browse more jobs

Create account — see all 54,378 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.