Join our innovative team as a Generative AI Engineer at Sia, where you will leverage the power of advanced AI models to develop transformative algorithms and solutions across diverse sectors.As a key player in our team, you will bridge the gap between Data Scientists, ML Engineers, and Platform Engineers to harness the full potential of Generative AI technology, crafting business-oriented solutions that deliver optimal value. Your expertise will guide our clients in navigating the complexities of semantic search, retrieval-augmented generation (RAG), and fine-tuning processes, ensuring they achieve their goals in the most efficient manner.Your responsibilities will extend beyond prompt engineering; you will design and construct robust, scalable products that begin with benchmarks of candidate foundational models and involve rapid prototyping and validation of innovative product ideas. Your mastery of the entire AI workflow will facilitate the seamless integration of these advanced models into applications, enhancing performance, security, compliance, scalability, and operational efficiency. You will adeptly manage interactions between prompts, chains, and agents while addressing infrastructure challenges.This role encompasses research, design, implementation, and optimization of AI systems, requiring proficiency in two key areas:Understanding the model stack, including foundational models (FM), vision models (VLM), and speech models (SLM), utilizing both proprietary and open-source solutions such as GPT-x, Claude, and Gemini, along with smaller edge models like Mistral or Arcee AI, to deliver cost-effective solutions.Familiarity with AI application frameworks that utilize chaining, retrieval, autonomous agents, and vector search technologies (e.g., LangChain, LlamaIndex, pgvector).We are committed to your success, providing extensive training that combines internal programs with resources from our technology partners. If you are passionate about advancing AI technology and making a lasting impact by empowering our clients to create GenAI-driven applications efficiently, we invite you to apply.
May 4, 2026