Senior Ai Ml Specialist Solutions Architect Ai Infrastructure Cloud jobs in San Francisco – Browse 7,495 openings on RoboApply Jobs

Senior Ai Ml Specialist Solutions Architect Ai Infrastructure Cloud jobs in San Francisco

Open roles matching “Senior Ai Ml Specialist Solutions Architect Ai Infrastructure Cloud” with location signals for San Francisco. 7,495 active listings on RoboApply Jobs.

7,495 jobs found

1 - 20 of 7,495 Jobs
Apply
companyLavendo logo
Full-time|$225K/yr - $315K/yr|Remote|San Francisco

About the CompanyLavendo is a pioneering publicly traded company leading the charge in the AI revolution. With an AI-centric cloud platform, we are transforming the artificial intelligence landscape. Our state-of-the-art infrastructure, including extensive GPU clusters and advanced cloud services, supports developers in harnessing the explosive growth of the global AI industry, catering to Fortune 1000 firms, innovative startups, and AI researchers alike.Company type: Publicly tradedIndustry: AI/ML, Cloud Computing, Infrastructure-as-CodeCandidate Location: Remote U.S.Our mission is to democratize AI infrastructure access and empower organizations to innovate, optimize, and deploy AI solutions seamlessly at any scale. By simplifying the complexities of AI development, we provide a comprehensive full-stack AI platform that marries robust hardware with easy-to-use tools and services.The OpportunityWe are on the lookout for a Senior AI/ML Specialist Solutions Architect to become a crucial part of our client's dynamic team. This role presents an exciting opportunity to design and implement scalable AI solutions tailored for AI-centric clients, leveraging cutting-edge technologies and contributing to one of the most powerful commercially available supercomputers.What You'll DoArchitect and enhance distributed training and inference systems for large-scale AI models.Design and deliver customer-centric solutions that optimize performance and drive business value.Lead the migration of ML pipelines from Proof of Concept to scalable production environments.Foster long-term relationships with clients, ensuring satisfaction and alignment with their strategic objectives.Produce whitepapers, conduct technical presentations, and facilitate webinars to disseminate insights and best practices.Provide technical guidance and mentorship to teams regarding AI infrastructure and deployment strategies.Collaborate with engineering and product teams to prioritize customer feedback and shape product roadmaps.

Feb 23, 2026
Apply
companyDigitalOcean logo
Full-time|On-site|San Francisco

Join DigitalOcean as a Senior Solutions Architect II specializing in AI and Machine Learning. In this pivotal role, you will leverage your expertise to design and implement innovative solutions that drive our customers' success. You'll collaborate with cross-functional teams to ensure our clients harness the full potential of our cloud services, enabling them to scale and optimize their operations effectively.

Apr 10, 2026
Apply
company
Full-time|On-site|San Francisco

About Us:At novita-ai, we are a rapidly growing global provider of AI cloud infrastructure, leading the charge in the artificial intelligence revolution. Our innovative platform equips developers and enterprises with powerful, scalable, and user-friendly solutions such as Model APIs, GPU Instances, and Serverless Computing. As organizations around the globe strive to integrate AI into their offerings, we serve as the essential engine that fuels their innovative efforts.Join our world-class team and contribute to our expanding customer base. This unique opportunity allows you to be part of a dynamic company in a hyper-growth market, where your technical skills will directly impact customer success and drive our business forward.The Role:As a Solutions Engineer, you will act as the primary technical leader and trusted advisor for our clients throughout their journey. You will collaborate closely with the sales team to bridge the gap between complex customer challenges and our sophisticated technical solutions. Your mission is to build technical credibility, demonstrate the capabilities of our platform, and design tailored solutions that empower our clients to achieve their AI-related business objectives.What You'll Do:Technical Discovery & Solution Design: Collaborate with Account Executives to gain a deep understanding of customer needs, technical requirements, and business goals. Develop elegant and effective solutions utilizing our AI infrastructure stack (Model APIs, GPU Instances, Serverless).Product Demonstration & Proof of Concept (POC): Conduct engaging, customized product demonstrations and interactive workshops. Plan, manage, and execute successful POCs, showcasing the value and performance of our platform within the client’s environment.Technical Evangelism & Trusted Advisory: Communicate the value proposition of our platform to diverse audiences, including both technical and non-technical stakeholders, from engineers to C-level executives. Establish yourself as the go-to expert for customers on best practices in AI infrastructure.Sales Enablement & Market Feedback Loop: Create and maintain technical sales materials, including whitepapers, best practice guides, and demo scripts. Serve as the voice of the customer, relaying valuable feedback from the field to our Product and Engineering teams to influence our product roadmap.Onboarding & Implementation Guidance: Facilitate a seamless post-sales transition by providing initial onboarding support and architectural guidance, setting customers up for sustained success.

Aug 27, 2025
Apply
companyDigitalOcean logo
Full-time|$147K/yr - $230K/yr|On-site|San Francisco

Embark on a transformative career journey with DigitalOcean, where you will collaborate with a vibrant community of exceptional talent dedicated to crafting the most user-friendly and scalable cloud solutions. If you possess a growth mindset, a penchant for thinking big and bold, and thrive in a high-energy environment that challenges the status quo, you will find your ideal fit with us. We believe in winning together while learning, enjoying the journey, and making a significant impact for the dreamers and builders around the globe.We are seeking a Senior Solutions Architect (AI/ML) who is enthusiastic about addressing intricate cloud infrastructure challenges, particularly within the burgeoning AI/ML domain.In your role as a Senior Solutions Architect at DigitalOcean, you will be an integral part of a dynamic team committed to revolutionizing cloud computing and AI. Reporting to the Senior Manager of Solutions Architecture, you will collaborate with our most strategic accounts and partners to design and implement scalable, reliable, and innovative cloud solutions. You will play a crucial role in influencing both technical and business decisions, assisting customers in modernizing their workloads, and establishing DigitalOcean as the top choice for their AI/ML and GPU requirements.In this capacity, you will work closely with Sales, Technical Account Management, and Partner teams to deliver technical excellence that fuels new customer acquisition and strengthens existing relationships. As the technical subject matter expert, your responsibilities will include designing, implementing, and optimizing high-performing, resilient cloud solutions tailored to customer needs.Furthermore, you will engage cross-functionally with Product, Engineering, and Operations to ensure that customer feedback informs the continuous enhancement of DigitalOcean’s offerings. Your technical leadership and consultative approach will be pivotal in fostering trusted relationships, shaping strategic decisions, and positioning DigitalOcean as the preferred cloud partner for expanding businesses and key partners.This is a remarkable opportunity to merge your technical expertise, business insight, and communication skills to drive substantial impact on DigitalOcean’s growth and success.

Feb 18, 2026
Apply
companylavendo logo
Full-time|$225K/yr - $315K/yr|On-site|San Francisco

About UsAt Lavendo, we are at the forefront of AI cloud infrastructure, rapidly expanding with a significant global presence that includes R&D centers in North America, Europe, and Israel. Our exceptional team of engineers and AI researchers is dedicated to creating innovative solutions that provide the essential infrastructure for the next wave of AI-driven enterprises.We empower organizations, from Fortune 500 companies to pioneering AI startups and research institutions, allowing them to address complex AI challenges without incurring heavy infrastructure costs or the need to develop extensive in-house AI/ML teams.Our MissionWe aim to democratize access to top-tier AI infrastructure, enabling organizations of all sizes to transform ambitious AI goals into tangible outcomes. Our culture fosters creativity, embraces challenges, and thrives on teamwork.Your RoleAs a Cloud Solutions Architect (Pre-Sales), you will serve as a vital technical partner to some of the most forward-thinking AI teams globally. You will engage directly with cutting-edge GPU infrastructure, including the latest NVIDIA technology, to assist clients in designing, deploying, and optimizing AI workloads at scale. This high-profile position lies at the intersection of deep technical expertise and strategic customer interaction, significantly shaping customer experiences and platform adoption.Key ResponsibilitiesAct as a trusted technical advisor to customers throughout the entire pre-sales and onboarding process.Lead proof-of-concept initiatives, architectural workshops, presentations, and training on GPU cloud technologies and industry best practices.Work closely with customers to understand their business needs and translate them into scalable solution architectures.Develop and document Infrastructure as Code solutions, reference architectures, and technical guides in collaboration with support engineers and technical writers.Assist clients in optimizing machine learning pipeline performance and resource efficiency.Serve as a cross-functional technical expert, connecting product, technical support, and marketing teams with customer requirements.Represent Lavendo at external events, including hackathons, conferences, and industry showcases.

Feb 25, 2026
Apply
companyPrime Intellect logo
Full-time|On-site|San Francisco

About Prime Intellect Prime Intellect builds the backbone for advanced AI labs, delivering infrastructure that supports the next generation of artificial intelligence. Our platform, Lab, brings together environments, evaluations, sandboxes, and high-performance training into one unified system for post-training at scale. Teams use Lab for Reinforcement Learning (RL), Supervised Fine-Tuning (SFT), tool integration, agent workflows, and deployment. We rely on our own technology to train leading-edge models, ensuring our clients benefit from proven solutions. Prime Intellect recently raised $15 million in new funding, bringing our total to $20 million. Investors include Founders Fund, Menlo Ventures, and notable angels such as Andrej Karpathy, Tri Dao, and Dylan Patel. Role Overview: Solutions Architect - AI Infrastructure This San Francisco-based role connects client success, technical operations, and AI infrastructure delivery. The Solutions Architect works directly with customers to guide onboarding, deployment, and scaling on the Prime Intellect platform. Internally, this position helps build systems and workflows that make our services more scalable and efficient. Beyond supporting individual accounts, this role shapes how we serve technically advanced clients, turning high-touch deployments into repeatable processes and better tools for sustainable growth. Key Responsibilities Customer Success and Account Management Serve as the main point of contact for customers using our AI infrastructure Oversee onboarding and coordinate deployments from post-sale through production readiness Build trusted relationships with customer teams, proactively resolving issues and managing risks Work closely with engineering and operations to ensure customer needs are understood and addressed Systems Development and Operational Efficiency Improve internal systems for onboarding, deployment tracking, escalation management, and account health monitoring Spot repetitive tasks and convert them into streamlined processes, tools, or automation Help define operational cadence, support infrastructure, and scalable service models for account management Develop structure and introduce solutions that enhance the customer experience

Apr 14, 2026
Apply
companyVaromoney logo
Full-time|On-site|San Francisco, CA

Join Varomoney as a Principal AI/ML Architect, where you will lead groundbreaking projects that leverage artificial intelligence and machine learning to transform financial services. Your expertise will guide our engineering teams in developing innovative solutions that not only meet but exceed client expectations. You will be at the forefront of AI/ML technology, driving strategic initiatives and ensuring the highest standards of technical excellence.

Mar 20, 2026
Apply
company
Full-time|On-site|San Francisco

About Liquid AIFounded as a spinoff of MIT CSAIL, Liquid AI specializes in developing versatile AI systems designed for optimized performance across various deployment platforms, from data center accelerators to on-device hardware. Our commitment to low latency, minimal memory consumption, privacy, and reliability sets us apart. We collaborate with enterprises in sectors such as consumer electronics, automotive, life sciences, and financial services. As we experience rapid growth, we seek remarkable talent to join our journey.The OpportunityAs we establish our solutions architecture function from the ground up, you will play a pivotal role as one of our inaugural Solutions Architects. Collaborating closely with the Head of Solutions Architecture and the go-to-market organization, you will manage customer engagements from inception to completion.Our models are specifically engineered for environments constrained by memory, latency, and power, encompassing edge devices, mobile applications, embedded systems, and on-premises infrastructure where traditional models cannot operate. You will engage with this boundary daily.Our clientele ranges from AI-native startups to established enterprises venturing into AI for the first time. Your mission is to bridge the gap between our models' capabilities and customers' expectations, delivering on that promise from technical validation through to go-live.

Apr 13, 2026
Apply
company
Full-time|On-site|NYC or SF Bay Area

Genesis Molecular AI is building the GEMS molecular AI platform, driving advances in foundation model training and industrial screening. Strategic partnerships and a strong compute infrastructure are central to the company’s growth and mission. Role Overview The Director of AI Infrastructure Partnerships will lead efforts to secure and manage critical technology alliances, investments, and compute resources. This leader will work closely with top AI organizations, hardware providers, and investors, including firms like a16z and NVIDIA, to support Genesis’s technical and business goals. The role is based in either New York City or the San Francisco Bay Area. What You Will Do Oversee partnerships with NVIDIA and identify new opportunities with leading AI organizations. Structure contracts, equity deals, technical collaborations, co-publications, and data-sharing agreements for both public and proprietary experimental and synthetic data. Create presentations and written materials that clearly communicate Genesis’s platform vision and technical strengths to partners and investors, and integrate these messages into broader external communications. Serve as the business lead and chief negotiator for major cloud computing and AI infrastructure deals. Secure high-performance compute at competitive rates and maintain strong relationships with key partners. Monitor the AI compute market, evaluating providers for cost, reliability, and availability to support research and deployment needs. Work with ML Engineering to forecast compute requirements for model training, synthetic data generation, fine-tuning, and large-scale inference. Optimize performance and budget across multiple cloud environments and track usage to maximize value. Manage the internal budgeting process for compute spend. Translate technical needs into financial forecasts and present capital allocation recommendations to company leadership. What We’re Looking For Significant experience in AI and cloud computing, including managing high-value negotiations and partnerships. Strong analytical and strategic skills, with the ability to assess market trends and make informed decisions. Excellent communication and interpersonal abilities, comfortable explaining complex topics to a range of audiences.

Apr 15, 2026
Apply
company
ML Infrastructure Engineer

Sygaldry Technologies

Full-time|On-site|San Francisco

About Sygaldry Technologies Sygaldry Technologies develops quantum-accelerated AI servers in San Francisco, focusing on faster AI training and inference. By combining quantum technology with artificial intelligence, the team addresses challenges in computing costs and energy efficiency. Their AI servers integrate multiple qubit types within a fault-tolerant system, aiming for a balance of cost, scalability, and speed. The company values optimism, rigor, and a drive to solve complex problems in physics, engineering, and AI. Role Overview: ML Infrastructure Engineer The ML Infrastructure Engineer joins the AI & Algorithms team, which includes research scientists, applied mathematicians, and quantum algorithm specialists. This role centers on building and maintaining the compute infrastructure that powers advanced research. The systems you build will support reliable GPU access, reproducible experiments, and scalable workloads, so researchers can focus on their core work without needing deep cloud expertise. Expect to design and manage compute platforms for a range of tasks, including quantum circuit simulation, large-scale numerical optimization, model training, tensor network contractions, and high-throughput data generation. These workloads span multiple cloud providers and on-premises GPU servers. Key Responsibilities Develop compute abstractions for diverse workloads, such as GPU-accelerated simulations, distributed training, high-throughput CPU jobs, and interactive analyses using frameworks like PyTorch and JAX. Set up infrastructure to support experiment tracking and reproducibility. Create developer tools that make cloud computing feel local, streamlining environment setup, job submission, monitoring, and artifact management. Scale experiments from single-GPU prototypes to large, multi-node production runs. Multi-Cloud GPU Orchestration Design orchestration strategies for workloads across multiple cloud providers, optimizing job routing for cost, availability, and capability. Monitor and improve cloud spending, keeping track of credit balances, burn rates, and expiration dates.

Apr 14, 2026
Apply
companyHyperbolic Labs logo
Full-time|On-site|San Francisco, CA

Join Our MissionAt Hyperbolic Labs, we are dedicated to democratizing artificial intelligence by eliminating barriers to computing power through our Open-Access AI Cloud. We aggregate global computing resources to provide an innovative GPU marketplace and AI inference service, making AI affordable and accessible for everyone. As pioneers at the crossroads of AI and open-source technology, we envision a future where AI innovation is driven by imagination, not resource limitations. We invite forward-thinking individuals who share our vision of making AI universally accessible, secure, and cost-effective to join us in crafting a platform that empowers innovators to realize their groundbreaking AI projects.As we gear up for expansion following our Series A funding, our team, led by co-founders with PhDs in AI, Mathematics, and Computer Science, is set to transform the landscape of computing.The RoleWe are on the lookout for a Senior Infrastructure Engineer to drive the development and scaling of Hyperbolic's GPU Cloud Marketplace. In this pivotal role, you will create a multi-tenancy provisioning and virtualization solution that transforms raw GPUs from diverse global suppliers into a programmable, orchestrated resource pool serving thousands of AI developers and researchers. You will work at the forefront of cloud infrastructure, building the core orchestration layer that allows our platform to deliver cost savings of up to 75% compared to traditional cloud providers.

Mar 26, 2026
Apply
company
Full-time|On-site|San Francisco Bay Area

Join the Revolution at Retell AIRetell AI is pioneering the future of call centers through innovative voice AI, driven by first principles thinking.In just 18 months since our inception, we have empowered thousands of businesses with our AI voice agents, transforming how sales, support, and logistics calls are managed—previously requiring extensive human teams. Supported by prestigious investors such as Y Combinator and Alt Capital, we've rapidly scaled from $5M ARR to an impressive $36M ARR with a compact yet dynamic team of 20.Our ambition for 2026 is to create a revolutionary customer experience platform, where entire contact centers are powered by AI. Moving beyond basic automation, we aim to develop intelligent AI “workers” that serve as frontline agents, QA analysts, and managers, continuously enhancing customer interactions without the need for constant human oversight.As we expand, we are seeking passionate engineers who are eager to solve challenging technical problems, act swiftly, and make a significant impact in one of the fastest-growing voice AI startups. Let’s shape the future together.

Aug 12, 2025
Apply
companyAnthropic logo
Full-time|$240K/yr - $315K/yr|On-site|San Francisco, CA | New York City, NY

About Anthropic Anthropic builds AI systems with a focus on reliability, interpretability, and steerability. The team includes researchers, engineers, policy experts, and business leaders, all working together to ensure AI benefits both users and society. Role Overview The Commercial Solutions Architect, Applied AI, joins Anthropic’s Applied AI team as a Pre-Sales Architect. This role centers on demonstrating the value of Claude and helping customers integrate and deploy it within their technology stacks. The position combines technical expertise with customer engagement to design LLM solutions for complex business needs, always maintaining Anthropic’s standards for safety and reliability. As a Commercial Solutions Architect, expect to work closely with key accounts, building reusable solution blueprints, demos, and enablement materials that support the wider adoption of Claude across Anthropic’s commercial clients. Collaboration is central: work alongside Sales, Product, and Engineering teams to guide clients from early technical discovery through to deployment. Use your knowledge to help customers understand Claude’s capabilities, develop evaluation strategies, and design scalable architectures that unlock the full potential of Anthropic’s AI systems. Location San Francisco, CA or New York City, NY

Apr 17, 2026
Apply
companynexxa logo
Full-time|On-site|SF Bay area

Join nexxa as a Senior AI Architect, where you'll lead groundbreaking projects and drive innovation in the realm of artificial intelligence. Your expertise will guide our engineering teams in creating scalable AI solutions, ensuring our products meet the highest standards of quality and performance. Collaborate with cross-functional teams to translate business needs into AI strategies, and influence the future of technology at nexxa.

Apr 8, 2026
Apply
companyIntercom logo
Full-time|On-site|San Francisco, California

Join Intercom as a Senior AI Deployment Architect, where you will play a pivotal role in designing and implementing AI solutions that revolutionize customer communication. You will work alongside a talented team of engineers and data scientists to build state-of-the-art AI models and systems. Your expertise will be essential in deploying scalable AI applications that enhance user experiences across various platforms.

Apr 3, 2026
Apply
companylavendo logo
Full-time|$225K/yr - $315K/yr|Remote|San Francisco

About UsAt Lavendo, we are pioneering an infrastructure that most engineers only dream of. We operate an AI-centric cloud platform that integrates expansive GPU clusters, high-speed networking, and cloud-native tools, catering to enterprises, innovative startups, and leading research teams. Our mission is straightforward: empower our clients to efficiently train and execute complex AI and simulation workloads without the need to construct their own supercomputers.As a publicly traded company, we are rapidly expanding, with R&D centers across North America, Europe, and the Middle East. Our culture emphasizes engineering excellence: minimal bureaucracy, significant ownership, and a focus on tackling challenging infrastructure problems while witnessing the impact of our work on real customer operations.Your Role as HPC Specialist Solutions ArchitectIn this pivotal role, you will be the go-to expert for customers looking to establish or enhance advanced GPU and HPC environments in the cloud. This includes multi-rack clusters, high-speed interconnects, intricate scheduling, and strict SLAs regarding throughput and latency.As an HPC Specialist Solutions Architect, you will design and optimize cutting-edge platforms for AI training, extensive simulations, and data-intensive workloads. You will collaborate closely with NVIDIA's latest hardware (Hopper, Blackwell, and future versions), NVLink/NVSwitch topologies, and InfiniBand/RoCE fabrics, having a substantial influence on the evolution of our platform and reference architectures. If you thrive on translating workloads into optimized clusters and maximizing performance, this is the ideal position for you.Your ResponsibilitiesCluster Design: Architect and implement HPC clusters for AI, simulation, and distributed training using Kubernetes and schedulers like Slurm. Your considerations will include node types, GPU topology, queues, partitions, and failure scenarios.Infrastructure Optimization: Integrate NVIDIA Hopper and Blackwell-class GPUs with NVLink/NVSwitch and InfiniBand/RoCE, ensuring the hardware layout aligns with the communication patterns of the workloads.Automation: Deploy and manage GPU and Network Operators to standardize drivers, CUDA, firmware, and high-speed networking across extensive fleets, rather than managing on a box-by-box basis.Supercomputer Cloud Functionality: Design and validate cloud-native HPC environments that emulate supercomputer capabilities.

Feb 6, 2026
Apply
companyAnthropic logo
Full-time|Remote|San Francisco, CA

Join Anthropic as a Solutions Architect specializing in Applied AI, focusing on state and local government initiatives. You will leverage cutting-edge AI technologies to develop innovative solutions that enhance public services. Collaborate with government agencies to identify challenges and implement effective AI strategies that improve efficiency and accessibility.

Mar 12, 2026
Apply
companyScale AI logo
Full-time|On-site|San Francisco, CA; Seattle, WA; New York, NY

Scale AI is seeking a Senior AI Infrastructure Engineer to help build and refine the company’s Training Platform. This position centers on designing, implementing, and improving infrastructure that supports machine learning teams as they train and deploy models. Role overview This engineer will work closely with colleagues across different functions to create solutions that make AI systems more efficient. The focus is on enabling faster, more reliable model training and deployment. Key responsibilities Design and build infrastructure for AI model training Implement and optimize systems to support machine learning workflows Collaborate with teams throughout the company to improve platform capabilities Locations This role is based in San Francisco, Seattle, or New York.

Apr 29, 2026
Apply
companyAir Apps logo
Full-time|On-site|San Francisco

Join Our Team at Air AppsAt Air Apps, we are on a mission to revolutionize resource management through innovative technology. Founded in 2018 in Lisbon, Portugal, we have expanded our reach with offices in both Lisbon and San Francisco, boasting over 100 million downloads globally. Our vision is to create the world’s first AI-powered Personal & Entrepreneurial Resource Planner (PRP), and we are looking for passionate individuals to help us achieve this goal.Our commitment to challenging the status quo drives us to push the boundaries of AI-driven solutions that make a real impact. Here, you will have the opportunity to be a creative force, developing products that empower individuals worldwide.Join us as we embark on this journey to redefine how people plan, work, and live.

Feb 25, 2025
Apply
companyAnthropic logo
On-site|On-site|San Francisco, CA | New York City, NY | Washington, DC

Join Anthropic as a Solutions Architect in our Applied AI team, where you will play a pivotal role as a Pre-Sales architect. Your mission will be to empower federal civilian agencies with the insights and technical expertise needed to harness the full potential of our AI system, Claude. You will bridge the gap between complex technical solutions and customer needs, ensuring smooth integration and deployment within their technology frameworks. Collaborating closely with Sales, Product, and Engineering teams, you will guide clients from initial discovery through to successful implementation, helping them understand the capabilities of Claude and designing innovative architectures that address their unique challenges while adhering to our stringent safety and reliability standards.

Jan 29, 2026

Sign in to browse more jobs

Create account — see all 7,495 results

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.