Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Mid to Senior
Qualifications
Key Responsibilities:
Develop and enhance user-facing AI applications for major enterprise clients, including prominent media and Fortune 500 companies.
Contribute to the development and refinement of features for Scale’s GenAI Platform, enabling businesses to build and manage AI-driven agents.
Design, construct, and optimize high-performance, user-friendly UIs using Next.js, React, TypeScript, and Tailwind.
Collaborate closely with product managers, designers, and AI/ML teams to deliver seamless and impactful user experiences.
Integrate frontend applications with backend services, working with APIs, authentication systems, and cloud infrastructure.
Deliver features at a rapid pace while maintaining exceptional code quality, performance, and accessibility.
Preferred Qualifications:
5+ years of experience in developing frontend or full-stack applications using modern tech stacks.
Strong expertise in Next.js, React, TypeScript, and Tailwind, with a passion for crafting polished, user-friendly interfaces.
Experience in building high-visibility, customer-facing applications and making informed trade-offs between speed and quality in fast-paced environments.
About the job
Join Our Team as a Staff Software Engineer, Full-Stack - Enterprise Generative AI
At Scale AI, we are revolutionizing enterprise workflows through our Scale Generative AI Platform (Scale GP), an advanced AI solution that provides APIs for knowledge retrieval, inference, evaluation, and much more. We are seeking a talented full-stack engineer, with a strong front-end focus, to develop innovative AI-powered applications that transform how businesses operate.
In this dynamic role, you will collaborate on diverse projects, ranging from cutting-edge customer-facing applications to internal SaaS products. Be part of our engineering team that powers initiatives such as TIME’s Person of the Year AI experience (see it in action), showcasing how our technology impacts the media landscape. Additionally, you will enhance Scale’s GenAI Platform (SGP), a robust system enabling businesses to create and manage AI agents at scale. Whether developing interactive AI assistants or refining our core SaaS platform, your contributions will shape the future of AI in real-world applications.
About Scale AI
Scale AI is at the forefront of generative AI technology, providing enterprises with the tools they need to harness AI for their workflows. Our mission is to empower businesses through innovative solutions that redefine the way they operate and engage with their customers. Join us and be part of a team that is shaping the future of AI.
Similar jobs
1 - 20 of 7,936 Jobs
Search for Infrastructure Software Engineer Enterprise Generative Ai
Full-time|$216.2K/yr - $270.3K/yr|On-site|San Francisco, CA; New York, NY
Join Scale AI's innovative team as an Infrastructure Software Engineer for our Enterprise Generative AI Platform (SGP). In this dynamic role, you will help design and enhance our enterprise-grade AI platform, which offers robust APIs for knowledge retrieval, inference, evaluation, and more. We're seeking an exceptional engineer who thrives in fast-paced environments and is eager to contribute to the scaling of our core infrastructure. The ideal candidate will possess a solid foundation in software engineering principles and extensive experience with large-scale distributed systems. Your role will involve implementing solutions across various cloud providers (GCP, Azure, AWS) for clients in highly regulated sectors, including healthcare, telecommunications, finance, and retail.
Full-time|$248.4K/yr - $310.5K/yr|On-site|New York, NY; San Francisco, CA
Join Our Team as a Staff Software Engineer, Full-Stack - Enterprise Generative AI At Scale AI, we are revolutionizing enterprise workflows through our Scale Generative AI Platform (Scale GP), an advanced AI solution that provides APIs for knowledge retrieval, inference, evaluation, and much more. We are seeking a talented full-stack engineer, with a strong front-end focus, to develop innovative AI-powered applications that transform how businesses operate. In this dynamic role, you will collaborate on diverse projects, ranging from cutting-edge customer-facing applications to internal SaaS products. Be part of our engineering team that powers initiatives such as TIME’s Person of the Year AI experience (see it in action), showcasing how our technology impacts the media landscape. Additionally, you will enhance Scale’s GenAI Platform (SGP), a robust system enabling businesses to create and manage AI agents at scale. Whether developing interactive AI assistants or refining our core SaaS platform, your contributions will shape the future of AI in real-world applications.
Full-time|$248.4K/yr - $310.5K/yr|On-site|San Francisco, CA; New York, NY
Join Scale's innovative team and contribute to the development of the Scale Generative AI Platform (Scale GP), an enterprise-level Generative AI platform offering robust APIs for knowledge retrieval, inference, evaluation, and beyond. We are on the lookout for a talented Staff Software Engineer who is eager to tackle challenging engineering problems and help scale our product in a dynamic environment. The ideal candidate will possess a solid grasp of software engineering principles, coupled with experience in large-scale distributed systems. You will take ownership of significant new components within our product, working across both backend and frontend systems while engaging with LLMs and machine learning models. Your role will involve solving complex issues related to scalability and reliability.
Full-time|$216.2K/yr - $270.3K/yr|On-site|New York, NY; San Francisco, CA
Join Scale GP (Scale Generative AI Platform), a cutting-edge enterprise-level Generative AI platform, where we provide robust APIs for knowledge retrieval, inference, evaluation, and much more. We are seeking a talented Software Engineer to become a pivotal part of our dynamic team, contributing to the development and expansion of our innovative product in a fast-paced setting. The ideal candidate will possess a deep understanding of software engineering principles and practices, along with hands-on experience in large-scale distributed systems. As a Software Engineer, you will take charge of major components of our product, engaging with both backend and frontend technologies, and collaborating with LLMs and machine learning models. You will tackle complex engineering challenges related to scalability and reliability, ensuring we meet the demands of our growing customer base.
Full-time|$138K/yr - $259.4K/yr|On-site|San Francisco, CA; St. Louis, MO; New York, NY; Washington, DC
Scale AI is on the lookout for an exceptionally talented and driven Software Engineer, Frontier AI Infrastructure to become an integral part of our innovative Public Sector Engineering team. In this role, you will take charge of the model inference layer, enabling cutting-edge AI models, troubleshooting the latest AI tools, managing networking tasks, addressing latency issues, and monitoring pricing and usage metrics for AI models. You will spearhead technical discussions with cloud vendors and clients to fulfill critical contracts and resolve platform challenges. Additionally, you will collaborate closely with Product teams to anticipate feature requirements, transitioning from reactive 'infra-only debugging' to proactive integration testing.Your Responsibilities Include:Designing and implementing secure, scalable backend systems tailored for Public Sector clients, utilizing Scale's advanced cloud-native AI infrastructure.Owning services or systems while defining long-term health objectives and enhancing the health of related components.Redesigning the architecture to operate in compliant or restrictive environments, which entails creating swappable components (authentication, storage, logging) to adhere to government and security regulations without compromising product integrity.Collaborating with Product teams to develop integration tests that identify issues early, shifting focus from 'infra-only debugging' to preventing upstream failures.Actively participating in customer engagements, liaising with stakeholders to comprehend requirements and deliver innovative solutions.Contributing to the platform roadmap and product strategy for Scale AI's Public Sector division, playing a vital role in shaping the future trajectory of our offerings.
Full-time|Remote|North America Remote / San Francisco, CA
Join Our Team as a Software Engineer - AI InfrastructureLocation: North America Remote / San Francisco · Full-TimeAt Andromeda Cluster, we are dedicated to democratizing access to advanced AI infrastructure that was once only available to hyperscalers. Founded by industry leaders Nat Friedman and Daniel Gross, we have evolved from a singular managed cluster to a global platform that connects top AI labs, data centers, and cloud providers around the world. Our orchestration layer efficiently manages training and inference tasks globally, enhancing flexibility and efficiency in this rapidly expanding sector. We aim to create a global marketplace for AI computing, empowering AGI with the same fluidity as global financial markets.As we continue to grow, we are on the lookout for talented individuals in the fields of AI infrastructure, research, and engineering.Your RoleIn the position of Infrastructure Product Engineer, you will be integral in constructing the foundational framework of Andromeda’s platform. Your challenge will be to simplify complex, real-world infrastructure issues into scalable product solutions that our customers will benefit from.Key ResponsibilitiesArchitect and develop essential platform components, focusing on infrastructure orchestration, provisioning, and lifecycle management solutions.Create robust APIs, services, and control planes that abstract diverse infrastructure types, including VMs, Kubernetes, bare metal, and schedulers.Convert customer usage patterns into actionable product requirements, delivering impactful features and enhancements.Design automation and internal tools to mitigate manual and ad-hoc operational tasks.Improve platform reliability, performance, and observability, focusing on sustainable enhancements rather than quick fixes.Collaborate with other teams to establish clear ownership boundaries between platform features and customer-specific solutions.Write clean, maintainable, and well-documented code with a focus on long-term sustainability.Engage in technical design discussions and contribute to the architectural advancements of our platform.
Join the Revolution at Retell AIRetell AI is pioneering the future of call centers through innovative voice AI, driven by first principles thinking.In just 18 months since our inception, we have empowered thousands of businesses with our AI voice agents, transforming how sales, support, and logistics calls are managed—previously requiring extensive human teams. Supported by prestigious investors such as Y Combinator and Alt Capital, we've rapidly scaled from $5M ARR to an impressive $36M ARR with a compact yet dynamic team of 20.Our ambition for 2026 is to create a revolutionary customer experience platform, where entire contact centers are powered by AI. Moving beyond basic automation, we aim to develop intelligent AI “workers” that serve as frontline agents, QA analysts, and managers, continuously enhancing customer interactions without the need for constant human oversight.As we expand, we are seeking passionate engineers who are eager to solve challenging technical problems, act swiftly, and make a significant impact in one of the fastest-growing voice AI startups. Let’s shape the future together.
About EventualAt Eventual, we are reimagining how AI applications process vast amounts of data, from images to complex datasets. Traditional data platforms are not equipped to handle the petabytes of multimodal data essential for AI, causing teams to struggle with inadequate infrastructure. Founded in 2022, our mission is to simplify data querying, making it as intuitive as working with tables while ensuring scalability for production workloads.Our open-source engine, Daft, is specifically designed for real-world AI systems. It efficiently manages external APIs, GPU clusters, and addresses failures that traditional engines cannot handle. Daft is already integral to operations at leading companies such as Amazon, Mobileye, Together AI, and CloudKitchens.We pride ourselves on our exceptional team, which includes talents from Databricks, AWS, Nvidia, Pinecone, GitHub Copilot, Tesla, and others. We have quadrupled our team size in just a year, supported by Series A and seed funding from notable investors like Felicis, CRV, Microsoft M12, and Y Combinator. We are now eager to expand further. Join us—Eventual is just getting started.We are seeking passionate individuals who are excited to collaborate in a close-knit team environment, working together four days a week in our San Francisco Mission district office.Your Role:As a Software Engineer, you will take charge of developing Eventual's core products and architecture. You’ll deliver features that our customers will use immediately and collaborate with a dedicated team that values open communication and cross-functional teamwork. Our fast-paced environment is focused on solving a variety of complex technical and product challenges. While our experienced team is here to provide guidance and mentorship, we appreciate engineers who can independently identify and tackle challenging technical issues.Key Responsibilities:Design and develop highly reliable and resilient products and features.Collaborate closely with cross-functional product and customer-facing teams to understand requirements and deliver thoughtful solutions.Write high-quality, extensible, and maintainable code.Create and build scalable applications and components.Architect and manage Kubernetes clusters optimized for our needs.
Join our innovative team at Abridge as a Software Engineer specializing in Generative AI. In this role, you will be at the forefront of developing cutting-edge AI applications that transform the way users interact with technology. With a focus on creativity and functionality, you will collaborate with cross-functional teams to design and implement robust software solutions that harness the power of AI.
About AbridgeFounded in 2018, Abridge is dedicated to enhancing understanding in healthcare through our innovative AI-powered platform. We specialize in transforming medical conversations into structured clinical notes in real-time, enabling clinicians to prioritize patient care. Our enterprise-grade technology seamlessly integrates with electronic medical records (EMRs) to ensure accuracy and trust in AI-generated summaries.As pioneers in generative AI for healthcare, we are setting the industry benchmarks for responsible AI deployment across health systems. Our diverse team consists of practicing MDs, AI scientists, PhDs, creatives, technologists, and engineers united in their mission to empower patients and make healthcare more comprehensible. We have offices located in San Francisco's Mission District, New York's SoHo neighborhood, and East Liberty in Pittsburgh.The RoleJoin us as an AI Platform Engineer, where your work will significantly impact the healthcare sector. You will collaborate with a multidisciplinary team of researchers, clinical scientists, and product engineers to design and develop the runtime, orchestration engine, and evaluation platform necessary for agentic orchestration and LLM-driven workflows.What You’ll DoCreate GenAI systems that transform LLMs into composable, reliable tools, utilizing retrieval, tool use, agentic reasoning, and structured outputs.Develop a highly reliable and scalable agent runtime that includes orchestration, shared state and memory, tool-calling interfaces, and scheduling focused on cost, latency, and quality.Build secure, sandboxed environments for agent actions and code, optimizing for cold start, isolation, and observability.Deliver unified interfaces for multiple model sizes and providers; integrate with open tool ecosystems such as MCP-style connectors.Create an evaluation platform for both online and offline assessments, A/B testing, safety checks, and regression gates that enhance agent reliability over time.Collaborate with Research to bring new agent capabilities from prototype to production.What You’ll BringDemonstrated experience in building agent applications with tool-calling, context engineering, and related technologies.Strong problem-solving skills and the ability to work in a fast-paced, collaborative environment.Familiarity with generative AI technologies and their applications in healthcare.
About UsAt Sierra, we are revolutionizing the way businesses engage with their customers by building a cutting-edge platform that harnesses the power of AI. Our headquarters is located in the vibrant city of San Francisco, with additional offices expanding in Atlanta, New York, London, France, Singapore, and Japan.Our company culture is deeply rooted in our core values: Trust, Customer Obsession, Craftsmanship, Intensity, and Family. These principles guide our actions and foster an environment where innovation thrives.Sierra was co-founded by visionary leaders Bret Taylor, who currently serves as the Board Chair of OpenAI and has a rich history with Salesforce and Facebook, and Clay Bavor, who previously led Google Labs and spearheaded initiatives like Google Lens and Project Starline.Your RoleAs a Software Engineer focusing on Infrastructure at Sierra, you will play a pivotal role in designing, constructing, and maintaining the foundational systems that empower our AI platform. Your expertise will ensure that our infrastructure is not only secure and reliable but also scalable, allowing product teams to execute their work with agility and confidence.Guarantee the reliability, scalability, and performance of our platform and LLM inference serving in response to increasing traffic demands.Develop and oversee cloud infrastructure using Terraform to create secure, scalable, and reproducible environments.Establish and manage a self-service infrastructure platform to empower engineering teams in deploying and operating services independently.Take ownership of and improve CI/CD pipelines and release management processes, facilitating rapid and reliable deployments across Sierra’s platform.Design and manage distributed systems utilizing distributed databases, retrieval systems, and machine learning models.Develop and sustain core data serving abstractions along with essential authentication and security features (SSO, RBAC, authentication controls).Effectively navigate and integrate our technology stack with enterprise customer environments in a scalable and maintainable manner.
At Exa, we are on a mission to create a cutting-edge search engine from the ground up, designed to cater to the diverse needs of AI applications. Our team is building a robust infrastructure that enables us to crawl the internet, train advanced embedding models for indexing, and develop high-performance vector databases using Rust. Additionally, we manage a significant $5M H200 GPU cluster that powers tens of thousands of machines.The Infrastructure Team at Exa is responsible for developing the essential tools and infrastructure that support our entire system. We are looking for talented infrastructure engineers to help us scale our capabilities rapidly. Your work could involve orchestrating GPU clusters with Kubernetes, implementing map-reduce batch jobs on Ray, or creating top-tier observability tools that set industry standards.
Full-time|$180K/yr - $225K/yr|On-site|San Francisco, CA; New York, NY
About Scale AIAt Scale AI, we are dedicated to accelerating the advancement of AI technologies. For over eight years, we have been a frontrunner in AI data solutions, driving groundbreaking developments in areas such as generative AI, defense systems, and autonomous vehicles. Following our recent Series F funding, we are committed to expanding access to frontier data, paving the way toward Artificial General Intelligence (AGI), while enhancing our model evaluation capabilities for both corporate and governmental clients.About Data EngineOur Generative AI Data Engine fuels the world's most sophisticated language models and generative systems through exceptional Reinforcement Learning with Human Feedback (RLHF), human data creation, model assessment, safety measures, and alignment strategies. The data we generate is pivotal in shaping how humanity interacts with AI.Our ApproachDuring the interview process, candidates may be considered for positions across various teams within the Generative AI Engineering division, depending on their interests, skills, and organizational needs. Potential teams include Allocation, Growth, Frontier Data, Trust & Safety, Pay, Operator, or Tasking Experience. Together, these teams drive Scale’s AI data operations—from developing impactful datasets that expand LLM capacities to refining contributor engagement and ensuring data integrity through advanced safety and security protocols. They operate at the intersection of machine learning, operations, and analytics to guarantee the delivery of high-quality data at scale.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENAt Baseten, we empower leading AI companies such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer with our state-of-the-art inference solutions. Our unique blend of applied AI research, versatile infrastructure, and intuitive developer tools allows organizations at the forefront of AI innovation to deploy cutting-edge models effectively. Recently, we have experienced significant growth, securing a $300M Series E funding round, backed by renowned investors like BOND, IVP, Spark Capital, Greylock, and Conviction. Become a part of our journey to create the ultimate platform for engineers to launch AI products seamlessly.THE ROLEAs a Senior Software Engineer focused on our Enterprise Platform, you will play a pivotal role in designing and developing robust infrastructure and platform features tailored for our enterprise clientele and cloud partners. Your contributions will encompass enabling self-hosted and single-tenant environments, implementing region-aware request routing, and ensuring enterprise-grade data security and integration capabilities.EXAMPLE INITIATIVESJoin our Infrastructure team and tackle exciting projects such as:Multi-cloud capacity managementOptimizing inference on B200 GPUsImplementing multi-node inference solutionsLeveraging fractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and implement infrastructure and platform features customized for enterprise clients, covering self-hosted clusters, single-tenant environments, and cross-cloud orchestration.Lead strategic initiatives to enhance secure and scalable private connectivity solutions.Craft and execute solutions that address complex regulatory and compliance requirements for enterprise environments.
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENJoin Baseten, where we drive mission-critical AI inference for leading companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. Our unique blend of applied AI research, robust infrastructure, and intuitive developer tools empowers organizations at the forefront of AI innovation to deploy state-of-the-art models into production. Recently, we secured a $300M Series E funding round, backed by esteemed investors such as BOND, IVP, Spark Capital, Greylock, and Conviction. Be a part of our rapid growth and help shape the platform that engineers trust for launching AI products.THE ROLEAs an Infrastructure Software Engineer at Baseten, you will play a pivotal role in developing and maintaining our ML inference platform that powers AI applications in production. Your contributions will enhance the core infrastructure, enabling developers to deploy, scale, and monitor machine learning models with exceptional performance.EXAMPLE INITIATIVESYou will engage in innovative projects within our Infrastructure team, including:Multi-cloud capacity managementInference on B200 GPUsMulti-node inferenceFractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and develop infrastructure components for our ML inference platform, primarily using Python and Go.Implement and maintain Kubernetes deployments for optimal model serving.Contribute to the orchestration layer for model deployments.Build and enhance monitoring systems to track model performance metrics effectively.Develop efficient resource management solutions to optimize performance.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Full-time|$216.2K/yr - $270.3K/yr|On-site|San Francisco, CA; New York, NY
About Scale AIAt Scale AI, we are dedicated to revolutionizing the development of artificial intelligence applications. For eight years, we have established ourselves as the foremost AI data foundry, driving groundbreaking advancements in areas such as generative AI, defense applications, and autonomous vehicles. With our recent Series F funding round, we are poised to enhance the availability of frontier data, paving the way towards Artificial General Intelligence (AGI). Our commitment extends to refining our model evaluation expertise for enterprise clients and government entities, thereby enriching our capabilities for both public and private assessments.About the Generative AI Data EngineOur Generative AI Data Engine empowers the most sophisticated LLMs and generative models through premier Reinforcement Learning with Human Feedback (RLHF), human data generation, model evaluation, safety, and alignment. The data we generate is pivotal for shaping humanity's interaction with artificial intelligence.Our ApproachDuring the interview process, candidates may be considered for various roles across different teams within the GenAI Engineering organization based on their skills, interests, and business needs. Potential placements include Allocation, Growth, Frontier Data, Trust & Safety, Pay, Operator, or Tasking Experience. These teams are instrumental in scaling Scale AI’s operations - from curating impactful datasets that enhance LLM capabilities to optimizing contributor onboarding and ensuring data integrity through advanced safety and security protocols. They operate at the crossroads of machine learning, operations, and analytics to guarantee that we deliver top-tier data at scale.Key Responsibilities:Design, develop, and maintain robust, scalable systems across the entire stack, including front-end, back-end, and infrastructure.Implement high-impact features using contemporary technologies such as TypeScript, React, Node.js, MongoDB, Elasticsearch, and Temporal.Work collaboratively with internal operators to identify bottlenecks and deliver rapid, effective solutions.Take ownership of core systems crucial to our contributor platform, directly influencing Scale’s GenAI data pipeline and overall business outcomes.Architect and scale infrastructure to manage millions of tasks weekly with high reliability and low latency.Collaborate cross-functionally with ML teams, Forward Deployed Engineers, and Product to maintain data quality and operational excellence.Contribute to fostering a robust engineering culture while setting best practices for peers through mentorship, code reviews, and process improvement.
Join Ivo's Engineering Team!At Ivo, we are pioneers in the tech industry. Our engineers are innovators who have created groundbreaking solutions such as:• An AI agent that seamlessly integrates with MS Word to enhance document editing [2023]• Revolutionizing embedding models with agentic RAG technology [2023]• Advanced LLM-based legal fact extraction capabilities [2024]• A legal assistant designed to search extensive contract databases without compromising accuracy [2024]• Clustering legal documents from the same lineage [2025]• Automatic deviation analysis to uncover hidden risks in vast contract databases [2025]• Merging contracts with their amendments to create a “composite” contract timeline that has moved our clients to tears [2025]Role OverviewAs an Infrastructure Engineer at Ivo, you will lay the groundwork for our platform's future. Your responsibilities will include:• Designing and owning the future of our infrastructure, allowing you the freedom to innovate.• Managing multiple customer deployments, ensuring each receives tailored containers, databases, and VPCs.• Instrumenting our systems to identify performance bottlenecks and errors.• Aggregating metrics and logs into visually appealing dashboards and setting up pager alerts.• Leading infrastructure-related incidents and being on-call as necessary.• Enhancing our CI/CD system to reduce deployment time from ~12 minutes.If you're passionate about LLMs, you'll thrive in our engineering team, where you’ll have the opportunity to:• Develop real-time LLM evaluations to monitor the accuracy of our responses.• Collaborate with talented engineers to push the boundaries of DevOps.
About the RoleJoin our pioneering team at vooma as a Backend & Infrastructure Software Engineer, where you will play a critical role in shaping the technical infrastructure of a transformative company.If you are passionate about creating not only resilient systems but also the foundational architecture of a groundbreaking enterprise from the outset, this position is ideal for you.We are looking for someone who excels at crafting infrastructure that is elegant, dependable, and secure, even under high-demand scenarios. You thrive on the challenge of scaling systems that enable intelligent agents and take pride in establishing reliable foundations that others can rely on.Your Key Responsibilities Include:Design and maintain secure, scalable infrastructure tailored for AI-powered agents in production environments.Deploy and optimize AI-driven services to meet high availability and performance standards.Manage infrastructure as code, alongside cloud environments and CI/CD pipelines.Implement monitoring, observability, and alerting systems to ensure the reliability of our infrastructure.Contribute to infrastructure security and adhere to best practices.You Should Have:Experience in deploying and productionizing machine learning or AI-centric workloads.Proficiency in developing secure, scalable infrastructures on platforms such as AWS, Azure, or GCP.In-depth knowledge of backend systems, networking, and container orchestration technologies (e.g., Kubernetes).Understanding of infrastructure security principles and compliance standards (e.g., SOC2).A proactive and hands-on mindset, with a strong drive to solve challenges from start to finish.
Full-time|$170K/yr - $200K/yr|On-site|San Francisco, CA - US
At Crusoe, our vision is to enhance the availability of energy and intelligence. We are at the forefront of developing solutions that empower individuals to innovate boldly with AI, all while ensuring that we uphold principles of scalability, speed, and sustainability.Join us in driving the AI revolution through sustainable technology at Crusoe. You will play a pivotal role in fostering innovation, making a significant impact, and collaborating with a team that is leading the way in responsible and transformative cloud infrastructure.About the RoleWe are on the lookout for a Senior Software Engineer to take on the role of founding engineer within our new Enterprise Software Engineering team in IT. You will work closely with the Director to develop internal tools, automation processes, and integrations that remove manual tasks and address critical operational challenges across the organization.This role transcends traditional enterprise IT positions. You will leverage AI-assisted development as your primary method, delivering production-quality software within days instead of months, while laying the groundwork for future team members. As you grow in this role, your primary contributions will focus on defining specifications, context, and reusable patterns to guide AI agents, while still engaging in hands-on coding for the most complex challenges.Your ResponsibilitiesDesigning and deploying internal tools, automation, and integrations that provide measurable benefits across Finance, Operations, HR, and other business functions.Utilizing AI-assisted development as your standard workflow, which includes drafting specifications, creating prompts, reviewing AI-generated code, and iterating quickly.Establishing foundational technical patterns such as coding standards, project conventions, reusable components, and context files that facilitate AI-driven development.Creating and managing integrations between enterprise systems through APIs, middleware, and data pipelines.Developing solutions across a versatile environment that combines cloud application platforms with internal GPU compute for AI workloads.Collaborating with business teams during rapid prototyping sessions, then refining prototypes into production-ready systems complete with testing, monitoring, and documentation.Setting up CI/CD pipelines, automated testing, and quality assurance measures for all solutions delivered.Taking ownership of solutions from end to end, covering everything from specification and implementation to deployment and production support.
Feb 12, 2026
Sign in to browse more jobs
Create account — see all 7,936 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.