Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Senior
Qualifications
The ideal candidate will possess a strong background in software engineering principles, with proven experience in developing scalable applications. Familiarity with modern programming languages such as Java, Python, or JavaScript is essential. A passion for healthcare technology and the ability to work in a collaborative environment will set you apart.
About the job
MidiHealth is seeking a Senior Software Engineer to join the Platform Engineering team. This hybrid role is based in the SF Bay Area and centers on building and enhancing the software that drives MidiHealth’s healthcare technology platform. The work contributes directly to improving patient outcomes through technology.
Key responsibilities
Design and develop software solutions for the core platform
Collaborate with engineering, product, and cross-functional teams to deliver integrated features
Support the reliability and scalability of the platform
Location
This position requires regular on-site work in the SF Bay Area as part of a hybrid schedule.
About MidiHealth
MidiHealth is at the forefront of healthcare technology, dedicated to delivering innovative solutions that enhance patient care. Our team values creativity, collaboration, and a commitment to making a positive impact in the healthcare industry. Join us in our mission to transform healthcare through technology.
Similar jobs
1 - 20 of 7,404 Jobs
Search for Senior Software Engineer Infrastructure Analytics Platform
About Our TeamThe Scaling team at OpenAI is dedicated to designing, constructing, and managing essential infrastructure that powers groundbreaking research.Our mission is straightforward: to expedite the advancement of research towards Artificial General Intelligence (AGI). We achieve this by developing foundational systems that researchers depend on, spanning from core infrastructure elements to specialized applications tailored for research. Our systems are designed to scale efficiently with the growing complexity and size of our workloads while ensuring reliability and user-friendliness.About the PositionWe are seeking a Senior Software Engineer to take charge of critical production infrastructure from start to finish.This role primarily focuses on backend and systems engineering, with a strong emphasis on low-level performance, distributed systems, and the hands-on management of vital services at scale. You will be responsible for transforming ambiguous challenges into actionable plans, delivering pragmatic solutions promptly, and refining them based on real-world feedback and iterations.This position goes beyond a standard Python backend role; we are specifically on the lookout for candidates with robust systems experience in Rust or C++, particularly in performance-sensitive infrastructure.This is an in-office role based in San Francisco, CA, following a hybrid model of three days in the office per week. We also provide relocation assistance for new hires.Your ResponsibilitiesManage critical infrastructure throughout its lifecycle, including design, implementation, deployment, operation, and ongoing improvements.Develop and maintain high-performance backend systems in Rust or C++ that facilitate core research operations.Design and optimize distributed data and serving systems, considering partitioning, replication, consistency, retries, backpressure, and failure isolation.Identify and resolve production bottlenecks related to latency, throughput, contention, hot spots, and overload scenarios.Oversee mission-critical services, including on-call duties, incident management, postmortems, observability, deployment safety, and zero-downtime migrations.Enhance the reliability of services running on Kubernetes, focusing on resource tuning and failure management.Collaborate closely with engineers and researchers to deliver fast, dependable, and effective systems.Elevate standards through strong technical judgment, ownership, and commitment to quality.You Will Excel in This Role If You Have:A proven track record of owning and delivering operationally critical systems end to end in ambiguous settings.Experience with systems programming in Rust or C++.Strong analytical skills and a problem-solving mindset.Excellent communication and collaboration skills.
Full-time|$190K/yr - $280K/yr|Hybrid|San Francisco, California
About SentryAt Sentry, we are determined to eradicate bad software. Our mission is to empower developers to create superior software more efficiently, allowing us all to enjoy technology again.With over $217 million in funding and a community of more than 100,000 organizations, including industry leaders like Disney, Microsoft, and Atlassian, we are crafting performance and error monitoring tools that enable teams to focus on building innovative products rather than squashing bugs.Embracing a hybrid work model, Sentry encourages collaboration by designating Mondays, Tuesdays, and Thursdays as in-office days across our global hubs. If you're passionate about creating tools that enhance the digital experience, we invite you to join us in building the future of software monitoring.About the RoleAs a member of our Event Analytics Platform (EAP) team, you will be at the forefront of developing the infrastructure that supports Sentry’s capabilities to handle time-series data and search functionalities across billions of events with remarkable speed. This initiative includes our innovative Snuba system, which acts as the primary storage and query service fueled by ClickHouse.In this role, you will expand the data visibility at Sentry by enhancing our search infrastructure, developing new capabilities atop our advanced storage solutions, and boosting the performance and integrity of our core data services. You will play a key role in shaping the technical vision of our Infrastructure team and collaborate closely with Product and Engineering teams to transform that vision into reality.If you are eager to tackle the complexities of scaling event data into the petabyte range, this opportunity is ideal for you.Key Responsibilities:Enhance EAP's capabilities to deliver data with exceptional speed and reliability.Design and automate systems and services to ensure reliable scaling amidst rising demand.Evaluate architectural decisions that harmonize product goals with engineering limitations.Promote and uphold high code quality standards through regular code reviews and contributions to design discussions.Lead the development of innovative data solutions to enhance user insights.
Full-time|$196K/yr - $220.5K/yr|On-site|San Francisco Bay Area
At Discord, we connect over 200 million users monthly for diverse experiences, with gaming being the predominant activity. Our platform supports more than 90% of our users in enjoying games, collectively logging 1.5 billion hours each month across various titles. As we shape the future of gaming, our mission is to enhance interactions before, during, and after gaming sessions.The Platform Infrastructure teams are pivotal in constructing and upholding the essential systems that energize Discord's core functionalities. We manage systems that process hundreds of thousands of requests per second and handle tens of billions of transactions daily, enabling seamless connections for millions of users. By developing foundational platform components, we empower internal developers to deploy new features swiftly and securely, ensuring Discord remains reliable, efficient, and scalable.As a Senior Software Engineer on our team, you will play a crucial role in continuously refining our codebase, processes, and infrastructure, directly impacting user interactions on Discord!
The Scaling team at OpenAI builds and maintains the core infrastructure that supports research efforts. This group focuses on enabling rapid progress toward Artificial General Intelligence by providing the systems and tools researchers rely on every day. Their work covers everything from foundational infrastructure to specialized applications, all designed to handle increasing complexity and scale without sacrificing reliability or ease of use. Role overview OpenAI is seeking a Site Reliability Engineer to manage and improve the infrastructure behind its analytics platform. This position centers on supporting production systems that handle data-intensive, low-latency workloads. Key technologies include large-scale ClickHouse clusters, high-throughput Kafka pipelines, and stable integrations with Snowflake. The engineer in this role will turn ambiguous operational challenges into concrete solutions, deliver improvements quickly, and iterate based on real-world feedback. Success in this role means independently setting and raising operational standards, working closely with production systems, and collaborating across teams to ensure reliability at scale. Key responsibilities Manage the full lifecycle of infrastructure: provisioning, upgrades, scaling, and decommissioning using Infrastructure as Code (IaC). Operate and scale ClickHouse clusters, including sharding, replication, capacity planning, tuning, and maintenance. Run Kafka as the primary data ingestion layer, improving throughput, managing lag and backpressure, and ensuring robust failure recovery. Improve latency and reliability for workloads involving heavy data serving and querying. Develop and maintain monitoring and alerting systems, including SLIs/SLOs, dashboards, alert policies, and actionable runbooks. Create and refine incident response protocols, on-call procedures, and postmortem practices. Oversee backup, restore, and disaster recovery strategies, including regular drills. Plan and execute safe rollouts across development, staging, and production environments, using canary deployments and rollback plans. Work daily with software engineers to embed reliability into system design, implementation, and release cycles. Set and promote standards for operational readiness and runbooks, encouraging adoption across teams. Enhance CI/CD pipelines and improve the developer experience for greater speed and safety.
Full-time|$160K/yr - $225K/yr|Hybrid|San Francisco, CA (Hybrid)
About Fable SecurityIn today’s digital landscape, AI-driven threats and human errors represent the most significant risks to enterprise security. Cybercriminals exploit human behavior, contributing to 70% of security breaches. At Fable, we empower individuals to transform from potential targets to active defenders with innovative tools.Fable is at the forefront of human risk management, offering a platform that effectively influences employee behavior. Our user-friendly, scalable solution analyzes complex employee data, identifies high-risk behaviors, and delivers timely interventions directly to users in their work environment.Supported by notable investors like Redpoint Ventures and Greylock Partners, and founded by former members of the Abnormal Security team, Fable is tackling one of cybersecurity's greatest challenges in a rapidly expanding market. Our team comprises alumni from esteemed organizations such as Meta, Twitter, and Flexport, as well as top universities including Waterloo, Columbia, and Stanford. This is an exceptional opportunity for you to join us at a time of rapid growth and help shape the future of security.Why Join UsBuild and scale the foundational data infrastructure that drives a groundbreaking product.Collaborate closely with engineering, data science, and product teams to operationalize data at scale.Become part of a small, high-caliber team where your contributions will have a significant impact.As part of an early-stage company, every engineer plays a crucial role in shaping the evolution of our products and the company's approach to data management.Your RoleAs a Platform and Infrastructure Engineer, you will be instrumental in developing and scaling the core systems that underpin Fable’s product and data operations.Your responsibilities will span backend systems including real-time services and data pipelines. You will ensure reliability, scalability, and optimal performance across all layers. This highly collaborative role involves working closely with data and ML teams, contributing to systems that effectively manage data ingestion, processing, and delivery.This role demands cross-functional collaboration with engineering, data, and product teams to create robust, production-grade systems that grow alongside the company.ResponsibilitiesDesign, develop, and maintain scalable backend and infrastructure systems.Collaborate with cross-functional teams to deliver high-quality software solutions.Ensure system reliability, performance, and security through rigorous testing and monitoring.
Senior Software Engineer, Infrastructure & PlatformRole OverviewIn the role of Senior Software Engineer, Infrastructure & Platform at AfterQuery, you will take on the exciting challenge of designing and constructing the essential infrastructure that drives our innovative data generation, evaluation, and agentic systems.Your responsibilities will include developing shared platforms that empower our engineering and research teams to execute large-scale human-in-the-loop workflows, evaluation harnesses, and automated data pipelines essential for training cutting-edge AI models.This position demands a high level of technical expertise and offers extensive ownership. You will be responsible for architecting and building the foundational infrastructure relied upon by numerous engineers, ensuring that systems are scalable, reliable, and capable of handling high-throughput workloads.Collaboration with the founding team will be key as you define system architecture, establish best engineering practices, and create the infrastructure that supports the evolution of AI development.
Full-time|$162K/yr - $216K/yr|Hybrid|San Francisco, California, United States
Who We AreBaton is Ryder’s innovative product development division dedicated to leveraging cutting-edge technologies to transform the transportation and logistics landscape. Managing over $10 billion in freight, our technology has a significant impact across the U.S. economy.We are committed to creating and delivering software that not only meets but exceeds the needs of Ryder and its 50,000+ clients, which includes some of the most recognized brands globally. Our projects range from user-centric applications to the robust data platform that will drive the future of Ryder’s innovations.Baton’s mission: To enable a supply chain that operates on autopilot.Since Ryder’s acquisition of Baton in 2022, we have been operating with the agility of a startup while benefiting from the extensive reach of a Fortune 500 company. If you're passionate about tackling intricate challenges and making a real impact in the backbone of the American economy, you’ll thrive with us.Role: Software Engineer - InfrastructureDepartment: Data PlatformLocation: Hayes Valley, San Francisco, CA
Full-time|$300K/yr - $300K/yr|On-site|San Francisco
ABOUT BASETENAt Baseten, we empower leading AI companies such as Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer with our state-of-the-art inference solutions. Our unique blend of applied AI research, versatile infrastructure, and intuitive developer tools allows organizations at the forefront of AI innovation to deploy cutting-edge models effectively. Recently, we have experienced significant growth, securing a $300M Series E funding round, backed by renowned investors like BOND, IVP, Spark Capital, Greylock, and Conviction. Become a part of our journey to create the ultimate platform for engineers to launch AI products seamlessly.THE ROLEAs a Senior Software Engineer focused on our Enterprise Platform, you will play a pivotal role in designing and developing robust infrastructure and platform features tailored for our enterprise clientele and cloud partners. Your contributions will encompass enabling self-hosted and single-tenant environments, implementing region-aware request routing, and ensuring enterprise-grade data security and integration capabilities.EXAMPLE INITIATIVESJoin our Infrastructure team and tackle exciting projects such as:Multi-cloud capacity managementOptimizing inference on B200 GPUsImplementing multi-node inference solutionsLeveraging fractional H100 GPUs for efficient model servingRESPONSIBILITIESDesign and implement infrastructure and platform features customized for enterprise clients, covering self-hosted clusters, single-tenant environments, and cross-cloud orchestration.Lead strategic initiatives to enhance secure and scalable private connectivity solutions.Craft and execute solutions that address complex regulatory and compliance requirements for enterprise environments.
Who We AreServal is an innovative AI-driven automation platform redefining operational efficiency for enterprises. Our intelligent agents seamlessly comprehend and execute real-world workflows, replacing outdated manual processes with adaptive, self-learning software. Since our inception in early 2024, we have garnered the trust of industry leaders such as General Motors, Notion, Perplexity, Vercel, Mercor, LangChain, and Verkada, streamlining high-volume operational tasks across their organizations.At the heart of Serval is a cutting-edge agentic AI platform that transforms natural language into actionable workflows. Our agents not only respond to queries but also reason, act across various systems, and continuously enhance their performance. What started as a solution for operational tasks has rapidly expanded into a versatile AI automation layer utilized across IT, HR, Finance, Security, Legal, and Engineering sectors.Our mission is to eradicate repetitive, manual tasks within enterprises, empowering teams through intelligent automation. In the long run, we aim to establish a universal AI operations layer—a system of agents that integrates across business functions, maintaining the momentum of modern companies.We are proud to be backed by renowned investors including Sequoia Capital, Redpoint Ventures, Meritech, First Round, General Catalyst, and Elad Gil, and founded by seasoned product and engineering leaders from Verkada.Role OverviewAs a Senior Software Engineer in Infrastructure at Serval, you will be pivotal in developing and scaling the core systems that empower our AI agents and workflow automation platform. A crucial aspect of this role involves enabling and supporting self-hosted deployments for enterprise clients needing on-premises or private cloud environments. We are looking for engineers with profound expertise in distributed systems, infrastructure-as-code, production operations, and customer-facing support, who aspire to influence the technical architecture of a rapidly evolving platform.What You'll DoDesign, implement, and operate large-scale distributed systems that power Serval's AI agents, workflow orchestration, and data pipelines.Create and maintain Terraform modules to provision and manage cloud infrastructure across AWS, GCP, or Azure environments.Develop and sustain deployment packages, installation scripts, and infrastructure templates, enabling customers to self-host Serval in their own environments.Provide technical support and guidance to enterprise customers during installation and deployment phases.
Why Join Harvey?At Harvey, we are revolutionizing the landscape of legal and professional services — not through minor adjustments, but by rethinking the process from the ground up. By harnessing cutting-edge agentic AI, a robust enterprise-level platform, and profound sector expertise, we are redefining how essential knowledge work is performed for years to come.This is a unique opportunity to contribute to the foundation of a transformative company at a pivotal moment. With over 1000 clients spanning more than 58 countries, proven product-market alignment, and exceptional investor backing, we are growing rapidly and establishing a new category in real-time. The challenges are ambitious, the standards are high, and the potential for personal, professional, and financial growth is unparalleled.Our team is intelligent, driven, and passionately dedicated to our mission. We operate with urgency, take true ownership of our challenges, and deliver results from initial concepts to long-term goals. We maintain close relationships with our customers — from executives to engineers — collaborating to address real-world problems with urgency and care. If you excel in uncertain environments, strive for excellence, and want to influence the future of work alongside like-minded individuals, we invite you to join us in our mission.At Harvey, we are actively shaping the future of professional services — and we’re just getting started.Role OverviewAs a Senior Software Engineer on the Core Infrastructure team at Harvey, you will be pivotal in architecting and constructing new infrastructure systems while enhancing and fortifying our existing frameworks. Our infrastructure underpins every user interaction with Harvey — managing billions of prompt tokens and millions of daily requests across our global legal AI platform.You will thrive in a balanced environment focused on innovation — building new systems — and operational excellence, ensuring Harvey remains resilient and efficient as we scale our products, regions, clientele, and usage. Your contributions will directly influence the reliability, scalability, and security of our platform, which serves the world’s leading law firms and professional service providers.This position is located in San Francisco, CA. We utilize an in-person work model and provide relocation assistance for new hires.What You Will DoDesign and develop scalable, fault-tolerant infrastructure systems that power Harvey's AI platform across multiple cloud environments.Manage and enhance our multi-cloud infrastructure (Azure, GCP), focusing on Kubernetes orchestration, networking, and container management.Lead key technical initiatives concerning observability, incident response, and performance optimization.
MidiHealth is seeking a Senior Software Engineer to join the Platform Engineering team. This hybrid role is based in the SF Bay Area and centers on building and enhancing the software that drives MidiHealth’s healthcare technology platform. The work contributes directly to improving patient outcomes through technology. Key responsibilities Design and develop software solutions for the core platform Collaborate with engineering, product, and cross-functional teams to deliver integrated features Support the reliability and scalability of the platform Location This position requires regular on-site work in the SF Bay Area as part of a hybrid schedule.
Full-time|$179.4K/yr - $224.3K/yr|On-site|San Francisco, CA; New York, NY
In a world where software is rapidly evolving, artificial intelligence (AI) is at the forefront, transforming how we interact with technology. At Scale AI, we recognize the immense potential of AI to enhance human capabilities, offering personalized support across various aspects of life—from coaching and tutoring to shopping and travel guidance. As enterprises, startups, and governments rush to integrate large language models (LLMs) into their operations, it is crucial to ensure these systems are safe, aligned, and effective. This involves rigorous human evaluation and reinforcement learning through human feedback (RLHF) during all stages of model development.Our innovative products, including the Generative AI Data Engine, SGP, and Donovan, are designed to empower the most advanced LLMs and generative models globally. By leveraging world-class RLHF, human data generation, model evaluation, safety, and alignment, we are shaping the future of human-AI interaction.As a member of our Platform Engineering team, you will play a pivotal role in designing and developing the foundational platforms that support Scale's operations. Your responsibilities will include architecting our core cloud infrastructure, enhancing our data lifecycle, and transforming the software development process for engineers at Scale. You will gain invaluable insights into the AI landscape as it develops within diverse sectors.
Aura is seeking a talented and experienced Senior Software Engineer, Platform to join our innovative team. In this role, you will be responsible for designing and implementing scalable software solutions that enhance our platform capabilities. You will work closely with cross-functional teams to ensure the delivery of high-quality software that meets the needs of our users.
Join our innovative team at Astranis as a Senior Software Engineer specializing in Infrastructure. In this role, you will be responsible for designing, implementing, and maintaining robust infrastructure solutions that support our cutting-edge satellite technology. Your expertise will play a crucial role in enhancing the reliability and scalability of our systems.
Full-time|$190K/yr - $280K/yr|Hybrid|San Francisco, California
About SentrySentry is dedicated to eliminating poor software experiences. Our mission is to empower developers to create high-quality software swiftly, allowing everyone to enjoy technology to its fullest.With over $217 million raised in funding and a community of over 100,000 organizations, including giants like Disney, Microsoft, and Atlassian, we are developing state-of-the-art performance and error monitoring tools. Our solutions help our partners minimize time spent on bug fixes and maximize product development.In our commitment to collaboration, Sentry follows a hybrid work model across our global offices. We have designated Mondays, Tuesdays, and Thursdays as in-office days to foster effective teamwork. If you are passionate about building tools that enhance the digital experience, join us in creating the next generation of software monitoring solutions.About the RoleAt Sentry.io, we offer vital services for diagnosing application health issues. Our tools are crucial for organizations aiming to respond adeptly in dynamic markets. We ensure a seamless and enjoyable experience in the development and deployment of these tools through a robust continuous integration environment and an insightful deployment pipeline.As part of the Infrastructure Engineering team, your contributions will be instrumental in supporting Sentry's growth and enabling engineering teams to operate with agility and confidence.Your responsibilities will include designing, developing, and maintaining internal software and platform capabilities that alleviate the cognitive load associated with infrastructure and developer tooling. You will create dependable, reusable abstractions that facilitate rapid shipping of features while incorporating durability, security, and operational excellence into service development and management.This role demands strong engineering judgment: selecting reliable technologies, planning for scalability from the outset, and crafting solutions that serve multiple teams. Your focus will be on practical systems that enhance reliability and ownership across the organization, driving adoption through comprehensive documentation, well-designed APIs, and seamless developer experiences that integrate into daily workflows.Ultimately, you will empower engineering teams to flourish within a culture of ownership—enabling them to deploy, manage, and evolve services confidently while minimizing operational burdens.Key ResponsibilitiesDesign systems that scale with company growth, ensuring a balance of reliability, performance, and cost-efficiency.Develop platform services that enhance internal operations and developer productivity.
Join our innovative team at Unify as a Senior Software Engineer, Platform, where you will play a crucial role in enhancing our platform capabilities. You will collaborate with cross-functional teams to design, develop, and implement high-quality software solutions that meet our clients' needs.
Full-time|$180K/yr - $210K/yr|On-site|San Francisco, CA
About Sigma Computing Sigma Computing builds AI-powered apps and analytics tools that connect directly to cloud data warehouses. Teams use Sigma to create applications, automate workflows, and analyze live data through a spreadsheet interface, SQL and Python editors, visual builders, and integrated AI features. The platform supports everything from interactive analyses to reports and embedded data experiences. Role Overview: Senior Product Manager - Platform Performance & Infrastructure Sigma is growing to serve larger enterprises with demanding, complex workloads. The Senior Product Manager for Platform Performance & Infrastructure will guide the development of core backend systems that keep Sigma responsive and reliable as usage scales. This role focuses on driving improvements in: Workbook performance Query lifecycle management Compute and caching strategies Metadata services Compiler components New warehouse connectors These systems are essential for Sigma’s ability to deliver consistent, high-quality performance to enterprise customers. What You Will Do Define and prioritize product enhancements for backend platform performance and scalability Work closely with platform engineering and cross-functional teams to address technical challenges Translate performance and scalability needs into clear product requirements and measurable objectives Ensure Sigma’s infrastructure can support enterprise clients with reliability and speed Who We’re Looking For Experienced Senior Product Manager with strong technical background Comfortable working hands-on with backend systems and infrastructure Skilled at collaborating with engineering and cross-functional partners Focused on delivering measurable improvements for customers Location & On-Site Requirement This position is based in San Francisco, CA. It requires working on-site at the Sigma office at least four days per week.
Full-time|$185K/yr - $400K/yr|On-site|San Francisco, California, United States
Join Our Team as an Infrastructure & Platform EngineerWe are seeking a talented Infrastructure & Platform Engineer to join our dynamic team at mlabs in San Francisco. As a rapidly growing technology company, we are at the cutting edge of the crypto derivatives market, an industry that generates tens of billions in annual revenue. Our exchange is one of the fastest-growing platforms for crypto derivatives, and we are committed to enhancing our offerings to meet the evolving needs of our users.Your mission will be to develop the next critical feature: Multi-Asset Margin, which will streamline how users post collateral directly on-chain, thus improving trading efficiency. You will work alongside our Infrastructure & Platform team, focusing on designing and managing our high-performance systems that deliver exceptional speed and reliability.Key Responsibilities:Design and implement robust scripts and services that ensure optimal performance in real-time environments.Manage and deploy computing resources and containers for tailored services and integrations.Automate scaling, load balancing, and congestion control for both compute and database layers.Establish and maintain CI/CD pipelines for streamlined deployments and continuous delivery.Monitor and optimize system performance across multiple metrics to enhance throughput and resilience.Develop and maintain indexing and explorer services for fast, real-time data access.Provision and optimize diverse database systems, including time-series, relational, key-value, and in-memory databases.
Compensation: Competitive base salary + substantial equityBenefits: Health & dental insurance, gym reimbursement, daily team lunches, 401(K)About JuliusAt Julius, we're pioneering advancements in applied AI by developing cutting-edge coding agents. Our platform executes approximately 1 million lines of code every 36 hours, serving over 1 million users and generating 3 million+ visualizations. We manage all code in isolated remote containers. As a revenue-generating entity, we are backed by AI Grant and founders with remarkable backgrounds from companies like Vercel, Notion, Perplexity, Palantir, Replit, Zapier, Intercom, and Dropbox.The RoleJoin us in building and scaling the robust code-execution platform that powers Julius, across both cloud and on-prem environments. We orchestrate over 500,000 containers/month and the demand is growing rapidly. You will take ownership of reliability, performance, and security within our multi-tenant compute environment.Your ResponsibilitiesDesign and manage a secure, multi-tenant container infrastructure that ensures quick startup and intelligent autoscaling.Implement on-prem/private cloud deployments using Helm and Terraform, integrating SSO, network controls, and audit logging.Enhance observability (metrics, traces, logs) with well-defined SLOs and lead incident response initiatives.Optimize images, scheduling, networking, and costs, while developing fair-use and rate-limiting controls.Your QualificationsStrong experience with production Kubernetes and container internals (Docker/containerd); solid understanding of networking principles.Familiarity with cloud environments (AWS/GCP/Azure) and Infrastructure as Code (Terraform/Helm).Proficiency in monitoring and logging tools (Prometheus, Grafana, OpenTelemetry, ELK/Vector).Understanding of security best practices for containerized, multi-tenant systems.Preferred QualificationsExperience with gVisor, Kata, Firecracker; Cilium/eBPF; GPU scheduling; serverless autoscaling (KEDA/Knative/Karpenter).Proven experience delivering on-prem or air-gapped enterprise software solutions.A passion for AI, with experience building side projects involving LLMs.Why Join Julius?Be part of a small, senior team where your contributions will have a massive impact. Tackle challenging infrastructure problems at a meaningful scale.
Join our dynamic team at Parafin as a Senior Software Engineer specializing in Infrastructure. In this pivotal role, you will design, develop, and maintain robust infrastructure solutions that support our scalable applications. Your expertise will help us enhance system performance, reliability, and security.We are looking for innovative thinkers who thrive in a collaborative environment. You will work closely with cross-functional teams to implement cutting-edge technologies that drive our product forward.
Apr 3, 2026
Sign in to browse more jobs
Create account — see all 7,404 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.