About the job
Join Luminance, a leader in Legal-Grade™ AI for enterprises, and be part of a groundbreaking team that's revolutionizing the legal profession worldwide. With backing from prestigious venture capitalists and accolades like being featured in Forbes AI 50 as one of the 'Most Promising Private AI Companies' and Inc. 5000's 'Fastest Growing Companies in America', we're on the cutting edge of technology.
We are on the lookout for an innovative and skilled Infrastructure Engineer to join our Infrastructure Team in a role focused on DevOps and platform engineering. The successful candidate will possess a robust background in cloud-native infrastructure development, emphasizing automation, scalability, and security, as well as a comprehensive grasp of Infrastructure-as-Code (IaC). Your role will be pivotal in designing and evolving Luminance’s AWS platform, modernizing deployment patterns, automating environment creation, and ensuring our systems are reliable, repeatable, and resilient.
Key Responsibilities
- Design, build, and maintain scalable cloud infrastructure utilizing AWS services including EC2, EKS, RDS/Aurora, ElastiCache, OpenSearch, and CloudFront.
- Lead the development and adoption of Kubernetes on EKS for managing both production and internal workloads.
- Architect and implement Infrastructure-as-Code (IaC) pipelines, integrating Terraform or similar tools into CI/CD workflows for environment provisioning, validation, and automated testing.
- Implement zero-downtime deployment strategies (blue/green, rolling, canary) and automate rollback and recovery processes.
- Promote continuous improvement in infrastructure, focusing on eliminating single points of failure and enhancing autoscaling, high availability, and managed service adoption across the platform.
- Collaborate with SRE, Security, and Engineering teams to enhance observability, monitoring, and alerting using tools such as Prometheus, Grafana, and CloudWatch.
- Work closely with the Security team to incorporate best practices for IAM, secrets management, WAF, and overall posture management.
- Optimize performance and cloud expenditure through automation and cost visibility dashboards.
- Participate in on-call rotations, conduct post-incident reviews, and contribute to ongoing operational reliability improvements.
