About the job
Senior Infrastructure & Performance Engineer
As a Senior Infrastructure & Performance Engineer, you will take charge of enhancing the performance, reliability, and scalability of Nash's foundational infrastructure. Collaborating closely with the Engineering Leadership and both platform and product engineering teams, you will design and manage low-latency, mission-critical systems that facilitate real-time logistics for some of the world's largest retailers.
This is a key senior role focused on elastic capacity, high availability, cloud-native architectures, Postgres performance, and enterprise-grade CI/CD for multi-region deployments. You will define the technical roadmap, establish best practices, and implement systems that support the essential workflows of major retailers.
Key Responsibilities
Oversee infrastructure performance and reliability for Nash's production environments, ensuring low latency, high throughput, and consistent performance under load.
Design, develop, and enhance AWS infrastructure, utilizing managed services with a focus on ECS/Fargate.
Lead initiatives in Postgres performance engineering, including query optimization, indexing strategies, connection management, replication, cluster design, and failover.
Architect and maintain multi-region, highly available systems with robust resiliency and guaranteed disaster recovery.
Design and refine enterprise-grade CI/CD pipelines that enable safe, repeatable, and rapid deployments across environments and regions.
Establish observability standards (metrics, logs, tracing, SLOs) to proactively identify and resolve performance bottlenecks.
Collaborate with application engineers to inform system design choices that influence scalability, latency, and reliability.
Lead incident response efforts and postmortems, emphasizing root cause analysis, systemic improvements, and long-term resilience.
Set best practices for infrastructure and performance while mentoring engineers throughout the organization.
Qualifications
6+ years of experience in building and managing high-scale production infrastructure for mission-critical systems.
Proficiency with AWS, particularly with ECS/Fargate, and experience with cloud-native architecture.
Strong background in Postgres performance tuning and optimization.
Deep understanding of CI/CD practices and experience in multi-region deployments.
Exceptional analytical and problem-solving skills, with a proactive approach to performance management.

