companyDigitalOcean logo

Senior Engineer I - GPU Infrastructure

DigitalOceanHyderabad
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Senior

Qualifications

Key Responsibilities:Collaborate with key stakeholders to package and continuously test our GPU images across multiple platform services, coordinating with hardware engineering on firmware versions while maintaining an active compatibility matrix for Nvidia and AMD GPU drivers (H100/H200/MI300x/MI325x). Automation and Scripting: Create and maintain tools and scripts (e.g., HashiCorp Packer, Ansible, Terraform, Python, Shell scripting) for automating the creation, testing, and deployment of machine images. Security and Compliance: Ensure machine images meet security best practices including hardening, patch management, and compliance with organizational and industry standards (e.g., CIS benchmarks, GDPR, HIPAA). Optimization: Enhance machine images for performance, size, and boot time to improve scalability and minimize operational costs. CI/CD Integration: Incorporate machine image creation into CI/CD pipelines using tools like Jenkins, GitHub Actions, or GitLab CI for automated builds and deployments. Versioning and Documentation: Maintain comprehensive documentation for the image creation process.

About the job

Join DigitalOcean and embark on a remarkable journey where you can contribute to the development of the simplest scalable cloud. Our vibrant community of talented professionals is dedicated to making a significant impact. If you possess a growth mindset, are driven by bold ideas, and thrive in a dynamic environment, you’ll find a home here. We believe in collective success—learning, enjoying our work, and making a substantial difference for innovators and visionaries worldwide.

We are looking for an experienced Software Engineer specializing in GPU Infrastructure, Ubuntu systems, and automation. In this pivotal role, you will design, build, maintain, and optimize machine images across diverse cloud platforms. Your contributions will ensure the delivery of standardized, secure, and high-performance images, especially for GPU Bare Metal and GPU Droplet (VM) images.

Your focus will be on automating the image creation lifecycle, ensuring image integrity, and enhancing performance and security. The ideal candidate will have extensive experience in GPU infrastructure, automation, and testing systems, along with a passion for creating secure, scalable, and efficient machine images that support modern cloud-native applications and services.

About DigitalOcean

DigitalOcean is a leading cloud infrastructure provider dedicated to simplifying cloud computing for developers. With a focus on community and innovation, DigitalOcean empowers individuals and businesses to build, manage, and scale applications with ease.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.