companyAchira logo

Software Engineer - Distributed Systems

AchiraSan Francisco Office
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Experience

Qualifications

What You’ll DoArchitect & Build: Design, implement, and optimize distributed computing infrastructure for ML data processing, training, and fine-tuning. Optimize & Monitor: Enhance cluster observability, scheduling, and resource utilization (CPU/GPU/TPU). Compute Efficiency: Investigate and implement cost-effective computing solutions (spot instances, auto-scaling, multi-cloud strategies). Tooling: Create tools for monitoring, debugging, and performance tuning of large-scale ML workloads. Collaboration: Work closely with cross-functional teams to integrate machine learning into our systems.

About the job

Why Join Achira?

  • Become part of an exceptional team comprised of scientists, ML researchers, and engineers dedicated to transforming the landscape of drug discovery.

  • Engage with cutting-edge machine learning infrastructure at an unprecedented scale, leveraging extensive computing resources, vast datasets, and ambitious goals.

  • Take ownership of significant projects from conception through to architecture and deployment on large-scale infrastructures.

  • Thrive in a culture that values thoroughness, speed, and a proactive, builder-oriented mindset.

About the Role

At Achira, we are developing state-of-the-art foundation models that address the most complex challenges in simulation for drug discovery and beyond. Our atomistic foundation simulation models (FSMs) serve as comprehensive representations of the physical microcosm, encompassing machine learning interaction potentials (MLIPs), neural network potentials (NNPs), and various generative model classes.

We are looking for a Software Engineer who is enthusiastic about distributed computing and its applications in machine learning. You will play a pivotal role in designing and constructing the infrastructure for our ML data generation pipelines, model training, and fine-tuning workflows across large-scale distributed systems.

Your expertise will be crucial in ensuring our compute clusters are efficient, observable, cost-effective, and dependable, enabling us to advance the frontiers of ML development. If you are passionate about distributed systems, performance optimization, and cloud cost efficiency, we encourage you to apply.

You will be empowered to conceptualize and manage complex workloads across multiple vendors worldwide. Achira's mission revolves around computation, and providing seamless access to our uniquely tailored workloads at the lowest possible cost is critical to our success.

About Achira

Achira is at the forefront of revolutionizing drug discovery through innovative technology and advanced machine learning techniques. Our dynamic team combines scientific expertise with engineering prowess to create solutions that can change lives. Join us in our mission to harness the power of data and computation to solve some of the world's most pressing challenges.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.