About the job
About Scale AI
At Scale AI, we are dedicated to revolutionizing the development of artificial intelligence applications. For eight years, we have established ourselves as the foremost AI data foundry, driving groundbreaking advancements in areas such as generative AI, defense applications, and autonomous vehicles. With our recent Series F funding round, we are poised to enhance the availability of frontier data, paving the way towards Artificial General Intelligence (AGI). Our commitment extends to refining our model evaluation expertise for enterprise clients and government entities, thereby enriching our capabilities for both public and private assessments.
About the Generative AI Data Engine
Our Generative AI Data Engine empowers the most sophisticated LLMs and generative models through premier Reinforcement Learning with Human Feedback (RLHF), human data generation, model evaluation, safety, and alignment. The data we generate is pivotal for shaping humanity's interaction with artificial intelligence.
Our Approach
During the interview process, candidates may be considered for various roles across different teams within the GenAI Engineering organization based on their skills, interests, and business needs. Potential placements include Allocation, Growth, Frontier Data, Trust & Safety, Pay, Operator, or Tasking Experience. These teams are instrumental in scaling Scale AI’s operations - from curating impactful datasets that enhance LLM capabilities to optimizing contributor onboarding and ensuring data integrity through advanced safety and security protocols. They operate at the crossroads of machine learning, operations, and analytics to guarantee that we deliver top-tier data at scale.
Key Responsibilities:
- Design, develop, and maintain robust, scalable systems across the entire stack, including front-end, back-end, and infrastructure.
- Implement high-impact features using contemporary technologies such as TypeScript, React, Node.js, MongoDB, Elasticsearch, and Temporal.
- Work collaboratively with internal operators to identify bottlenecks and deliver rapid, effective solutions.
- Take ownership of core systems crucial to our contributor platform, directly influencing Scale’s GenAI data pipeline and overall business outcomes.
- Architect and scale infrastructure to manage millions of tasks weekly with high reliability and low latency.
- Collaborate cross-functionally with ML teams, Forward Deployed Engineers, and Product to maintain data quality and operational excellence.
- Contribute to fostering a robust engineering culture while setting best practices for peers through mentorship, code reviews, and process improvement.

