About the job
g2i is hiring a Senior AI Interaction Evaluator for a contract role based in Miami. This position involves assessing the performance of advanced AI coding agents such as OpenAI Codex and Claude Code. The contract begins as soon as possible and continues through early May, requiring a weekly commitment of 10 to 20 hours. Compensation is set at $100–$200 per hour.
Watch this Loom video for more details!
What you will do
- Review AI-generated coding interactions and assess their overall quality and usefulness.
- Judge whether AI responses are relevant, practical, and accurate at a high level.
- Evaluate if responses reflect the approach of skilled engineers when solving problems.
- Examine the clarity and depth of the AI’s explanations and reasoning, beyond just the code output.
- Distinguish subtle differences in quality between different AI-generated outputs.
- Provide detailed, constructive feedback on the effectiveness of AI responses.
- Contribute to shaping criteria for excellence in AI-driven coding interactions.
Role focus
This role emphasizes evaluating engineering judgment and the ability to recognize thoughtful solutions. The focus is not on checking syntax or surface-level correctness, and there is no requirement to write production code.
