About the job
Please submit your resume in English and include your English proficiency level.
Mindrift, powered by Toloka AI, offers project-based freelance roles for specialists focused on testing, evaluating, and improving AI systems for major technology companies. These positions are not permanent but are structured around specific projects.
Role overview
This Senior Python Systems Developer role emphasizes functional testing of large codebases. The position requires significant experience with Python, Linux, and Docker, as well as the ability to work with multiple programming languages such as C, Rust, and Go. Using large language models (LLMs) like Roo Code or Claude Code to streamline development is a key part of the work.
What you will do
- Design and implement functional black box tests for codebases in various languages.
- Set up and maintain Docker environments to ensure reproducible builds and consistent testing across platforms.
- Monitor code coverage and configure automated scoring metrics to align with industry standards.
- Use LLMs (such as Roo Code and Claude) to automate repetitive tasks, enhance code quality, and accelerate development cycles.
Requirements
- Minimum 5 years of experience as a Software Engineer, primarily working with Python.
- Strong background with pytest, including fixtures, session-scoped tests, and timeouts, plus experience designing black-box functional tests for command-line tools.
- Advanced knowledge of Docker, including reproducible Dockerfiles, managing user contexts, and secure workspace practices.
- Proficient in Linux and Bash scripting, with the ability to debug within containers.
- Familiarity with modern Python tooling such as uv, pyproject.toml, and packaging.
- Ability to read and understand code in C, C++, Rust, or Go, with support from LLMs.
- Hands-on experience using LLMs (Claude Code, Roo Code, Cursor) to speed up development and generate test cases.
- English proficiency at B2 level or higher.
Preferred qualifications
- Background with agent evaluation platforms and MCP CLI.
Tools and technologies
Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits and work structure
- Freelance, project-based assignments through Mindrift.
- Fully remote work with flexible hours, typically 20-30 hours per week.
- Compensation depends on project scope and expertise. For this project, AI trainers can earn up to $80 per hour.

