About the job
Please submit your CV in English and indicate your level of English proficiency.
Mindrift provides a platform that connects skilled professionals with project-based AI opportunities for some of the most innovative tech companies, primarily focusing on the testing, evaluation, and enhancement of AI systems. Note that participation is based on specific projects rather than permanent employment.
Role Overview
We are seeking a highly experienced Senior Python Engineer specializing in functional testing. The ideal candidate will possess exceptional skills in Linux and Docker, be adept at reading and understanding code in multiple programming languages (such as C, Rust, and Go), and have the capability to translate migration task requirements into actionable items. Familiarity with tools like Roo Code or Claude Code to expedite iterative development is also essential.
Key Responsibilities
- Design and implement functional black-box tests for extensive codebases in various source languages.
- Develop and oversee Docker environments, ensuring 100% reproducibility of builds and test executions across diverse platforms.
- Track code coverage and set up automated scoring metrics to align with industry standards.
- Utilize LLMs (including Roo Code and Claude) to enhance development cycles, automate repetitive tasks, and improve code quality.
Required Qualifications
- Minimum of 5 years of experience as a Software Engineer, with a focus on Python.
- Extensive experience with pytest (including fixtures, session-scoped tests, and timeouts) and the design of black-box functional tests for CLI tools.
- Expertise in Docker, including reproducible Dockerfiles, user contexts, and secure workspaces.
- Strong proficiency in Linux and Bash scripting, along with the ability to debug within containers.
- Familiarity with modern Python tools (such as uv, pyproject.toml, and packaging).
- Ability to read and comprehend various programming languages with LLM support (C, C++, Rust, or Go).
- Experience working with LLMs (such as Claude Code, Roo Code, Cursor) to speed up iterative development and generate test cases.
- English language proficiency at the B2 level or above.
Preferred Qualifications
- Prior experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
What We Offer:
- Freelance, project-based collaboration through the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation—choose your availability (20-30 hours per week).
- Compensation based on tasks, up to $80/hour* depending on performance and workload.
- Opportunity to work on groundbreaking AI projects for leading tech companies.

