About the job
Please submit your CV in English and specify your level of English proficiency.
Mindrift is your gateway to project-based AI opportunities, connecting skilled professionals with top-tier tech companies focused on testing, evaluating, and enhancing AI systems. This is a project-based collaboration, not a permanent position.
About the Role
We are looking for a seasoned Python Engineer with extensive functional testing expertise. The ideal candidate will have robust skills in Linux and Docker, a proficiency for reading code across multiple languages (such as C, Rust, and Go) with the aid of LLMs, and the capability to translate migration requirements effectively. Familiarity with tools like Roo Code or Claude Code to streamline iterative development is essential.
Key Responsibilities
- Develop and implement functional black box tests for sizable codebases across various programming languages.
- Set up and oversee Docker environments to guarantee fully reproducible builds and test executions across platforms.
- Monitor code coverage and develop automated scoring criteria aligning with industry benchmarks.
- Utilize LLMs (such as Roo Code and Claude) to enhance development cycles, automate repetitive tasks, and elevate overall code quality.
Requirements
- 5+ years of software engineering experience, primarily in Python.
- In-depth knowledge of pytest (including fixtures, session-scoped, timeouts) and experience in designing black-box functional tests for CLI tools.
- Advanced proficiency with Docker (including reproducible Dockerfiles, user contexts, and secure workspaces).
- Strong skills in Linux & Bash scripting and debugging within containers.
- Familiarity with modern Python tools (like uv, pyproject.toml, and packaging).
- Ability to interpret and understand multiple programming languages with LLM support (such as C, C++, Rust, or Go).
- Experience leveraging LLMs (Claude Code, Roo Code, Cursor) for accelerating iterative development and generating test cases.
- English proficiency at a B2 level or higher.
Preferred Qualifications
- Previous experience with agent evaluation platforms and MCP CLI.
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
Benefits
- Project-based freelance collaboration via the Mindrift platform (powered by Toloka AI).
- Fully remote and flexible participation—choose your working hours and commitment (20-30 hours per week).
- Compensation based on task performance, up to $80/hour*.

