About the job
We kindly request candidates to submit their CV in English along with their English proficiency level.
At Toloka AI, we bridge the gap between skilled professionals and exciting project-based AI opportunities for top technology companies. Our focus is on the testing, evaluation, and enhancement of AI systems. This role is project-based and does not constitute permanent employment.
What This Role Entails
Each project presents unique challenges, and contributors may be tasked with:
- Crafting original computational data science scenarios that emulate real-world analytical processes across diverse sectors, including telecom, finance, government, e-commerce, and healthcare.
- Designing problems that necessitate Python programming for resolution (utilizing libraries such as Pandas, NumPy, SciPy, Scikit-learn, Statsmodels, Matplotlib, and Seaborn).
- Ensuring that problems require significant computational resources, making manual solutions impractical within a reasonable timeframe (days or weeks).
- Engineering challenges that involve complex reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insights extraction.
- Creating deterministic problems with reproducible outcomes, avoiding stochastic elements unless fixed random seeds are employed for precise replication.
- Focusing on authentic business challenges, such as customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency.
- Designing comprehensive end-to-end problems that encompass the entire data science pipeline (data ingestion → cleaning → exploratory data analysis → modeling → validation → deployment considerations).
- Incorporating big data scenarios that require scalable computational techniques.
- Validating solutions using Python alongside standard data science libraries and statistical methodologies.
- Clearly documenting problem statements with realistic business contexts and providing verified correct solutions.
What We Are Looking For
This opportunity is ideal for Data Science professionals with extensive Python experience seeking part-time, non-permanent projects. Preferred qualifications include:
- 5+ years of practical experience in data science with demonstrable business outcomes.
- A portfolio showcasing completed projects and publications that highlight real-world problem-solving capabilities.
- Advanced Python programming skills for data science (Pandas, NumPy, SciPy, Scikit-learn, Statsmodels).
- Deep expertise in statistical analysis and machine learning, with a thorough understanding of algorithms, methods, and their practical applications.
- Proficiency in SQL and database operations for data manipulation and analysis.
- Experience with generative AI technologies (LLMs, RAG, prompt engineering, vector databases).
- Familiarity with MLOps practices and model deployment workflows.
- Knowledge of contemporary frameworks (TensorFlow, PyTorch, LangChain).
- Strong written English proficiency (C1+).
Application Process
Apply → Pass qualifications → Join a project → Complete tasks → Receive payment
