About the job
About Us
At Odyssey, we are at the forefront of artificial intelligence, specializing in pioneering general-purpose world models that represent a groundbreaking form of multimodal intelligence. These innovations unlock a plethora of applications across consumer, enterprise, and intelligence sectors. Our flagship model, Odyssey-2 Pro, exemplifies our commitment to leading this revolutionary frontier.
Position Overview
We are seeking a seasoned Data Platform Lead who will take charge of our data practices. This pivotal technical leadership role emphasizes architectural strategy and execution. The ideal candidate will possess robust data engineering skills, adept at defining a long-term architectural vision while engaging directly with coding tasks. A comprehensive understanding of the data lifecycle is essential, from collaborating with Operations for data sourcing to designing efficient data processes that optimize our world models.
Key Responsibilities
- Define and implement the long-term technical architecture for our data platform, ensuring scalability and reliability for high-volume, multimodal datasets.
- Manage the complete data lifecycle from sourcing to delivery for machine learning model training.
- Design and construct resilient data processing pipelines, focusing on data cleaning, feature engineering, and normalization tailored for world models.
- Oversee the data curation system, including adaptable metadata schemas, evolving labels, and modular tagging pipelines for effective data categorization and resampling.
- Collaborate closely with ML Research and Engineering teams to ascertain current and future data needs, translating research requirements into actionable data infrastructure and acquisition strategies.
- Lead the integration of advanced quality filtering and signal analysis into the data workflow, ensuring datasets meet rigorous quality standards.
- Drive data acquisition strategies by evaluating various methods, aligning with budgetary constraints and quality requirements.
