About the job
Step into the future of artificial intelligence with Runware as a Senior Machine Learning Engineer, where you will spearhead the development of groundbreaking AI solutions across multiple media formats, including text, images, videos, 3D, and audio. Our innovative AI media creation platform is designed to transform the landscape of content generation.
In this pivotal role, you will oversee essential projects, managing the complete lifecycle from research and experimentation through to production deployment and performance evaluation. Your contributions will be instrumental in enhancing the capabilities of our platform and improving the experiences of users who rely on our state-of-the-art AI technologies.
Key Responsibilities
- Integrate open-source and third-party models into our inference platform.
- Lead fine-tuning initiatives including LoRA, adapters, PEFT, and domain adaptation.
- Optimize inference workloads focusing on latency, batching, memory efficiency, and throughput.
- Benchmark model quality against cost and performance across various modalities.
- Enhance inference startup times and stability under heavy load.
- Develop evaluation frameworks and internal tools for model validation.
- Collaborate closely with Infrastructure and Backend teams to create scalable serving systems.
- Monitor production performance and drive continuous optimization efforts.
- Mentor junior engineers and elevate the ML engineering standards within the team.
