About the job
About the Role
Join PubMatic as a Senior Software Engineer focused on Data Analytics, where you will leverage your expertise to craft and refine AI agents. Your proficiency in technologies such as Hadoop, Spark, Scala, Kafka, and cloud solutions, alongside programming skills in Scala and Python, will be essential. In this role, you will develop sophisticated AI agents that elevate our data analytics capabilities, facilitating complex information retrieval, contextual comprehension, and adaptive interactions, thereby enhancing data-driven insights and analytical workflows.
What You'll Do:
- Design, build, and implement a highly scalable, fault-tolerant big data platform to process vast amounts of data and deliver comprehensive analytics to clients.
- Develop backend services utilizing Java, REST APIs, JDBC, and AWS.
- Construct and maintain Big Data pipelines using Spark, Hadoop, Kafka, and Snowflake.
- Architect real-time data processing workflows and automation frameworks.
- Create GenAI-powered agents for analytics, operational tasks, and data enrichment, employing frameworks such as LangChain and LlamaIndex.
- Integrate large language models (e.g., OpenAI, Claude, Mistral) into existing services for enhanced query understanding, summarization, and decision-making support.
- Oversee end-to-end GenAI workflows, including prompt engineering, fine-tuning, vector embeddings, and retrieval-augmented generation (RAG).
- Collaborate with cross-functional teams to improve the availability and scalability of large data platforms and enhance PubMatic software functionality.
- Engage in Agile/Scrum practices such as sprint planning, retrospective sessions, backlog grooming, and user story management.
- Regularly consult with product managers regarding the integration of new features into the PubMatic Data Analytics platform.
- Provide support for customer issues through email or JIRA, offering updates and patches to resolve problems.

