company

Data Engineer specializing in Natural Language Processing

ifm-usSunnyvale, CA
On-site Full-time

Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.


Unlock Your Potential

Generate Job-Optimized Resume

One Click And Our AI Optimizes Your Resume to Match The Job Description.

Is Your Resume Optimized For This Role?

Find Out If You're Highlighting The Right Skills And Fix What's Missing

Experience Level

Entry Level

Qualifications

Key Responsibilities:Gather and preprocess large datasets for NLP applications. Design and implement scalable data pipelines. Collaborate with researchers to identify data needs. Utilize web crawling and content refinement techniques. Ensure data quality and integrity for research initiatives. Qualifications:Proficient in Python and related data technologies. Experience in Natural Language Processing. Strong problem-solving skills and attention to detail. Ability to work collaboratively in a dynamic research environment.

About the job

About the Institute of Foundation Models
We are an innovative research institute focused on the development, understanding, application, and risk management of foundational models. Our mission is to propel research forward, cultivate the next generation of AI innovators, and contribute significantly to a knowledge-driven economy.

As a team member, you will engage with pioneering foundation model training, collaborating with leading researchers, data scientists, and engineers to address vital challenges in AI development. You will play a crucial role in creating revolutionary AI solutions that could transform entire sectors. Your strategic thinking and innovative problem-solving abilities will be key in positioning MBZUAI as a global leader in high-performance computing for deep learning, sparking impactful discoveries that will inspire future AI trailblazers.



The Role

In the capacity of a Data Engineer focused on Natural Language Processing (NLP) and large-scale data processing, you will swiftly and efficiently gather, curate, and prepare high-quality datasets that support advanced NLP research. Your expertise will empower researchers by providing them with essential data through scalable and efficient engineering practices, including web crawling, LLM-generated content refinement, and the creation of robust data pipelines, primarily utilizing Python and associated technologies.

About ifm-us

The Institute of Foundation Models is at the forefront of AI research, dedicated to creating transformative solutions and fostering the next generation of technology builders. Join a team that is shaping the future of deep learning and AI innovation.

Similar jobs

Tailoring 0 resumes

We'll move completed jobs to Ready to Apply automatically.