About the job
About NationGraph
At NationGraph, we are revolutionizing the accessibility and usability of public sector data for businesses targeting municipalities, state agencies, educational institutions, and specialized districts. Our advanced data intelligence engine extracts actionable insights from millions of public sector sources, empowering organizations to make informed decisions. Established in 2024, our mission is to democratize information, ensuring that public data is genuinely accessible to everyone. Discover more at nationgraph.com
Our Team
Comprises seasoned entrepreneurs who have successfully built, scaled, and exited multiple companies.
Developed robust software infrastructure capable of processing billions in transactions.
Supported by top-tier venture capitalists and seasoned operating partners with a track record of investing in and nurturing iconic brands.
Role Overview
Design and implement end-to-end machine learning pipelines.
Extract and mine data from various online sources through large-scale web crawling and scraping techniques to enhance our models and insights.
Convert unstructured text data into structured knowledge using natural language processing (NLP), entity recognition, and bespoke models.
Develop and refine text classification models to systematically organize intricate datasets.
Enhance retrieval-augmented generation (RAG) systems utilized in our product offerings.
Drive our data strategy by identifying and integrating new data sources.
Tackle open-ended technical challenges, fostering a culture of learning and collaboration within the team.
Primarily utilize Python and SQL for development.
Qualifications
A strong quantitative background in fields such as computer science, physics, mathematics, or engineering.
Solid foundation in mathematics and statistics.
A PhD in a quantitative discipline.
Expertise in Python programming.
Proactive ownership mentality with the ability to address complex technical challenges to create commercial value.
A genuine enthusiasm for continuous learning, growth, and uncovering insights from complex datasets.
Strong problem-solving, communication, and collaboration abilities in a dynamic work environment.

