Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Experience Level
Mid to Senior
About the job
Join Sonsoft Inc. as an Informatica ETL Developer in the vibrant city of San Francisco. We are seeking a skilled professional to enhance our data integration process using Informatica tools. In this role, you will be responsible for designing, developing, and maintaining ETL processes that transform raw data into actionable insights. If you are passionate about data and technology, this is the perfect opportunity to advance your career.
Join Sonsoft Inc. as an Informatica ETL Developer in the vibrant city of San Francisco. We are seeking a skilled professional to enhance our data integration process using Informatica tools. In this role, you will be responsible for designing, developing, and maintaining ETL processes that transform raw data into actionable insights. If you are passionate ab…
Full-time|On-site|San Francisco, CA; Sunnyvale, CA; Seattle, WA
Join DoorDash as a Staff Software Engineer specializing in Data Engineering, where you will play a critical role in designing and implementing data solutions that drive business insights and enhance operational efficiency. You will collaborate with cross-functional teams to create robust data pipelines and leverage cutting-edge technology to manage large-scale datasets.
About Our TeamAt OpenAI, our Applied team is at the forefront of integrating cutting-edge AI technologies into the daily lives of consumers and businesses worldwide. We work collaboratively across research, engineering, design, and business domains to transform innovative AI advancements into significant real-world applications. Our team has been instrumental in the successful launch of products like ChatGPT, API, and Sora. We create tools that empower developers, enhance business efficiency, and promote individual learning and creativity. As AI technologies evolve, our commitment is to ensure that our products remain safe, accessible, and beneficial for everyone.About the PositionWe are seeking a Data Scientist to join our Applied Product team, where you will play a vital role in fostering a data-driven product development culture for our consumer and enterprise offerings. Your contributions will be crucial as our products serve millions globally. You will help align research with product development to create measurable impacts for users and businesses alike.In this position, you will define key performance metrics, design and analyze A/B tests, and create comprehensive dashboards that empower teams across the organization to derive insights from product data. Most importantly, you will be an integral part of our product development team.This role is primarily based in San Francisco, CA, or Seattle, WA, operating under a hybrid work model, allowing three days in the office per week. We also provide relocation assistance for new hires.Key Responsibilities:Collaborate closely with the product development team to identify opportunities for product enhancement and growth.Design and interpret A/B tests to evaluate the effects of changes in models and user experience on our products.Foster a data-driven culture by defining and operationalizing metrics at the feature, product, and company levels.Create and promote dashboards and reports that enable the team and organization to address product data inquiries independently.Ideal Candidate Profile:5+ years of experience in a quantitative role, adept at navigating ambiguous environments, ideally with a focus on product analytics.Strong proficiency in statistical analysis and experience with data visualization tools.Excellent communication skills to convey complex data insights to diverse audiences.A passion for AI and its applications in real-world scenarios.
Join OpenAI’s Codex Team as a Senior Data ScientistAbout Our TeamCodex is an innovative first-party developer product by OpenAI, concentrating on agentic software engineering. Our mission is to create cutting-edge tools that empower engineers to design, write, test, and deploy code more efficiently and safely at scale. We work collaboratively with research and product teams to transform advancements in AI models into significant productivity enhancements for developers.Role OverviewIn the role of Senior Data Scientist at Codex, you will play a crucial part in evaluating and driving product-market fit for our AI-driven developer tools. You will define the metrics of “developer productivity” for our offerings, conduct experiments on new coding models and user experiences, and identify areas where our technology can enhance or hinder performance across various languages and tasks. Your findings will have a direct impact on how the software industry evolves.This position is located in San Francisco, CA, and we follow a hybrid work model requiring 3 days in the office weekly. We also provide relocation assistance for new hires.Your ResponsibilitiesCollaborate closely with the Codex product team to uncover opportunities for enhancing developer outcomes and fostering growth.Design and analyze A/B tests and staged rollouts of new coding models and product features.Establish and implement key performance metrics such as suggestion acceptance rates, edit distances, compile/test pass rates, task completion times, latency, and overall session productivity.Create informative dashboards and analyses that enable the team to independently find answers to product-related questions (segmented by language, framework, repository size, task type).Investigate failure modes and work with Research to identify targeted improvements (model quality signals, user feedback, evaluations).Ideal Candidate Profile5+ years of experience in a quantitative role within a developer-focused or high-growth product environment.Proficiency in SQL and Python, with a solid understanding of experimental design and causal inference methodologies.Demonstrated experience in defining product metrics that relate to user value.Strong communication skills to effectively engage with product managers, engineers, and designers, and influence product direction.Preferred QualificationsA robust programming background with the ability to prototype, run simulations, and assess code quality.Familiarity with IDE/extension telemetry or analytics related to developer tooling.Previous experience with Natural Language Processing (NLP), Large Language Models (LLMs), or code models is a plus.
Join our dynamic team as a Microstrategy Developer, where you will play a crucial role in designing, developing, and implementing business intelligence solutions. Your expertise will help drive data-driven decision making and enhance our data analytics capabilities.
Sonsoft Inc. is seeking a skilled Informatica Developer to join our dynamic team in San Francisco. In this role, you will leverage your expertise in Informatica to design, develop, and implement data integration solutions that drive our clients' success. You will work collaboratively with cross-functional teams to ensure the seamless flow of data across various platforms.
Full-time|On-site|CA - San Francisco; WA - Seattle; UT - Cottonwood Heights
Join SoFi as a Senior Software Engineer in our Data Foundations team, where you will play a pivotal role in shaping our data architecture and enhancing our data-driven capabilities. You will work closely with cross-functional teams to develop robust data solutions that empower our business decisions and improve customer experiences.As a Senior Software Engineer, you will leverage your expertise in data engineering, software development, and cloud technologies to build scalable data pipelines and maintain high-quality data infrastructure. Your contributions will directly impact our ability to deliver innovative financial solutions.
Full-time|$140K/yr - $160K/yr|On-site|San Francisco, CA
Join our dynamic team at Cargomatic as a Senior Data Architect in Data Engineering, where you'll play a pivotal role in designing and constructing scalable, cloud-native data infrastructures. Your work will empower analytics, machine learning, and AI-driven applications that revolutionize the local trucking industry.In this fast-paced environment, you'll leverage your deep expertise in data architecture alongside hands-on experience with modern data platforms and LLM-enabled application development. You will be responsible for leading the design of enterprise-grade data models, architecting RAG systems, implementing agentic workflows, and integrating secure, production-ready LLM capabilities into our ecosystem. This high-impact position offers significant ownership and visibility, allowing you to shape the future of intelligent logistics technology.
Azumo is in search of a dynamic Big Data Engineer to spearhead the development and enhancement of our data and analytics infrastructure. This role is fully remote and based in Latin America.As a member of our innovative team, you will collaborate with forward-thinking engineers in the field of big data computing. If you are passionate about designing and developing scalable, high-performance big data infrastructure leveraging technologies such as Spark, Kafka, Snowflake, or similar frameworks, both on-premise and in the cloud, this position is perfect for you. We are seeking candidates with experience in building data pipelines, data services, data warehouses, as well as BI and ML platforms.At Azumo, we are committed to excellence and believe in fostering both professional and personal growth. We strive for each individual’s success and are dedicated to helping our team achieve their goals during their tenure at Azumo and beyond. Embracing challenges and acquiring new technologies is at the core of our mission. We also value giving back to the community through philanthropy, open-source initiatives, and knowledge sharing.
Full-time|$124.1K/yr - $223.5K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY; Los Angeles, CA; Chicago, IL; Austin, TX; Washington D.C.
Join the dynamic Analytics team at DoorDash as a Data Scientist or Senior Data Scientist, where your expertise will directly influence strategic decisions and operational improvements. You will uncover valuable insights from vast datasets, transforming them into actionable recommendations that drive company-wide initiatives. Collaborate with diverse teams in areas like Consumer & Growth, Business Operations, and Customer Experience to elevate our analytics capabilities and impact our business in meaningful ways.
Full-time|$120K/yr - $140K/yr|On-site|San Francisco, California
The Baseball Data Platform team at Major League Baseball is seeking a Senior Data Analyst to help us tell the story of baseball through data. In this role, you will utilize play-by-play and Statcast tracking data to support our customers, enhance data quality, and generate actionable insights. You will collaborate with top-tier software engineers, data scientists, and industry experts while working with one of the most comprehensive datasets in sports.Key ResponsibilitiesProactively identify and investigate potential data quality issues.Lead significant data initiatives aimed at achieving strategic research and development objectives, including enhancing scalability for data alerts and outlier resolution.Implement impactful changes based on internal operations data and logging in collaboration with our support team.Develop a deep understanding of the Baseball Data Platform, including schema, data lineage, and optimal usage patterns of critical tables (such as pitch-by-pitch data, tracking data, and weather data).Address inquiries from MLB clubs, broadcasters, and MLB leadership directly.Establish and uphold high standards for data documentation, code review, and statistical best practices within the R&D data team.Create optimized queries and statistical reports/models to detect trends and present findings to MLB and Club personnel.Design and produce insightful reports.Collaborate with Statcast engineering, product, data science, and content teams to innovate storytelling metrics.Manage user accounts, permissions, and security protocols across key systems, ensuring compliance with data security standards.Test and validate new data and metrics, contributing to their development.Serve as the primary resource for complex data challenges, assisting teammates with debugging inefficient queries and understanding nuanced data definitions.
Full-time|$90K/yr - $120K/yr|On-site|San Francisco, California
Major League Baseball seeks a Senior Data Analyst for its Baseball Data Platform team in San Francisco. This team blends baseball expertise with technology, working closely with play-by-play and Statcast tracking data. The group supports MLB clubs, broadcasters, and internal partners by ensuring data quality and delivering actionable insights. Role overview This position focuses on maintaining high standards for data quality, collaborating with engineers, data scientists, and industry experts. The Senior Data Analyst will address data issues, lead significant projects, and act as a key resource for both technical and non-technical stakeholders. Key responsibilities Identify and investigate data quality issues, both proactively and in response to reports. Lead major data projects to advance research and development, such as improving alerting systems and resolving outliers. Collaborate with the support team to implement process changes based on internal operations data. Develop deep expertise in the Baseball Data Platform, including schema, data lineage, and best practices for key tables like pitch-by-pitch, tracking, and weather data. Handle inquiries from MLB clubs, broadcasters, and leadership, providing direct support and solutions. Maintain high standards for data documentation, code review, and statistical practices within the R&D data team. Write optimized queries and statistical reports or models to uncover trends and share findings with MLB and Club staff. Produce clear, insightful reports and work closely with Statcast engineering, product, data science, and content teams to develop new storytelling metrics. Manage user accounts, permissions, and security protocols across core systems to ensure compliance with data security requirements. Test and validate new data sets and metrics, supporting their ongoing development. Serve as a resource for complex data questions, including query debugging, identifying obscure data points, and clarifying nuanced data definitions for teammates. Collaboration and impact This role involves frequent interaction with engineers, product managers, data scientists, and content teams. The Senior Data Analyst helps shape the way baseball data is used across the league, contributing to both operational improvements and new forms of baseball storytelling.
Join usm2 as a Senior Data Modeler / Data Architect with expertise in Big Data and Hadoop. In this pivotal role, you will harness the power of data to drive strategic decisions and enhance business outcomes. Your experience with data modeling and architecture will be essential in building robust data ecosystems.
Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California
P-186 At Databricks, we are passionate about empowering data teams to tackle some of the world’s most challenging problems, from security threat detection to cancer drug development. Our mission is to build and operate the leading data and AI infrastructure platform, enabling our customers to concentrate on the high-value challenges that are integral to their own objectives. Founded in 2013 by the original creators of Apache Spark™, Databricks has rapidly evolved from a small office in Berkeley, California, to a global powerhouse with over 1000 employees. Trusted by thousands of organizations, from startups to Fortune 100 companies, we are recognized as one of the fastest-growing SaaS companies worldwide. Our engineering teams create highly sophisticated products that address significant needs in the industry. We continuously push the limits of data and AI technology while maintaining the resilience, security, and scalability essential for our customers' success on our platform. We manage one of the largest-scale software platforms, consisting of millions of virtual machines that generate terabytes of logs and process exabytes of data daily. At this scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must effectively shield our customers from these challenges. Modern data analysis leverages advanced techniques, such as machine learning, that far exceed the capabilities of traditional SQL query engines. As a Software Engineer on the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems that outshine specialized SQL query engines in relational query performance, while providing the flexibility and programming abstractions to support a variety of workloads, from ETL to data science. Examples of projects you may work on include: Apache Spark™: Contributing to the de facto open-source framework for big data. Data Plane Storage: Developing reliable, high-performance services and client libraries for storing and accessing vast amounts of data on cloud storage backends like AWS S3 and Azure Blob Store. Delta Lake: A storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, featuring low latency streaming. Its higher-level abstractions and guarantees, including ACID transactions and time travel, significantly reduce the complexity of real-world data engineering architectures. Delta Pipelines: Aiming to simplify the management of data engineering pipelines.
About ChalkChalk is revolutionizing the data platform landscape to empower the next generation of machine learning applications. We dismantle the complexities, latency issues, and scalability challenges that have historically limited ML capabilities. Our state-of-the-art platform seamlessly integrates Rust-speed performance with user-friendly tools that developers appreciate. Top-tier companies rely on Chalk for various applications, including preventing fraudulent credit card transactions, identity verification, and optimizing renewable energy capture. We are proud to have recently secured a $50 million Series A funding round, led by Felicis.About the RoleWe are seeking a passionate Developer Relations Specialist to become an integral part of our expanding Go-To-Market (GTM) team. This role serves as the technical liaison between Chalk and the AI/ML and data community. We need someone with a profound understanding of modern data infrastructure, experience in sales-driven environments, and the ability to create engaging and informative content.You will collaborate closely with the sales, product, and marketing teams to articulate how Chalk can enhance the technical stacks of our users—through product launches, community outreach, enablement efforts, and proactive engagement. Your contributions will range from crafting in-depth technical articles to developing proof-of-concept projects, producing instructional videos, and conducting live demonstrations. You will play a vital role in shaping the narrative around Chalk.Our team works in the office five days a week, but we are flexible with unavoidable conflicts. Please note, this position is not hybrid.What You Will DoAct as the technical ambassador for Chalk among data engineers, ML teams, and infrastructure leaders.Produce and disseminate impactful content including technical blogs, field guides, explanatory materials, demonstrations, tweet threads, and short videos.Collaborate with product and sales departments to create resources that cater to enterprise clients—such as diagrams, presentations, proof-of-concepts, and ROI calculators.Represent Chalk at conferences, meetups, and customer interactions.Engage with prospects and customers to define best practices and relay insights back to the product team.Cultivate and expand a community focused on real-time data infrastructure and production ML.What Excites YouA robust technical background in data infrastructure, ML tools, or developer platforms.
AfterQuery partners with leading AI research labs to deliver high-quality training data and assessment frameworks. The company’s datasets and evaluation tools help research groups improve their models and set higher standards for performance. As a small, post-Series A team, every member directly shapes both the product and the business. Role overview The Business Development Representative will be the first point of contact for new clients, focusing on outreach to top AI research labs. This position combines hands-on prospecting with the chance to influence how AfterQuery approaches new business. Working closely with the founders, the BDR will help turn initial conversations into qualified leads while refining outreach strategies as the company grows. What you will do Identify and research decision-makers at leading AI labs and enterprise organizations. Lead outbound prospecting through cold email, LinkedIn, Twitter/X, and industry events, tailoring messages for technical audiences. Screen inbound leads and connect them with the appropriate team members. Collaborate with the founding team to refine Ideal Customer Profiles, develop messaging, and build outbound sales strategies from the ground up. Develop a strong understanding of AfterQuery’s products to clearly communicate their value, especially how data can improve AI models. Track pipeline activity, maintain CRM records, and report on outbound metrics. Represent AfterQuery at conferences, meetups, and events within the AI and ML community. Requirements 1–3 years of experience in a BDR, SDR, or outbound sales role, ideally within B2B SaaS, DaaS, or AI sectors. Technical knowledge to discuss AI/ML topics such as model training, evaluation, and data quality. Strong written communication skills, with outreach that is clear, concise, and credible. Self-motivated and proactive, comfortable working through ambiguity and building new processes. Experience or genuine interest in selling to technical buyers. Location: San Francisco
About UsAt Imprint, we are transforming the landscape of co-branded credit cards and financial products to be more innovative, rewarding, and brand-centric. We collaborate with distinguished companies such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to launch contemporary credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our platform seamlessly integrates advanced payment infrastructure, intelligent underwriting, and a user-friendly experience, enabling brands to offer impactful financial products without the need to operate as a bank.Co-branded cards represent over $300 billion in annual spending in the U.S., yet many are still reliant on outdated banking systems. Imprint stands out as the modern alternative: adaptable, technology-driven, and tailored for today’s consumers. Supported by renowned investors such as Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a top-tier team to revolutionize payment methods and foster brand growth. If you're eager to work in a dynamic environment, tackle challenging problems, and make a significant impact, we invite you to join us.Our TeamThe Data Engineering team at Imprint is tasked with developing and expanding the data infrastructure that underpins product innovation, analytics, operations, and machine learning throughout the organization. We are responsible for creating the pipelines, platforms, and processes that enable our stakeholders to trust and leverage data effectively.We are seeking a Data Engineer to advance our modern data architecture and deliver dependable, scalable data solutions. Your contributions will directly influence decision-making and innovation across various business areas—from financial operations to real-time personalization.
Full-time|$106.1K/yr - $162.1K/yr|On-site|Mountain View, California; New York City, New York; San Francisco, California; Seattle, Washington; Washington, D.C.
Databricks is a pioneering force in the realm of data and AI, rapidly transforming how companies leverage their data for impactful results. As we continue to scale globally, we are on the lookout for a seasoned and results-oriented professional in talent development who is eager to create and oversee learning programs within a fast-paced environment.As a vital member of our Talent and Leadership Development team, you will be instrumental in designing, launching, and managing key programs that empower thousands of Bricksters around the world. This is a unique opportunity to influence talent development at a leading-edge technology company.Our team operates within four essential pillars that support every facet of the employee and leadership journey:New Bricksters: We craft and implement a transformative new hire orientation program that engages newcomers, connects them to our culture, and equips them with the essential knowledge for success.Leaders: We provide leaders at all tiers with the necessary skills, mindsets, and frameworks to enhance performance, including our flagship annual leadership conference, Leadershift.High-Potential Talent: We curate specialized programs to foster the advancement of our high-potential talent, preparing them for greater leadership responsibilities.All Bricksters (Career & Growth): We facilitate career growth opportunities for every Brickster through initiatives focused on career navigation and long-term development.The Impact You’ll HaveOversee one of our talent development initiatives (e.g. Career Development enablement).Facilitate significant global programs for the Talent and Leadership Development team (e.g. New Hire Orientation, Leadership development) across various functions and regions.Utilize AI tools, analytics, and digital platforms to develop scalable, engaging learning experiences that drive tangible business outcomes.Collaborate with leaders and teams to assess learning needs, devise targeted solutions, and align programs with Databricks’ ambitious growth and innovation objectives.Evaluate and measure program effectiveness through data-driven insights and continuous iteration to enhance reach and adoption.
Full-time|$170K/yr - $178K/yr|Remote|New York, NY • United States; San Francisco, CA • New York, NY
Figma’s Core Data team is hiring a research-focused Data Scientist to help shape the frameworks and tools that power data science across the company. This group supports experimentation, analytics, and AI systems, partnering with Data Infrastructure, Machine Learning, and Applied Science teams. Their work strengthens Figma’s platforms and integrates AI into the daily workflows of data scientists. Key responsibilities Improve Figma’s experimentation platform and analytics tools Build machine learning-based systems for product analytics Define and refine metrics for AI-powered features using causal inference and statistical modeling Work closely with engineers, analytics specialists, and other teams to support data-driven decisions Requirements PhD-level expertise in a relevant field Strong research background with experience applying findings to real products Experience with experimentation platforms, statistical modeling, or machine learning Interest in developing tools for data scientists and product teams Location and work arrangement This full-time role is open to candidates based in any Figma U.S. hub (New York, NY or San Francisco, CA), or working remotely within the United States.
Full-time|$260K/yr - $320K/yr|On-site|San Francisco, CA - US
At Crusoe, our mission is to expedite the emergence of abundant energy and intelligence. We are developing the foundational technology that enables ambitious AI creation without compromising on scale, speed, or sustainability.Join the AI revolution with Crusoe’s sustainable technology initiatives. In this role, you will spearhead significant innovation, make a real-world impact, and collaborate with a team that is leading the charge in responsible and transformative cloud infrastructure.Position Overview:Crusoe is in search of a Senior Director of Development to oversee the execution and delivery of our expansive AI data center projects, each with budgets exceeding $1 billion. This role is pivotal in our physical project delivery, encompassing all phases of the development lifecycle, including entitlements and permitting management, contractor oversight, and site execution. You will act as the primary development leader on key projects, ensuring seamless delivery while managing executional risks. This hands-on leadership role demands technical expertise, project management skills, and municipal coordination capabilities.Note: This position requires in-office attendance five days a week at our San Francisco, CA location.Key Responsibilities:Project Delivery: Lead the execution of Crusoe’s multi-billion dollar data center campuses from initial concept to final completion.Site Oversight: Manage construction managers, engineering teams, and third-party consultants effectively.Project Scheduling: Develop and maintain comprehensive project schedules, ensuring alignment of design, permitting, and construction milestones.Entitlements: Supervise entitlements and permitting processes in collaboration with governmental agencies.Risk Management: Proactively identify and resolve risks that could affect the project timeline.Stakeholder Coordination: Act as a key liaison with government entities and internal engineering teams.Infrastructure Management: Oversee utility requirements and municipal approval processes.Qualifications:Experience: A minimum of 10 years in real estate development roles, ideally within industrial, life sciences, or data center sectors.
Join Sonsoft Inc. as an Informatica ETL Developer in the vibrant city of San Francisco. We are seeking a skilled professional to enhance our data integration process using Informatica tools. In this role, you will be responsible for designing, developing, and maintaining ETL processes that transform raw data into actionable insights. If you are passionate ab…
Full-time|On-site|San Francisco, CA; Sunnyvale, CA; Seattle, WA
Join DoorDash as a Staff Software Engineer specializing in Data Engineering, where you will play a critical role in designing and implementing data solutions that drive business insights and enhance operational efficiency. You will collaborate with cross-functional teams to create robust data pipelines and leverage cutting-edge technology to manage large-scale datasets.
About Our TeamAt OpenAI, our Applied team is at the forefront of integrating cutting-edge AI technologies into the daily lives of consumers and businesses worldwide. We work collaboratively across research, engineering, design, and business domains to transform innovative AI advancements into significant real-world applications. Our team has been instrumental in the successful launch of products like ChatGPT, API, and Sora. We create tools that empower developers, enhance business efficiency, and promote individual learning and creativity. As AI technologies evolve, our commitment is to ensure that our products remain safe, accessible, and beneficial for everyone.About the PositionWe are seeking a Data Scientist to join our Applied Product team, where you will play a vital role in fostering a data-driven product development culture for our consumer and enterprise offerings. Your contributions will be crucial as our products serve millions globally. You will help align research with product development to create measurable impacts for users and businesses alike.In this position, you will define key performance metrics, design and analyze A/B tests, and create comprehensive dashboards that empower teams across the organization to derive insights from product data. Most importantly, you will be an integral part of our product development team.This role is primarily based in San Francisco, CA, or Seattle, WA, operating under a hybrid work model, allowing three days in the office per week. We also provide relocation assistance for new hires.Key Responsibilities:Collaborate closely with the product development team to identify opportunities for product enhancement and growth.Design and interpret A/B tests to evaluate the effects of changes in models and user experience on our products.Foster a data-driven culture by defining and operationalizing metrics at the feature, product, and company levels.Create and promote dashboards and reports that enable the team and organization to address product data inquiries independently.Ideal Candidate Profile:5+ years of experience in a quantitative role, adept at navigating ambiguous environments, ideally with a focus on product analytics.Strong proficiency in statistical analysis and experience with data visualization tools.Excellent communication skills to convey complex data insights to diverse audiences.A passion for AI and its applications in real-world scenarios.
Join OpenAI’s Codex Team as a Senior Data ScientistAbout Our TeamCodex is an innovative first-party developer product by OpenAI, concentrating on agentic software engineering. Our mission is to create cutting-edge tools that empower engineers to design, write, test, and deploy code more efficiently and safely at scale. We work collaboratively with research and product teams to transform advancements in AI models into significant productivity enhancements for developers.Role OverviewIn the role of Senior Data Scientist at Codex, you will play a crucial part in evaluating and driving product-market fit for our AI-driven developer tools. You will define the metrics of “developer productivity” for our offerings, conduct experiments on new coding models and user experiences, and identify areas where our technology can enhance or hinder performance across various languages and tasks. Your findings will have a direct impact on how the software industry evolves.This position is located in San Francisco, CA, and we follow a hybrid work model requiring 3 days in the office weekly. We also provide relocation assistance for new hires.Your ResponsibilitiesCollaborate closely with the Codex product team to uncover opportunities for enhancing developer outcomes and fostering growth.Design and analyze A/B tests and staged rollouts of new coding models and product features.Establish and implement key performance metrics such as suggestion acceptance rates, edit distances, compile/test pass rates, task completion times, latency, and overall session productivity.Create informative dashboards and analyses that enable the team to independently find answers to product-related questions (segmented by language, framework, repository size, task type).Investigate failure modes and work with Research to identify targeted improvements (model quality signals, user feedback, evaluations).Ideal Candidate Profile5+ years of experience in a quantitative role within a developer-focused or high-growth product environment.Proficiency in SQL and Python, with a solid understanding of experimental design and causal inference methodologies.Demonstrated experience in defining product metrics that relate to user value.Strong communication skills to effectively engage with product managers, engineers, and designers, and influence product direction.Preferred QualificationsA robust programming background with the ability to prototype, run simulations, and assess code quality.Familiarity with IDE/extension telemetry or analytics related to developer tooling.Previous experience with Natural Language Processing (NLP), Large Language Models (LLMs), or code models is a plus.
Join our dynamic team as a Microstrategy Developer, where you will play a crucial role in designing, developing, and implementing business intelligence solutions. Your expertise will help drive data-driven decision making and enhance our data analytics capabilities.
Sonsoft Inc. is seeking a skilled Informatica Developer to join our dynamic team in San Francisco. In this role, you will leverage your expertise in Informatica to design, develop, and implement data integration solutions that drive our clients' success. You will work collaboratively with cross-functional teams to ensure the seamless flow of data across various platforms.
Full-time|On-site|CA - San Francisco; WA - Seattle; UT - Cottonwood Heights
Join SoFi as a Senior Software Engineer in our Data Foundations team, where you will play a pivotal role in shaping our data architecture and enhancing our data-driven capabilities. You will work closely with cross-functional teams to develop robust data solutions that empower our business decisions and improve customer experiences.As a Senior Software Engineer, you will leverage your expertise in data engineering, software development, and cloud technologies to build scalable data pipelines and maintain high-quality data infrastructure. Your contributions will directly impact our ability to deliver innovative financial solutions.
Full-time|$140K/yr - $160K/yr|On-site|San Francisco, CA
Join our dynamic team at Cargomatic as a Senior Data Architect in Data Engineering, where you'll play a pivotal role in designing and constructing scalable, cloud-native data infrastructures. Your work will empower analytics, machine learning, and AI-driven applications that revolutionize the local trucking industry.In this fast-paced environment, you'll leverage your deep expertise in data architecture alongside hands-on experience with modern data platforms and LLM-enabled application development. You will be responsible for leading the design of enterprise-grade data models, architecting RAG systems, implementing agentic workflows, and integrating secure, production-ready LLM capabilities into our ecosystem. This high-impact position offers significant ownership and visibility, allowing you to shape the future of intelligent logistics technology.
Azumo is in search of a dynamic Big Data Engineer to spearhead the development and enhancement of our data and analytics infrastructure. This role is fully remote and based in Latin America.As a member of our innovative team, you will collaborate with forward-thinking engineers in the field of big data computing. If you are passionate about designing and developing scalable, high-performance big data infrastructure leveraging technologies such as Spark, Kafka, Snowflake, or similar frameworks, both on-premise and in the cloud, this position is perfect for you. We are seeking candidates with experience in building data pipelines, data services, data warehouses, as well as BI and ML platforms.At Azumo, we are committed to excellence and believe in fostering both professional and personal growth. We strive for each individual’s success and are dedicated to helping our team achieve their goals during their tenure at Azumo and beyond. Embracing challenges and acquiring new technologies is at the core of our mission. We also value giving back to the community through philanthropy, open-source initiatives, and knowledge sharing.
Full-time|$124.1K/yr - $223.5K/yr|On-site|San Francisco, CA; Seattle, WA; New York, NY; Los Angeles, CA; Chicago, IL; Austin, TX; Washington D.C.
Join the dynamic Analytics team at DoorDash as a Data Scientist or Senior Data Scientist, where your expertise will directly influence strategic decisions and operational improvements. You will uncover valuable insights from vast datasets, transforming them into actionable recommendations that drive company-wide initiatives. Collaborate with diverse teams in areas like Consumer & Growth, Business Operations, and Customer Experience to elevate our analytics capabilities and impact our business in meaningful ways.
Full-time|$120K/yr - $140K/yr|On-site|San Francisco, California
The Baseball Data Platform team at Major League Baseball is seeking a Senior Data Analyst to help us tell the story of baseball through data. In this role, you will utilize play-by-play and Statcast tracking data to support our customers, enhance data quality, and generate actionable insights. You will collaborate with top-tier software engineers, data scientists, and industry experts while working with one of the most comprehensive datasets in sports.Key ResponsibilitiesProactively identify and investigate potential data quality issues.Lead significant data initiatives aimed at achieving strategic research and development objectives, including enhancing scalability for data alerts and outlier resolution.Implement impactful changes based on internal operations data and logging in collaboration with our support team.Develop a deep understanding of the Baseball Data Platform, including schema, data lineage, and optimal usage patterns of critical tables (such as pitch-by-pitch data, tracking data, and weather data).Address inquiries from MLB clubs, broadcasters, and MLB leadership directly.Establish and uphold high standards for data documentation, code review, and statistical best practices within the R&D data team.Create optimized queries and statistical reports/models to detect trends and present findings to MLB and Club personnel.Design and produce insightful reports.Collaborate with Statcast engineering, product, data science, and content teams to innovate storytelling metrics.Manage user accounts, permissions, and security protocols across key systems, ensuring compliance with data security standards.Test and validate new data and metrics, contributing to their development.Serve as the primary resource for complex data challenges, assisting teammates with debugging inefficient queries and understanding nuanced data definitions.
Full-time|$90K/yr - $120K/yr|On-site|San Francisco, California
Major League Baseball seeks a Senior Data Analyst for its Baseball Data Platform team in San Francisco. This team blends baseball expertise with technology, working closely with play-by-play and Statcast tracking data. The group supports MLB clubs, broadcasters, and internal partners by ensuring data quality and delivering actionable insights. Role overview This position focuses on maintaining high standards for data quality, collaborating with engineers, data scientists, and industry experts. The Senior Data Analyst will address data issues, lead significant projects, and act as a key resource for both technical and non-technical stakeholders. Key responsibilities Identify and investigate data quality issues, both proactively and in response to reports. Lead major data projects to advance research and development, such as improving alerting systems and resolving outliers. Collaborate with the support team to implement process changes based on internal operations data. Develop deep expertise in the Baseball Data Platform, including schema, data lineage, and best practices for key tables like pitch-by-pitch, tracking, and weather data. Handle inquiries from MLB clubs, broadcasters, and leadership, providing direct support and solutions. Maintain high standards for data documentation, code review, and statistical practices within the R&D data team. Write optimized queries and statistical reports or models to uncover trends and share findings with MLB and Club staff. Produce clear, insightful reports and work closely with Statcast engineering, product, data science, and content teams to develop new storytelling metrics. Manage user accounts, permissions, and security protocols across core systems to ensure compliance with data security requirements. Test and validate new data sets and metrics, supporting their ongoing development. Serve as a resource for complex data questions, including query debugging, identifying obscure data points, and clarifying nuanced data definitions for teammates. Collaboration and impact This role involves frequent interaction with engineers, product managers, data scientists, and content teams. The Senior Data Analyst helps shape the way baseball data is used across the league, contributing to both operational improvements and new forms of baseball storytelling.
Join usm2 as a Senior Data Modeler / Data Architect with expertise in Big Data and Hadoop. In this pivotal role, you will harness the power of data to drive strategic decisions and enhance business outcomes. Your experience with data modeling and architecture will be essential in building robust data ecosystems.
Full-time|$192K/yr - $260K/yr|On-site|San Francisco, California
P-186 At Databricks, we are passionate about empowering data teams to tackle some of the world’s most challenging problems, from security threat detection to cancer drug development. Our mission is to build and operate the leading data and AI infrastructure platform, enabling our customers to concentrate on the high-value challenges that are integral to their own objectives. Founded in 2013 by the original creators of Apache Spark™, Databricks has rapidly evolved from a small office in Berkeley, California, to a global powerhouse with over 1000 employees. Trusted by thousands of organizations, from startups to Fortune 100 companies, we are recognized as one of the fastest-growing SaaS companies worldwide. Our engineering teams create highly sophisticated products that address significant needs in the industry. We continuously push the limits of data and AI technology while maintaining the resilience, security, and scalability essential for our customers' success on our platform. We manage one of the largest-scale software platforms, consisting of millions of virtual machines that generate terabytes of logs and process exabytes of data daily. At this scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must effectively shield our customers from these challenges. Modern data analysis leverages advanced techniques, such as machine learning, that far exceed the capabilities of traditional SQL query engines. As a Software Engineer on the Runtime team at Databricks, you will be instrumental in developing the next generation of distributed data storage and processing systems that outshine specialized SQL query engines in relational query performance, while providing the flexibility and programming abstractions to support a variety of workloads, from ETL to data science. Examples of projects you may work on include: Apache Spark™: Contributing to the de facto open-source framework for big data. Data Plane Storage: Developing reliable, high-performance services and client libraries for storing and accessing vast amounts of data on cloud storage backends like AWS S3 and Azure Blob Store. Delta Lake: A storage management system that merges the scalability and cost-effectiveness of data lakes with the performance and reliability of data warehouses, featuring low latency streaming. Its higher-level abstractions and guarantees, including ACID transactions and time travel, significantly reduce the complexity of real-world data engineering architectures. Delta Pipelines: Aiming to simplify the management of data engineering pipelines.
About ChalkChalk is revolutionizing the data platform landscape to empower the next generation of machine learning applications. We dismantle the complexities, latency issues, and scalability challenges that have historically limited ML capabilities. Our state-of-the-art platform seamlessly integrates Rust-speed performance with user-friendly tools that developers appreciate. Top-tier companies rely on Chalk for various applications, including preventing fraudulent credit card transactions, identity verification, and optimizing renewable energy capture. We are proud to have recently secured a $50 million Series A funding round, led by Felicis.About the RoleWe are seeking a passionate Developer Relations Specialist to become an integral part of our expanding Go-To-Market (GTM) team. This role serves as the technical liaison between Chalk and the AI/ML and data community. We need someone with a profound understanding of modern data infrastructure, experience in sales-driven environments, and the ability to create engaging and informative content.You will collaborate closely with the sales, product, and marketing teams to articulate how Chalk can enhance the technical stacks of our users—through product launches, community outreach, enablement efforts, and proactive engagement. Your contributions will range from crafting in-depth technical articles to developing proof-of-concept projects, producing instructional videos, and conducting live demonstrations. You will play a vital role in shaping the narrative around Chalk.Our team works in the office five days a week, but we are flexible with unavoidable conflicts. Please note, this position is not hybrid.What You Will DoAct as the technical ambassador for Chalk among data engineers, ML teams, and infrastructure leaders.Produce and disseminate impactful content including technical blogs, field guides, explanatory materials, demonstrations, tweet threads, and short videos.Collaborate with product and sales departments to create resources that cater to enterprise clients—such as diagrams, presentations, proof-of-concepts, and ROI calculators.Represent Chalk at conferences, meetups, and customer interactions.Engage with prospects and customers to define best practices and relay insights back to the product team.Cultivate and expand a community focused on real-time data infrastructure and production ML.What Excites YouA robust technical background in data infrastructure, ML tools, or developer platforms.
AfterQuery partners with leading AI research labs to deliver high-quality training data and assessment frameworks. The company’s datasets and evaluation tools help research groups improve their models and set higher standards for performance. As a small, post-Series A team, every member directly shapes both the product and the business. Role overview The Business Development Representative will be the first point of contact for new clients, focusing on outreach to top AI research labs. This position combines hands-on prospecting with the chance to influence how AfterQuery approaches new business. Working closely with the founders, the BDR will help turn initial conversations into qualified leads while refining outreach strategies as the company grows. What you will do Identify and research decision-makers at leading AI labs and enterprise organizations. Lead outbound prospecting through cold email, LinkedIn, Twitter/X, and industry events, tailoring messages for technical audiences. Screen inbound leads and connect them with the appropriate team members. Collaborate with the founding team to refine Ideal Customer Profiles, develop messaging, and build outbound sales strategies from the ground up. Develop a strong understanding of AfterQuery’s products to clearly communicate their value, especially how data can improve AI models. Track pipeline activity, maintain CRM records, and report on outbound metrics. Represent AfterQuery at conferences, meetups, and events within the AI and ML community. Requirements 1–3 years of experience in a BDR, SDR, or outbound sales role, ideally within B2B SaaS, DaaS, or AI sectors. Technical knowledge to discuss AI/ML topics such as model training, evaluation, and data quality. Strong written communication skills, with outreach that is clear, concise, and credible. Self-motivated and proactive, comfortable working through ambiguity and building new processes. Experience or genuine interest in selling to technical buyers. Location: San Francisco
About UsAt Imprint, we are transforming the landscape of co-branded credit cards and financial products to be more innovative, rewarding, and brand-centric. We collaborate with distinguished companies such as Crate & Barrel, Rakuten, Booking.com, H-E-B, Fetch, and Brooks Brothers to launch contemporary credit programs that enhance customer loyalty, unlock savings, and stimulate growth. Our platform seamlessly integrates advanced payment infrastructure, intelligent underwriting, and a user-friendly experience, enabling brands to offer impactful financial products without the need to operate as a bank.Co-branded cards represent over $300 billion in annual spending in the U.S., yet many are still reliant on outdated banking systems. Imprint stands out as the modern alternative: adaptable, technology-driven, and tailored for today’s consumers. Supported by renowned investors such as Kleiner Perkins, Thrive Capital, and Khosla Ventures, we are assembling a top-tier team to revolutionize payment methods and foster brand growth. If you're eager to work in a dynamic environment, tackle challenging problems, and make a significant impact, we invite you to join us.Our TeamThe Data Engineering team at Imprint is tasked with developing and expanding the data infrastructure that underpins product innovation, analytics, operations, and machine learning throughout the organization. We are responsible for creating the pipelines, platforms, and processes that enable our stakeholders to trust and leverage data effectively.We are seeking a Data Engineer to advance our modern data architecture and deliver dependable, scalable data solutions. Your contributions will directly influence decision-making and innovation across various business areas—from financial operations to real-time personalization.
Full-time|$106.1K/yr - $162.1K/yr|On-site|Mountain View, California; New York City, New York; San Francisco, California; Seattle, Washington; Washington, D.C.
Databricks is a pioneering force in the realm of data and AI, rapidly transforming how companies leverage their data for impactful results. As we continue to scale globally, we are on the lookout for a seasoned and results-oriented professional in talent development who is eager to create and oversee learning programs within a fast-paced environment.As a vital member of our Talent and Leadership Development team, you will be instrumental in designing, launching, and managing key programs that empower thousands of Bricksters around the world. This is a unique opportunity to influence talent development at a leading-edge technology company.Our team operates within four essential pillars that support every facet of the employee and leadership journey:New Bricksters: We craft and implement a transformative new hire orientation program that engages newcomers, connects them to our culture, and equips them with the essential knowledge for success.Leaders: We provide leaders at all tiers with the necessary skills, mindsets, and frameworks to enhance performance, including our flagship annual leadership conference, Leadershift.High-Potential Talent: We curate specialized programs to foster the advancement of our high-potential talent, preparing them for greater leadership responsibilities.All Bricksters (Career & Growth): We facilitate career growth opportunities for every Brickster through initiatives focused on career navigation and long-term development.The Impact You’ll HaveOversee one of our talent development initiatives (e.g. Career Development enablement).Facilitate significant global programs for the Talent and Leadership Development team (e.g. New Hire Orientation, Leadership development) across various functions and regions.Utilize AI tools, analytics, and digital platforms to develop scalable, engaging learning experiences that drive tangible business outcomes.Collaborate with leaders and teams to assess learning needs, devise targeted solutions, and align programs with Databricks’ ambitious growth and innovation objectives.Evaluate and measure program effectiveness through data-driven insights and continuous iteration to enhance reach and adoption.
Full-time|$170K/yr - $178K/yr|Remote|New York, NY • United States; San Francisco, CA • New York, NY
Figma’s Core Data team is hiring a research-focused Data Scientist to help shape the frameworks and tools that power data science across the company. This group supports experimentation, analytics, and AI systems, partnering with Data Infrastructure, Machine Learning, and Applied Science teams. Their work strengthens Figma’s platforms and integrates AI into the daily workflows of data scientists. Key responsibilities Improve Figma’s experimentation platform and analytics tools Build machine learning-based systems for product analytics Define and refine metrics for AI-powered features using causal inference and statistical modeling Work closely with engineers, analytics specialists, and other teams to support data-driven decisions Requirements PhD-level expertise in a relevant field Strong research background with experience applying findings to real products Experience with experimentation platforms, statistical modeling, or machine learning Interest in developing tools for data scientists and product teams Location and work arrangement This full-time role is open to candidates based in any Figma U.S. hub (New York, NY or San Francisco, CA), or working remotely within the United States.
Full-time|$260K/yr - $320K/yr|On-site|San Francisco, CA - US
At Crusoe, our mission is to expedite the emergence of abundant energy and intelligence. We are developing the foundational technology that enables ambitious AI creation without compromising on scale, speed, or sustainability.Join the AI revolution with Crusoe’s sustainable technology initiatives. In this role, you will spearhead significant innovation, make a real-world impact, and collaborate with a team that is leading the charge in responsible and transformative cloud infrastructure.Position Overview:Crusoe is in search of a Senior Director of Development to oversee the execution and delivery of our expansive AI data center projects, each with budgets exceeding $1 billion. This role is pivotal in our physical project delivery, encompassing all phases of the development lifecycle, including entitlements and permitting management, contractor oversight, and site execution. You will act as the primary development leader on key projects, ensuring seamless delivery while managing executional risks. This hands-on leadership role demands technical expertise, project management skills, and municipal coordination capabilities.Note: This position requires in-office attendance five days a week at our San Francisco, CA location.Key Responsibilities:Project Delivery: Lead the execution of Crusoe’s multi-billion dollar data center campuses from initial concept to final completion.Site Oversight: Manage construction managers, engineering teams, and third-party consultants effectively.Project Scheduling: Develop and maintain comprehensive project schedules, ensuring alignment of design, permitting, and construction milestones.Entitlements: Supervise entitlements and permitting processes in collaboration with governmental agencies.Risk Management: Proactively identify and resolve risks that could affect the project timeline.Stakeholder Coordination: Act as a key liaison with government entities and internal engineering teams.Infrastructure Management: Oversee utility requirements and municipal approval processes.Qualifications:Experience: A minimum of 10 years in real estate development roles, ideally within industrial, life sciences, or data center sectors.