Clicking Apply Now takes you to AutoApply where you can tailor your resume and apply.
Unlock Your Potential
Generate Job-Optimized Resume
One Click And Our AI Optimizes Your Resume to Match The Job Description.
Is Your Resume Optimized For This Role?
Find Out If You're Highlighting The Right Skills And Fix What's Missing
Experience Level
Entry Level
Qualifications
Proficiency in C/C++ and Python programming languages. Strong understanding of operating systems, particularly kernel architecture. Experience with performance tuning and optimization techniques. Familiarity with AI frameworks and tools. Excellent problem-solving skills and a passion for technology. Bachelor's degree in Computer Science or a related field preferred.
About the job
About the Role
OpenAI is looking for a Software Engineer specializing in Kernel Performance and AI Tooling to join the team in San Francisco. This role centers on improving software systems for maximum efficiency and building advanced tools that support AI development.
What You Will Do
Optimize kernel-level performance across OpenAI's software stack.
Design and implement tools that accelerate AI research and deployment.
Work closely with engineers to identify bottlenecks and deliver practical solutions.
Contribute to technical discussions and share knowledge with teammates.
Team and Collaboration
Work alongside engineers who are committed to advancing AI technology. Collaboration and innovation are central to the team’s approach.
About OpenAI
OpenAI is at the forefront of artificial intelligence research and deployment. We are committed to ensuring that AI benefits all of humanity. Join our mission to create safe and powerful AI technologies that can transform industries and improve lives.
Full-time|$166K/yr - $225K/yr|On-site|San Francisco, California
P-97 At Databricks, we are dedicated to empowering data teams to tackle some of the most challenging problems in the world. We achieve this by creating and managing a leading data and AI infrastructure platform that enables our clients to leverage deep data insights for business enhancement. Our commitment to pushing the limits of data and AI technology is matched by our focus on resilience, security, and scalability, which are essential for our customers' success on our platform. Databricks operates one of the largest-scale software platforms, comprising millions of virtual machines that generate terabytes of logs and process exabytes of data daily. Given our scale, we frequently encounter cloud hardware, network, and operating system faults, and our software must adeptly protect our customers from these issues. As a Senior Performance Engineer, you will collaborate with various teams throughout the organization to assess product and feature performance, pinpoint performance bottlenecks, and partner with engineers to address performance and scalability challenges. This includes setting performance goals for different software releases, guiding teams in developing performance benchmarks, conducting competitive benchmark analyses for various Databricks products, and performing in-depth analyses to identify and resolve performance issues.
Join our talented team at Canva as a Senior Software Engineer specializing in Video Performance. We are looking for an innovative and solutions-oriented engineer who is passionate about optimizing video experiences for our users. In this role, you will collaborate with cross-functional teams to enhance performance, develop new features, and implement best practices in video engineering.
At ClickUp, we're not just developing software; we're shaping the future of work! In an era dominated by work sprawl, we identified a more efficient way. This led us to create the first truly integrated AI workspace, consolidating tasks, documents, chat, calendar, and enterprise search, all enhanced by context-driven AI. Our mission is to empower millions of teams to escape silos, reclaim their time, and reach unprecedented levels of productivity. At ClickUp, you'll have the chance to learn, innovate, and leverage AI in transformative ways that will not only influence our product but also the broader landscape of work itself. Join a daring, pioneering team that's challenging the limits of what's possible! We are on the lookout for a technical leader in SaaS client performance who is passionate about enhancing the customer experience through top-tier performance solutions. As a Senior Performance Engineer, you will spearhead comprehensive strategies to optimize application speed, memory utilization, and reliability across our entire platform. You will be empowered to analyze, diagnose, and address performance bottlenecks wherever they arise—be it front-end, back-end, or infrastructure—ensuring ClickUp remains the fastest and most reliable productivity platform available.The ideal candidate is a hands-on authority in browser and NodeJS performance, with a thorough understanding of how code influences rendering, memory management, and overall user experience. You excel in solving intricate challenges, collaborating across teams, and establishing new benchmarks for performance excellence. If you're driven to make a significant impact for millions of users, this is your chance to lead at scale.Your Responsibilities:Conduct root cause analysis on client performance issues and perform post-mortems.Profile application code to identify inefficient algorithms, memory leaks, and other issues; propose and implement effective solutions.Establish performance monitoring, alerting, and dashboards to proactively detect and resolve client performance challenges.Examine client traffic patterns, load testing outcomes, and other metrics to set benchmarks and drive enhancements.Champion performance best practices and set performance standards across the engineering organization.Identify infrastructure upgrades (caching, CDNs, database optimization) to elevate the client experience.Collaborate with development teams to incorporate performance as a core requirement in the development of new features.
Join Cloudflare as a Senior Software Engineer specializing in Network Performance & Reliability! In this role, you'll be at the forefront of enhancing the performance and stability of our global network, ensuring our customers benefit from unparalleled speed and reliability. You'll collaborate with experts across various teams to design and implement innovative solutions that optimize network operations.
Join Crusoe as a Senior Systems Performance Engineer, where you will play a crucial role in optimizing and enhancing our systems for superior performance. You will be responsible for diagnosing performance bottlenecks, implementing solutions, and ensuring that our infrastructure can scale efficiently. Work in a dynamic environment that encourages innovation and professional growth.
About UsAt Lemurian Labs, we are dedicated to democratizing AI technology while prioritizing sustainability. Our mission is to create solutions that minimize environmental impact, ensuring that artificial intelligence serves humanity positively. We are committed to responsible innovation and the sustainable growth of AI.We are in the process of developing a state-of-the-art, portable compiler that empowers developers to 'build once, deploy anywhere.' This technology ensures seamless cross-platform integration, allowing for model training in the cloud and deployment at the edge, all while maximizing resource efficiency and scalability.If you are passionate about scaling AI sustainably and are eager to make AI development more powerful and accessible, we invite you to join our team at Lemurian Labs. Together, we can build a future that is innovative and responsible.The RoleWe are seeking a Senior ML Performance Engineer to take charge of designing and leading our Performance Testing Platform from inception. In this pivotal role, you will be recognized as the technical expert in measuring, validating, and enhancing the performance of large language models (including Llama 3.2 70B, DeepSeek, and others) prior to and following compiler optimization on cutting-edge GPU architectures.This is a critical position that will significantly impact our product quality and customer success. You will work at the intersection of Machine Learning systems, GPU architecture, and performance engineering, constructing the infrastructure that substantiates the value of our compiler.
ABOUT BASETENBaseten is at the forefront of AI technology, empowering leading-edge companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer to seamlessly integrate advanced AI models into their operations. Our unique blend of applied AI research, adaptable infrastructure, and intuitive developer tools enables innovators to bring their most ambitious AI products to life. With our recent $300M Series E funding from top-tier investors such as BOND, IVP, Spark Capital, Greylock, and Conviction, we are poised for rapid growth. Join us in shaping the platform that engineers rely on to deploy transformative AI solutions.THE ROLEAre you driven by a passion for enhancing artificial intelligence applications? We are seeking a proactive Software Engineer specializing in ML performance to join our energetic team. This position is perfect for backend engineers who thrive in a fast-paced startup environment and are eager to make substantial contributions to the realm of Large Language Model (LLM) Inference. If you're enthusiastic about optimizing open-source ML models, we can't wait to hear from you!EXAMPLE INITIATIVESAs a member of our Model Performance team, you will have the opportunity to work on exciting projects, including:Baseten Embeddings Inference: The quickest embeddings solution availableThe Baseten Inference StackDriving model performance optimizationRESPONSIBILITIESDevelop, refine, and implement advanced techniques (quantization, speculative decoding, kv cache reuse, chunked prefill, and LoRA) for ML model inference and infrastructure.Conduct thorough investigations into the codebases of TensorRT, PyTorch, TensorRT-LLM, vllm, sglang, CUDA, and other libraries to troubleshoot and resolve ML performance issues.Scale and apply optimization techniques across a diverse array of ML models, with a focus on large language models.
Role overview This Software Engineer position at OpenAI focuses on inference and performance optimization. Based in San Francisco, the role centers on increasing the speed and efficiency of advanced AI systems. Collaboration with experienced engineers is a key part of the work, with an emphasis on refining AI performance. What you will do Work on optimizing the performance of AI inference systems Collaborate with other engineers to improve efficiency and speed Contribute to solutions that enhance AI system capabilities Location This role is based in San Francisco.
OpenAI is seeking a Software Engineer in San Francisco to focus on improving productivity by optimizing model performance. This position centers on developing solutions that make machine learning models more efficient and effective. Role overview This role involves working closely with teams across different functions to identify and address areas where model performance can be improved. The aim is to deliver changes that have a measurable impact on both systems and workflows. What you will do Collaborate with engineers and other specialists to enhance model efficiency Develop and implement solutions that improve the effectiveness of machine learning systems Contribute to projects that streamline processes and drive productivity gains Impact Your work will help shape improvements in how models operate and how teams at OpenAI achieve their goals. The changes you help deliver will support more effective use of resources and better outcomes for the organization.
Join Cloudflare as a Software Engineer dedicated to enhancing our network performance and reliability. In this dynamic role, you will collaborate with cross-functional teams to develop innovative software solutions that optimize our network infrastructure and ensure high availability and performance for our users. Your contributions will directly impact millions of users worldwide, making the internet a safer place for everyone.
Full-time|$180K/yr - $250K/yr|On-site|San Francisco
Join fal in our pursuit to maintain a leading edge in model performance for generative media models. You'll be instrumental in designing and implementing innovative solutions for model serving architecture, built on our proprietary inference engine. Your focus will be on maximizing throughput while minimizing latency and resource consumption. In addition, you will create performance monitoring and profiling tools to identify bottlenecks and optimization opportunities. Collaborate closely with our Applied ML team and clients in the media sector to ensure their workloads leverage our accelerator effectively.
Full-time|$190.9K/yr - $232.8K/yr|On-site|San Francisco, California
P-1285 About This Role Join our dynamic team at Databricks as a Staff Software Engineer specializing in GenAI Performance and Kernel. In this pivotal role, you will take charge of designing, implementing, and optimizing high-performance GPU kernels that drive our GenAI inference stack. Your expertise will lead the development of finely-tuned, low-level compute paths, balancing hardware efficiency with versatility, while mentoring fellow engineers in the intricacies of kernel-level performance engineering. Collaborating closely with machine learning researchers, systems engineers, and product teams, you will elevate the forefront of inference performance at scale. What You Will Do Lead the design, implementation, benchmarking, and maintenance of essential compute kernels (such as attention, MLP, softmax, layernorm, memory management) tailored for diverse hardware backends (GPU, accelerators). Steer the performance roadmap for kernel-level enhancements, focusing on areas like vectorization, tensorization, tiling, fusion, mixed precision, sparsity, quantization, memory reuse, scheduling, and auto-tuning. Integrate kernel optimizations seamlessly with higher-level machine learning systems. Develop and uphold profiling, instrumentation, and verification tools to identify correctness, performance regressions, numerical discrepancies, and hardware utilization inefficiencies. Conduct performance investigations and root-cause analyses to address inference bottlenecks, such as memory bandwidth, cache contention, kernel launch overhead, and tensor fragmentation. Create coding patterns, abstractions, and frameworks to modularize kernels for reuse, cross-backend compatibility, and maintainability. Influence architectural decisions to enhance kernel efficiency (including memory layout, dataflow scheduling, and kernel fusion boundaries). Guide and mentor fellow engineers focused on lower-level performance, conducting code reviews and establishing best practices. Collaborate with infrastructure, tooling, and machine learning teams to implement kernel-level optimizations in production and assess their impacts.
Join Canva as a Staff Software Engineer specializing in Video Performance. In this role, you will be instrumental in enhancing our video features, ensuring top-notch performance for our users. You will collaborate with cross-functional teams, leveraging your expertise to drive innovation and optimize our video products.
About the Role OpenAI is looking for a Software Engineer specializing in Kernel Performance and AI Tooling to join the team in San Francisco. This role centers on improving software systems for maximum efficiency and building advanced tools that support AI development. What You Will Do Optimize kernel-level performance across OpenAI's software stack. Design and implement tools that accelerate AI research and deployment. Work closely with engineers to identify bottlenecks and deliver practical solutions. Contribute to technical discussions and share knowledge with teammates. Team and Collaboration Work alongside engineers who are committed to advancing AI technology. Collaboration and innovation are central to the team’s approach.
Full-time|$190K/yr - $288K/yr|Hybrid|San Francisco, California
About SentryIn a world inundated with poor software, Sentry stands out with its commitment to ensuring developers can create high-quality software efficiently. Our mission is to empower developers to write better software faster, allowing everyone to rediscover the joy of technology.With over $217 million in funding and a growing community of more than 100,000 organizations, including industry giants like Disney, Microsoft, and Atlassian, we are at the forefront of developing performance and error monitoring tools that streamline the development process.Sentry supports a dynamic hybrid work environment across our global offices, with dedicated in-office days on Mondays, Tuesdays, and Thursdays to foster collaboration. If you have a passion for crafting tools that enhance the digital experience, join us in shaping the future of software monitoring.About the RoleAs a Senior Software Engineer on Sentry’s AI/ML team, you will play a pivotal role in developing the platform utilized by our debugging agents. This position is critical in incorporating AI and machine learning into our fundamental products, ranging from issue triage and resolution to predictive analytics for application performance monitoring. Your contributions will provide actionable insights to companies worldwide, enabling them to enhance their software development processes.Your ResponsibilitiesDevelop advanced agentic AI platforms to triage, debug, and resolve real-world production challenges.Utilize Sentry’s extensive dataset of errors, spans, and profiles to inform your development.Lead the development of significant projects within the AI/ML domain.What You’ll Love About This JobYou are motivated by impact and relish working on high-stakes, visible projects.You enjoy the process of building and will have the chance to be a foundational member of the AI/ML team.You thrive in collaborative environments and enjoy developing features alongside cross-functional teams.
Senior Infrastructure & Performance EngineerAs a Senior Infrastructure & Performance Engineer, you will take charge of enhancing the performance, reliability, and scalability of Nash's foundational infrastructure. Collaborating closely with the Engineering Leadership and both platform and product engineering teams, you will design and manage low-latency, mission-critical systems that facilitate real-time logistics for some of the world's largest retailers.This is a key senior role focused on elastic capacity, high availability, cloud-native architectures, Postgres performance, and enterprise-grade CI/CD for multi-region deployments. You will define the technical roadmap, establish best practices, and implement systems that support the essential workflows of major retailers.Key ResponsibilitiesOversee infrastructure performance and reliability for Nash's production environments, ensuring low latency, high throughput, and consistent performance under load.Design, develop, and enhance AWS infrastructure, utilizing managed services with a focus on ECS/Fargate.Lead initiatives in Postgres performance engineering, including query optimization, indexing strategies, connection management, replication, cluster design, and failover.Architect and maintain multi-region, highly available systems with robust resiliency and guaranteed disaster recovery.Design and refine enterprise-grade CI/CD pipelines that enable safe, repeatable, and rapid deployments across environments and regions.Establish observability standards (metrics, logs, tracing, SLOs) to proactively identify and resolve performance bottlenecks.Collaborate with application engineers to inform system design choices that influence scalability, latency, and reliability.Lead incident response efforts and postmortems, emphasizing root cause analysis, systemic improvements, and long-term resilience.Set best practices for infrastructure and performance while mentoring engineers throughout the organization.Qualifications6+ years of experience in building and managing high-scale production infrastructure for mission-critical systems.Proficiency with AWS, particularly with ECS/Fargate, and experience with cloud-native architecture.Strong background in Postgres performance tuning and optimization.Deep understanding of CI/CD practices and experience in multi-region deployments.Exceptional analytical and problem-solving skills, with a proactive approach to performance management.
Full-time|$170K/yr - $200K/yr|On-site|San Francisco, CA - US
At Crusoe, our vision is to enhance the availability of energy and intelligence. We are at the forefront of developing solutions that empower individuals to innovate boldly with AI, all while ensuring that we uphold principles of scalability, speed, and sustainability.Join us in driving the AI revolution through sustainable technology at Crusoe. You will play a pivotal role in fostering innovation, making a significant impact, and collaborating with a team that is leading the way in responsible and transformative cloud infrastructure.About the RoleWe are on the lookout for a Senior Software Engineer to take on the role of founding engineer within our new Enterprise Software Engineering team in IT. You will work closely with the Director to develop internal tools, automation processes, and integrations that remove manual tasks and address critical operational challenges across the organization.This role transcends traditional enterprise IT positions. You will leverage AI-assisted development as your primary method, delivering production-quality software within days instead of months, while laying the groundwork for future team members. As you grow in this role, your primary contributions will focus on defining specifications, context, and reusable patterns to guide AI agents, while still engaging in hands-on coding for the most complex challenges.Your ResponsibilitiesDesigning and deploying internal tools, automation, and integrations that provide measurable benefits across Finance, Operations, HR, and other business functions.Utilizing AI-assisted development as your standard workflow, which includes drafting specifications, creating prompts, reviewing AI-generated code, and iterating quickly.Establishing foundational technical patterns such as coding standards, project conventions, reusable components, and context files that facilitate AI-driven development.Creating and managing integrations between enterprise systems through APIs, middleware, and data pipelines.Developing solutions across a versatile environment that combines cloud application platforms with internal GPU compute for AI workloads.Collaborating with business teams during rapid prototyping sessions, then refining prototypes into production-ready systems complete with testing, monitoring, and documentation.Setting up CI/CD pipelines, automated testing, and quality assurance measures for all solutions delivered.Taking ownership of solutions from end to end, covering everything from specification and implementation to deployment and production support.
Role overview The Performance Modeling Engineer II position at OpenAI centers on building and applying performance models to enhance the efficiency of advanced AI systems. Based in San Francisco, this role contributes to the reliability and speed of OpenAI’s technologies. What you will do Develop and implement performance models for AI systems Collaborate with data scientists and engineers to refine performance metrics Support the efficiency and rigorous standards of OpenAI’s technologies
Full-time|$124.9K/yr - $228.9K/yr|On-site|San Francisco
The Trade Desk is a leading global technology company dedicated to fostering a better, more open internet for everyone through principled, intelligent advertising. With the capability to handle over 1 trillion queries daily, our platform operates at an unparalleled scale. We pride ourselves on our award-winning culture, built on the foundations of trust, ownership, empathy, and collaboration. We appreciate the unique experiences and perspectives that every individual brings to The Trade Desk, and we are committed to creating inclusive spaces where everyone can express their authentic selves at work. If you are passionate about solving complex problems at scale and are eager to join a dynamic, globally-connected team where your contributions will significantly impact the media ecosystem, we invite you to explore why Fortune magazine consistently ranks The Trade Desk among the top small to medium-sized workplaces worldwide. As a Senior Software Engineer, you will have end-to-end ownership, allowing you to engage in various facets of designing, building, and delivering data-centric products for our stakeholders. At The Trade Desk, we focus on constructing the back-end infrastructure of our platform with an unwavering commitment to quality at scale. Whether developing components for our client-facing applications, crafting internal custom solutions for our teams, or building model pipelines for bidding optimizations, we ensure that the infrastructure, development processes, and tools empower us to execute efficiently. Our systems operate continuously, serving global traffic, and we collaborate in a highly cooperative environment while leveraging a diverse array of technologies. Our back-end developers tackle algorithmic, optimization, and scalability challenges across all our initiatives.
Full-time|On-site|San Francisco, CA; New York City, NY
Merge stands at the forefront of innovation, delivering cutting-edge tools and customer-centric integrations tailored for emerging LLMs, Fortune 500 corporations, and B2B SaaS enterprises. Our platform encompasses two primary offerings: Merge Unified, which allows organizations to seamlessly incorporate a multitude of integrations through a single API, and Merge Agent Handler, which equips AI agents with secure access to a wide array of third-party tools. Trusted by thousands, Merge's enterprise-level platform streamlines the entire integration lifecycle, ensuring robust authentication, security, monitoring, and maintenance. Our solutions empower companies to accelerate product development, enhance sales efficiency, minimize customer churn, and conserve engineering resources, enabling them to concentrate on their core offerings. Key Responsibilities:Lead discussions on roadmapping and architectural strategies.Develop high-quality, production-ready, and maintainable code.Steer Merge's most significant projects from conceptualization to deployment.Support and empower your team, including engineers, designers, product managers, and business operators.Ideal Candidate Profile:7+ years of professional software engineering experience.Proficiency with frameworks such as Django, Rails, or Spring.In-depth understanding of SQL databases, preferably Postgres.Projects You May Engage In:Scaling Core Systems: Identify and resolve performance bottlenecks in our data syncing engine, lead large-scale migrations to more efficient technologies and storage solutions, and drive initiatives to enhance system throughput.Product Innovation: Design adaptive syncing paradigms that facilitate advanced customization across model normalization, scheduling, data filtering, and API rate limit management.New Revenue Streams: Collaborate on the development of features that open new avenues for revenue generation within the platform.
Feb 26, 2026
Sign in to browse more jobs
Create account — see all 6,985 results
Tailoring 0 resumes…
Tailoring 0 resumes…
We'll move completed jobs to Ready to Apply automatically.