Volver al tablero de empleos

Confidencial

San Francisco

Software Engineer, ML Infrastructure

PresencialMidFull-time

Publicado 27 de enero de 2026

Oferta externa

Esta vacante proviene de una fuente externa. La descripción puede estar abreviada y algunos datos (salario, habilidades) pueden no estar disponibles. Regístrate como candidato para recibir la información completa.

Descripción del puesto

<p style="min-height:1.5em">Our mission is to automate coding. The first step in our journey is to build the best tool for professional programmers, using a combination of inventive research, design, and engineering. Our organization is very flat, and our team is small and talent dense. We particularly like people who are truth-seeking, passionate, and creative. We enjoy spirited debate, crazy ideas, and shipping code.</p><h1><strong>About the role</strong></h1><p style="min-height:1.5em">The ML Infrastructure team builds large-scale compute, storage, and software infrastructure to support Cursor’s work building the world’s best agentic coding model. We’re looking for strong engineers who are interested in building high-performance infrastructure and the software to support it. This role works closely with ML researchers and engineers to enable their work through improvements to our training framework, systems reliability/performance, and developer experience.</p><p style="min-height:1.5em"></p><h2><strong>What you’ll do</strong></h2><ul style="min-height:1.5em"><li><p style="min-height:1.5em">Collaborate with ML researchers to improve the throughput and reliability of training</p></li><li><p style="min-height:1.5em">Work with OEMs, cloud service providers, and others to plan and build cutting-edge GPU infrastructure</p></li><li><p style="min-height:1.5em">Improve the density and scalability of compute environments to enable increasingly large RL workloads</p></li><li><p style="min-height:1.5em">Create software and systems to automate building, monitoring, and running GPU clusters</p></li><li><p style="min-height:1.5em">Build workload scheduling and data movement systems to support Cursor’s growing training footprint</p></li></ul><h2><strong>You may be a fit if</strong></h2><ul style="min-height:1.5em"><li><p style="min-height:1.5em">A strong background in systems and infrastructure-focused software engineering, particularly in Python, Typescript, Rust, and Golang</p></li><li><p style="min-height:1.5em">Experience with distributed storage and networking infrastructure, particularly on Linux systems across cloud and bare metal environments</p></li><li><p style="min-height:1.5em">Exposure to large-scale systems and their unique challenges, ideally across thousands of nodes with significant resource footprints.</p></li><li><p style="min-height:1.5em">Production use of infrastructure-as-code and configuration management, across hosts and Kubernetes</p></li></ul><h2>Nice to have</h2><ul style="min-height:1.5em"><li><p style="min-height:1.5em">Operational exposure to Nvidia GPUs with Infiniband or RoCE, particularly with Blackwell and Hopper-class hardware</p></li><li><p style="min-height:1.5em">Exposure to Ray, Slurm, or other common compute and runtime schedulers</p></li></ul><p style="min-height:1.5em">#LI-DNI</p>