Skip to main content

ML Performance Engineer - GPU & Inference Optimization

Technology
Modal
New York, United States1 months agoUntil 4/25/2026

Job description

A leading AI infrastructure firm in New York is seeking experienced engineers to optimize ML systems at scale. The ideal candidate will have over 5 years of high-performance coding experience, familiarity with torch and Nvidia GPU architecture, and skills in ML performance engineering. The role offers opportunities to contribute to significant projects in a vibrant, fast-growing team atmosphere across multiple locations including NYC, San Francisco, and Stockholm.

¿Te interesa este puesto?