ML Performance Engineer - GPU & Inference Optimization
Technology
ModalNew York, United States1 months agoUntil 4/25/2026
Job description
A leading AI infrastructure firm in New York is seeking experienced engineers to optimize ML systems at scale. The ideal candidate will have over 5 years of high-performance coding experience, familiarity with torch and Nvidia GPU architecture, and skills in ML performance engineering. The role offers opportunities to contribute to significant projects in a vibrant, fast-growing team atmosphere across multiple locations including NYC, San Francisco, and Stockholm.
¿Te interesa este puesto?