Skip to main content

Senior Software Development Engineer, Machine Learning

Technology
חברה בתחום הייטק / חומרה / תוכנה / סייבר
תל אביב יפו, ישראללפני 3 ימיםעד 22.7.2026
משרה מלאה

תיאור המשרה

The MLIL FLOW team is looking for a Senior Software Development Engineer to lead the design and delivery of systems software for our next-generation ML accelerator servers. We build production software to validate, initialize, monitor, and qualify these servers - from first silicon through fleet-scale deployment. We work on the physical systems that execute ML workloads: Server bring-up, hardware diagnostics, interconnect validation, power/thermal monitoring, and fleet-scale operations are our bread and butter.Key job responsibilitiesLead the architecture and implementation of hardware validation and diagnostic software for new ML acceleration platforms.Drive technical direction for PCIe validation, power/thermal diagnostics, and stress-testing frameworks that run across manufacturing, vetting, and production environments.Own subsystems end-to-end: from design through implementation, testing, deployment, and operational excellence at fleet scale.Work with Hardware, Manufacturing, EC2 teams to create coordinated software packages that enable both qualification and rapid deployment.Debug and root-cause complex hardware/software interaction failures on first silicon and production fleet returns; drive root-cause to closure.Build and maintain data pipelines, dashboards, and monitoring systems for fleet health and performance benchmarking.Mentor engineers, define best practices, drive design reviews, and raise the bar for the team.Lead multiple development initiatives in parallel, balancing schedule, risk, and technical quality across a fast-moving hardware program.Requirements: Basic Qualifications- Bachelor's degree or above in Computer Science, Computer Engineering, Electrical Engineering, or related fields.

  • At least 8 years of professional software development experience.Preferred Qualifications- Experience with hardware bring-up, ASIC/FPGA validation, or manufacturing test development.
  • Proficiency in scripting languages (Python, Lua) for test automation and data analysis.
  • Track record of cross-team influence and delivering results through others.
  • Experience building data pipelines, ETL systems, or fleet-scale monitoring/dashboarding.
  • Demonstrated project-management experience leading multiple R&D initiatives in parallel.
  • Advantage: Experience with PCIe, or high-speed interconnect validation and debugging.This position is open to all candidates.
Keywords
OCamlGNU parallelLuaPythonStress TestingDebuggerParallelDebugging

מתעניינים במשרה הזו?