How much does a Inference Infrastructure earn in United States?

The estimated salary for Inference Infrastructure in United States ranges from $59,000 to $94,000 USD per year, depending on experience and location.

How many Inference Infrastructure jobs are available?

There are currently 122 job offers for Inference Infrastructure in United States listed on BeBee.

Which cities have the most Inference Infrastructure jobs?

The cities with the most Inference Infrastructure jobs in United States are: San Francisco, Seattle, San Jose, Palo Alto, New York.

How can I apply for a Inference Infrastructure job?

Sign up for free on BeBee, complete your professional profile, and apply directly to the Inference Infrastructure positions that interest you with one click.

122+ Inference Infrastructure Jobs in United States 2026 | Salaries & Apply

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PhD)

Seattle

ByteDance

Responsibilities About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance’s Core Compute Infrastructure organization, responsible for designing and operating the pla

Full timeOn-site

4 days ago

Machine Learning Infrastructure Engineer- Model Inference

San Francisco

Abridge

$18,417 - $21,667 /year

About Abridge Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients. Our en

Full timeHybrid

6 days ago

Member of Technical Staff

Palo Alto

Armadin

About Armadin Armadin is an AI-native cybersecurity company building the ultimate attacker: an autonomous, agentic AI system that continuously finds and eliminates exploitable risk before adversaries do. We are redefining cybersecurity for the era of AI-driven Hyperattacks, deploying a fleet of spec

On-site

5 days ago

Inference Engineer

San Francisco

Cartesia

About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason over a year-long stream of audio, video and text—1B text tokens, 10B audio tokens and 1T video tokens

On-site

6 days ago

Tech Lead Software Engineer - AI Inference Infrastructure

Seattle

ByteDance

Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the plat

4 days ago

Staff Software Engineer, ML Training and Inference Infrastructure

Palo Alto

Rivian

*About Rivian** Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what’s possible, never simply accepting what has always bee

Full timeOn-site

2 weeks ago

Tech Lead Software Engineer - AI Inference Infrastructure Technology - Infrastructure San Jose [...]

San Jose

ByteDance

Tech Lead Software Engineer - AI Inference Infrastructure Location: San Jose Team: Infrastructure Employment Type: Regular Job Code: A201019 Share this listing: Responsibilities Design and build large-scale, container-based cluster management and orchestration systems with extreme performance, scala

5 days ago

Inference Engineering Manager: Lead Scalable AI Infrastructure

San Francisco

Perplexity

A cutting-edge AI company in San Francisco seeks an Inference Engineering Manager to lead a team dedicated to building scalable AI infrastructure. This role involves developing APIs, architecting inference systems, and enhancing reliability. Candidates should have over 5 years of engineering experie

3 days ago

Software Engineer, Training & Inference Infrastructure

Redwood City

DatologyAI

About the Company Models are what they eat. But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy. At DatologyAI, we've built a state of the art data curation suite to autom

1 weeks ago

Staff Inference Infrastructure Engineer - Remote-Flexible

New York

Cohere

A leading AI platform provider in New York seeks a Member of Technical Staff to develop and deploy large language models. You will ensure high availability and low latency for AI applications, working in a collaborative environment. Required skills include extensive engineering experience, knowledge

1 weeks ago

Machine Learning Infrastructure Engineer- Model Inference

San Francisco

Abridge

About Abridge Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI-powered platform was purpose-built for medical conversations, improving clinical documentation efficiencies while enabling clinicians to focus on what matters most—their patients. Our en

1 weeks ago

Senior ML Infrastructure Engineer, Inference Platform

Sunnyvale

General Motors

Senior ML Infrastructure Engineer, Inference Platform page is loaded## Senior ML Infrastructure Engineer, Inference Platformremote type: Hybridlocations: Sunnyvale, California, United States of America: Austin, Texas, United States of America: Mountain View, California, United States of America: War

1 weeks ago

Staff Software Engineer, ML Training and Inference Infrastructure

Palo Alto

Rivian

Overview Rivian is on a mission to keep the world adventurous forever. This goes for the emissions-free Electric Adventure Vehicles we build, and the curious, courageous souls we seek to attract. As a company, we constantly challenge what's possible, never simply accepting what has always been done.

1 weeks ago

Software Engineer - AI Inference Infrastructure

San Jose

ByteDance

Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the plat

1 weeks ago

Software Engineer, Training & Inference Infrastructure

Redwood City

DatologyAI

About the Company Models are what they eat. But a large portion of training compute is wasted training on data that are already learned, irrelevant, or even harmful, leading to worse models that cost more to train and deploy. At DatologyAI, we've built a state of the art data curation suite to autom

1 weeks ago

Site Reliability Engineer, Inference Infrastructure

San Francisco

Cohere

Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental

1 weeks ago

Machine Learning Infrastructure Engineer- Model Inference

San Francisco

Abridge

Machine Learning Infrastructure Engineer Join to apply for the Machine Learning Infrastructure Engineer role at Abridge . Base pay range $179,000.00/yr - $248,000.00/yr About Abridge Abridge was founded in 2018 with the mission of powering deeper understanding in healthcare. Our AI‑powered platform

1 weeks ago

Tech Lead Software Engineer - AI Inference Infrastructure

San Jose

ByteDance

Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the plat

1 weeks ago

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PhD)

San Jose

ByteDance

Responsibilitie About the Team The Inference Infrastructure team is the creator and open-source maintainer of AIBrix, a Kubernetes-native control plane for large-scale LLM inference. We are part of ByteDance's Core Compute Infrastructure organization, responsible for designing and operating the plat

1 weeks ago

ML Inference Infrastructure Engineer

San Francisco

Baseten

A dynamic AI company is seeking an Infrastructure Software Engineer in San Francisco to build and maintain components of an ML inference platform. The successful candidate will develop infrastructure components using Python and Go, manage Kubernetes deployments, and enhance monitoring systems for mo

2 weeks ago

Inference Infrastructure - 122+ job offers

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PhD)

Machine Learning Infrastructure Engineer- Model Inference

Member of Technical Staff

Inference Engineer

Tech Lead Software Engineer - AI Inference Infrastructure

Staff Software Engineer, ML Training and Inference Infrastructure

Tech Lead Software Engineer - AI Inference Infrastructure Technology - Infrastructure San Jose [...]

Inference Engineering Manager: Lead Scalable AI Infrastructure

Software Engineer, Training & Inference Infrastructure

Staff Inference Infrastructure Engineer - Remote-Flexible

Machine Learning Infrastructure Engineer- Model Inference

Senior ML Infrastructure Engineer, Inference Platform

Staff Software Engineer, ML Training and Inference Infrastructure

Software Engineer - AI Inference Infrastructure

Software Engineer, Training & Inference Infrastructure

Site Reliability Engineer, Inference Infrastructure

Machine Learning Infrastructure Engineer- Model Inference

Tech Lead Software Engineer - AI Inference Infrastructure

Software Engineer Graduate (Inference Infrastructure) - 2026 Start (PhD)

ML Inference Infrastructure Engineer

Estimated salary for Inference Infrastructure

Inference Infrastructure by city

Industries hiring Inference Infrastructure

Frequently asked questions about Inference Infrastructure

Related