About the team Applied Evals defines what good looks like for safe, advanced AI systems. We turn complex, high-value workflows into clear, reproducible signals that guide model training and product quality. Our work bridges frontier customers and models, ensuring improvements show up where users exp
Evals jobs in Berkeley
76 evals opportunities in Berkeley updated today
job offers found
· Page 3 / 4Role At Variance, we are teaching machines to make the hardest judgment calls at scale. That means building AI agents for the high-stakes gray area of risk investigations, fraud, and identity reviews. We’re a small, talent-dense team in San Francisco working on a problem at the edge of what AI syste
About the team The Frontier Evals & Environments team builds north star model environments to drive progress towards safe AGI/ASI. This team builds ambitious environments to measure and steer our models, and creates self‑improvement loops to steer our training, safety, and launch decisions. Some of
Machine Learning Engineer - LLM Evals + Observability Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI ag
About Glean: Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry's most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With o
Overview Workato transforms technology complexity into business opportunity. As the leader in enterprise orchestration, Workato helps businesses globally streamline operations by connecting data, processes, applications, and experiences. Its AI-powered platform enables teams to navigate complex work
About Edison Scientific builds and commercializes AI agents for science. Scientific discovery moves too slowly, and autonomous AI agents are how we intend to fix that. We're assembling a team of top researchers and engineers across AI and biology to build an AI scientist. Role We are seeking an ambi
About Us At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to a
About Us: At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to
About the team Applied Evals defines what good looks like for safe, advanced AI systems. We turn complex, high-value workflows into clear, reproducible signals that guide model training and product quality. Our work bridges frontier customers and models, ensuring improvements show up where users exp
Be Your Own Lab Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, fr
Scale AI is seeking a technically rigorous and driven AI Research Engineer to join our Enterprise Evaluations team. This high-impact role is critical to our mission of delivering the industry's leading GenAI Evaluation Suite . You will be a hands‑on contributor to the core systems that ensure the sa
Be Your Own Lab Prime Intellect builds the infrastructure that frontier AI labs build internally, and makes it available to everyone. Our platform, Lab, unifies environments, evaluations, sandboxes, and high-performance training into a single full-stack system for post-training at frontier scale, fr
A technology company is seeking a Frontend Engineer to develop and enhance features on their enterprise platform. This role requires strong proficiency in React, JavaScript, and Typescript, and the ability to collaborate with designers and engineers. The company offers competitive compensation, incl
About Glean Glean is the Work AI platform that helps everyone work smarter with AI. What began as the industry’s most advanced enterprise search has evolved into a full-scale Work AI ecosystem, powering intelligent Search, an AI Assistant, and scalable AI agents on one secure, open platform. With ov
A global AI research and deployment company is hiring product-minded engineers in San Francisco to design evals for advanced AI systems. You will define evaluation signals, prototype solutions, and enhance model reliability while working closely with research and product teams. The ideal candidate h
About Us: At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools and have grown to
About LangChain At LangChain, our mission is to make intelligent agents ubiquitous. We help developers build mission-critical AI applications across the entire agent development lifecycle. Our open source frameworks — LangChain and LangGraph — see over 70+ million downloads per month. Developers rel
Senior Fullstack Engineer At LangChain, our mission is to make intelligent agents ubiquitous. We build the foundation for agent engineering in the real world, helping developers move from prototypes to production-ready AI agents that teams can rely on. We began as widely adopted open-source tools an
A tech firm in AI development is seeking a Senior Full Stack Engineer for their LangSmith product in San Francisco. The role involves leading technical architecture, mentoring junior members, and collaborating with cross-functional teams. Ideal candidates should have over 7 years of experience in so
Estimated salary for Evals in Berkeley
$40,000 – $63,000/year
Estimation confidence: Low
Estimate based on market data for Berkeley. Actual salaries may vary depending on experience, company, and area.
View full salary data →Other professions in Berkeley
Explore other job opportunities in Berkeley