This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Deep Learning Engineer, LLM Accuracy Evaluation in Switzerland .

Join a cutting-edge engineering team focused on advancing how next-generation AI models are evaluated and optimized. In this role, you will work at the intersection of deep learning research and scalable infrastructure, helping define new standards for assessing the accuracy and reliability of large language models, retrieval-augmented systems, and multimodal architectures. You will collaborate with global partners and open-source communities to bring high-performance AI models into production-ready environments.

With access to powerful computing resources and emerging technologies, you will contribute directly to shaping the future of AI systems. This position offers a fast-paced, innovation-driven environment where experimentation, technical excellence, and impact go hand in hand.

Accountabilities

Lead the development of advanced methodologies to evaluate the performance, accuracy, and robustness of deep learning models, including LLMs, RAG systems, and vision models

Collaborate with internal teams and external partners to optimize and deploy flagship AI models as high-performance inference services

Design, build, and maintain scalable tools, pipelines, and infrastructure supporting AI evaluation and benchmarking initiatives

Analyze and improve AI frameworks, libraries, and APIs to ensure alignment with best practices and performance standards

Conduct experiments and research to validate new evaluation techniques and contribute to continuous model improvement

Support cross-functional initiatives by translating complex technical findings into actionable insights

Requirements

Advanced degree (BS, MS, or PhD) in Computer Science, Artificial Intelligence, Applied Mathematics, or a related field, or equivalent experience

Extensive hands-on experience (10+ years) in AI development, particularly in NLP and large language models

Strong expertise in deep learning algorithms, mathematical modeling, and performance evaluation techniques

Proven ability in debugging, testing, and optimizing large-scale AI systems

Experience with inference and deployment technologies such as TensorRT, ONNX, or Triton is highly desirable

Familiarity with MLOps/DevOps practices, containerization (Docker), and Linux-based environments

Experience working with large-scale computing environments or HPC clusters is a plus

Excellent communication skills and ability to collaborate effectively in a fast-paced, global environment

Benefits

Opportunity to work on state-of-the-art AI technologies with access to cutting-edge hardware and infrastructure

Flexible and remote-friendly working environment within Spain

Exposure to global collaborations with leading AI researchers and engineers

Continuous learning and development opportunities in a rapidly evolving field

Competitive compensation package with performance-based incentives

Inclusive and diverse workplace culture that fosters innovation and creativity

Why Apply Through Jobgether?

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!

Why Apply Through Jobgether?

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

Deep Learning Engineer, LLM Accuracy Evaluation

Stellenbeschreibung

Accountabilities

Verwandt

Verwandt