Deep Learning Engineer, LLM Accuracy Evaluation
Stellenbeschreibung
This position is posted by Jobgether on behalf of a partner company. We are currently looking for a Deep Learning Engineer, LLM Accuracy Evaluation in Switzerland .
Join a cutting-edge engineering team focused on advancing how next-generation AI models are evaluated and optimized. In this role, you will work at the intersection of deep learning research and scalable infrastructure, helping define new standards for assessing the accuracy and reliability of large language models, retrieval-augmented systems, and multimodal architectures. You will collaborate with global partners and open-source communities to bring high-performance AI models into production-ready environments.
With access to powerful computing resources and emerging technologies, you will contribute directly to shaping the future of AI systems. This position offers a fast-paced, innovation-driven environment where experimentation, technical excellence, and impact go hand in hand.
Accountabilities
- Lead the development of advanced methodologies to evaluate the performance, accuracy, and robustness of deep learning models, including LLMs, RAG systems, and vision models
- Collaborate with internal teams and external partners to optimize and deploy flagship AI models as high-performance inference services
- Design, build, and maintain scalable tools, pipelines, and infrastructure supporting AI evaluation and benchmarking initiatives
- Analyze and improve AI frameworks, libraries, and APIs to ensure alignment with best practices and performance standards
- Conduct experiments and research to validate new evaluation techniques and contribute to continuous model improvement
- Support cross-functional initiatives by translating complex technical findings into actionable insights
- Advanced degree (BS, MS, or PhD) in Computer Science, Artificial Intelligence, Applied Mathematics, or a related field, or equivalent experience
- Extensive hands-on experience (10+ years) in AI development, particularly in NLP and large language models
- Strong expertise in deep learning algorithms, mathematical modeling, and performance evaluation techniques
- Proven ability in debugging, testing, and optimizing large-scale AI systems
- Experience with inference and deployment technologies such as TensorRT, ONNX, or Triton is highly desirable
- Familiarity with MLOps/DevOps practices, containerization (Docker), and Linux-based environments
- Experience working with large-scale computing environments or HPC clusters is a plus
- Excellent communication skills and ability to collaborate effectively in a fast-paced, global environment
- Opportunity to work on state-of-the-art AI technologies with access to cutting-edge hardware and infrastructure
- Flexible and remote-friendly working environment within Spain
- Exposure to global collaborations with leading AI researchers and engineers
- Continuous learning and development opportunities in a rapidly evolving field
- Competitive compensation package with performance-based incentives
- Inclusive and diverse workplace culture that fosters innovation and creativity
We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.
We appreciate your interest and wish you the best!
Why Apply Through Jobgether?
Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
¿Te interesa este puesto?