Skip to main content

AI Engineer

Technology
DigiHyre
România, Româniaacum 1 luniPână la 29.05.2026
Normă întreagăHibrid

Descrierea postului

What You Will Do

Your responsibilities will include

  • Designing and building production-ready RAG pipelines — document ingestion, chunking strategies, embedding generation, vector storage, retrieval logic, and response generation using LLMs
  • Integrating LLM APIs into existing product backends — OpenAI, Anthropic, Mistral, or open-source models — with proper error handling, retry logic, streaming support, and cost awareness
  • Working with vector databases including Pinecone, Weaviate, Qdrant, or pgvector — managing indexes, optimizing retrieval, and ensuring search quality meets product requirements
  • Implementing evaluation frameworks to measure model output quality — building evals, tracking regressions, and creating feedback loops that improve system performance over time
  • Designing and maintaining AI-related APIs that integrate cleanly with the rest of the product stack — authentication, rate limiting, input validation, and output formatting all included
  • Building and maintaining LLM orchestration workflows using frameworks such as LangChain or LlamaIndex — chaining prompts, managing context windows, and handling multi-step reasoning tasks
  • Collaborating with backend and frontend engineers to integrate AI features into existing product surfaces — you understand the full stack well enough to work effectively across it

Requirements The following are non-negotiable for this role:

Technical

  • 2 to 5 years of overall engineering experience with at least 1 to 2 years working specifically with LLMs or AI systems in a production environment — not just experimentation or coursework
  • Strong Python skills — you write clean, maintainable, production-quality Python; you understand dependency management, async patterns, and testing
  • Hands-on experience with at least one major LLM provider — OpenAI, Anthropic, Mistral — and familiarity with open-source alternatives such as Llama or Falcon
  • Practical experience building RAG systems — you have designed the full pipeline, not just the retrieval step
  • Working knowledge of vector databases and embedding models — you have used them in production and understand their tradeoffs
  • Ability to evaluate model outputs systematically — you do not rely on vibes; you build evals, track metrics, and iterate based on data
  • Solid API design skills — AI features exist within larger systems and you understand how to design the interfaces that connect them cleanly
  • Experience with LangChain, LlamaIndex, or a comparable orchestration framework in a real project context

Availability & Setup

  • Full-time availability: 40 hours per week with no parallel full-time commitments
  • Minimum 4 hours of consistent daily overlap with Central European Time
  • Stable internet connection and a reliable, dedicated home office setup

Communication

  • English proficiency at B2 level or above — you communicate directly with a European client engineering team in meetings, written updates, and technical documentation
  • Able to explain AI system behavior, limitations, and tradeoffs clearly to both technical and non-technical stakeholders

Keywords
OrchestrationGNU parallelFalconPythonIteration

¿Te interesa este puesto?