Key Responsibilities AI & LLM Systems Build end-to-end RAG (Retrieval-Augmented Generation) pipelines for context-aware AI responses. Implement and fine-tune vLLM for efficient inference of large language models (LLMs). Collaborate with ML engineers to deploy transformer models (e.g., BERT, GPT