Diversity NexusHi,
We have immediate requirement for Software Developer/Engineer (AI/ML Engineer or LLM Engineer) role with one of our direct clients. (ONLY US Citizens and GC on W2) (LOCAL CANDIDATES)
Software Developer/Engineer (Mid-Level experience)
Philadelphia, PA Hybrid, minimum 3 days in the office)
Interview Schedule: 1st interview, 1-hour, in-person;
2nd interview, 1-hour, in-person
Consultant Requirements – On-Prem LLM & Vector DB Implementation
Hands-on experience deploying open-source LLMs such as Meta Llama 3 and Mistral / Mixtral in on-prem or private environments
Strong proficiency in Python for LLM inference, prompt engineering, and integration
Experience with CPU-based inference, model quantization, and performance tuning
Vector Databases & RAG
Practical experience with open-source vector databases such as Qdrant, Chroma, Milvus, or pgvector
Proven implementation of Retrieval-Augmented Generation (RAG) pipelines
Experience generating and managing embeddings and metadata filtering
Security & Governance
Understanding of data privacy, air-gapped deployments, and enterprise security requirements
Experience implementing access controls and audit logging
Experience with LangChain or LlamaIndex
Exposure to Rust, Go, or C++ for high-performance services
Familiarity with Docker and Kubernetes for on-prem deployments
Knowledge of inference frameworks (e.g., vLLM, , Hugging Face Transformers)
Prior work in regulated or enterprise environments
Reference architecture and deployment guidance
Working prototype (LLM + vector DB + RAG)
Documentation and knowledge transfer to internal teams
If this opportunity aligns with your career interests, we kindly request you to share your updated resume along with your availability for a brief discussion to explore the role further.
Additionally, if you know of any professionals in your network who may be good fit, we would greatly appreciate your referrals.
Thank you for your time and consideration. I look forward to your response.
Thanks & Best Regards,
4365 Route 1 South Princeton NJ, 08540
eshwar venkatesh | LinkedIn
Mid-Senior level
¿Te interesa este puesto?