LEAD DATA SCIENTIST - Python
Job description
Build and deploy multimodal ML models across: o Natural Language Processing (NLP) o Computer Vision (CV) o OCR and document understanding ?
Develop robust pipelines for: o Text processing, entity extraction, and classification o Image tagging, moderation, and visual understanding o Speech-to-text and speaker-level analysis ?
Implement
Retrieval-Augmented Generation (RAG) pipelines with text and multimodal indexing. ? Strong proficiency in Python with PyTorch and/or TensorFlow. ? Hands-on experience with: o NLP, computer vision or speech models ? Working knowledge of LLM and orchestration frameworks: o LangChain, LlamaIndex or equivalent ?
Experience with vector search and semantic retrieval: o FAISS, Pinecone, Weaviate, or Azure AI Search ? Solid understanding of Docker, Kubernetes, and CI/CD pipelines. Python, Natural Language Processing (NLP), Kubernetes, Docker
¿Te interesa este puesto?