HumiintAt Humiint, we partner with innovators and tech-forward companies who have great data but struggle to turn it into a deployed product.
Whether it's taking a prototype recommendation engine and scaling it to handle 10k requests per second, setting up an MLOps pipeline to retrain models automatically, or fine-tuning an open-source LLM for a specific vertical, the right Fractional Machine Learning Engineer bridges the gap between "it works on my laptop" and "it works in production"—without the overhead of a full-time AI Infrastructure Team.
We're looking to connect with builders and systems thinkers who know that the model is only 5% of the code.
If you are an expert at containerization, cloud inference, and optimizing latency, this is a great opportunity to apply your engineering rigor flexibly while building the intelligence layer for growing companies.
This role is ideal for MLOps practitioners and software engineers with a deep understanding of AI who want to focus on the architecture and deployment of machine learning systems on a part-time, project-based, or advisory basis.
Wrapping models in APIs and containerizing them for scalable production use.
Building end-to-end data pipelines that feed data into models for training and inference.
LLM Fine-Tuning:
Adapting foundation models using techniques like LoRA/QLoRA on custom datasets.
Configuring cloud environments to balance performance and cost.
reducing inference time through quantization, model distillation, or using specialized hardware.
Monitoring & Observability:
Setting up tools to detect "data drift" and ensure model accuracy doesn't degrade over time.
Setting up and managing vector stores for RAG applications.
Background as an ML Engineer, MLOps Engineer, or Backend Engineer with a heavy data focus.
Expert-level Python skills and familiarity with C++ or Go for performance-critical components.
Deep experience with PyTorch, TensorFlow, Hugging Face Transformers, and LangChain.
proficiency with the major cloud providers and their specific AI services.
A commitment to version control, testing, and CI/CD principles—treating ML assets like software code.
Pragmatism:The ability to choose the "boring," reliable solution over the newest, shiniest paper on arXiv when business stability is the goal.
Explaining infrastructure costs and technical trade-offs to non-technical stakeholders.
Ability to set your own pricing
Virtual / Work from home
At Humiint, we work with trusted professionals who bring structure, scale, and reliability to the intelligence they build. If you're a Fractional Machine Learning Engineer ready to turn experiments into engines, we'd love to hear from you.
¿Te interesa este puesto?