Responsibilities Operate and manage Kubernetes or OpenShift clusters for multinode orchestration Deploy and manage LLMs and other AI models for inference using Triton Inference Server or custom endpoints Automate CI/CD pipelines for model packaging, serving, retraining and rollback using GitLab CI o