Careers

AI/MLOps Engineer (with DevOps)

Job Description

  • Manage HGX nodes (OS, drivers, GPU allocation)
  • Set up and manage OpenShift/K8s clusters
  • Deploy models to inference servers (Triton, TensorRT, etc.)
  • Automate fine-tuning pipelines (PyTorch/TensorFlow)
  • Handle CI/CD for models (training -> serving) Basic scripting (Python/Bash) for ops automation

Work Locations

UAE - Abu Dhabi

Responsibilities

  • Manage artifacts (model checkpoints, fine-tuned versions)
  • Validate fine-tuned models (accuracy, fairness, drift)
  • Monitor model behavior in production
  • Alert on anomalies
  • Manage model registry (track model versions, fine-tuning metadata)

Skills + Experience

  • Kubernetes (mandatory)
  • OpenShift (bonus)
  • DevOps (CI/CD)
  • Python
  • Torch/TensorFlow familiarity
  • Triton Server or similar deployment tool
  • Triton Inference Server
  • MLFlow/KubeFlow
  • Understanding of AI model validation
  • Monitoring tools (Prometheus, Grafana)
  • Basic ML performance metrics
  • Good scripting skills