img
نوع العقددوام كامل
طبيعة الوظيفةبالموقع
الموقعمكة المكرمة

وصف الوظيفة

About the Role

TMC Middle East is seeking a Machine Learning Operations (MLOps) Engineer with expertise in Google Cloud Platform (GCP) for a full-time position. This role involves production ownership, focusing on operationalizing, monitoring, and scaling machine learning systems that handle live traffic and have defined service level objectives (SLOs).

Core Responsibilities

The successful candidate will be responsible for the end-to-end MLOps architecture on GCP, encompassing design, implementation, and ongoing operation. Key duties include:

  • Designing, building, and operating MLOps architecture on GCP.
  • Automating CI/CD pipelines for ML workflows, from code commit to canary deployments on Vertex AI Endpoints.
  • Implementing and managing production monitoring systems, including drift detection, alerting, and automated retraining triggers.
  • Leading incident response for production ML systems, including root cause analysis, rollback procedures, post-mortems, and preventative measures.

Essential Qualifications

To be considered for this role, candidates must meet the following non-negotiable requirements:

  • Proven experience deploying a production ML system at scale, with the ability to detail query per second (QPS), latency SLOs, and a resolved failure mode.
  • Demonstrated recent (within the last 12-18 months) hands-on experience writing GCP code.
  • In-depth knowledge of Vertex AI, including pipelines, model registry, endpoints, and monitoring capabilities.
  • Experience with production incident resolution, including identifying root causes, implementing fixes, and conducting post-mortems.
  • Ability to whiteboard a recent GCP ML architecture without notes.
  • A minimum of 5 years of experience in ML Engineering, MLOps, or DevOps.
  • A minimum of 3 years of mandatory GCP experience.

A GCP Professional Cloud DevOps Engineer certification is considered a strong advantage.

Undesired Experience

The following types of experience are not sought for this position:

  • Experience with Vertex AI that lacks practical application in endpoints, monitoring configurations, or quantifiable results.
  • Candidates who have primarily built Proofs of Concept (POCs) without carrying production SLOs.
  • Individuals whose GCP expertise is more than two years out of date.

Location and Work Type

This is a full-time position located in Saudi Arabia.


متطلبات الوظيفة

  • تتطلب ٢-٥ سنوات خبرة

وظائف مشابهة