Manager Site Reliability Engineer

Sana Commerce · Dubai

Hybrid: DubaiFull TimeEngineering & Architecture
Posted Today

Job description

Responsibilities

  • Lead and build a global SRE team, setting objectives and guiding the team toward high reliability while balancing cost and performance SLAs
  • Collaborate with platform and product engineering teams to embed reliability and operational best practices into the software development lifecycle
  • Develop and implement SRE policies including service level objectives (SLOs), service level indicators (SLIs), and error budgets
  • Drive automation across operations to reduce toil, improve system performance, and ensure scalability
  • Oversee incident management, post-mortem analyses, and root cause investigations to prevent future outages
  • Facilitate capacity planning, scalability exercises, and disaster recovery testing to ensure business continuity
  • Mentor team members and foster a culture of continuous improvement and innovation

Requirements

  • 5+ years of experience in Site Reliability Engineering, with 2+ years in a leadership or management role
  • Proven, hands-on expertise in Microsoft Azure, including designing, deploying, and managing cloud-native infrastructure
  • Experience with Kubernetes and container orchestration
  • Deep understanding of network protocols, load balancing, and high availability configurations
  • Experience with Infrastructure as Code tools such as Terraform or Ansible
  • Proficiency in monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack)
  • Programming experience in PowerShell, C#, Python, Go, or Java
  • Excellent problem-solving skills and ability to tackle complex issues under pressure
  • Outstanding leadership qualities with a track record of developing high-performing teams

About the Company

Sana Commerce is a fast-growing B2B e-commerce SaaS platform that helps manufacturers, distributors, and wholesalers succeed by fostering lasting relationships with their customers. Founded in 2007, the company offers a hybrid working model and emphasizes continuous growth and innovation.

Skills & tools

AzureKubernetesTerraformPrometheusGrafanaPowerShellC++SRE

What the team is looking for

Use this list as a quick fit check before you apply.

  1. 015+ years SRE experience
  2. 022+ years leadership experience
  3. 03Microsoft Azure expertise
  4. 04Kubernetes experience
  5. 05Infrastructure as Code
  6. 06Monitoring tools proficiency
  7. 07PowerShell or C# programming