
Senior Site Reliability Engineer (SRE)
Sleek
Completely RemoteFull TimeEngineering & Architecture
Posted Today
Job description
Responsibilities
- Architect, build, and scale next-generation infrastructure and AI-powered capabilities to support modern applications and advanced AI workloads.
- Design resilient cloud architectures and ensure platforms remain secure, scalable, and high-performing.
- Implement robust automation across CI/CD, infrastructure provisioning, and operations to reduce manual overhead.
- Integrate AI systems into production environments, including pipelines for model hosting, embeddings, and vector search.
- Strengthen observability through improved logging, monitoring, tracing, and alerting (Prometheus, OpenTelemetry).
- Lead incident response processes, define SLIs/SLOs, and improve on-call readiness.
- Mentor team members to elevate platform engineering and DevOps maturity across the organization.
Requirements
- 6+ years of progressive experience in Site Reliability Engineering (SRE).
- Deep expertise in multi-cloud environments (AWS, GCP, or Azure) including networking, compute, and storage.
- Extensive experience with container orchestration (Kubernetes, EKS, or ECS).
- Proficiency with Infrastructure as Code (Terraform, Pulumi, or CloudFormation).
- Experience with GitOps practices (ArgoCD, Flux) and modern deployment patterns (Blue/Green, Canary).
- Strong background in platform security, secrets management, and IAM.
- Familiarity with AI/ML infrastructure requirements (model inference, vector databases, or GPU workloads).
- Programming proficiency in Node.js, NestJS, or Python.
Preferred Qualifications
- Experience building self-service developer platforms to increase engineering velocity.
- Experience managing Multi-Cloud API Gateways and Edge Routing (Kong, Traefik, or Cloudflare).
- Hands-on experience with security hardening tools like Falco or eBPF.
Benefits
- Fully remote work environment with flexibility to manage your own schedule.
- Competitive market salaries and generous paid time off.
- Eligibility for employee share ownership plans.
- Opportunities for personal growth and autonomy within a fast-paced, AI-driven environment.
- Work for a certified B Corp committed to building a force for good.
About the Company
Sleek makes back-office operations easy for micro SMEs through proprietary software and AI. We provide automated corporate secretarial services, accounting, bookkeeping, and FinTech payment solutions. Launched in 2017, Sleek serves over 15,000 customers across Singapore, Hong Kong, Australia, and the UK, and is on a mission to streamline entrepreneurship through intelligent automation.
Skills & tools
AWSKubernetesTerraformPythonNode.jsDockerGitOpsPrometheus
What the team is looking for
Use this list as a quick fit check before you apply.
- 016+ years SRE experience
- 02Multi-cloud expertise (AWS, GCP, Azure)
- 03Container orchestration (Kubernetes, EKS, ECS)
- 04Infrastructure as Code (Terraform, Pulumi)
- 05GitOps practices (ArgoCD, Flux)
- 06Observability stacks (Prometheus, OpenTelemetry)
- 07AI/ML infrastructure familiarity
- 08Programming (Node.js, Python)

Sleek
Job details
- Work model
- Completely Remote
- Commitment
- Full Time
- Category
- Engineering & Architecture
- Posted
- Today