Blink Health

Senior Cloud Resilience Architect

Blink Health

Posted 10 hours ago

Employment Type

Full Time

Location

Completely Remote

Requirements

Cloud Infrastructure, Disaster Recovery, Multi-Region Architectures, Kubernetes, Infrastructure as Code, Automation, Technical Leadership

Job Description

Responsibilities

  • Evaluate and mature the organization’s disaster recovery posture, including recovery objectives (RTO/RPO), dependency mapping, and failure domain analysis across applications, data, and infrastructure.
  • Define, document, and establish disaster recovery standards and best practices across cloud infrastructure, platforms, and application architectures.
  • Partner with SRE, platform, security, and product engineering teams to design and implement resilient, fault-tolerant systems, progressing from backup-based recovery to multi-region and active-active architectures.
  • Lead the disaster recovery roadmap, balancing technical feasibility, cost, risk, and business priorities.
  • Design and recommend reference architectures for disaster recovery patterns, including pilot-light, warm standby, hot standby, and active-active.
  • Drive adoption of active-active disaster recovery for critical systems, including traffic management, data replication, consistency models, and automated failover.
  • Define and operationalize testing strategies for DR, including game days, chaos testing, and regular recovery exercises.
  • Establish clear documentation, runbooks, and escalation paths to ensure recoverability is well understood and not dependent on individuals.
  • Evaluate and recommend platform upgrades, cloud services, and tooling that improve resilience, recovery speed, and reliability.
  • Serve as a technical authority and advisor on disaster recovery and resilience for leadership and engineering teams.
  • Provide architectural guidance, design reviews, and mentorship to engineers implementing DR-related changes.
  • Partner with security and compliance teams to ensure DR strategies meet regulatory, audit, and data protection requirements.

Requirements

  • Bachelor’s or Master’s degree in Computer Science or equivalent practical experience.
  • 8+ years of experience in cloud infrastructure, platform engineering, SRE, or reliability-focused architecture roles.

Preferred Qualifications

  • Deep understanding of disaster recovery concepts including RTO/RPO, blast radius reduction, failure domains, and dependency isolation.
  • Proven experience designing and implementing multi-region and multi-availability zone architectures.
  • Hands-on experience moving systems toward active-active or highly available architectures.
  • Strong grasp of data replication strategies, consistency tradeoffs, and recovery patterns for databases and stateful systems.
  • Extensive experience with major cloud providers (AWS preferred, GCP/Azure acceptable).
  • Strong understanding of managed cloud services and their DR characteristics and limitations.
  • Experience with Kubernetes-based platforms, including regional failover, workload portability, and cluster recovery strategies.
  • Familiarity with global traffic management, DNS, load balancing, and service mesh patterns.
  • Experience designing and maintaining Infrastructure as Code using tools such as Terraform, Pulumi, CloudFormation, or Ansible.
  • Strong focus on automation for recovery workflows, failover testing, and environment provisioning.
  • Experience defining and running DR tests, game days, and failure simulations.
  • Strong documentation and communication skills, with the ability to translate complex technical risk into business impact.

Benefits

About the Company

Blink Health is the fastest growing healthcare technology company that builds products to make prescriptions accessible and affordable to everybody. Our two primary products – BlinkRx and Quick Save – remove traditional roadblocks within the current prescription supply chain, resulting in better access to critical medications and improved health outcomes for patients.

BlinkRx is the world’s first pharma-to-patient cloud that offers a digital concierge service for patients who are prescribed branded medications. Patients benefit from transparent low prices, free home delivery, and world-class support on this first-of-its-kind centralized platform. With BlinkRx, never again will a patient show up at the pharmacy only to discover that they can’t afford their medication, their doctor needs to fill out a form for them, or the pharmacy doesn’t have the medication in stock.

We are a highly collaborative team of builders and operators who invent new ways of working in an industry that historically has resisted innovation. Join us!

How to Apply

Similar Jobs You Might Be Interested In

Join Dubai's Remote Work Revolution.

Stay ahead in your career with Dubai's first platform dedicated to remote and hybrid job opportunities. Subscribe for weekly insights and job alerts directly to your inbox.

Thank you for subscribing! Check your inbox for confirmation.
Weekly Job Alerts
Subscribe to receive curated lists of the best remote and hybrid job opportunities in Dubai, tailored to your skills and interests.
Weekly Blog Newsletter
Get the latest insights, trends, and advice on remote work every week to help you thrive in the evolving work environment.