Nowa
Head of SRE and Infrastructure
Brak informacji o wynagrodzeniu
C-Level / ManagerFull-time·Umowa o pracę
#351218·Dodano wczoraj·0
Źródło: Capital.comTech Stack / Keywords
DevOpsCloudAWSKubernetesTerraformHelmContinuous DeliveryArgoCD
Firma i stanowisko
We are a leading trading platform that is ambitiously expanding globally. Our top-rated products have won prestigious industry awards for cutting-edge technology and seamless client experience.
Wymagania
- Experience as Head of SRE, SRE Director, Infrastructure Director, Engineering Director, or similar senior leadership role in technology, fintech, or financial services.
- Strong background in SRE, DevOps, infrastructure engineering, cloud platforms, and operating complex, high-availability systems.
- Hands-on technical understanding of AWS, Kubernetes, Terraform, FluxCD/ArgoCD, CI/CD tools, monitoring and alerting systems, and infrastructure-as-code.
- Deep understanding of SRE principles including SLOs, SLIs, SLAs, error budgets, incident management, observability, automation, and resilience engineering.
- Experience managing managers with skills in hiring, mentoring, performance management, and building engineering culture.
- Ability to collaborate with various teams and translate technical details for non-technical stakeholders.
- Strong analytical skills using metrics and analytics to guide decisions.
- Pragmatic approach prioritizing outcomes over process to drive effective results.
Obowiązki
- Develop and execute the SRE and infrastructure strategy to support the organisation’s technology roadmap, product growth, and global expansion.
- Lead the evolution of DevOps and infrastructure capabilities into a mature SRE framework with documented SLOs, error budgets, and operating standards.
- Oversee design, automation, and optimisation of cloud infrastructure (AWS, Kubernetes/EKS, Terraform, Helm, infrastructure-as-code).
- Drive migration of on-premise workloads to cloud and build multi-cloud disaster recovery with on-premise backup.
- Own the GitOps platform end-to-end, consolidate FluxCD estate, evaluate and execute move to ArgoCD with progressive/canary delivery.
- Build and maintain a reliable, scalable platform for regulated, multi-jurisdiction trading.
- Define and enforce reliability standards (SLIs, SLOs, SLAs, error budgets).
- Own disaster recovery strategy including recovery-site selection, RTO/RPO targets, regular DR drills, and playbooks.
- Define and operate a single observability standard (metrics, logs, traces) including SLO instrumentation, golden signals, alerting hygiene, and on-call ergonomics.
- Improve incident response, post-incident analysis, and long-term prevention with clear escalation criteria, SLAs, change-quality gates, and DR readiness.
- Lead, hire, and develop SRE, DevOps, DBA, developer experience, and technical support teams.
- Foster engineering culture based on accountability, ownership, technical excellence, and continuous improvement.
- Partner with development, security, compliance, risk, release, and business teams to align infrastructure and reliability priorities with product delivery and regulatory obligations.
Oferta
- Competitive salary.
- Work-life harmony with hybrid work model.
- Generous annual leave policy.
- Employee referral program.
- Comprehensive health and pension benefits including medical insurance and pension plans.
- 30 extra days to work remotely from anywhere in the world (with some restrictions).
- Two additional paid volunteer days per year.
Płatny urlop
Opieka zdrowotna
Ubezpieczenie
Elastyczne godziny
Inne informacje
Our company has an Internal Reporting Procedure available from Human Resources. Violations may be reported under the terms specified therein.
Capital.com
23 aktywne oferty