We are seeking an experienced Site Reliability Engineering leader to join a high-growth SaaS organisation in a hybrid role that combines technical leadership with hands-on engineering. This is a key position for someone passionate about reliability, resilience, and running production systems at scale.
The successful candidate will lead and mentor the SRE team, set the technical direction for reliability engineering, and take end-to-end ownership of production systems. They will be accountable for availability, performance, and incident response, while working closely with Product and Engineering to define SLIs and establish meaningful SLOs that balance stability with delivery pace. They will champion a blameless culture, embedding robust incident management processes and driving continuous, systemic improvement.
Key skills and experience: