Salary to £60k + Company Options SchemeHybrid working (2-3days/weekacrosstheirVauxhalloffices&Londonbasedclients).NB: Please only apply if you are (ideally a UK National) able to achieve either SC or DV clearance.(SC) 5 years & (DV) 10 years residency in the UK with no more than a 3-month break outside the UK.OverviewThis company is a Workflow & AI Orchestration Specialist. They’re on a mission to modernise how public sector organisations manage casework, derive insight from data and deliver citizen services. Theyre growing fast and looking for bright, dynamic people to help build their business.RoleTheyre looking for a Site Reliability Engineer (SRE) to join their growing platform and delivery teams. Youll help design, build, and operate reliable, secure, and performant infrastructure that underpins critical public-sector services.Youll combine your background in DevOps, cloud engineering, and automation with a focus on reliability, observability, and scalability. Youll also work with event-driven technologies, identity and access management, and data platforms, ensuring our orchestration solutions are resilient, secure, and future-ready.This is an exciting opportunity for an engineer who thrives on solving complex infrastructure and automation challenges, automating everything, and shaping secure-by-design delivery environments.ResponsibilitiesDesign, build, and operate resilient cloud infrastructure (AWS & Azure preferred)Develop and maintain CI/CD pipelines using tools such as GitHub Actions, Jenkins, or GitLab CIImplement infrastructure-as-code using Terraform, Helm, and Kubernetes manifestsBuild and manage event-driven architectures using Apache Kafka or similar messaging systemsOperate and maintain databases and data platforms such as PostgreSQL, Elasticsearch, and MongoDBConfigure and support identity and access management (IdAM) solutions such as KeycloakMonitor system health, performance, and capacity using modern observability stacks (Prometheus, Grafana, ELK, OpenTelemetry)Champion DevSecOps practices, embedding security and compliance into every stage of deliveryAutomate deployment, scaling, and recovery processes to improve reliability and reduce manual toilSupport development teams with environment configuration, secrets management, and container orchestrationParticipate in incident management and root-cause analysis to continuously improve system resilienceContribute to internal platform frameworks, templates, and automation acceleratorsEssentialExperience in SRE, DevOps, or Cloud Infrastructure Engineering rolesStrong knowledge of AWS, Azure, or GCP environmentsHands-on experience with Kubernetes, Docker, and container orchestrationProven experience building and maintaining CI/CD pipelinesStrong experience with infrastructure-as-code (Terraform, Helm, or CloudFormation)Working knowledge of event-driven systems (Kafka, RabbitMQ, or similar)Experience managing and tuning databases (PostgreSQL, Elasticsearch, MongoDB)Familiarity with Keycloak or equivalent identity and access management (IdAM) platformsStrong scripting or automation experience (Python, Bash, or Go preferred)Experience with monitoring, logging, and alerting tools for observabilityUnderstanding of networking, DNS, load balancing, and security controlsCollaborative, pragmatic, and proactive problem solverMust be eligible and prepared to go through SC Security ClearanceDesirableExperience operating production workloads in the UK public sector or regulated environmentsFamiliarity with Camunda or other process orchestration platformsExperience implementing DevSecOps frameworks or security automation toolsUnderstanding of zero-trust networking and secure-by-design principlesFamiliarity with service mesh technologies (e.g. Istio, Linkerd)Exposure to agentic AI or workflow automation environmentsHolds UK security clearance (SC, DV, eDV)What They OfferA company option scheme gives their employees a stake in the organisation theyre buildingMeaningful missions, tackling high-impact public-sector problems that improve services millions rely on.Hybrid working, flexible working between their London office, client offices and your home location.Outputs over optics. They focus on what gets delivered, not hours online.Working shoulder-to-shoulder with founders. Learn from experienced practitioners in a nurturing, hands-on environment.Shape their organisation. your fingerprints will be on their company as it grows.Autonomy by default. They empower their people to do whats needed.Sociable, low-ego, dynamic engagements. No big-firm stiffness or needless bureaucracy.Professional Development. They invest in their people through training and certifications.Other StuffNB: Please only apply if you are (ideally a UK National) able to achieve either SC or DV clearance.(SC) 5 years & (DV) 10 years residency in the UK with no more than a 3-month break outside the UK.NB: for non-UK Citizens; we cannot accept applications from anyone requiring sponsorship (now or in the future) for UK permanent employment status. If you are utilising a work visa this must allow you to work in the UK unrestricted for at least the next 5 years.Profile 29 recruitment keywords: Site reliability engineer SRE DevOps Cloud Infrastructure AWS Azure GCP Kubernetes Docker container orchestration CI/CD Terraform Helm CloudFormation Kafka RabbitMQ PostgreSQL Elasticsearch MongoDB Keycloak identity access management IdAM Python Bash Go DV SC Security Clearance AI artificial intelligence London hybrid work from homeTPBN1_UKTJ