Make yourself visible and let companies apply to you.
Roles
Prometheus Jobs
Overview
Looking for top Prometheus jobs? Explore the latest Prometheus monitoring and alerting roles on Haystack, the leading IT job board. Whether you're a developer, DevOps engineer, or site reliability specialist, find your perfect Prometheus job today and advance your career in cloud-native infrastructure and observability. Start your search now!
Data Engineer - Contract - 9 Months
CBSbutler Holdings Limited trading as CBSbutler
Sheffield
In office
Mid - Senior
£390/day - £442/day
RECENTLY POSTED

Data Engineer (Contract)
9+ Month Contract based in Sheffield
£395 - £442 per day InsideIR35
BPSS clearance required - candidates must be eligible

My client is seeking a Data Engineer to design and operate large-scale telemetry and observability data pipelines within a modern OpenShift and Kafka ecosystem. This role is central to enabling proactive, Level 4 observability, delivering high-quality metrics, logs, and traces to support platform reliability, operational insight, and automation.

Responsibilities:

* Design, implement and maintain scalable data pipelines to ingest and process
OpenShift telemetry (metrics, logs, traces)
* Stream telemetry through Kafka (producers, topics, schemas) and build resilient
consumer services for enrichment and transformation
* Engineer multi-tenant observability data models, ensuring data lineage, quality
controls and SLAs across streaming layers
* Integrate processed telemetry into Splunk for dashboards, analytics, alerting and
operational insights
* Implement schema management and governance using Avro/Protobuf, including versioning
and compatibility strategies
* Build automated validation, replay and backfill mechanisms to ensure data
reliability and recovery
* Instrument services using OpenTelemetry, standardising tracing, metrics and
structured logging
* Apply LLMs to enhance observability, such as query assistance, anomaly summarisation
and runbook generation
* Collaborate with Platform, SRE and Application teams to align telemetry, alerts and
SLOs
* Ensure pipelines meet security, compliance and best-practice standards
* Produce clear documentation covering data flows, schemas, dashboards and operational
runbooks

Skills & Experience:
* Strong hands-on experience building streaming data pipelines with Kafka
(producers/consumers, schema registry, Kafka Connect, KSQL/KStreams)
* Experience with OpenShift / Kubernetes telemetry, including OpenTelemetry and
Prometheus
* Proven capability integrating telemetry into Splunk
(HEC, Universal Forwarders, sourcetypes, CIM, dashboards, alerting)
* Solid data engineering skills in Python (or similar) for ETL/ELT, enrichment and
validation

Please apply for immediate interview!

CBSbutler is operating and advertising as an Employment Agency for permanent positions and as an Employment Business for interim / contract / temporary positions. CBSbutler is an Equal Opportunities employer and we encourage applicants from all backgrounds

Cloud Engineer
ECS
Leeds
Hybrid
Mid - Senior
£400/day - £500/day
RECENTLY POSTED
+1

Cloud Engineer Observability/APM/CMM (Inside IR35)6-month contract
Location: Leeds (Hybrid, 1-2 days per week on-site)
Rate: £400-£500 per day (Inside IR35)

We are supporting a major IT service provider who are looking for a skilled Cloud Engineer specialising in Observability, APM, and Cloud Monitoring & Management (CMM).

You’ll focus on monitoring, performance, reliability, and visibility across cloud platforms, helping improve service health and customer experience.

Key responsibilities:

  • Design and implement cloud observability solutions
  • Build and operate APM and monitoring platforms
  • Implement logging, metrics, tracing, and alerting standards
  • Support incident management, root cause analysis, and service improvement
  • Optimise performance, availability, and reliability across cloud environments
  • Work closely with Cloud, DevOps, and Service teams

Key skills & experience:

  • Strong cloud engineering background (Azure and/or AWS)
  • Observability, APM, and CMM experience
  • Hands-on with monitoring tools (e.g. Azure Monitor, App Insights, Datadog, Dynatrace, New Relic, Grafana, Prometheus)
  • Experience designing alerting, dashboards, and SLOs
  • Understanding of ITIL, service operations, and MSP environments
  • Infrastructure as Code and automation experience beneficial

Further information available upon application.
Please contact or call 01676 545 393 for more information.

ECS Recruitment Group Ltd is acting as an Employment Business in relation to this vacancy.

Cloud DevOps Engineer
PSD Group
London
Hybrid
Mid - Senior
£60,000/day
RECENTLY POSTED
+2

AWS, DevOps, Kubernetes, GitOps, Terraform, Terragunt, Grafana, Prometheus, Loki

Were looking for aCloud DevOps Engineerto join a leading fintech company to help reduce friction in their deployment pipelines and to enable faster and more reliable change across their AWS cloud environments.

In this role, youll work closely with solution architects, product engineers, and global stakeholders to design, deploy, and operate secure, scalable AWS-based platforms. Youll be part of a global, diverse team, collaborating across Technology and Product to continuously improve cloud service operations.

This is a6-month fixed term contract,full-timerole based inLondon, with hybrid working.

What youll do:

  • Work with a solution architect, to design, implement, and maintain AWS solutions
  • Deploy and manage development changes with product engineering teams
  • Support and optimise Kubernetes (EKS) platforms, including application deployment and troubleshooting
  • Manage CI/CD pipelines and GitOps practices (Helm, Git, CodePipeline, CodeDeploy)
  • Build and maintain infrastructure using Terraform / Terragrunt
  • Configure and enhance observability tooling (Grafana, Prometheus, Loki, CloudWatch)
  • Develop automation scripts to improve cloud health, reliability, and security
  • Ensure cloud environments remain secure, compliant, and cost-effective
  • Participate in incident, problem, and change management activities

Skills & Experience:

Essential experience and skills-

  • Proven experience in a DevOps or Cloud Operations environment
  • Strong Kubernetes (EKS) experience
  • Solid AWS services knowledge (e.g. RDS, Lambda, IAM, API Gateway, networking and security services)
  • Infrastructure as Code experience (Terraform / Terragrunt)
  • CI/CD and Git-based workflows
  • Observability and monitoring tooling experience
  • ITIL Foundation certification
  • AWS Certified Cloud Practitioner (or equivalent)

Desirable-

  • AWS DevOps Engineer certification
  • Experience with DevSecOps practices
  • Mentoring or supporting junior engineers
  • Blue/green or multi-region deployment experience

If this role sounds of interest and you have the required skills please submit your cv for immediate reviewl.

*Full right to work in the UK required*

Senior Cloud Engineer
WRK DIGITAL LTD
Leeds
Hybrid
Senior
£60,000
RECENTLY POSTED
+2

??Location:Leeds, West Yorkshire (Hybrid)

??Employment Type:Permanent

??Status:Actively Hiring

?? £55,280 - £62,190+ Excellent Benefits

WRK digital are delighted to be partnered exclusively with a highly respected, UK-based organisation, shortlisting for aSenior Cloud Engineeron a permanent basis.

We are looking for an accomplished and innovative Senior Cloud Engineer to join a collaborative Cloud Platform team, helping to manage, develop, and optimise a large Azure cloud environment. You will play a key role in delivering secure, scalable and resilient infrastructure that underpins the organisations cloud strategy.

The ideal candidate will bring expertise across Microsoft Azure, Kubernetes, Terraform (IaC), CI/CD pipelines (GitHub Actions or similar), and cloud automation. You will provide technical leadership, support operational excellence, mentor junior engineers, and drive continuous service improvements. A proactive mindset, strong troubleshooting capability, and the ability to draw insights from performance data will be essential.

This opportunity suits someone who thrives in a modern cloud environment, enjoys solving complex engineering challenges, and values cross-team collaboration.

Core Responsibilities include:

  • Manage and optimise Azure cloud infrastructure, ensuring stability, performance, and reliability
  • Provide L2/L3 support for incidents, troubleshooting cloud-related issues
  • Lead Kubernetes cluster operations and support containerised workloads
  • Maintain and enhance Terraform Infrastructure as Code modules
  • Participate in CI/CD automation using GitHub Actions or similar tooling
  • Analyse observability data to produce insights and recommendations
  • Ensure effective monitoring using tools such as Prometheus, Grafana, Dynatrace, AppDynamics or Splunk
  • Drive automation across cloud operations following SRE and DevOps principles
  • Support junior engineers through mentoring and knowledge sharing
  • Collaborate with cross-functional teams to implement platform improvements
  • Maintain clear documentation for procedures, configuration and operational readiness

Minimum Requirements:

  • 24/7 on-call L2/L3 support experience
  • Experience with a major cloud platform (Azure preferred)
  • Familiarity with monitoring tools for troubleshooting and insight gathering
  • Hands-on experience with CI/CD pipelines (GitHub Actions or similar)
  • Strong experience with Terraform or similar IaC tooling
Site Reliability Engineer
CBSbutler Holdings Limited trading as CBSbutler
Milton Keynes
Hybrid
Mid - Senior
£390/day
RECENTLY POSTED
+7

Role Title: Site Reliability Engineer

Location: Milton Keynes/Hybrid (3 days on site)

Duration: 6 months contract

Rate: 390 per day inside ir35

Role Description:

Join a leading global IT consultancy and digital transformation organisation at the forefront of cloud, automation and secure platform engineering. We’re looking for a Kubernetes-first engineer who wants to own and evolve a modern, enterprise-scale platform spanning AWS, Azure and on-prem. This is a hands-on role with real influence over reliability, security and architecture.

Responsibilities:

  • Operate and enhance our Kubernetes platform across AWS, Azure, and on prem.
  • Lead incident response, problem management, and root cause analysis.
  • Deliver cluster lifecycle work: upgrades, patching, node pools, CNI/CSI, ingress, and Rancher operations.
  • Own observability, dashboards, alerting, and SLOs/SLIs.
  • Implement GitOps (Fleet) and reduce toil through automation and strong governance.
  • Apply secure API gateway and WAF patterns.
  • Work with distributed system patterns, including event brokers and asynchronous messaging.
  • Maintain security posture: CVE remediation, GRC controls, scanning pipelines.

Required Skills:

  • Deep knowledge of Kubernetes, Rancher, GitOps, Linux, and cloud networking.
  • Understanding of API gateway and WAF patterns.
  • Experience with distributed systems and event driven architectures.
  • Strong automation/scripting (Python, Go, Bash, PowerShell, .NET).
  • IaC:

o Terraform for foundational/bootstrap cluster provisioning.

o Crossplane as an orchestration layer (leveraging Terraform providers).

  • Ability to work securely within PCI DSS / GDPR patterns.
  • CI/CD: Concourse, GitHub Actions, Azure DevOps.
  • Observability: Grafana, Prometheus, Jaeger/Tempo, CloudWatch, Loki, OpenTelemetry.

Nice to Have:

  • AWS operational experience.
  • Service mesh (Istio/Kuma).
  • Hybrid cloud experience (AWS + Azure + on prem).
  • Payments or regulated industry background.

If you are interested in this role or wish to apply, please feel free to submit your CV.

Linux DevOps Engineer
Akkodis
Newcastle upon Tyne
Hybrid
Mid - Senior
£45,000 - £50,000
RECENTLY POSTED
+5

Akkodis are currently working in partnership with a leading service provider to recruit an experienced DevOps Engineer to join their leading cloud services team.

Please note this is a hybrid role where you will be required to attend the office 2 days a week.

The Role
As a DevOps Engineer you will be responsible for designing, building, and maintaining the infrastructure that powers our clients’ cutting-edge platforms. In this role, you will be instrumental in automating the development pipeline and ensuring the reliability, scalability, and security of services within telecommunications and a managed service provider (MSP) environment.

The Responsibilities
* CI/CD Pipeline Management: Design, implement, and manage continuous integration and continuous delivery (CI/CD) pipelines for all platforms, enabling rapid and reliable software releases.
* Infrastructure as Code (IaC): Develop and maintain cloud and on-premise infrastructure using IaC principles with tools like Terraform and Ansible.
* Containerization & Orchestration: Manage and scale containerized applications, ensuring high availability and efficient resource utilization in a multi-tenant environment.
* Automation & Scripting: Automate manual processes related to deployment, monitoring, and operations using Scripting languages such as Python, Bash, or Go.
* Monitoring & Logging: Implement and manage robust monitoring, logging, and alerting solutions (eg, Prometheus, Grafana, ELK Stack) to proactively identify and resolve system issues.
* Collaboration: Work closely with software developers, network engineers, and product managers to troubleshoot issues and optimize performance
* Security: Integrate security best practices (DevSecOps) into the development life cycle, including vulnerability scanning, static code analysis, and compliance checks.

The Requirements
* Hands-on experience in a DevOps, SRE, or similar role.
* Strong proficiency with at least one major cloud provider (AWS, Azure, or GCP).
* In-depth knowledge of container orchestration.
* Demonstrable experience with CI/CD tools like Jenkins, GitHub Actions, or Azure DevOps.
* Expertise in using tools like Terraform or Ansible.
* Proficiency in a Scripting language such as Python or Bash.
* Solid understanding of networking principles (TCP/IP, DNS, HTTP/S, Firewalls

If you are looking for an exciting new challenge to play a pivotal part in a market leading organisation please apply now.

Modis International Ltd acts as an employment agency for permanent recruitment and an employment business for the supply of temporary workers in the UK. Modis Europe Ltd provide a variety of international solutions that connect clients to the best talent in the world. For all positions based in Switzerland, Modis Europe Ltd works with its licensed Swiss partner Accurity GmbH to ensure that candidate applications are handled in accordance with Swiss law.

Both Modis International Ltd and Modis Europe Ltd are Equal Opportunities Employers.

By applying for this role your details will be submitted to Modis International Ltd and/or Modis Europe Ltd. Our Candidate Privacy Information Statement which explains how we will use your information is available on the Modis website.

Junior DevOps Engineer - Azure
Akkodis
Newcastle upon Tyne
Hybrid
Junior
£30,000 - £40,000
RECENTLY POSTED
+4

Akkodis are currently working in partnership with a leading service provider to recruit a Junior DevOps Engineer to join their growing cloud services teams.

Please note this is a hybrid role where you will be required to attend the office 2 days a week.

The Role
As a Junior DevOps Engineer you will be responsible for designing, building, and maintaining the infrastructure that powers our clients’ cutting-edge platforms. In this role, you will be instrumental in automating the development pipeline and ensuring the reliability, scalability, and security of services within telecommunications and a managed service provider (MSP) environment.

The Responsibilities
* CI/CD Pipeline Management: Design, implement, and manage continuous integration and continuous delivery (CI/CD) pipelines for all platforms, enabling rapid and reliable software releases.
* Infrastructure as Code (IaC): Develop and maintain cloud and on-premise infrastructure using IaC principles with tools like Terraform and Ansible.
* Containerization & Orchestration: Manage and scale containerized applications, ensuring high availability and efficient resource utilization in a multi-tenant environment.
* Automation & Scripting: Automate manual processes related to deployment, monitoring, and operations using Scripting languages such as Python, Bash, or Go.
* Monitoring & Logging: Implement and manage robust monitoring, logging, and alerting solutions (eg, Prometheus, Grafana, ELK Stack) to proactively identify and resolve system issues.
* Collaboration: Work closely with software developers, network engineers, and product managers to troubleshoot issues and optimize performance
* Security: Integrate security best practices (DevSecOps) into the development life cycle, including vulnerability scanning, static code analysis, and compliance checks.

The Requirements
* Hands-on experience in a DevOps, SRE, or similar role.
* Strong proficiency with at least one major cloud provider (AWS, Azure, or GCP).
* In-depth knowledge of container orchestration.
* Demonstrable experience with CI/CD tools like Jenkins, GitHub Actions, or Azure DevOps.
* Expertise in using tools like Terraform or Ansible.
* Proficiency in a Scripting language such as Python or Bash.
* Solid understanding of networking principles (TCP/IP, DNS, HTTP/S, Firewalls

If you are looking for an exciting new challenge to play a pivotal part in a market leading organisation please apply now.

Modis International Ltd acts as an employment agency for permanent recruitment and an employment business for the supply of temporary workers in the UK. Modis Europe Ltd provide a variety of international solutions that connect clients to the best talent in the world. For all positions based in Switzerland, Modis Europe Ltd works with its licensed Swiss partner Accurity GmbH to ensure that candidate applications are handled in accordance with Swiss law.

Both Modis International Ltd and Modis Europe Ltd are Equal Opportunities Employers.

By applying for this role your details will be submitted to Modis International Ltd and/or Modis Europe Ltd. Our Candidate Privacy Information Statement which explains how we will use your information is available on the Modis website.

Senior Site Reliability Engineer
EMBL-EBI
Saffron Walden
Hybrid
Senior
£75,000
RECENTLY POSTED
+7

Were seeking a skilled individual to join our Applications Group and contribute to the success of our applications portfolio, joining the team as a Site Reliability Engineer. Within the Applications Group, the Web Applications Platform Team is responsible for providing the platforms on which all EBI web services are hosted.

A couple of years ago, the web hosting service started shifting to a container based model, and our goal is to accelerate and consolidate this trend. Our web services are very popular among the scientific community, and the average monthly request count is over 3,000 million.

Working closely with the different IT groups like Infrastructure and Operations, this position will help designing, implementing and administering the future platform, on which all our scientific web services will be running on.

Duties & Responsibilities

In this role you will:

Responsible for building and maintaining the following environments:

  • Web Hosting platform based on Kubernetes, where users can deploy web applications along with the following eco-system:

    • Infrastructure and application monitoring based on Prometheus
    • Web analytics platform, currently based on ElasticSearch
    • CI/CD tools like Gitlab
  • Drive automation and change to simplify management, operations and increase efficiency

  • Ensure documentation is of standard

  • Drive SRE best practices

This position will contribute directly to the above mentioned projects and tasks, and will help the team move forward with the production automation. This position will also help and guide other team members with daily prioritisation of tasks.

You have (Requirements)

  • Bachelor’s degree or higher in computer science or a related discipline, or demonstrate equivalent experience. The role would be suitable for a Unix/Linux systems administrator with good web hosting, Kubernetes, and CI/CD understanding.
  • At least 3 years of experience in the design, implementation and operation of large scale web hosting platforms.
  • Experience managing public-facing production services
  • Experience working with Agile methodologies
  • 3 years of experience with automated deployment/configuration methods (e.g. Ansible, Puppet, Terraform)
  • Solid experience in Kubernetes deployment and administration in public or private cloud
  • Strong Linux administration skills, ideally with RHEL or a RHEL clone
  • Solid skills in automation tools like Jenkins, Rundeck, or similar.
  • Hands-on experience using Git in CI/CD and infrastructure-as-code workflows.
  • Solid skills in at least one programming language, ideally python
  • Experience with methodologies for infrastructure monitoring.
  • Solid interpersonal and written English communication skills
  • Proven ability to work well in a team, building positive relationships and sharing knowledge
  • Ability to plan and prioritise workloads

You might also have (Desirable)

  • Experience with cloud technologies, including Google or AWS certification
  • Experience with Web Security best practices (OWASP)

Behaviors we value in our team:

You will possess strong communication skills, with the ability to multiple priorities and deadlines In a collaborative and effective in multidisciplinary, international teams!

A technical expert in your area of expertise, willing to share knowledge and keep up with trends!

Apply now! Benefits and Contract Information

  • Financial incentives: depending on circumstances, monthly family/marriage allowance of £278 monthly child allowance of £336 per child. Non resident allowance up to £569 per month. Annual salary review, pension scheme, death benefit, long-term care, accident-at-work and unemployment insurances

  • Hybrid working arrangements

  • Private medical insurance for you and your immediate family (including all prescriptions and generous dental & optical cover)

  • Generous time off: 30 days annual leave per year, in addition to eight bank holidays

  • Relocation package including installation grant (as applicable)

  • Campus life: Free shuttle bus to and from work, on-site library, subsidised on-site gym and cafeteria, casual dress code, extensive sports and social club activities (on campus and remotely)

  • Family benefits: On-site nursery, child sick leave, generous parental leave, holiday clubs on campus and monthly family and child allowances

  • Contract duration: This position is a 3 year contract renewable up to 9 years

  • Salary: Monthly salary starting from£3,303 - £3,695after tax but excl. pension & insurances) + benefits (Total package will be dependent on family circumstances)

  • International applicants: We recruit internationally and successful candidates are offered visa exemptions. Read more on our page for international applicants.

  • Diversity and inclusion: At EMBL-EBI, we strongly believe that inclusive and diverse teams benefit from higher levels of innovation and creative thought. We encourage applications from women, LGBTQ+ and individuals from all nationalities.

  • Job location: This role is based in Hinxton, near Cambridge, UK. You will be required to relocate if you are based overseas and you will receive a generous relocation package to support you.

To apply, please submit a covering letter and CV via our online system. Applications will close on 9/03/2026

Senior Site Reliability Engineer
Stratospherec Ltd
London
Fully remote
Senior
£80,000 - £100,000
RECENTLY POSTED
+6

Senior DevOps Engineer / Senior Site Reliability Engineer

Fully Remote working for candidates based in the UK

Salary £80k to £100k (depending on experience) + Benefits

We are looking for a Senior DevOps Engineer that has strong C# code knowledge combined with strong knowledge of DevOps tools like Kubernetes (EKS or ideally AKS) and Azure or AWS Cloud platform. We are looking for a SRE or DevOps Engineer with a strong understanding of C# code combined with experience of monitoring tools like DataDog, Grafana and Prometheus to join a growing global Cloud Infrastructure team supporting SaaS products.

Our client are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior DevOps Engineer to join their UK Cloud Infrastructure team.

Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. DevOps are expected to be experienced in software engineering principals, operational discipline, and automation.

The Cloud and DevOps team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this company’s unique SaaS platform is an essential platform in the life of millions of University students across the globe.

In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You’ll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a “follow the sun” model to operate our products on a multi-region cloud platform.

Role Responsibilities:

* Provide technical leadership and mentoring within the team through knowledge sharing sessions, pair programming, code reviews and solution design

* Identify and implement technical solutions to improve platform reliability, including the creation of mitigation strategies and operational playbooks.

* Implement and maintain monitoring/alerting/logging systems to identify and respond to incidents

* Ensure scalability and efficiency of cloud infrastructure and systems to handle traffic and data growth

* Conduct performance tests to identify and remediate bottlenecks

* Develop and maintain platform solutions, automate infrastructure provisioning, configuration, and management tasks using Infrastructure as Code.

* Monitor, review and tune databases to ensure high availability and performance

* Collaborate with product engineering teams to design/build fit-for-purpose and observable software

Required Skills and Experience:

* Proven experience in a SR DevOps / Site Reliability Engineering role

* Having strong code experience of C# or similar OO development language.

* Experience of supporting .Net applications as a SRE or DevOps Engineer

* Production experience operating containerization technologies - ideally with Kubernetes and/or Docker. Strong preference for AKS or EKS experience as well.

* Proficiency with one or more public cloud providers such as Azure, AWS or GCP

* Proficiency using Infrastructure as Code (IaC) tools such as Terraform (preferred), Ansible, or CloudFormation.

* Experience with monitoring, observability and logging tools such as DataDog, Prometheus, Grafana, or similar.

* Proven track record of maintaining highly-available and performant production environments.

* Ability to identify and implement effective mitigation strategies and operational playbooks.

Useful / Bonus Skills to have:

* Experience in CI/CD tooling: Azure DevOps/GitHub Actions, Octopus Deploy

* Relevant certifications in cloud platforms (e.g., Microsoft Certified: Azure Solutions Architect) and DevOps practices (e.g., Certified Kubernetes Administrator) are a plus

* Experience in database management/performance tuning, particularly MSSQL.

Employee benefits:

* Opportunity to be a part of a 30+ year well-established, high-performance SaaS company.

* Excellent Company Pension scheme and Life Insurance,

* Excellent holiday allowance.

* A supportive team environment with emphasis on learning and development opportunities

* Working with a team of caring, high-performing, and passionate people who have fun supporting our vision, innovation, and continuous improvement.

This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.

Please apply with your CV to find out more.

e apply with your CV to find out more

SAS Viya/SAS Platform Engineer - SC Cleared
Careerwise
London
Fully remote
Mid - Senior
£650/day
RECENTLY POSTED
+3

Hiring: SAS Viya/SAS Platform Engineer - Only SC Cleared candidates

Location: United Kingdom, Remote role

Duration - 12 months+ Contract

Rate - Upto £650/Day inside IR35

We are currently seeking an experienced SAS Viya/SAS Platform Engineer to join a high-performing data & analytics team delivering enterprise-grade SAS environments.

If you’re passionate about SAS infrastructure, platform operations, and large-scale ETL/SAS migrations - we’d love to hear from you.

Role Overview

As a SAS Platform Engineer, you will be responsible for designing, implementing, maintaining, and optimizing SAS environments (including SAS Viya). You will play a key role in platform engineering, operational support, and migration initiatives.

Key Responsibilities

  • Administration & support of SAS Viya and SAS 9.x environments
  • SAS infrastructure design, build, and platform operations
  • Performance tuning, monitoring, and troubleshooting
  • ETL workflows and SAS migration projects (on-prem - cloud/version upgrades)
  • Deployment automation and CI/CD integration
  • Environment patching, upgrades, and security hardening
  • Collaboration with DevOps, Cloud, and Data Engineering teams

Required Experience

  • Strong experience with SAS Viya architecture & administration
  • Proven background as:
  • SAS Platform Engineer
  • SAS Infrastructure Engineer
  • Deep understanding of:
  • SAS Metadata, SAS Grid, SAS Compute
  • Linux system administration
  • Kubernetes (for Viya 4)
  • Cloud platforms (AWS/Azure/GCP preferred)
  • ETL processing & SAS Migration experience
  • Infrastructure-as-Code and automation exposure (Terraform, Ansible, etc.)

Nice to Have

  • Experience with containerized SAS deployments
  • DevOps tooling (Git, Jenkins, Azure DevOps)
  • Monitoring tools (Prometheus, Grafana)
  • Financial services/regulated environment exposure

Please share your CV at (see below)

PERM - London - Principle Software Engineer- Digital Assets - Golang, PostgreSQL
TrinIT Group
London
Hybrid
Senior - Leader
£140,000
RECENTLY POSTED
+3

TrinIT Talent are looking for an experienced Principle Software Engineer with deep Golang or Node.js experience to join our customers team based in London. This will be onsite 4 days a week and 1 day remote. This is a permanent position paying up to £140k + 25% bonus + benefits depending on experience.

Job Description:-

Our customer are looking to build out an elite team of Engineers in the Digital Asset/Blockchain space. They are building a team of Principle and Lead Engineers to provide the seniority and guidance for the engineering division.

As a Lead or Principle you will need extensive Leadership experience in a Software Engineering role specifically with Golang. Driving Technical roadmaps and improving operation efficiency rather than just individual development contributions.

Key skills:-

  • Golang/Node.js - must be a deep level expert in Golang
  • Backend expert
  • Distributes systems specialist
  • Long term vision for tech roadmap and architecture
  • Build and grow software engineering teams
  • Ability to Architect high-performance distributed systems
  • Data intensive Application experience and experience in modern databases PostgreSQL, Redis etc
  • DevSecOps mindset - building CI/CD pipelines, IAC and Observability
  • Building API’s into high throughput, data intense applications
  • Ideally have FS/Digital Asset background
  • Nice to have - Azure/GCP architecture, Docker, Kubernetes, Prometheus, DataDog, Jaeger, Data Governance

If you feel you have the right experience for this role, please get in touch by sending your CV in Word format to (see below)

TrinIT Talent will consider applications based only on skills and ability and will not discriminate on any grounds.

PERM - London - Principle Software Engineer- Digital Assets - Golang, PostgreSQL

PERM - London - Lead Software Engineer- Digital Assets - Golang, PostgreSQL
TrinIT Group
London
Hybrid
Senior
£120,000
RECENTLY POSTED
+2

TrinIT Talent are looking for an experienced Lead Software Engineer with deep Golang experience to join our customers team based in London. This will be onsite 4 days a week and 1 day remote. This is a permanent position paying up to £120k + 20% bonus + benefits depending on experience.

Job Description:-

Our customer are looking to build out an elite team of Engineers in the Digital Asset/Blockchain space. They are building a team of Principle and Lead Engineers to provide the seniority and guidance for the engineering division.

As a Lead or Principle you will need extensive Leadership experience in a Software Engineering role specifically with Golang. Driving Technical roadmaps and improving operation efficiency rather than just individual development contributions.

Key skills:-

  • Golang - must be a deep level expert in Golang
  • Ability to Architect high-performance distributed systems
  • Data intensive Application experience and experience in modern databases PostgreSQL, Redis etc
  • Lead teams of engineers and helped shape technology roadmaps for the team
  • DevSecOps mindset - building CI/CD pipelines, IAC and Observability
  • Building API’s into high throughput, data intense applications
  • Ideally have FS/Digital Asset background
  • Nice to have - Azure/GCP architecture, Docker, Kubernetes, Prometheus, DataDog, Jaeger, Data Governance

If you feel you have the right experience for this role, please get in touch by sending your CV in Word format to (see below)

TrinIT Talent will consider applications based only on skills and ability and will not discriminate on any grounds.

PERM - London - Lead Software Engineer- Digital Assets - Golang, PostgreSQL

Senior Software Development Engineer
Permax Recruitment Limited
London
Hybrid
Senior
£100,000
RECENTLY POSTED
+4

Permax Recruitment is working in partnership with a London based firm who are on the lookout for a Software Engineer. For nearly a century, our client has been building a firm as accountants, auditors, tax specialists and close advisors to clients operating in emerging markets, disrupting the status quo.

This has accelerated thanks to the blockchain. In 2017, a client asked to help with an ICO and they have been crypto pilled ever since, developing into what is currently the leading professional services firm on chain.

In 2023, they opened a new leg of the business to carve out a team dedicated to all things Web3, which is now over 80 strong and servicing near 600 digital asset clients globally. They partner with some of the industry’s most influential playerscryptocurrency exchanges, blockchain innovators, Web3 pioneers, and digital asset fundsoffering tailored audit, tax, and advisory services that keep pace with this fast-evolving landscape.

Senior Software Engineer (Cloud Infrastructure & DevOps)

Location: London (Three days in office, two days wfh)

Salary: Approx £100,000 + Bonus

While our team builds data pipelines and reporting tools that enable accountancy teams to work efficiently, this role focuses primarily on managing our AWS infrastructure, supporting the team with robust DevOps practices, and mentoring other developers. You’ll be the technical expert who ensures our systems are scalable, secure, and well-architected as we transition to microservices and ephemeral infrastructure.

Key Responsibilities

Cloud Infrastructure & DevOps (Primary Focus)

Own and manage our AWS infrastructure, acting as the team’s cloud platform expert

Be one of the leaders in the migration toward microservices and ephemeral architecture

Lead in infrastructure as code

Establish and maintain CI/CD pipelines for the team’s data and application projects

Lead the implementation of monitoring, logging, and alerting systems to ensure reliability in our solutions

Manage cloud security, IAM policies, and compliance requirements

Provide infrastructure support and guidance to team members working on data pipelines and applications

Troubleshoot infrastructure and deployment issues

Team Leadership & Mentorship

Mentor other developers on DevOps practices, cloud architecture, and infrastructure concepts, jointly with other senior members

Support and encourage team members in deploying and managing their data pipelines and applications

Conduct code and infrastructure reviews

Develop and share best practices for cloud-native development

Foster a collaborative learning environment within the team

Contribute to technical documentation

Collaboration & Technical Enablement

Enable the team to build and deploy data pipelines efficiently by providing templates and guidance on infrastructure

Work with colleagues to understand their infrastructure needs and provide solutions

Translate infrastructure requirements into scalable, maintainable solutions

Communicate technical concepts clearly to both technical and non-technical stakeholders

Collaborate with accountancy teams to ensure data platform reliability and performance

Technical

5+ years of software engineering experience with a significant cloud infrastructure focus

Understanding of networking, security, and cloud best practices

Hands-on experience with AWS services

Proficiency with infrastructure as code tools

Experience designing and implementing CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)

Solid understanding of containerization and orchestration (Docker, Kubernetes, ECS)

Experience with monitoring and observability tools (CloudWatch, Datadog, Prometheus, or similar)

Proficiency in bash

Experience supporting development teams with infrastructure and deployment needs

Knowledge of microservices architecture and serverless patterns

Leadership

Experience working in teams outside the realm of Software Engineering

Demonstrated experience mentoring or managing junior Engineers

Strong communication skills with both technical and non-technical audiences

Ability to provide clear technical guidance and support

Pragmatic approach to balancing technical delivery with business needs

Desirable

Python experience for infrastructure automation and tooling

Familiarity with data pipeline infrastructure (supporting ETL workloads, data warehousing)

Experience with data governance and compliance requirements

Cloud cost and resource utilisation optimisation

Experience migrating from monolithic to microservice architectures

What We Offer

Opportunity to shape and own the technical infrastructure

Small, collaborative team environment where your expertise will have a direct impact

Opportunity to create and develop solutions that are new, be impactful

Dress for your diary.

Flexible working hours.

A technology enabled firm.

A Family environment, fantastic retention, hiring due to exceptional growth and internal promotions.

A Fixed profit-sharing bonus scheme payable to all staff.

Brand new central London office.

Plenty of socialising opportunities.

Free breakfast and fresh fruit provided daily.

Network SRE
SR2
London
In office
Senior
£500/day - £525/day
RECENTLY POSTED

Senior Network SRE (Contract)

Location: On-site 5 days a week in London
Rate: 500 per day
Contract: Inside IR35

We’re looking for a Senior Network Site Reliability Engineer (SRE) to join a global network operations team supporting large-scale, business-critical infrastructure. This is a hands-on, high-impact contract role where you’ll play a key part in keeping complex network environments reliable, scalable, and performant.

You’ll take ownership of major incidents, troubleshoot deep technical issues, and drive automation and observability across a multi-vendor estate.

The role:

  • Lead incident management for critical network issues, owning resolution during outages and high-pressure situations
  • Troubleshoot complex network problems across routing, switching, firewalling, and wireless environments
  • Provide technical leadership, setting direction and mentoring other engineers
  • Operate in a 24/7 environment, participating in a shift-based support model
  • Work across a multi-vendor estate, including Arista, Cisco, Cumulus, Palo Alto, Check Point, F5, Netscaler, Aruba, Mist, A10, Spectrum Ethernet, and InfiniBand
  • Support network security and segmentation, including VPN solutions such as GlobalProtect and AnyConnect
  • Drive automation and observability, using modern monitoring, alerting, and configuration tools
  • Contribute to innovation projects, including wireless design and AI cluster deployments

Requirements:

  • 10+ years ofhands-on experience in network engineering and operations
  • Deep expertise in routing, switching, firewalling, and wireless
  • Strong understanding of overlay and underlay networking
  • Solid experience in Linux/Unix environments
  • Proven ability to work independently and take technical ownership

Technical stack & tooling:

Network & Architecture (one required):

  • EVPN & Segment Routing
  • or significant MPLS expertise

Tools & Platforms:

  • NetBox / Nautobot
  • Prometheus / VictoriaMetrics
  • Salt
  • Grafana, Splunk, syslog
  • ServiceNow, BigPanda, ITMP
  • Ansible

Nice to have

  • Experience with InfiniBand and AI cluster deployments
  • Wireless design experience (Cisco, Mist, Aruba)
  • Strong background in network asset management

Please apply with a copy of your CV and Emma from SR2 will contact potential candidates regarding next steps.

Lead Full Stack Developer
HAYS
Swindon
Hybrid
Senior
£60,000
+15

Your new company

You’ll be joining a large, purpose-driven organisation where digital technology plays a critical role in delivering strategic objectives and engaging a wide audience through high quality digital experiences.The IT Digital function is fast-paced and collaborative, focused on enabling the best possible customer experience across public facing platforms. The role is based at a head office location, with hybrid working available. There is an expectation of regular on-site collaboration with the wider team.

Your new role

As a Lead Full Stack Developer, you’ll be responsible for building and maintaining a large-scale public-facing website along with its supporting applications and APIs. You’ll work in small, cross-functional agile teams, delivering high-quality solutions into production on a regular cadence.You’ll architect and implement new features and technologies, placing a strong emphasis on quality, accessibility and user experience. Working closely with Product Owners, engineers and UI/UX designers, you’ll help translate designs into robust, production-ready features while influencing engineering standards and best practices.
The role spans the full technology stack, including:

  • Front-end TypeScript / React applications
  • Java / Spring microservices and APIs
  • A range of databases and cloud-based infrastructure
  • CI/CD pipelines using modern DevOps tooling

You’ll collaborate with infrastructure and operations teams to build out new cloud environments, support live services and resolve production issues. Continuous improvement is core to the team’s culture, and you’ll be encouraged to stay current with industry trends and expand your technical skill set.

What you’ll need to succeed

You’ll have experience working within a highly collaborative, agile delivery environment and a strong background in developing and maintaining modern web-based products. You’ll take pride in building solutions that are performant, reliable, scalable and resilient.

Key experience includes:

  • Java 11+ enterprise development using Spring, REST and/or GraphQL APIs and microservices
  • Modern front-end frameworks such as React (JavaScript / TypeScript); experience with AngularJS or jQuery is advantageous
  • Build and deployment tools including Maven, Git and CI/CD pipelines
  • A strong testing mindset, including unit, functional and integration testing
  • Experience with Docker, Kubernetes and database technologies such as Elasticsearch, MongoDB or PostgreSQL
  • Cloud-native application design and architecture (e.g. containerised, serverless or event-driven systems)
  • Familiarity with BDD practices and monitoring tools such as Grafana, Prometheus or Kibana
  • Experience using collaborative tooling such as Jira, Bitbucket and Confluence

You’ll be someone who cares deeply about code quality, user experience and continuous improvement, and who enjoys working on complex systems that make a meaningful impact.

What you need to do now
If you’re interested in this role, click ‘apply now’ to forward an up-to-date copy of your CV, or call us now.
If this job isn’t quite right for you, but you are looking for a new position, please contact us for a confidential discussion about your career.

Hays Specialist Recruitment Limited acts as an employment agency for permanent recruitment and employment business for the supply of temporary workers. By applying for this job you accept the T&C’s, Privacy Policy and Disclaimers which can be found at hays.co.uk

Cloud Engineer
ECS Resource Group Ltd
Leeds
Hybrid
Mid - Senior
£400/day - £500/day
+1

Cloud Engineer Observability/APM/CMM (Inside IR35)6-month contract
Location: Leeds (Hybrid, 1-2 days per week on-site)
Rate: 400- 500 per day (Inside IR35)

We are supporting a major IT service provider who are looking for a skilled Cloud Engineer specialising in Observability, APM, and Cloud Monitoring & Management (CMM).

You’ll focus on monitoring, performance, reliability, and visibility across cloud platforms, helping improve service health and customer experience.

Key responsibilities:

  • Design and implement cloud observability solutions
  • Build and operate APM and monitoring platforms
  • Implement logging, metrics, tracing, and alerting standards
  • Support incident management, root cause analysis, and service improvement
  • Optimise performance, availability, and reliability across cloud environments
  • Work closely with Cloud, DevOps, and Service teams

Key skills & experience:

  • Strong cloud engineering background (Azure and/or AWS)
  • Observability, APM, and CMM experience
  • Hands-on with monitoring tools (e.g. Azure Monitor, App Insights, Datadog, Dynatrace, New Relic, Grafana, Prometheus)
  • Experience designing alerting, dashboards, and SLOs
  • Understanding of ITIL, service operations, and MSP environments
  • Infrastructure as Code and automation experience beneficial

Further information available upon application.
Please contact (url removed) or call (phone number removed) for more information.

ECS Recruitment Group Ltd is acting as an Employment Business in relation to this vacancy.

Site Reliability Engineer / SRE / Systems Engineer (AWDO-P14376)
AWD online
Manchester
Remote or hybrid
Mid - Senior
£70,000
+6

Site Reliability Engineer / SRE / Systems Engineer A fantastic opportunity for a Site Reliability Engineer / Systems Engineer to support highly available, scalable production systems within a fast-growing technology environment, working across cloud platforms, DevOps, networking and operational resilience. If you’ve also worked in the following roles, we’d also like to hear from you: DevOps Engineer, Operations Engineer, Cloud Engineer, Platform Engineer, Systems Engineer, Infrastructure Engineer, Production Engineer SALARY: up to £70,000 per annum (depending on experience) + Benefits LOCATION: Remote and Hybrid Working Options Available. You can either work remotely of if you prefer Hybrid working from home and the office in Altrincham, Greater Manchester, North West England JOB TYPE: Full-Time, Permanent JOB OVERVIEW We have a fantastic new job opportunity for a Site Reliability Engineer / Systems Engineer to join a growing technology team focused on delivering reliable, scalable and resilient platforms and services. As a Site Reliability Engineer/ Systems Engineer you will act as the vital link between operations, end users and backend development teams, ensuring system availability, performance optimisation and effective incident management across live environments. This Site Reliability Engineer/ Systems Engineer role offers the chance to work with modern cloud technologies, containerisation, observability tools and automation practices, while influencing long-term reliability improvements across business-critical systems. APPLY TODAY Ready to make your next career move? Apply Now for our Recruitment Team to review. DUTIES Your duties as the Site Reliability Engineer / Systems Engineer include: • Incident Triage and Ownership: Acting as first-line technical escalation for live production issues through to resolution or handover • System Monitoring and Availability: Maintaining high availability, performance and scalability of production platforms and services • Observability Implementation: Managing logging, monitoring, alerting and metrics to proactively identify and resolve issues • Reliability Improvements: Collaborating with development teams to translate operational insights into long-term platform resilience • Automation and Resilience: Supporting automation, incident response and continuous improvement practices • New Service Support: Ensuring new products and features are operable, reliable and scalable from day one • Cross-Team Collaboration: Working with network engineering, operations and support teams to diagnose service issues • Documentation and Reporting: Creating and maintaining runbooks, escalation guides and incident reports • Incident Prioritisation: Balancing customer impact with long-term system health and stability • Security and Compliance: Supporting compliance with security, availability and regulatory frameworks CANDIDATE REQUIREMENTS ESSENTIAL • Previous experience in a Site Reliability Engineer, DevOps Engineer, Systems Engineer or Operations Engineer role • Experience supporting production services at scale within a DevOps or SRE environment • Strong working knowledge of ISP-related networking concepts including DNS, DHCP, PPPoE, RADIUS and IPv4/IPv6 • Experience with observability tools such as Prometheus, Grafana, ELK or Splunk • Hands-on experience with containerisation and orchestration using Docker and Kubernetes • Cloud platform experience, ideally Google Cloud Platform, including automation and scaling practices • Strong Linux administration skills with scripting capability in Bash, Python or similar • Familiarity with CI/CD pipelines and source control tools such as GitHub Actions • Understanding of security frameworks and operational resilience best practices DESIRABLE • Experience within ISP, MSP or telecommunications environments • Familiarity with enterprise IT architectures including OSS and BSS systems • Knowledge of information security frameworks such as ISO27001, NIST or GDPR • Experience with infrastructure automation tools such as Terraform or Ansible BENEFITS • Smart casual dress code • Free access to gym facilities • Access to a financial wellbeing platform (on successful completion of probationary period) • Access to an employee assistance programme, Virtual GP and Elderly Care support (on successful completion of probationary period) • Access to cycle to work, childcare, and electric vehicle schemes after six months • Brand new office with excellent transport links • Supportive team culture, growth and career progression HOW TO APPLY To be considered for this job vacancy, please submit your CV to our Recruitment Team who will review your details. CV’s of Job Applicants meeting this requirement will be submitted to our Client for consideration. By submitting your job application to us you are hereby giving us your express consent to submit your details to our Client for this purpose. JOB REF: AWDO-P14376 Full-Time, Permanent Jobs, Careers and Vacancies. Find a new job and work in Altrincham, Greater Manchester, North West England. Multi-Job Board Advertising and CV Sourcing Recruitment Services provided by AWD online. AWD online specialise in sourcing candidates and advertising vacancies on multiple job boards for companies on a non-commission basis. AWD online operates as an employment agency. awd online | http://www.awdo.co.uk AWD-IN-SPJ

Platform Engineer
Better Days Recruitment Ltd
Horley
Hybrid
Mid - Senior
£45,000 - £55,000
+4

A new permanent opportunity to join a fast growing, successful energy, technology and data organisation as a Platform Engineer.

The role, located in Surrey is to enhance the software development process, platforms and environments all running on the Microsoft suite. Working across and with the development, testing and infrastructure teams to deliver high quality solutions and services across the business.

You will have solid experience at a mid-senior level in standardising development practices and tooling to establish frameworks wherever possible. Keeping ahead of best practices in Platform Engineer and to challenge and have an opinion. Elevate new DevOps technologies and to drive their adoption across the software engineering teams. Automate data capture and reporting against agreed KPIs . Document to-be processes and facilitate their adoption. Support the software testing function in creating and refreshing data in test environments. Mentoring of junior members of staff and deputising for the team leader as and when required.

You will be comfortable with scripting in a variety of languages and experienced with configuration and have come from a development background. Happy and confident to recommend standards and helping to adopt best practices along with reviewing of new technologies.

There is hybrid working on offer and the office centrally located with plenty of local parking and a short walk from a train station. There is a competitive salary and excellent company benefits on offer.

Skills/Experience/attributes:

  • Minimum of four-six years solid experience as a Platform Engineer or within as similar role is essential
  • Degree educated
  • Strong experience of configuration and scripting
  • Experience from a Development background
  • Experience of message queues (Rabit MQ)
  • Infrastructure as code experience (Teraform)
  • Server-side scripting, automation and batch processes - PowerShell and Bash are essential
  • Experience of Git version control and branching strategies
  • Observability (Grafana, Prometheus, Dynatrace or a similar tool set are essential
  • Administration and configuration of development management tools (Jira, Confluence and Azure DevOps Server)
  • Experience of the Microsoft tech stack for development (C#,.Net, VB.Net, NuGet).
  • Highly organised with a high attention for detail
  • Strive to get things right first time
  • Demonstrate a customer focused approach
  • Professional, confident and calm even in challenging situations
  • Strive to meet objectives and improve performance
  • Supportive and helpful team player
  • Articulate, professional with clear verbal and written communication skills
Page 1 of 3
Frequently asked questions
Our job board features a variety of Prometheus roles including Monitoring Engineer, DevOps Engineer, Site Reliability Engineer (SRE), and Cloud Infrastructure Specialist positions that require Prometheus expertise.Commonly required skills include proficiency with Prometheus for metrics collection and monitoring, experience with Grafana for dashboards, knowledge of alerting rules, familiarity with Kubernetes and cloud platforms, and strong Linux system administration skills.Yes, you can filter job listings by experience level such as entry-level, mid-level, and senior roles to find Prometheus positions that match your career stage.Absolutely. Many companies post remote or hybrid Prometheus jobs on our platform. You can easily filter your search results to find remote opportunities.New Prometheus job listings are added daily as companies continuously seek monitoring and DevOps professionals skilled with Prometheus.
Feedback
Contact