Make yourself visible and let companies apply to you.
Roles
OpenTelemetry Jobs
Overview
Looking for the best OpenTelemetry jobs? Discover top IT positions specializing in OpenTelemetry on Haystack, your go-to job board for cutting-edge observability and monitoring careers. Whether you're an experienced engineer or a developer new to OpenTelemetry, find your next role with leading companies embracing cloud-native and distributed tracing technologies. Start your OpenTelemetry job search today and advance your career in this fast-growing field!
Data Engineer - Contract - 9+ Months
CBSbutler Holdings Limited trading as CBSbutler
Sheffield
In office
Mid - Senior
£390/day - £442/day
RECENTLY POSTED

Data Engineer (Contract) 9+ Month Contract based in Sheffield £395 - £442 per day InsideIR35 BPSS clearance required - candidates must be eligible My client is seeking a Data Engineer to design and operate large-scale telemetry and observability data pipelines within a modern OpenShift and Kafka ecosystem. This role is central to enabling proactive, Level 4 observability, delivering high-quality metrics, logs, and traces to support platform reliability, operational insight, and automation. Responsibilities: * Design, implement and maintain scalable data pipelines to ingest and process OpenShift telemetry (metrics, logs, traces) * Stream telemetry through Kafka (producers, topics, schemas) and build resilient consumer services for enrichment and transformation * Engineer multi-tenant observability data models, ensuring data lineage, quality controls and SLAs across streaming layers * Integrate processed telemetry into Splunk for dashboards, analytics, alerting and operational insights * Implement schema management and governance using Avro/Protobuf, including versioning and compatibility strategies * Build automated validation, replay and backfill mechanisms to ensure data reliability and recovery * Instrument services using OpenTelemetry, standardising tracing, metrics and structured logging * Apply LLMs to enhance observability, such as query assistance, anomaly summarisation and runbook generation * Collaborate with Platform, SRE and Application teams to align telemetry, alerts and SLOs * Ensure pipelines meet security, compliance and best-practice standards * Produce clear documentation covering data flows, schemas, dashboards and operational runbooks Skills & Experience: * Strong hands-on experience building streaming data pipelines with Kafka (producers/consumers, schema registry, Kafka Connect, KSQL/KStreams) * Experience with OpenShift / Kubernetes telemetry, including OpenTelemetry and Prometheus * Proven capability integrating telemetry into Splunk (HEC, Universal Forwarders, sourcetypes, CIM, dashboards, alerting) * Solid data engineering skills in Python (or similar) for ETL/ELT, enrichment and validation Please apply for immediate interview! CBSbutler is operating and advertising as an Employment Agency for permanent positions and as an Employment Business for interim / contract / temporary positions. CBSbutler is an Equal Opportunities employer and we encourage applicants from all backgrounds

Site Reliability Engineer
CBSbutler Holdings Limited trading as CBSbutler
Milton Keynes
Hybrid
Mid - Senior
£390/day
RECENTLY POSTED
+7

Role Title: Site Reliability Engineer

Location: Milton Keynes/Hybrid (3 days on site)

Duration: 6 months contract

Rate: 390 per day inside ir35

Role Description:

Join a leading global IT consultancy and digital transformation organisation at the forefront of cloud, automation and secure platform engineering. We’re looking for a Kubernetes-first engineer who wants to own and evolve a modern, enterprise-scale platform spanning AWS, Azure and on-prem. This is a hands-on role with real influence over reliability, security and architecture.

Responsibilities:

  • Operate and enhance our Kubernetes platform across AWS, Azure, and on prem.
  • Lead incident response, problem management, and root cause analysis.
  • Deliver cluster lifecycle work: upgrades, patching, node pools, CNI/CSI, ingress, and Rancher operations.
  • Own observability, dashboards, alerting, and SLOs/SLIs.
  • Implement GitOps (Fleet) and reduce toil through automation and strong governance.
  • Apply secure API gateway and WAF patterns.
  • Work with distributed system patterns, including event brokers and asynchronous messaging.
  • Maintain security posture: CVE remediation, GRC controls, scanning pipelines.

Required Skills:

  • Deep knowledge of Kubernetes, Rancher, GitOps, Linux, and cloud networking.
  • Understanding of API gateway and WAF patterns.
  • Experience with distributed systems and event driven architectures.
  • Strong automation/scripting (Python, Go, Bash, PowerShell, .NET).
  • IaC:

o Terraform for foundational/bootstrap cluster provisioning.

o Crossplane as an orchestration layer (leveraging Terraform providers).

  • Ability to work securely within PCI DSS / GDPR patterns.
  • CI/CD: Concourse, GitHub Actions, Azure DevOps.
  • Observability: Grafana, Prometheus, Jaeger/Tempo, CloudWatch, Loki, OpenTelemetry.

Nice to Have:

  • AWS operational experience.
  • Service mesh (Istio/Kuma).
  • Hybrid cloud experience (AWS + Azure + on prem).
  • Payments or regulated industry background.

If you are interested in this role or wish to apply, please feel free to submit your CV.

Site Reliability Engineer
CBS Butler
Milton Keynes
Hybrid
Mid - Senior
£390/day
RECENTLY POSTED
+7

Role Title: Site Reliability Engineer

Location: Milton Keynes/Hybrid (3 days on site)

Duration: 6 months contract

Rate: £390 per day inside ir35

Role Description:

Join a leading global IT consultancy and digital transformation organisation at the forefront of cloud, automation and secure platform engineering. We’re looking for a Kubernetes-first engineer who wants to own and evolve a modern, enterprise-scale platform spanning AWS, Azure and on-prem. This is a hands-on role with real influence over reliability, security and architecture.

Responsibilities:

  • Operate and enhance our Kubernetes platform across AWS, Azure, and on prem.
  • Lead incident response, problem management, and root cause analysis.
  • Deliver cluster life cycle work: upgrades, patching, node pools, CNI/CSI, ingress, and Rancher operations.
  • Own observability, dashboards, alerting, and SLOs/SLIs.
  • Implement GitOps (Fleet) and reduce toil through automation and strong governance.
  • Apply secure API gateway and WAF patterns.
  • Work with distributed system patterns, including event brokers and asynchronous messaging.
  • Maintain security posture: CVE remediation, GRC controls, scanning pipelines.

Required Skills:

  • Deep knowledge of Kubernetes, Rancher, GitOps, Linux, and cloud networking.
  • Understanding of API gateway and WAF patterns.
  • Experience with distributed systems and event driven architectures.
  • Strong automation/Scripting (Python, Go, Bash, PowerShell, .NET).
  • IaC:

o Terraform for foundational/bootstrap cluster provisioning.

o Crossplane as an orchestration layer (leveraging Terraform providers).

  • Ability to work securely within PCI DSS/GDPR patterns.
  • CI/CD: Concourse, GitHub Actions, Azure DevOps.
  • Observability: Grafana, Prometheus, Jaeger/Tempo, CloudWatch, Loki, OpenTelemetry.

Nice to Have:

  • AWS operational experience.
  • Service mesh (Istio/Kuma).
  • Hybrid cloud experience (AWS + Azure + on prem).
  • Payments or regulated industry background.

If you are interested in this role or wish to apply, please feel free to submit your CV.

Observability/Monitoring & Telemetry Consultant
Sanderson Government & Defence
Bristol
Hybrid
Mid - Senior
Private salary
RECENTLY POSTED

Location: Bristol (Hybrid - 3 days in office)
Employer: Specialist Data & Observability Consultancy

This consultancy helps organisations turn high-volume, noisy telemetry and log data into clear, decision-ready insight - feeding SIEM platforms, observability stacks, and data lakes. You’ll sit at the intersection of discovery, design, delivery, and operational excellence, helping clients solve real problems around data quality, detection efficacy, and operational resilience.

What You’ll Do

  1. Lead Discovery & Client Engagement

You’ll be the front-facing consultant responsible for understanding what clients actually need.
This includes running structured discovery workshops to map:

  • Data sources (platforms, agents, syslog, APIs, cloud-native feeds).
  • Event volumes, constraints, ownership, governance, and data lineage.
  • The real business question - eg reducing SIEM cost, improving detection, stabilising pipelines, or enhancing observability.

You then turn this into:

  • A clear view of current state & target state
  • A roadmap of recommended changes
  • A sprint backlog with a clear definition of done
  • Decision-grade outputs clients can act on immediately
  1. Translate Strategy into Technical Designs

You act as the bridge between leadership objectives and engineering realities.
This includes mapping:

  • Cost vs detection fidelity
  • Operational overhead vs automation opportunities
  • Log/metric/tracing design choices
  • Risk, resilience & failure modes in data flows
  1. Design Full Telemetry Pipelines

You’ll build end-to-end designs for telemetry pipelines across the stages:

  • Collection: agents, collectors, syslog, cloud-native logs, APIs
  • Routing: multi-destination delivery, buffering, retries, backpressure handling
  • Transformation: parsing, enrichment, filtering, PII masking/redaction
  • Standardisation: OpenTelemetry conventions; OCSF mapping where relevant
  • Quality: sampling, validation, acceptance criteria, rollback plans

This is a blend of observability engineering, security telemetry design, data engineering, and consultancy.

  1. Support Delivery & Deployment

You’ll work closely with engineers who deploy the pipelines you design - ensuring what you create is practical, scalable, and resilient.

You help shape:

  • Reusable design patterns
  • Deployment artefacts (config packs, templates, patterns)
  • Standardised service definitions that the team can use across clients
  1. Contribute to the Managed Service

The consultancy operates an “Operate” service - you help shape:

  • Onboarding patterns
  • Runbooks, health checks, platform boundaries
  • Minimum viable operate checklists
  • Upgrade/patch and maintenance approaches

This is ideal for someone who likes production-grade operational thinking as well as design.

What You Need

Choose Your Primary Lens (you only need one)

A) Security/SIEM Lens

You may have experience with:

  • SIEM concepts and event pipelines
  • Telemetry-to-use-case mapping for security detections
  • Threat modelling and detection life cycle management
  • Normalisation approaches including OCSF
  • Understanding how log quality impacts threat severity and efficacy

This role is great for:

  • Security engineers
  • Detection engineers
  • SIEM consultants
  • Cyber defence specialists

B) Observability/ITOps Lens

You might bring experience in:

  • Service decomposition and architecture thinking
  • Metrics, logs, tracing and correlation
  • SLIs/SLOs and reliability engineering principles
  • Incident/problem root-cause thinking
  • OpenTelemetry-first design approaches
  • Modern observability tools (APM, tracing, log pipelines, dashboards)

This is ideal for:

  • Observability engineers
  • SREs/Reliability engineers
  • Platform or ITOps engineers

Additional general skills

  • Ability to run workshops confidently and extract the right information
  • Strong documentation & communication skills
  • An interest in data quality, data flow design, and outcomes over technology
  • Ability to collaborate with engineers, architects, and leadership

Why Candidates Love This Role

  • Proper consulting: You lead real discovery, not just implement tickets.
  • Variety: Work across multiple platforms, environments, and client types.
  • High impact: Your designs directly reduce cost, risk, and operational headaches.
  • Modern tech: Heavy focus on OpenTelemetry, SIEM modernisation, cloud-native logs, and structured pipelines.
  • Intellectual challenge: This role blends strategy, design, engineering, and operational thinking - perfect for people who enjoy solving complex problems.
  • Career development: Exposure to both observability and security telemetry gives you long-term career flexibility.

Please reach out if you’d like to know more!

Reasonable Adjustments:

Respect and equality are core values to us. We are proud of the diverse and inclusive community we have built, and we welcome applications from people of all backgrounds and perspectives. Our success is driven by our people, united by the spirit of partnership to deliver the best resourcing solutions for our clients.

If you need any help or adjustments during the recruitment process for any reason, please let us know when you apply or talk to the recruiters directly so we can support you.

Technical Architect
RCRTR
Swansea
Hybrid
Senior - Leader
£470/day - £500/day
+5

Immediate Start

Hybrid Working required - 2 days a week onsite

Role Purpose

The Technical Architect will provide end-to-end technical leadership across design, architecture, and delivery of modern cloud-based systems. The role focuses on AWS cloud architecture, Java-based services, API-led design, microservices patterns, and schema-driven approaches to ensure high-quality, scalable, resilient digital services.

You will work closely with engineering teams, product owners, and delivery leads to define technical direction, set standards, and ensure solutions are secure, maintainable, and aligned to organisational goals.

Key Responsibilities

Technical Architecture & Solution Design

Design end-to-end architectures for cloud-native services built on AWS.

Define and document high-level and detailed designs, architectural patterns, and technical specifications.

Architect Java-based services using frameworks such as Spring / Spring Boot.

Create robust, well-governed API architectures (RESTful, event-driven, synchronous/asynchronous).

Lead the design of distributed microservices, ensuring scalability, resilience, observability, and maintainability.

Apply schema-driven design principles (e.g., JSON Schema, OpenAPI, AsyncAPI) to ensure consistent and reusable domain models.

Cloud & Platform Engineering (AWS)

Select and integrate AWS services (EC2, Lambda, API Gateway, S3, DynamoDB, RDS, ECS/EKS, etc.) based on architectural requirements.

Ensure cloud solutions meet organisational standards for cost, performance, security, and operational readiness.

Promote Infrastructure-as-Code practices using Terraform / CloudFormation.

Guide adoption of serverless and container-based patterns where appropriate.

Leadership & Technical Governance

Provide technical leadership to engineers, developers, DevOps, and platform teams.

Lead architecture reviews, design assurance, and technical direction across multiple teams or projects.

Define and maintain engineering and architectural standards, patterns, and best practices.

Conduct option evaluations, risk assessments, and make informed technology recommendations.

API, Integration & Data Modelling

Design API ecosystems including versioning, discovery, governance, throttling, and security.

Define schemas, data contracts, and integration patterns for internal and external systems.

Promote consistent schema models across teams to enable loose coupling and robust interoperability.

Collaboration & Stakeholder Engagement

Work closely with product managers, delivery managers, business analysts, and engineering leads to translate requirements into technical designs.

Communicate complex technical concepts to technical and non-technical stakeholders.

Act as a key point of contact for architectural decision-making.

Security, Compliance & Quality Assurance

Ensure secure-by-design principles across all solutions.

Support threat modelling, risk assessments, and compliance with security and data protection requirements.

Drive non-functional requirement definition (performance, scalability, availability, resilience).

Support test strategy and quality engineering practices.

Essential Skills & Experience

Strong experience as a Technical Architect, Solutions Architect, or Senior Engineer with architecture responsibilities.

Proven track record designing cloud-based applications on AWS.

Strong hands-on background in Java, Spring Boot, and JVM-based service architectures.

Deep experience with APIs (REST, event-driven, messaging), microservices, and distributed system design.

Strong understanding of schema-driven design, using tools such as JSON Schema, OpenAPI, or AsyncAPI.

Solid understanding of cloud security, identity and access management, and operational best practices.

Experience working with Infrastructure-as-Code (Terraform, CloudFormation).

Excellent communication skills and the ability to influence technical decisions.

Desirable Skills

AWS certification (Architect Associate / Professional).

Experience with API Gateway, Lambda, DynamoDB, or Kubernetes-based workloads.

Experience with event-driven architecture (SNS/SQS, Kinesis, Kafka).

Experience with CI/CD pipelines, DevOps tooling, and automated delivery.

Experience with containerisation (Docker, ECS/EKS).

Familiarity with architectural frameworks (C4 model, ArchiMate, TOGAF).

Knowledge of observability tooling (CloudWatch, ELK, Grafana, OpenTelemetry).

Data Engineer - Contract - 9+ Months
CBSbutler Holdings Limited trading as CBSbutler
Sheffield
In office
Mid - Senior
£390/day - £442/day

Data Engineer (Contract)
9+ Month Contract based in Sheffield
395 - 442 per day InsideIR35
BPSS clearance required - candidates must be eligible

My client is seeking a Data Engineer to design and operate large-scale telemetry and observability data pipelines within a modern OpenShift and Kafka ecosystem. This role is central to enabling proactive, Level 4 observability, delivering high-quality metrics, logs, and traces to support platform reliability, operational insight, and automation.

Responsibilities:

Design, implement and maintain scalable data pipelines to ingest and process
OpenShift telemetry (metrics, logs, traces)
Stream telemetry through Kafka (producers, topics, schemas) and build resilient
consumer services for enrichment and transformation
Engineer multi-tenant observability data models, ensuring data lineage, quality
controls and SLAs across streaming layers
Integrate processed telemetry into Splunk for dashboards, analytics, alerting and
operational insights
Implement schema management and governance using Avro/Protobuf, including versioning
and compatibility strategies
Build automated validation, replay and backfill mechanisms to ensure data
reliability and recovery
Instrument services using OpenTelemetry, standardising tracing, metrics and
structured logging
Apply LLMs to enhance observability, such as query assistance, anomaly summarisation
and runbook generation
Collaborate with Platform, SRE and Application teams to align telemetry, alerts and
SLOs
Ensure pipelines meet security, compliance and best-practice standards
Produce clear documentation covering data flows, schemas, dashboards and operational
runbooks

Skills & Experience:
Strong hands-on experience building streaming data pipelines with Kafka
(producers/consumers, schema registry, Kafka Connect, KSQL/KStreams)
Experience with OpenShift / Kubernetes telemetry, including OpenTelemetry and
Prometheus
Proven capability integrating telemetry into Splunk
(HEC, Universal Forwarders, sourcetypes, CIM, dashboards, alerting)
Solid data engineering skills in Python (or similar) for ETL/ELT, enrichment and
validation

Please apply for immediate interview!

CBSbutler is operating and advertising as an Employment Agency for permanent positions and as an Employment Business for interim / contract / temporary positions. CBSbutler is an Equal Opportunities employer and we encourage applicants from all backgrounds.

Site Reliability Engineer
Searchability Ltd
Wigan
Hybrid
Mid - Senior
£70,000
+1

KEY POINTS
* Up to £70,000 salary
* Hybrid working with three days a week onsite in Greater Manchester
* Modern SRE environment with cloud-native tooling (AWS, Kubernetes, Terraform)
* High-availability digital platforms and performance-critical workloads

ABOUT THE CLIENT
We’re supporting a well-established UK organisation recognised for operating large-scale, high-availability digital services. With continued investment into platform reliability and engineering maturity, they’re looking to appoint an experienced Site Reliability Engineer to strengthen a growing SRE function.

THE BENEFITS
* Exposure to modern cloud-native tooling and reliability practices
* High-impact role supporting major digital events
* Strong engineering culture with collaboration across product, operations and platform teams

THE SITE RELIABILITY ENGINEER ROLE:
As a Site Reliability Engineer, you’ll ensure the reliability, performance and scalability of critical digital platforms. You’ll monitor production systems, refine SLAs/SLOs and error budgets, design scalable solutions, improve architecture through telemetry insights, and build dashboards that provide clear visibility of system health. You’ll also contribute to performance testing strategies and collaborate with engineering, operations and compliance teams to maintain high standards across the platform.

SITE RELIABILITY ENGINEER ESSENTIAL SKILLS
* Strong understanding of reliability engineering, scalable architectures and performance optimisation
* Experience with observability, debugging and incident response
* Proficiency in a programming language for automation and tooling (GO or .NET preferred)
* Cloud experience, ideally AWS, and knowledge of container orchestration (Kubernetes) and Infrastructure as Code (Terraform)
* Experience with monitoring and observability tools such as Grafana, Prometheus or OpenTelemetry
* Strong understanding of networking fundamentals and distributed systems
* Ability to collaborate effectively with engineering, operations and product teams

TO BE CONSIDERED:
Please either apply through this advert or email me directly via .
For further information please call me on 01244 567 930 / 07833 460 873.
By applying for this role, you give express consent for us to process and submit (subject to required skills) your application to our client in conjunction with this vacancy only.

KEY SKILLS
SRE, Site Reliability Engineer, AWS, Kubernetes, Terraform, Observability, Performance, SLAs/SLOs, Monitoring, Automation, GO, .NET, Distributed Systems, Cloud-Native Engineering

Page 1 of 1
Frequently asked questions
Haystack offers a wide range of OpenTelemetry jobs including roles such as OpenTelemetry engineers, observability specialists, instrumentation developers, and site reliability engineers with expertise in OpenTelemetry.While formal certifications in OpenTelemetry are beneficial, most employers look for practical experience with OpenTelemetry tools, distributed tracing, and metrics collection. Relevant experience and demonstrated skills often matter more than certifications.Yes, Haystack features remote, hybrid, and on-site OpenTelemetry job listings to accommodate different work preferences and global talent pools.Highlight your experience with OpenTelemetry frameworks, distributed tracing, metrics, and logging. Include specific projects or outcomes demonstrating your expertise in observability and telemetry data analysis.Yes, Haystack lists entry-level roles and internships where candidates can grow their skills in OpenTelemetry and observability technologies. Look for job descriptions mentioning junior or associate-level positions.
Feedback
Contact