Make yourself visible and let companies apply to you.
Roles
Datadog Jobs in London
Overview
Looking for Datadog jobs in London? Explore the latest opportunities on Haystack, your go-to IT job board for top Datadog roles in the heart of the UK tech scene. Whether you're a monitoring expert or DevOps professional, find the perfect Datadog position in London to advance your career today.
Lead DevOps Engineer
Data Careers
London
Fully remote
Senior
£75,000
RECENTLY POSTED
+2

Permanent

Location: UK- Remote

Salary: £70,000 - £75,000 (+ benefits)

Skills: AWS, Terraform, CI/CD, Production SaaS experience

We are looking to recruit a Lead DevOps Engineer for a leading software company. This is a hands-on technical leadership role, ideal for someone who enjoys owning AWS infrastructure strategy while remaining close to engineering delivery.

You’ll play a key role in shaping platform standards, improving reliability, embedding security best practice, and driving automation across the organisation.

This is a fully remote UK based role.

The Role

Platform Architecture & Cloud Engineering

Own AWS multi-account infrastructure architecture (secure-by-design)

Define infrastructure standards across networking, IAM, logging and disaster recovery

Lead Infrastructure-as-Code strategy (Terraform preferred)

Ensure scalability, resilience and high availability across production environments

CI/CD & Release Automation

Design and optimise CI/CD pipelines

Improve deployment reliability and reduce rollback frequency

Standardise release processes across engineering teams

Implement progressive delivery practices

Reliability & Observability

Define and track SLIs/SLOs

Enhance monitoring, alerting and incident response processes

Lead post-incident reviews and root cause analysis

Drive reduction of operational toil

Security & Compliance

Embed DevSecOps controls into pipelines

Implement least-privilege IAM models

Support ISO 27001 and compliance evidence automation

FinOps & Cost Optimisation

Partner on cloud cost optimisation strategy

Improve tagging standards and cost allocation models

Implement rightsizing and automation policies

About You

5+ years’ experience in DevOps / Cloud Engineering

Strong AWS expertise (VPC, IAM, EC2, RDS, EKS, Lambda)

Proven Infrastructure-as-Code experience (Terraform preferred)

CI/CD tooling experience (GitHub Actions, GitLab CI, Jenkins)

Experience operating production SaaS environments

Strong observability tooling knowledge (Datadog, Prometheus, ELK etc.)

Incident management and root cause analysis experience

Experience in regulated or security-conscious environments is highly desirable

TPBN1_UKTJ

Lead Platform Engineer
Resident-Advisor
London
Hybrid
Senior
£85,000
RECENTLY POSTED
+10

Founded in 2001, Resident Advisor (RA) is one of the world’s longest-running music media brands and a cornerstone of the dance, electronic and DJ ecosystem. The site’s audience of over 7 million monthly users is drawn in by a combination of news, editorial, club listings and ticketing, RA-branded events at venues and festivals worldwide, original films and a weekly mix series that has run for 20 years.

Purpose of the role:

We’re looking for a Lead Platform Engineer passionate about electronic music to join our Core Platform team. You’ll lead a small, focused team helping scale our high-traffic infrastructure that handles massive spikes during on-sale events. Our tech stack includes Node.js/.NET microservices, PostgreSQL/MSSQL databases, ElasticSearch, Redis, and Kafka running on AWS EKS (Kubernetes), managed via Terraform with CI/CD pipelines and DataDog monitoring.

Your responsibilities include improving infrastructure performance and reliability, driving modernization and cost optimization, developing shared components (i.e. auth systems, GraphQL gateways), enhancing developer experience, maintaining E2E testing systems, and creating internal tooling. This is an opportunity to solve challenging scale problems while shaping the technical foundation that powers RA’s products for the electronic music community.

Key responsibilities

  • Managing and improving AWS infrastructure using infrastructure-as-code practices
  • Database maintenance and query optimization
  • Cloud cost optimization and resource efficiency
  • Researching, planning, and leading the evolution of backend infrastructure, transitioning from reactive maintenance to a proactive, future-proof platform roadmap
  • Providing functional and technical training to the broader development team, fostering a culture of shared knowledge and operational excellence
  • Writing application code to support infrastructure tooling and shared platform services
  • Leading, mentoring, and supporting a team of two site reliability engineers and one QA Engineer including regular 1:1s, goal setting, and performance feedback
  • Owning team capacity planning and workload prioritization, ensuring the right work gets done in the right order
  • Collaborating with the QA Engineer to define and uphold quality standards, integrating QA practices into the wider development team’s process
  • Acting as a technical escalation point for your team and a key voice in broader platform and architecture decisions

Required Skills:

  • Experience working with AWS
  • Experience with Kubernetes and containerized application deployment
  • Experience with infrastructure-as-code technologies (Terraform preferred)
  • Experience deploying and maintaining SQL-based databases (PostgreSQL or MS SQL Server)
  • Experience writing production-level application code
  • Experience building and maintaining CI/CD pipelines
  • Experience with git and version control workflows
  • Strong documentation writing skills and commitment to knowledge sharing
  • Proven experience leading or mentoring engineers in a formal or informal capacity
  • Strong interpersonal skills with openness to giving and receiving feedback
  • Fluent English communication skills (written and verbal)
  • Experience with DataDog or similar for monitoring and alerting

Desired Skills:

  • Experience with both PostgreSQL and MS SQL Server
  • Advanced SQL query optimization skills
  • Experience deploying and maintaining Kafka for event-driven architectures
  • Experience deploying and maintaining ETL pipelines
  • Deep understanding of Site Reliability Engineering principles and best practices
  • Experience with backend languages such as C# and Node.js
  • Experience with scripting languages (such as Bash, Python, PowerShell)
  • Experience managing high-traffic systems with significant load variations
  • Familiarity with QA methodologies and E2E testing frameworks, and an appreciation for how reliability and quality practices intersect

What we offer you:

  • Generous annual leave policies aimed at promoting work-life balance.
  • Flexibility in working arrangements, offering hybrid or remote work options based on role requirements and location.
  • Matching pension schemes and/or 401k.
  • Comprehensive staff wellbeing initiatives, featuring regular activities and workplace programmes to support mental and physical health. This includes discounted Classpass memberships, and custom-fitted earplugs.
  • Company-led social events, team lunches, and discounts on RA merch.
  • Paid annual volunteering allowance, encouraging contributions to community projects and charities you care about.
  • Regular company-wide Q&As with senior leadership, along with ongoing virtual educational training sessions for all staff.
  • A transparent internal company culture committed to diversity, equity, and inclusion (see:https://ra.co/about/diversity).
  • Active involvement in community projects within the electronic music community (see: https://ra.co/about/community).

More about RA:

As an independent company run by devoted dance music enthusiasts, our mission is to bring together the world’s electronic music communities. Our Global Contributor Network (GCN) and international teams help us establish connections with hyper-niche local scenes.

We became B-Corp certified in 2024, which counts us among businesses leading a global movement for an inclusive, equitable, and regenerative economy and part of a community that meets high social and environmental impact standards.

We especially welcome applicants from diverse backgrounds, abilities, ethnicities, experiences, gender identities, and sexual orientations. We aim for our team to reflect the communities we engage with. We ensure everyone is valued and respected by actively promoting equality, diversity, and inclusion in our workplace.

Our values:

  • Electronic music is art.
  • We celebrate the progressive values that underpin electronic music.
  • We advocate for a more inclusive and equitable electronic music community.
  • We honour the past, present and future of electronic music.
  • We use innovation to empower the community.
  • We choose honesty over gain and purpose over profit.
  • We’re always front left

This role is a full-time position, based in London. Please note: This position requires applicants to be based in London and to work from the office three days per week. The annual salary range for this role is £75,000 £85,000.This listing will be open for a maximum of two weeks from the 6th March.

Senior Site Reliability Engineer
McCabe & Barton
City of London
In office
Senior
£700/day
RECENTLY POSTED

Site Reliability Engineer

6 Months contract

Overview

We are looking for a proactive and technically strong SRE Engineer to join a leading Investment clients operations team on a contract basis. This role blends cloud engineering, automation, monitoring, and structured operational support.

You will play a key role in maintaining platform reliability, improving automation, strengthening observability, and ensuring operational best practice across our Microsoft Azure environment.

This position is ideal for someone who enjoys solving complex problems, automating repetitive tasks, and working within a well-structured ITIL-aligned operations environment.

Key Responsibilities

Design, develop, and maintain automation solutions using PowerShell to optimise workflows and improve operational efficiency

Monitor platform performance, availability, and security using tools such as Datadog

Support and maintain Microsoft Azure services, including governance, networking, and security configurations

Manage and support identity and endpoint services including:

Microsoft Intune

Microsoft Entra ID (Azure AD)

Active Directory (Group Policy, user lifecycle, device management, conditional access)

Participate in Incident and Change Management processes aligned to ITIL best practices

Investigate, diagnose, and resolve technical issues using structured troubleshooting methods

Contribute to continuous improvement of operational processes, documentation, and automation tooling

Work within defined SLAs, escalation paths, and change controls

Technical Skills & Experience

Essential

Strong PowerShell scripting experience for automation and infrastructure tooling

Hands-on experience with monitoring and observability platforms (preferably Datadog)

Solid working knowledge of Microsoft Azure services, governance, networking, and security

Practical experience with:

Microsoft Intune

Microsoft Entra ID (Azure AD)

Active Directory (Group Policy, identity lifecycle, device management)

Experience working in structured operations or support environments with defined SLAs and change management controls

Good understanding of ITIL principles, particularly Incident and Change Management

Desirable

Experience improving monitoring maturity and observability practices

Experience implementing automation to reduce operational overhead

Exposure to security best practices in cloud environments

TPBN1_UKTJ

Senior Golang Developer - Kubernetes - Financial Services
Rothstein Recruitment Ltd
London
In office
Senior
£130,000
RECENTLY POSTED
+11

Excellent opportunity opens for an experienced Developer strong on Golang with experience in AWS and Kubernetes to join a highly regarded Financial Services entity’s London office. You will act as the team lead and play a key role in building mission-critical financial applications that power trading, investment, and risk management systems across the firm.

If you are passionate about working in a dynamic, fast-paced environment and are eager to apply your technical expertise to the financial services industry, this is the role for you.

Key Responsibilities:

  • Design, develop, and maintain high-performance Back End services using GoLang to support financial applications and services, including trading platforms, investment systems, and risk management tools.
  • Build and deploy cloud-based solutions using Amazon Web Services (AWS), including services such as EC2, S3, RDS, DynamoDB, and Lambda to create scalable, reliable, and secure infrastructure.
  • Implement and manage containerized applications using Kubernetes, ensuring seamless orchestration, scaling, and resilience in a cloud environment.
  • Write clean, efficient, and well-documented code while following best practices for financial systems development, focusing on performance and security.
  • Collaborate with other development teams, business analysts, and stakeholders to define and refine requirements, and ensure that applications meet financial regulatory standards and business needs.
  • Optimize the performance of Back End services, ensuring low-latency responses and high availability, critical for financial services.
  • Implement CI/CD pipelines, automated testing, and monitoring systems to ensure the reliability and stability of production systems.
  • Proactively identify issues and bottlenecks in existing systems and propose solutions to improve the system’s performance and scalability.
  • Stay updated with new tools, technologies, and industry trends in cloud computing, containerization, and financial systems to continuously improve development practices and outcomes.

Ideal Skills:

  • Proven experience (2+ years) in GoLang Back End development, with a strong focus on performance optimization and building scalable systems for high-volume, high-frequency financial applications.
  • Strong experience working with Amazon Web Services (AWS), including EC2, S3, RDS, DynamoDB, Lambda, and other cloud-native technologies.
  • Hands-on experience with Kubernetes for deploying, managing, and scaling containerized applications in a cloud environment.
  • Solid understanding of financial systems and services, particularly in areas such as trading platforms, investment management, and risk analytics.
  • Experience in building microservices architectures and working with APIs (RESTful, gRPC, etc.) to integrate various systems.
  • Strong knowledge of containerization (Docker) and continuous integration/deployment (CI/CD) practices.
  • Experience with database systems (relational and NoSQL) and working with financial data.
  • Familiarity with DevOps practices and tools to streamline the development life cycle, such as infrastructure-as-code (eg, Terraform or CloudFormation).
  • Ability to troubleshoot and resolve issues in production environments, ensuring uptime and performance in high-pressure, mission-critical scenarios.
  • Excellent communication skills to collaborate effectively with cross-functional teams and stakeholders in a fast-paced financial environment.
  • Experience with serverless computing (AWS Lambda, etc.) to create efficient and scalable solutions.
  • Knowledge of financial industry regulations and standards, particularly around data security and privacy.
  • Familiarity with event-driven architectures or message queues (eg, Kafka, RabbitMQ) for Real Time data processing.
  • Experience with automated testing frameworks and continuous delivery tools like Jenkins, GitLab CI, or CircleCI.
  • Understanding of performance monitoring and observability tools such as CloudWatch, Prometheus, or Datadog.

Interested? Please Apply!

Golang Go AWS Kubernetes Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading Exchange Digital Assets Hybrid Flexible Developer Software Engineer Backend Developer Golang Engineer Kafka Apache Kafka RabbitMQ AWS Lambda Cloud Platform

Senior Software Development Engineer
Permax Recruitment Limited
London
Hybrid
Senior
£100,000
+4

Permax Recruitment is working in partnership with a London based firm who are on the lookout for a Software Engineer. For nearly a century, our client has been building a firm as accountants, auditors, tax specialists and close advisors to clients operating in emerging markets, disrupting the status quo. This has accelerated thanks to the blockchain. In 2017, a client asked to help with an ICO and they have been crypto pilled ever since, developing into what is currently the leading professional services firm on chain. In 2023, they opened a new leg of the business to carve out a team dedicated to all things Web3, which is now over 80 strong and servicing near 600 digital asset clients globally. They partner with some of the industry's most influential playerscryptocurrency exchanges, blockchain innovators, Web3 pioneers, and digital asset fundsoffering tailored audit, tax, and advisory services that keep pace with this fast-evolving landscape. Senior Software Engineer (Cloud Infrastructure & DevOps) Location: London (Three days in office, two days wfh) Salary: Approx £100,000 + Bonus While our team builds data pipelines and reporting tools that enable accountancy teams to work efficiently, this role focuses primarily on managing our AWS infrastructure, supporting the team with robust DevOps practices, and mentoring other developers. You'll be the technical expert who ensures our systems are scalable, secure, and well-architected as we transition to microservices and ephemeral infrastructure. Key Responsibilities Cloud Infrastructure & DevOps (Primary Focus) Own and manage our AWS infrastructure, acting as the team's cloud platform expert Be one of the leaders in the migration toward microservices and ephemeral architecture Lead in infrastructure as code Establish and maintain CI/CD pipelines for the team's data and application projects Lead the implementation of monitoring, logging, and alerting systems to ensure reliability in our solutions Manage cloud security, IAM policies, and compliance requirements Provide infrastructure support and guidance to team members working on data pipelines and applications Troubleshoot infrastructure and deployment issues Team Leadership & Mentorship Mentor other developers on DevOps practices, cloud architecture, and infrastructure concepts, jointly with other senior members Support and encourage team members in deploying and managing their data pipelines and applications Conduct code and infrastructure reviews Develop and share best practices for cloud-native development Foster a collaborative learning environment within the team Contribute to technical documentation Collaboration & Technical Enablement Enable the team to build and deploy data pipelines efficiently by providing templates and guidance on infrastructure Work with colleagues to understand their infrastructure needs and provide solutions Translate infrastructure requirements into scalable, maintainable solutions Communicate technical concepts clearly to both technical and non-technical stakeholders Collaborate with accountancy teams to ensure data platform reliability and performance *Technical* 5+ years of software engineering experience with a significant cloud infrastructure focus Understanding of networking, security, and cloud best practices Hands-on experience with AWS services Proficiency with infrastructure as code tools Experience designing and implementing CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar) Solid understanding of containerization and orchestration (Docker, Kubernetes, ECS) Experience with monitoring and observability tools (CloudWatch, Datadog, Prometheus, or similar) Proficiency in bash Experience supporting development teams with infrastructure and deployment needs Knowledge of microservices architecture and serverless patterns *Leadership* Experience working in teams outside the realm of Software Engineering Demonstrated experience mentoring or managing junior Engineers Strong communication skills with both technical and non-technical audiences Ability to provide clear technical guidance and support Pragmatic approach to balancing technical delivery with business needs Desirable Python experience for infrastructure automation and tooling Familiarity with data pipeline infrastructure (supporting ETL workloads, data warehousing) Experience with data governance and compliance requirements Cloud cost and resource utilisation optimisation Experience migrating from monolithic to microservice architectures What We Offer Opportunity to shape and own the technical infrastructure Small, collaborative team environment where your expertise will have a direct impact Opportunity to create and develop solutions that are new, be impactful Dress for your diary. Flexible working hours. A technology enabled firm. A Family environment, fantastic retention, hiring due to exceptional growth and internal promotions. A Fixed profit-sharing bonus scheme payable to all staff. Brand new central London office. Plenty of socialising opportunities. Free breakfast and fresh fruit provided daily.

Forward Deployed Engineer (B2)
DCV Technologies
London
Hybrid
Senior
Private salary
+8

Position: Forward Deployed Engineer (B2)
Location: London, UK (Hybrid)
Permanent position

Job Requirements

  • Experience building GenAI applications, including RAG, multi-agent systems, fine-tuning, etc., with tools such as LangChain, LangGraph etc.
  • Clear understanding of Model Context Protocols, A2A Protocols, Agent Developeer Kit and working experience with LLMs
  • Expertise in deploying production grade GenAI solutions, including evaluation and optimizations; Machine Learning deployments on AWS, Azure or GCP
  • Extensive hands-on data science experience, leveraging machine learning and data science tools (i.e., pandas, scikit-learn, PyTorch, etc.)
  • Experience with DevOps tools: Kubernetes, Docker, Terraform, CI/CD Pipelines, GitHub/GitLab, GitOps, GitHub Actions, Jira, Jenkins, CircleCI, Datadog, Slack.
  • Graduate degree in a quantitative discipline (Computer Science, Engineering, Statistics, Operations Research, etc.) or equivalent practical experience
  • Experience communicating and/or teaching technical concepts to non-technical and technical audiences alike
  • 5+ years of engineering and technical deployment experience in a customer-facing set up
  • Should scoped and delivered complex systems in rapid and ambiguous environments
  • Delivered production-grade code across frontend and backend using Python, JavaScript, or similar stacks
  • Understand how AI model behaviour affects product experience
  • Communicate clearly with engineers, product teams, and customer stakeholders
  • Flag risks early and seek attention as per the severity

Key responsibilities:

  • Help clients integrate and adopt the offerings; demonstrate the impact / outcomes the offerings commited such as KPI improvements and help client succeed
  • Embed within the client landscape, understand their domain and co-develop solutions with the core product engineering teams
  • Own technical delivery across multiple deployments from prototype to stable release
  • Build bespoke AI and transformative agentic AI solutions
  • Technical debugging and root cause analysis
  • Rapid prototyping
  • Implement and administer best practices
Technical Lead - TypeScript / Node.js
Adria Solutions Ltd
Multiple locations
Remote or hybrid
Senior
£60,000 - £80,000
+7

As a Technical Lead, you ll work directly with the founder to shape the technical vision and execution of a high-growth startup backed by a larger, established group. You ll lead a talented engineering squad, drive architectural decisions, and deliver scalable backend systems that support thousands of users-all while influencing the strategic direction of the product.

This is a hands-on leadership role where you ll combine deep technical expertise with mentorship, strategic thinking, and cross-functional collaboration in a fast-moving startup environment.

The Role Technical Strategy & Leadership

  • Work alongside the founder to define and evolve the technical roadmap
  • Lead architectural design and critical technology decisions
  • Champion engineering best practices, code quality, and technical excellence
  • Mentor, coach, and grow engineers at all levels
  • Foster a culture of ownership, collaboration, and innovation

Backend Architecture & Development

  • Architect, build, and scale backend services and RESTful APIs
  • Design, optimise, and maintain MongoDB database solutions
  • Improve system reliability, performance, and scalability
  • Resolve complex production issues with robust solutions
  • Implement microservices and modern backend patterns

Collaboration & Delivery

  • Partner with Product, Design, and the founder to deliver high-impact outcomes
  • Participate in hiring and technical interviews to grow the engineering team
  • Drive improvements in processes, CI/CD pipelines, and DevOps practices
  • Ensure effective planning, estimation, and execution across initiatives

Core Technical Expertise

You ll thrive in this role if you have strong experience with:

  • MongoDB (data modelling, indexing, aggregation pipelines, performance tuning)
  • Node.js and the wider JavaScript/TypeScript ecosystem
  • Express.js and/or Fastify
  • RESTful API design and distributed systems
  • AWS or similar cloud platforms
  • Microservices architecture
  • Testing frameworks (Vitest, Jest, Mocha, etc.)
  • CI/CD pipelines and DevOps best practices
  • GitHub workflows
  • Observability and monitoring tools (e.g., Datadog)
  • Docker and Kubernetes

Tech Stack

You ll be working with:

  • Node.js, JavaScript/TypeScript
  • Express.js, Fastify
  • MongoDB
  • AWS
  • Vue.js, Nuxt.js

Nice to Have AI Experience

Not essential, but highly desirable:

  • Integrating AI/ML models into production systems
  • Working with LLM APIs (OpenAI, Anthropic, etc.)
  • Prompt engineering
  • AI workflow tools (LangChain, Flowise)
  • Building internal AI-powered automation tools

Why This Role is Exciting

  • Work directly with the founder on a startup backed by a larger, established group
  • Influence the technical and product direction from the ground up
  • Hands-on leadership in a fast-moving, high-impact environment

Benefits

We genuinely invest in our people. You ll enjoy:

  • Remote-first working (offices in London and Manchester)
  • Flexible working hours
  • 25 days annual leave + 8 bank holidays + 2 Christmas shutdown days
  • Option to purchase an additional week of leave

Interested? Please Click Apply Now! Technical Lead - TypeScript / Node.js

Page 1 of 1
Frequently asked questions
Our job board features a wide range of Datadog-related roles in London, including Site Reliability Engineer, DevOps Engineer, Cloud Infrastructure Engineer, and Monitoring Specialist positions.
Most Datadog jobs in London require familiarity with the Datadog platform and monitoring tools, but some entry-level positions are available where you can grow your expertise on the job.
To increase your chances, highlight your experience with Datadog integrations, monitoring, and alerting. Certifications and hands-on projects involving cloud infrastructure and observability tools are also highly valued.
Job locations vary by employer. Many companies now offer hybrid or fully remote options, but some roles in London require on-site presence. Each job listing includes specific details about the work arrangement.
We update our job board daily to provide the latest Datadog job opportunities in London, ensuring you have access to the newest and most relevant openings.