Lead MLOps Engineer - London - Permanent
?? London, UK
(If you like the sound of this role and want to relocate - the Client is willing to help facilitate this move!)
This is a high-impact role within a fast-growing AI and robotics organisation focused on building advanced, scalable intelligent systems for real-world industrial applications. The position owns the machine learning infrastructure and MLOps foundations as products, platforms, and teams scale.
You will play a key role in transforming machine learning prototypes into reliable production systems, defining pragmatic engineering standards, and enabling fast, safe delivery of ML-powered capabilities. The role combines hands-on engineering, architectural ownership, and close collaboration with engineering and product teams.
Key Responsibilities
Required Experience & Skills
What’s Offered
If you are interested - please apply directly!
Randstad Technologies Ltd is a leading specialist recruitment business for the IT & Engineering industries. Please note that due to a high level of applications, we can only respond to applicants whose skills & qualifications are suitable for this position. No terminology in this advert is intended to discriminate against any of the protected characteristics that fall under the Equality Act 2010. For the purposes of the Conduct Regulations 2003, when advertising permanent vacancies we are acting as an Employment Agency, and when advertising temporary/contract vacancies we are acting as an Employment Business.
Excellent opportunity opens for an experienced Developer strong on Golang with experience in AWS and Kubernetes to join a highly regarded Financial Services entity’s London office. You will act as the team lead and play a key role in building mission-critical financial applications that power trading, investment, and risk management systems across the firm.
If you are passionate about working in a dynamic, fast-paced environment and are eager to apply your technical expertise to the financial services industry, this is the role for you.
Key Responsibilities:
Ideal Skills:
Interested? Please Apply!
Golang Go AWS Kubernetes Terraform Bank Banking Finance Financial Services Crypto Blockchain Web3 Trading Exchange Digital Assets Hybrid Flexible Developer Software Engineer Backend Developer Golang Engineer Kafka Apache Kafka RabbitMQ AWS Lambda Cloud Platform
Lynx are working with a leading consultancy who partner with fast-moving engineering teams who build and run their businesses in the cloud. They need pragmatic, code-literate security specialists.
The Role
They’re looking for a hands-on AWS Security Architect who lives and breathes AWS. You’ll dissect designs, model attack paths, and show engineering teams what good really looks like. Depending on the engagement, you might run a threat model, assess CI/CD pipelines, learn a vendor DSL for a PoC, or build internal tooling. They don’t expect you to know everything - just to be curious, practical, and willing to dive in.
What You’ll Do
What You Bring
Must-Haves
Nice-to-Haves
Lynx are looking for a Cloud Security Engineer who can design, automate, and enforce cloud controls at scale. If you enjoy building policy-as-code frameworks, enabling shift-left security, and strengthening cloud governance across complex environments, this role is for you.
The Role
You’ll own the design and implementation of organization-wide cloud controls across AWS and Azure. You’ll work closely with DevOps, Security, Risk, and Compliance teams to embed secure-by-default practices and ensure continuous adherence to security and regulatory requirements. This is a hands-on engineering role where you’ll build automation, develop policy frameworks, and help teams remediate issues efficiently.
Key Responsibilities
Experience Requirements
Role Overview
Role / Job Title: OpenShift SRE SME
Work Location: London / Sheffield
Mode of Working: Hybrid
Office Requirement (if hybrid): 10 days a month
The Role
We are seeking a skilled OpenShift Site Reliability Engineer (SRE) to join our team.
You need to have good hands-on experience on SRE with OpenShift virtualization and Kubernetes.
Your Responsibilities
In this role, you will be responsible for:
Ensuring the reliability, availability, and performance of our OpenShift-based virtual/container platforms and services with a focus on automation.
Work and collaborate across teams, such as Applications, Hardware, and Network. Develop secure service architecture using cloud-native technologies.
Develop systems, primarily in Shell scripting, YAML, Ruby, Python and Go language, to prevent outages through automatic scanning and remediation. Establish and enforce SRE best practices through platform constraints and high-fidelity system modeling. Participate in an on-call rotation.
Your Profile Essential Skills / Knowledge / Experience
Hands-on experience with OpenShift virtualization and Kubernetes administration.
Understanding of distributed systems and common distributed system failure domains. Experience managing a production service with RedHat, Windows and ESXi.
Strong knowledge of Linux systems and networking.
Experience with monitoring, logging, alerting & observability tools (e.g., Otel, Prometheus, Grafana, Slunk etc.).
Proficiency in scripting languages Python, Shell, Go Lang, Terraform etc.
Familiarity with CI/CD tools (e.g., Jenkins, GitLab CI).
Understanding containerization (Docker) and microservices architecture.
Ansible configuration management and deployment.
Desirable Skills / Knowledge / Experience / Personal Attributes
Proven experience in compute, OpenShift, Kubernetes, hypervisors, storage, Windows, networks and Linux.
Work with industry groups and vendors outside of HSBC to establish and maintain HSBC’s involvement and influence.
Accountability for the control and compliance of the engineering process.
Promote innovation and adoption of cutting-edge specialist technologies and practices within the domain.
Promote development of engineers through coaching and mentoring.
Consult as required in other areas to assist and provide a different perspective to programmes or projects that require it.
Good problem-solving and communication skills.
Open Shift Architect Job Description Role Overview
Role / Job Title: OpenShift Architect
Work Location: London / Sheffield
Mode of Working: Hybrid
Office Requirement (if hybrid): 10 days a month
The Role
We are seeking an experienced OpenShift Architecture and Migration Design Specialist to lead the design, planning, and execution of OpenShift architecture and migration strategies.
You need to have expertise in designing robust, scalable, and secure OpenShift environments, as well as creating and implementing migration plans for transitioning workloads and applications to OpenShift. Experience with VMware and Pure Storage is essential to ensure seamless integration with existing infrastructure.
Your Responsibilities 1. Architecture Design
Design the target architecture for OpenShift, including cluster topology, networking, and storage solutions.
Define and implement best practices for OpenShift cluster setup, including multi-zone and multi-region deployments.
Ensure the architecture supports high availability, fault tolerance, and disaster recovery.
Assess existing infrastructure, applications, and workloads to determine migration readiness.
Develop detailed migration plans, including strategies for containerization, workload transfer, and data migration.
Implement migration processes, ensuring minimal downtime and disruption to business operations.
Identify and mitigate risks associated with the migration process.
Design and implement OpenShift solutions that integrate seamlessly with VMware virtualized environments.
Leverage VMware tools (e.g., vSphere, vCenter, NSX) to optimize OpenShift deployments.
Configure and manage Pure Storage solutions (e.g., Flash Array, Flash Blade) to provide high-performance, scalable storage for OpenShift workloads.
Ensure compatibility and performance optimization between OpenShift, VMware, and Pure Storage.
Design and implement CI/CD pipelines tailored for the OpenShift environment.
Integrate DevOps workflows with OpenShift-native tools and third-party solutions.
Automate deployment, scaling, and monitoring processes to streamline application delivery.
Ensure the architecture and migration plans are scalable to meet future growth and workload demands.
Implement security best practices, including role-based access control (RBAC), network policies, and encryption.
Conduct regular security assessments and audits to maintain compliance with organizational standards.
Work closely with development, DevOps, and operations teams to align architecture and migration plans with business needs.
Provide detailed documentation of the architecture, migration strategies, workflows, and configurations.
Offer technical guidance and training to teams on OpenShift architecture, migration, and best practices.
Your Profile Essential Skills / Knowledge / Experience
Strong experience in designing and implementing OpenShift architectures and migration strategies.
In-depth knowledge of Kubernetes, containerization, and orchestration.
Expertise in VMware tools and technologies (e.g., vSphere, vCenter, NSX).
Hands-on experience with Pure Storage solutions (e.g., FlashArray, FlashBlade).
Expertise in networking concepts (e.g., ingress, load balancing, DNS) and storage solutions (e.g., persistent volumes, dynamic provisioning).
Hands-on experience with CI/CD tools (e.g., Jenkins, Github, ArgoCD) and DevOps workflows.
Strong understanding of high availability, scalability, and security principles in cloud-native environments.
Proven experience in workload and application migration to OpenShift or similar platforms.
Proficiency in scripting and automation (e.g., Bash, Python, Ansible, Terraform).
Excellent problem-solving and communication skills.
Desirable Skills / Knowledge / Experience / Personal Attributes
OpenShift certifications (e.g., Red Hat Certified Specialist in OpenShift Administration).
Experience with multi-cluster and hybrid cloud OpenShift deployments.
Familiarity with monitoring and logging tools (e.g., oTel, Grafana, Splunk stack).
Knowledge of OpenShift Operators and Helm charts.
Experience with large-scale migration projects.
Join our team here at StepStone and youll be responsible for providing technical leadership to all engineering and development areas across your domain, developing and executing an engineering strategy aligned to the portfolio and global tech strategies.
Working in the Design and Platform Performance domain, you willlead the technology side of web and mobile analytics for a platform with over 50 million visits each month, deliver the best user experiences and personalisation for talent and talent seekers. We built in-house large scale tracking platform, powered by Tealium, Kafka and Adobe Analytics, and successfully rolled it out to 10 of our brands. Along with the tracking we are owners of our AB testing tool Optimizely, our in-house Design System Genesys and Frontend Framework, which when put together powers the product development at scale using a data driven approach.
You will play a vital role as we reimagine the labour market to make it work for everybody.
Your responsibilities:
Qualifications
Additional Information
Were a community here that cares as much about your life outside work as how you feel when youre with us. Because your job shouldnt take over your life, it should enrich it. Here are some of the benefits we offer:
Our commitment
Equal opportunities are important to us. We believe that diversity and inclusion at The Stepstone Group are critical to our success as a global company, so we want to recruit, develop, and keep the best talent. We encourage applications from everyone, regardless of background, gender identity, sexual orientation, disability status, ethnicity, belief, age, family or parental status, and any other characteristic.
As a global business we further our DEI and sustainability progress by working with national and international bodies and are proud to have been recognised for our work - both locally and internationally, including:
GenAI Full Stack Engineer - Managing Consultant
Salary: £80,000 - £88,000 pa + £8,000 Bonus plus benefits, perks and healthcare options
Job Type: Permanent - Hybrid / 2 x days per week - Travel to client site
Base Locations: London, Manchester, Newcastle, Glasgow
Overview:
We’re looking for a GenAI Full Stack Engineer who is passionate about solving real-world challenges through technology. You’ll work closely with senior stakeholders both internally and within key clients to create GenAI strategies that translate business issues into relevant technical solutions and competitive propositions that are scalable, secure, and sustainable.
Your Role:
Your skills and experience:
To be successfully appointed to this role, it is a requirement to obtain Security Check (SC) clearance.
Based at client locations, working remotely, or based in our Godalming or Milton Keynes offices.
Salary up to £65k plus company benefits.
About Us
Triad Group Plc is an award-winning digital, data, and solutions consultancy with over 35 years’ experience primarily serving the UK public sector and central government. We deliver high-quality solutions that make a real difference to users, citizens and consumers.
At Triad, collaboration thrives, knowledge is shared, and every voice matters. Our close-knit, supportive culture ensures you’re valued from day one. Whether working with cutting-edge technology or shaping strategy for national-scale projects, you’ll be trusted, challenged, and empowered to grow.
We nurture learning through communities of practice and encourage creativity, autonomy, and innovation. If you’re passionate about solving meaningful problems with smart and passionate people, Triad could be the place for you.
Glassdoor score of 4.7
96% of our staff would recommend Triad to a friend
100% CEO approval
See for yourself some of the work that makes us all so proud:
Helping law enforcement with secure intelligence systems that keep the UK safe
Supporting the UK’s national meteorological service in leveraging supercomputers for next-level weather forecasting
Assisting a UK government department responsible for consumer product safety with systems to track unsafe products
Powering systems that help the government monitor and reduce greenhouse gas emissions from commercial transport
Role Summary
Triad is seeking a Senior Data Engineer to play a key role in delivering high-quality data solutions across a range of client assignments, primarily within the UK public sector. You will design, build, and optimise cloud-based data platforms, working closely with multidisciplinary teams to understand data requirements and deliver scalable, reliable, and secure data pipelines. This role offers the opportunity to shape data architecture, influence technical decisions, and contribute to meaningful, data-driven outcomes.
Key Responsibilities
Design, develop, and maintain scalable data pipelines to extract, transform, and load (ETL) data into cloud-based data platforms, primarily AWS.
Create and manage data models that support efficient storage, retrieval, and analysis of data.
Utilise AWS services such as S3, EC2, Glue, Aurora, Redshift, DynamoDB and Lambda to architect and maintain cloud data solutions.
Maintain modular Terraform based IaC for reliable provisioning of AWS infrastructure.
Develop, optimise and maintain robust data pipelines using Apache Airflow.
Implement data transformation processes using Python to clean, preprocess, and enrich data for analytical use.
Collaborate with data analysts, data scientists, developers, and other stakeholders to understand and integrate data requirements.
Monitor, optimise, and tune data pipelines to ensure performance, reliability, and scalability.
Identify data quality issues and implement data validation and cleansing processes.
Maintain clear and comprehensive documentation covering data pipelines, models, and best practices.
Work within a continuous integration environment with automated builds, deployments, and testing.
Skills and Experience
Qualifications & Certifications
Triad’s Commitment to You
As a growing and ambitious company, Triad prioritises your development and well-being:
What Our Colleagues Have to Say
Please see for yourself on Glassdoor and our “Day in the Life” videos at the bottom of our Careers Page.
Our Selection Process
After applying for the role, our in-house talent team will contact you to discuss Triad and the position. If shortlisted, you will be invited for:
We aim to complete interviews and progress candidates to offer stage within 2-3 weeks of the initial conversation.
Other Information
If this role is of interest to you or you would like further information, please contact Ryan Jordanand submit your application now.
Triad is an equal opportunities employer and welcomes applications from all suitably qualified people regardless of sex, race, disability, age, sexual orientation, gender reassignment, religion, or belief. We are proud that our recruitment process is inclusive and accessible to disabled people who meet the minimum criteria for any role. Triad is a signatory to the Tech Talent Charter and a Disability Confident Leader.
DevOps Engineer - Public Sector (SC Cleared) Rate: £500-£650 per day Inside IR35 Contract Length: 6 months - strong likelihood of extension Clearance: Active SC Clearance required Eligibility: British passport holder The Role We are looking for an SC Cleared DevOps Engineer to support a public sector digital programme. The role focuses on improving and maintaining Salesforce delivery pipelines, helping teams release changes reliably and consistently across multiple environments. You will be responsible for setting up and supporting CI/CD processes, automating deployments, and working closely with delivery teams within a secure environment. Key Responsibilities Build and maintain Salesforce CI/CD pipelines Support automated build, test, and deployment processes Assist with environment setup and configuration Improve release management and deployment reliability Work with development teams to support Salesforce delivery best practices Contribute to secure and compliant deployment processes Required Experience Salesforce platform experience Hands-on DevOps or release engineering experience CI/CD tooling experience (GitLab preferred) Cloud experience (AWS) Infrastructure as Code experience (Terraform) Experience supporting Salesforce application delivery Additional Information 6-month contract - though this is part of a multi year contract so there’s a very strong likelihood the contract will be extended SC Clearance must be active and transferable Public sector or regulated environment experience desirable
Network AutomationEngineer Location: London (Hybrid, 3 days per week on-site) Duration: 6 months Rate: Negotiable (DOE) Overview An industry-leading global organisation is driving the automation and standardisation of its network infrastructure and services. Theyre looking for a Network Automation Engineer with strong hands-on experience in Ansible, Python, and network orchestration to help design, build, and integrate automation solutions across a range of enterprise platforms. This role blends network engineering, DevOps, and systems integration ideal for someone comfortable writing automation playbooks and APIs while understanding routing, firewalls, and connectivity fundamentals. Key Responsibilities Design, build, and maintain automation solutions for network and infrastructure systems. Develop Ansible playbooks and reusable automation modules for configuration, orchestration, and provisioning. Configure and integrate Cisco NSO and related orchestration tools to streamline network operations. Build and maintain API-based integrations with systems such as ServiceNow, NetBox, and GitHub. Drive network configuration standardisation using YANG models for routers and switches. Implement and maintain CI/CD pipelines (GitHub Actions / Azure DevOps) for network code delivery. Troubleshoot and optimise automation workflows across hybrid (on-prem and cloud) environments. Collaborate with architects, service owners, and developers to translate requirements into scalable automated solutions. Maintain documentation for designs, processes, and operational procedures. Promote automation, standardisation, and best practices across global teams. Core Technologies Automation & Orchestration: Ansible, Cisco NSO, Terraform, GitOps (GitHub / GitLab / Azure DevOps) Networking & Modelling: YANG, RESTCONF, NETCONF, XML, JSON, Jinja2, NetBox Cloud & Platform: Azure, AWS, RedHat Enterprise Linux, VMware, OpenStack Scripting & Languages: Python, Bash, YAML (Go / PowerShell desirable) CI/CD & Tooling: GitHub Actions, Azure Pipelines, Jenkins, Terraform, ServiceNow Monitoring & Observability: Grafana, Prometheus (desirable) Experience & Requirements Proven experience in network engineering or network automation (WAN, LAN, routing, switching, DNS, DHCP). Hands-on experience developing automation using Ansible and Python . Familiarity with Cisco NSO and YANG modelling or similar orchestration tools. Understanding of network APIs (REST/NETCONF) and data formats ( XML/JSON ). Experience integrating automation into CI/CD pipelines using Git-based workflows. Comfortable working in Linux-based environments (RedHat, CentOS, Ubuntu). Knowledge of cloud networking (Azure / AWS VPCs, Transit Gateways, VPNs). Excellent documentation, troubleshooting, and stakeholder communication skills. Certifications such as Cisco CCNP / DevNet Professional preferred; ITIL desirable. What Youll Work On Standardising and automating global network configurations and connectivity services. Integrating network provisioning with ServiceNow and NetBox. Building the automation layer for hybrid cloud networks (Azure / AWS / on-prem). Creating reusable playbooks, templates, and design artefacts for future deployments. Supporting the organisations transition toward infrastructure-as-code and self-service provisioning . Ideal Candidate Youre an experienced Network Automation Engineer , Network DevOps Engineer , or DevNet Specialist with hands-on expertise in both traditional networking and modern automation practices. Youre fluent in Python , think in APIs , and take pride in transforming manual configurations into efficient, repeatable code. If you’re interested and keen to find out more, please apply now with your updated CV and reach out to Tom Johnson at Certain Advantage - Ref: 79585 TPBN1_UKTJ
Senior DevOps Engineer / Senior Site Reliability Engineer
Fully Remote working for candidates based in the UK Salary to £90k + Benefits
We are looking for a Senior DevOps Engineer that has strong C# code knowledge combined with strong knowledge of DevOps tools like Kubernetes (EKS or AKS) and Azure or AWS Cloud platforms. We are looking for a DevOps Engineer with a strong understanding of C# code combined with experience of monitoring tools like DataDog, Grafana and Prometheus to join a growing global Cloud Infrastructure team supporting SaaS products.
Our client are a Global Digital SaaS Software Company have a fantastic fully remote opportunity for an experienced Senior DevOps Engineer to join their UK Cloud Infrastructure team.
Site Reliability Engineers at this company are responsible for keeping the SaaS products running properly. Using concepts of software and systems engineering, they work to improve the reliability of all cloud systems while keeping levels of manual work low. DevOps are expected to be experienced in software engineering principals, operational discipline, and automation.
The Cloud and DevOps team work on a fully remote basis and work in conjunction with their US and Australian teams as well. This company are a market leader in Student community management software, this company s unique SaaS platform is an essential platform in the life of millions of University students across the globe.
In this role, you will apply your Software Engineering experience to enhance system performance and reliability, as well as building internal systems and capabilities that eliminate manual work through automation. You’ll be joining our Platforms teams with globally-dispersed Site Reliability and Platform Engineers in a “follow the sun” model to operate our products on a multi-region cloud platform.
Role Responsibilities:
Required Skills and Experience:
Useful / Bonus Skills to have:
Employee benefits:
This Senior Site Reliability Engineer role is working for a market leading global software company and this job is part of a large program of change and improvement in their Cloud SaaS products over the coming years. If you are looking for an interesting SRE role with a forward-thinking global organisation, then this would be a tremendous career opportunity to consider.
Please apply with your CV to find out more.
Epsom, Surrey, KT17
£70,000 - £80,000 plus a bonus, generous pension and lots more
We are working with a highly respected financial services organisation, seeking an experienced Cloud Infrastructure Manager to lead the design, delivery and ongoing management of their cloud and hybrid infrastructure estate.
This is a pivotal leadership role, combining hands-on technical expertise with people management and strategic planning. You will take ownership of the full cloud lifecycle across Azure, ensuring resilience, security, performance and cost-effectiveness, while developing and mentoring a high-performing infrastructure team.
The Cloud Infrastructure Manager Role:
You will be responsible for the planning, build, operation and lifecycle management of cloud infrastructure and related services, with a strong focus on Azure. Key responsibilities include:
Skills & Experience Required:
Essential:
Desirable:
Why Apply?
This is a rare opportunity to step into a highly influential role where you will shape cloud strategy, modernise infrastructure and build a best-in-class cloud operations function. You ll work with cutting-edge Microsoft technologies, lead a talented technical team and play a key part in the organisation s digital transformation journey.
If you are a technically strong Cloud Infrastructure Manager who enjoys balancing strategy, delivery and people leadership, this role offers genuine scope, challenge and progression.
Integral Recruitment are acting as an employment agency in regard to this advertisement.
Northwest - Hybrid
Up to £100,000
VIQU are seeking a Principal Data Engineer to join a leading social enterprise that reinvests profits to create thriving, sustainable communities. Following a full transition to a 100% cloud-based data platform, this role will play a key part in shaping and leading the organisation’s data engineering capability, with a strong focus on technical leadership, platform design and mentoring engineers within a Google Cloud environment.
Key Responsibilities of the Principal Data Engineer:
Key Requirements of the Principal Data Engineer:
Apply now to speak with VIQU IT in confidence. Or reach out to Katie Dark via the VIQU IT website.
Do you know someone great? We’ll thank you with up to £1,000 if your referral is successful (terms apply).
Principal Data Engineer (GCP)
Northwest - Hybrid
Up to £100,000
Data Platform Engineer – London
(AWS, Apache Spark, AWS Glue, Iceberg, S3, RDS, Redshift, Kafka/MSK, Python, Terraform, Ansible, CI/CD, Jenkins, GitLab, Snowflake, Databricks)
Working with an established FinTech client in London who is looking for a Data Platform Engineer to play a key role in defining, building, and evolving their enterprise Data Lakehouse platform during an exciting period of growth. You’ll work closely with Platform Engineering and Application Engineering teams, taking ownership of the infrastructure, patterns, standards,and tooling used to build and operate data products across the business.
The role focuses on ensuring the data platform is resilient,secure, reliable, and cost-effective within an AWS environment. You’ll be responsible for how the platform is operated, maintained, monitored, and extended, with a strong emphasis on observability, fault prevention, and early fault detection across AWS data services.
Automation is central to the way this team works. You’ll design and maintain Infrastructure as Code and Configuration as Code solutions, supported by CI/CD pipelines, to ensure consistent, repeatable deployments and strong governance. You’ll also enhance data lake integration testing, security measures, monitoring, SLAs, and operational metrics.
Working for a tech driven organisation in a collaborative environment, for an organisation that values engineering that values engineering best practises! This client Is offering this role on hybrid basis, looking to be in the office few times per month.
For more information, please get in touch
Ready to automate, innovate and make a real impact?
Do you want to build reliable, scalable systems that make a real difference to millions of people? Do you want to work at a certified B Corp with an inclusive and learning culture as part of a diverse team of great people?
If so, Opencast could be the place for you. We’re a growing tech consultancy that creates user-centred solutions with purpose for our clients in government, healthcare and purpose-driven businesses. Working in DevOps, you’ll build and maintain the systems that let teams respond quickly to needs. You’llwork on high-impact projects, helping to build reliable, scalable systems. We align work to industry-recognised roles such as Build & Release Engineer (BRE), Platform Engineer (PLE) or Site Reliability Engineer (SRE) and, where needed, to more specialised areas such as security, cloud hosting, infrastructure or networking.
What’s life like as a DevOps consultant at Opencast?
At Opencast, we love to keep things simple, and we love automation.
It’s not the same as your standard Devops Consultant job. Depending on your client’s needs, you’ll take on a range of roles. These will include:
We care about building things right. We believe in good devops practices and keeping things simple. We want to support you as much as we can on interiorising our approach to devops engineering and you supporting others.
Salary:
Some of the benefits our offer includes:
Where you’ll work:
We include you:
Interview: