Job Description
Were looking for aLead Cloud Site Reliability Engineer (SRE)with Valid SC Clearance and strong expertise inAzure, Kubernetes, Terraform, and GitHubto lead large-scale projects and mentor a growing team.
Key Responsibilities
- Lead SRE activities for large-scale cloud projects, providing technical guidance to engineers.
- Deliver solutions acrossVMs and Kubernetes, ensuring efficient deployment, scaling, and management.
- ImplementCI/CD pipelinesusing GitHub Actions or similar tools.
- Design and manageInfrastructure as Code (IaC)using Terraform (preferred), Ansible, Jenkins, etc.
- Assess networking requirements and design secure solutions (load balancing, firewalls, routing).
- Troubleshoot and resolve complex cloud infrastructure and application issues.
- Mentor junior engineers and promote knowledge sharing within the team.
- Collaborate with stakeholders, vendors, and cross-functional teams (Cyber Security, Testing, Application).
- Support cloud migration initiatives using frameworks likeCAF, AzureRM, Google Cloud.
- Represent the team during project delivery and ensure adherence to change control processes.
- Participate in24/7 on-call support rotaand occasional support for previous adoption work.
What Were Looking For
Strong DevOps background with automation-first mindset
Expertise inAzure, Kubernetes, Terraform, GitHub
Experience in cloud migration and networking solutions
Ability to lead projects and communicate effectivelyFamiliarity with change control processes
Nice to Have
Cloud certifications (Azure, GCP, etc.)
Experience with Multi-Tenant solutions
Passion for continuous learning and innovation