Senior TechOps Engineer
Senior TechOps Engineer
Overview
We are seeking a Senior TechOps Engineer to join a high-performing Site Reliability and Technical Operations team. The role involves working closely with engineering partners to drive initiatives from design through implementation. You will support highly available multi-region Kubernetes (AWS EKS) environments that underpin mission-critical workloads.
This is a hands-on role where you will help shape cloud infrastructure strategy, ensure operational reliability, and promote DevOps and automation practices across the organization.
Key Responsibilities
Manage and maintain production Kubernetes workloads, preferably on AWS EKS.
Build and deploy Docker images and Helm charts; migrate applications from other orchestration solutions to Kubernetes.
Develop, maintain, and optimize CI/CD pipelines using Jenkins or similar tools.
Monitor and maintain cloud infrastructure using observability tools such as Datadog, Splunk, and CloudWatch.
Maintain Unix/Linux systems and implement shell/Python scripts for automation.
Deploy and manage AWS resources across multiple accounts and regions, including IAM and federated access.
Implement logging, monitoring, and alerting capabilities to ensure uptime and performance.
Work independently and collaboratively across teams to troubleshoot issues and drive operational improvements.
Promote automation, DevOps practices, and infrastructure-as-code (Terraform preferred) across projects.
Mentor and provide technical leadership to other engineers in cloud operations, CI/CD, and SRE practices.
Configure resilient infrastructure across multiple availability zones and regions.
Required Skills & Experience
5+ years of hands-on experience in AWS production environments.
Proficiency with Kubernetes, ideally on AWS EKS, and experience with Docker containerization.
Experience with CI/CD tools (Jenkins, Git) and declarative pipelines.
Strong knowledge of Unix/Linux operating systems and shell scripting.
Hands-on experience with monitoring tools: Datadog, Splunk, CloudWatch.
Experience creating and deploying Helm charts and libraries.
Understanding of cloud security principles, IAM, and federated access.
Ability to communicate effectively at all levels and collaborate across multiple teams.
Strong automation mindset and problem-solving skills.
Programming experience (Python preferred).
Experience with infrastructure-as-code (Terraform preferred).
Preferred / Plus Skills
Experience with Apache or Confluent Kafka.
Exposure to agile development processes and Kanban methodology.
Knowledge of CDN providers (e.g., Akamai).
Previous experience leading cloud operations or SRE teams.
Experience with multi-region, highly available infrastructure.
Key Attributes
Hands-on, self-motivated, and able to work independently.
Passionate about automation and DevOps practices.
Strong analytical, organizational, and communication skills.
Comfortable in dynamic, fast-paced, and evolving technical environments.
Similar Jobs
Search Jobs
Match my CV
We take the hard work out of finding you a new job. Simply upload your CV (or call us) and we’ll get hunting for you!