We are seeking a Container Operations Engineer to manage and optimize Kubernetes environments, ensuring the security and smooth operation of our containerized applications. The role focuses on cluster upgrades, security patches, vulnerability remediation, and resource optimization, while also emphasizing learning, collaboration, and best practices in Kubernetes and infrastructure management. You will be responsible for performing level 1 and level 2 support tasks and contributing to the ongoing improvement of our Kubernetes operations.
Responsibilities
- Cluster Management and automation:
- Monitor and maintain the health of Kubernetes clusters to ensure optimal performance and availability.
- Assist in routine maintenance tasks, including cluster upgrades, control plane upgrades, and security-focused updates for operational and client node groups.
- Participate in triaging and resolving issues related to Kubernetes infrastructure, networking, storage, and security.
- Perform routine AWS module upgrades, manage SSL certificate rotations, secrets management, and Terraform state refresh/updates.
- Learn and apply Kubernetes best practices and automation techniques to improve operational efficiency.
- Security and Vulnerability Management:
- Conduct image vulnerability remediation, ensuring secure deployments by rolling out newer Docker images after remediation (75+ images).
- Address Kyverno policy violations on customer workloads and remediate security concerns.
- Perform ArgoCD version upgrades to maintain secure GitOps workflows and Helm chart upgrades for operational security.
- Collaboration & Support:
- Work closely with development teams to understand and support containerized applications, ensuring seamless integration and deployment.
- Assist in implementing infrastructure as code (IaC) using tools like Terraform for managing Kubernetes clusters and automating operational tasks.
- Collaborate on the rollout of new Helm chart updates across multiple environments, ensuring smooth transitions.
- Perform routine AWS module upgrades, manage SSL certificate rotations, secrets management, and Terraform state refresh/updates.
- Documentation
- Contribute to internal documentation and knowledge-sharing initiatives to enhance team capabilities and collaboration.
Requirements
- Basic understanding of Kubernetes concepts and containerization technologies (Docker, containerd)
- Familiarity with atleast one of the container orchestration platforms like Kubernetes (EKS, AKS, GKE)
- Knowledge of scripting languages (e.g., Bash, Python)
- Exposure to infrastructure as code tools (e.g., Terraform, Ansible) is a plus
- Fundamental understanding of cloud platforms (AWS, Azure, GCP)
- Strong problem-solving and troubleshooting mindset
- Excellent communication and documentation skills
- Ability to work in a collaborative team environment
- Eagerness to learn and grow in the field of Kubernetes and cloud-native technologies
- Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent experience)