Experience: 4+ yearsLocation: Bengaluru/On SiteEmployment Type: Full-timeAbout the RoleWe are seeking an experienced Site Reliability Engineer (SRE) with a strong background in DevOps technologies and cloud infrastructure. The ideal candidate will have hands-on experience with Kubernetes, Helm charts, and AWS, along with a solid understanding of CI/CD pipelines, automation, and scripting to ensure reliability, scalability, and performance of our systems.Responsibilities- Design, deploy, and manage scalable infrastructure using Kubernetes and Helm. - Implement and maintain CI/CD pipelines using tools like ArgoCD, GitHub Actions, or similar. - Manage and optimize AWS environments; experience with Azure and GCP is a strong plus. - Proactively monitor system performance and reliability, ensuring high availability and quick issue resolution. - Automate infrastructure and operational tasks through robust scripting. - Work closely with development and operations teams to improve system stability and deployment workflows. - Contribute to the reliability and performance of application builds; knowledge of Make build system is beneficial. - Develop tools and workflows using Python and other scripting languages.Required Skills- 4+ years of professional experience in SRE or DevOps roles. - Hands-on experience with Kubernetes, Helm, and AWS. - Strong programming/scripting skills (preferably Python and Shell). - Sound understanding of CI/CD tools and practices. - Practical experience with infrastructure as code (IaC) and automation tools. - Deep understanding of system monitoring, observability, and incident management practices.Skills in the below areas would be a big plus:- Knowledge of ArgoCD, GitHub Actions, and Make build systems. - Experience with container registry management and image pipelines.
Job Title
System Reliability Engineer