Skip to Main Content

Job Title


Site Reliability Engineer (Junior)


Company : CES


Location : dehradun,


Created : 2025-07-24


Job Type : Full Time


Job Description

We’re looking for a highly skilled Site Reliability Engineer to help us build, manage, and scale modern infrastructure systems for high-availability applications. If you're passionate about automation, cloud platforms, and solving tough operational challenges, we would love to hear from you. Key Skills and Competencies 3+ years of extensive experience with Infrastructure as Code (IaC) and Desired State Configuration (DSC) tools like Terraform, CDK, and Chef Experience in packaging, deploying, and managing containerized workloads on Docker and Kubernetes Expertise in managing AWS infrastructure at scale – EC2, S3, ELB, Lambda, Route 53, ECS, SQS, CloudWatch Prior experience working in DevOps or SRE environments Strong automation/scripting skills using PowerShell, Ruby, Go, Python, and Bash Hands-on with monitoring and reporting tools – ELK Stack, Dynatrace, New Relic, Nagios Experience with IIS management , performance monitoring, and troubleshooting Background in web farm management for high-traffic SaaS applications Strong problem-solving and root-cause analysis skills Experience working with .NET application architectures – caching, content delivery, high availability, load balancing Familiarity with CI/CD pipelines and tools – TeamCity, Octopus Deploy, GitHub, Jenkins, Codefresh , etc Responsibilities: Drive initiatives to improve platform scalability and operational efficiency Lead standardization efforts across engineering and infrastructure teams Identify opportunities to improve and automate deployments, visibility, and management Apply cloud security best practices to ensure infrastructure safety Provide full-stack diagnostics and resolve complex infrastructure issues Track performance metrics and make data-backed improvement decisions Proactively suggest infrastructure or process changes for system reliability Ensure disaster recovery readiness and implement high availability systems Build support workflows and assist with incident response Own and improve the customer experience through system reliability and uptime Personal Attributes: Passionate about learning and applying new technologies A strong collaborator who believes in team success Excellent communicator – verbal, written, and virtual High integrity and commitment to ethical standards Self-motivated, driven, and detail-oriented Able to work independently on short-term projects