Skip to Main Content

Job Title


Site Reliability Engineer


Company : Datum Technologies Group


Location : Chennai, Tamil Nadu


Created : 2025-12-17


Job Type : Full Time


Job Description

Job Title: Site Reliability Engineer (SRE) – AWSExperience: 8+ yearsLocation: Chennai / MumbaiWork Mode: HybridKey Skills: AWS, Terraform, Kubernetes, Docker, Grafana, Prometheus, DatadogJob Summary:We are looking for a skilled Site Reliability Engineer (SRE) with strong AWS experience and a solid background in DevOps, automation, observability, and large-scale distributed systems.Responsibilities:• Manage and optimize cloud infrastructure using AWS IaaS.• Implement SRE practices to enhance reliability, performance, and SDLC efficiency.• Build and maintain CI/CD pipelines (Jenkins, GitLab, Terraform).• Work with containers and orchestration (Docker, ECS, Kubernetes).• Troubleshoot performance, networking, and distributed system issues.• Drive DevOps and QA best practices across teams.• Implement observability: SLI/SLO, Error Budgets, monitoring, logging, tracing, alerting.• Lead incident resolution and perform RCA.• Automate tasks using Python/Bash/PowerShell.• Collaborate effectively with cross-functional teams with minimal supervision.Qualifications:• Strong AWS cloud experience• Proven DevOps & SRE implementation skills• Good understanding of Linux, networking, and distributed systems• Hands-on experience with observability tools• Strong scripting and automation expertise• Excellent communication and teamwork skills