Role:Site Reliability Engineer (SRE) – Core IT Infrastructure Location: Ahmedabad, Gujarat, India Work mode:On-site (full Time) Experience:6+ year'sKey ResponsibilitiesInfrastructure Reliability & Operations • Design, implement, and maintainhighly available and fault-tolerant infrastructure • Ensure reliability, performance, scalability, and security ofcore IT systems • Monitor system health, capacity, and performance using proactive observability practices • Lead incident response, root cause analysis (RCA), and post-incident reviewsAutomation & SRE Development • Develop and maintainautomation tools, scripts, and frameworksto reduce manual operations • ApplyInfrastructure as Code (IaC)principles using tools such as Terraform, Ansible, or CloudFormation • Build self-healing systems and automate repetitive operational tasks • Improve deployment pipelines and operational workflows through engineering solutionsDevOps & Platform Engineering • Collaborate with DevOps, development, and security teams to support CI/CD pipelines • Enable seamless application deployments with minimal downtime • Support containerized and orchestration platforms (Docker, Kubernetes, OpenShift) • Implement best practices for configuration management and environment consistencyMonitoring, Observability & Performance • Design and maintain monitoring, logging, and alerting systems • Define and trackSLIs, SLOs, and SLAs • Optimize system performance, capacity planning, and cost efficiency • Enhance observability using tools such as Prometheus, Grafana, ELK, Datadog, or similarSecurity & Compliance • Implement infrastructure security best practices • Collaborate with security teams on vulnerability management and compliance requirements • Ensure secure access, identity management, and audit readiness⸻Required Skills & QualificationsTechnical Skills • Strong experience inLinux/Unix system administration • Proficiency inprogramming/scripting(Python, Go, Bash, Shell, or similar) • Experience withcloud platforms(AWS, Azure, or GCP) • Hands-on experience withcontainerization and orchestration • Knowledge of networking concepts (DNS, TCP/IP, load balancing, firewalls) • Experience with monitoring, logging, and alerting tools
Job Title
Site Reliability Engineer (SRE) – Core IT Infrastructure