Company DescriptionResourceDekho is a global provider of business solutions, IT services, and resource outsourcing, enabling organisations to achieve digital transformation. Our offerings include infrastructure and cloud management, monitoring and logging solutions, software development, and web development. We specialize in delivering tailored services that enhance operational efficiency, performance, and seamless integration. With a focus on innovation, ResourceDekho empowers businesses to realize their full potential in the digital landscape. Explore our solutions and join us in shaping the future of digital transformation.Role DescriptionWe are hiring skilled DevOps / Linux & Cloud Engineers with at least 2+ years of hands-on production experience to support our nightly operations and critical infrastructure. The ideal candidate should have strong expertise in Linux systems, cloud platforms, Kubernetes-based environments, containerization, automation, deployments, monitoring, troubleshooting production workloads, and networking fundamentals.Key ResponsibilitiesManage, monitor, and support Linux-based production and staging environments.Handle deployments, environment maintenance, patching, and system updates.Operate and troubleshoot production Kubernetes (K8s) clusters, containerised workloads, and cloud infrastructure.Deploy, manage, and troubleshoot Docker containers, including Docker networking concepts.Monitor system health using monitoring and observability tools and respond to production incidents.Configure, maintain, and optimise CI/CD pipelines for automated deployments.Ensure high availability, reliability, and performance of services in production.Work closely with development teams to resolve application, infrastructure, and deployment issues.Maintain documentation for production configurations, deployments, and operational procedures.Troubleshoot networking issues related to applications, servers, containers, and cloud environments in production.Required Skills2+ years of production handling experience in DevOps / Linux / Cloud environments.Strong knowledge of Linux administration (Ubuntu / CentOS / RHEL).Hands-on experience with cloud platforms (AWS / Azure / GCP).Kubernetes (K8s) experience: deploying, monitoring, and troubleshooting workloads (EKS/AKS/GKE preferred).Docker & container networking experience (bridge/overlay networking, ingress, load balancing concepts).Hands-on experience with monitoring & logging tools (Prometheus, Grafana, CloudWatch, ELK, or similar).Experience with MongoDB operations (deployment, basic administration, backups, monitoring) in production or containerized environments.Hands-on experience with TCP/IP protocols and networking fundamentals in a DevOps production environment (DNS, routing, subnetting, load balancing, firewall rules, security groups, etc.).Good understanding of networking fundamentals, monitoring, and log analysis.Proficiency with Git, version control, and scripting (Shell / Python preferred)Nice to HaveExperience with Infrastructure as Code tools (Terraform, Ansible, CloudFormation).Experience with advanced Kubernetes concepts (Helm, scaling, upgrades, security).Exposure to SRE practices, alerting, and incident response.Note: Immediate joiners preferred
Job Title
Site Reliability Engineer