Skip to Main Content

Job Title


Site Reliability Engineer


Company : Viridium.AI


Location : Kottayam, Kerala


Created : 2026-03-19


Job Type : Full Time


Job Description

Company DescriptionAt Viridium.AI, We are driven by the dual opportunity to build an amazing company and make a positive impact on the world. We are building a Material Intelligence platform. Our mission is to help manufacturers swiftly and profitably identify and phase out hazardous materials, such as forever chemicals, from their products. By leveraging cutting-edge AI technology, we aim to set the standard for applying the power of AI to solve problems otherwise impossible for humans. Our AI design principles are rooted in responsible AI, adhering to the laws of physics and nature, designed to unveil insights beyond human analytical capabilities, to automate routine tasks, offering ease of walking on an escalator and hence making meeting subsequent challenges easier and more cost effective.What this role isWe’re building cloud systems that have to work—reliably, at scale, without babysitting.We need a hands-on DevOps / SRE who can own infrastructure end-to-end on Azure, automate aggressively, and keep production stable.What you’ll doDesign and run Azure infrastructure (VNet, App Service, Storage, Key Vault, PostgreSQL, ACR)Build everything as code using TerraformSet up and maintain CI/CD pipelines (GitHub Actions / Azure DevOps)Deploy and manage apps using Docker (AKS is a plus)Implement zero-downtime deploymentsOwn monitoring & alerts (Azure Monitor, App Insights, Grafana)Troubleshoot production issues and fix root causes—not patch symptomsLock down systems with RBAC, Key Vault, and Azure ADHandle networking basics (DNS, subnets, private endpoints, firewalls)What you should haveStrong, real-world experience with Microsoft AzureSolid grip on Terraform and CI/CDExperience running production systems (not just deploying them)Good understanding of containers, networking, and system designComfort with Linux/Windows and web servers (Nginx/IIS)Experience5–15 years in DevOps / SRE / Cloud EngineeringYou’ve owned uptime before—and know what breaks in productionWhat mattersYou automate firstYou take ownershipYou build systems that don’t fail silentlyIf this sounds like you, you’ll fit right in.