• 6 years of strong SRE experience along with knowledge of the Core Azure Service, IoT/ Event Hub, Databricks• Must have 3 years of experience with Kubernetes and docker• Implement and manage monitoring (ELK), alerting, and logging systems to ensure proactive identification and resolution of issues• Engage and contribute towards System Monitoring, Incident management, performance tuning and fault finding• Must have Python, Powershell scripting experience or any other scripting language• Must have effective communication with excellent logic and problem-solving skills and a drive to make a difference• Good to have experience with AI/ML Ops, Release Management, CI/CD using tools such as GitHub, Blackduck Hub, Coverity, Container Signing with good understanding on Software configuration Management• Ability to understand and communicate customer issues• Experience in development and supporting enterprise applicationsGood written and verbal communication skills with the ability to document and communicate technical information to IT professionals
Job Title
Site Reliability Engineer