About Inspire Brands:Inspire Brands is disrupting the restaurant industry through digital transformation and operational efficiencies. The company’s technology hub, Inspire Brands Hyderabad Support Center, India, will lead technology innovation and product development for the organization and its portfolio of distinct brands. The Inspire Brands Hyderabad Support Center will focus on developing new capabilities in data science, data analytics, eCommerce, automation, cloud computing, and information security to accelerate the company’s business strategy. Inspire Brands Hyderabad Support Center will also host an innovation lab and collaborate with start-ups to develop solutions for productivity optimization, workforce management, loyalty management, payments systems, and more.Network Operations Center (NOC) Analyst has a responsibility for all technology aspects of Inspire Brands and will act as the primary technology expert for the NOC team This role detects and resolve the issues in production before it impacts to the large scale. NOC analyst ensures that Inspire Digital Platform (IDP) services have reliability, uptime appropriate to user’s needs and a fast rate of improvement. This role is responsible for ensuring an exceptional customer experience, establishing service level agreements that align with business model.ResponsibilityTechnical:Develop and Monitor various monitoring dashboards to detect problems related to application, infrastructure, and potential security incidents.Provide primary operational support for multiple large, distributed software applications.Employ deep troubleshooting skills to improve the availability, performance, and security to ensure services are designed with 24/7 availability and operational readiness and rigor.Perform in depth postmortem on production incidents, to assess effective business impact and for Engineering to learn from these.Create dashboards and alerts for Monitoring the platform, define key metrics and service level indicators and ensure relevant metric data is collected to create actionable alerts for the responsible Teams.Willing to participate in the 24/7 on call rotation.Automate toil, by building software and automation for seamless application deployment and third-party tool integration.Analyzing the major incidents and working towards a permanent fix in collaboration with other teams.Establish and publish regular KPI’s/metrics useful for measuring performance, SLA, stability, and customer satisfaction.Ability to Perform integrations between the components and monitoring tools.Leading the efforts of Hypercare events internally from NOC Team.Responsible for Incident logging, accurately tracks and documents all incidents.Provide recommendations and feedback in the process review sessions.KNOWLEDGE, SKILLS AND ABILITIES:Education:4-year degree in computer science, Information Technology, or related fieldExperience:Minium 5 years of experience as a production support role supporting large scale SAAS Production B2C or B2B Cloud Platforms.Hands-on problem-solving and troubleshootingKnowledge and skills (general and technical):Minium 5 years of experience as a production support role supporting large scale SAAS Production B2C or B2B Cloud Platforms.Good understanding on applications developed using Java, TypeScript, python.Hands on Azure Cloud experience particularly with AKS, API management, Service Bus, Azure Functions. Proficiency in monitoring, APM and profiling tools Splunk, Prometheus, Grafana.Working experience with containers, Kubernetes and Helm. Functional knowledge of Cloud Network, Firewalls, Ingress and Egress controllers, Service Mesh and Good understanding of ArgoCD, GitLab, CICD, Terraform, Infrastructure as Code.Strong communication skills and ability to explain technical concepts clearlyA willingness to dive into understanding, debugging, and improving any layer of the stackTechnical Skills:Cloud Provider: AzureCore Services: Elasticpool, SQL, Application Gateway, API Management (APIM), Key Vaults, AKS (Azure Kubernetes Service), VMSS (Virtual Machine Scale Sets), VMStorage accountsMessaging and Events: EventHub, EventGrid, Azure Service Bus (Namespaces, Queues, Topics)Containerization and Orchestration:Kubernetes (K8s): For container orchestrationDocker: For containerizationMonitoring and Observability (O11Y):New Relic / Splunk/ AWS Cloud WatchGood to have certifications (Not Mandatory):AZ-900 (Microsoft Certified: Azure Fundamentals)AZ-104: (Microsoft Certified: Azure Administrator Associate)
Job Title
Network Operations Center (NOC) Support Analyst [T500-17202]