About Signzy Signzy is an AI-powered RPA platform for financial services. No matter how complex your workflow or operational complexity, Signzy can completely automate your back-operations decision-making process into a real-time API. This is possible due to a combination of Nebula - Our no-code AI model builder and our Fintech API Marketplace of over 200+ APIs. Today we work with over 90+ FIs globally including the 4 largest banks in India and a Top 3 acquiring Bank in the US. Globally we have a strong partnership with MasterCard and offices in New York and Dubai to serve our customers in the 2 geographies. Our Product team of 120+ people is building a global AI product out of Bangalore.Job Location: Bangalore and DubaiWorking at Signzy At Signzy we breathe software and exploit the latest technologies to create the most amazing products. We comprise a tech-savvy team and are backed by investors who are enthusiastic about creating solutions using technology.This is an invitation to be a part of the future!Responsibilities Design, deploy, and operate reliable and scalable systems across cloud and Kubernetes environments.Automate infrastructure provisioning, deployments, and operational workflows.Build and maintain tools for deployment, monitoring, and system operations.Monitor system health and performance, and proactively identify areas for improvement.Troubleshoot and resolve issues across development, test, and production environments.Participate in incident response, root cause analysis, and reliability improvements.Collaborate with engineering teams to improve system operability and deployment safety.Support and operate large-scale systems, including data-intensive or AI-driven workloads.Requirements 2 - 6 years of experience managing and operatingproduction infrastructure and servicesin cloud environments such as AWS, Azure, or GCP.Strong hands-on experience withLinux systemsin production environments.Experience working withcontainerized workloads and Kubernetesin real-world scenarios.Working knowledge ofInfrastructure as Codetools such asTerraform, Terragrunt, or Crossplane .Experience designing and maintainingCI/CD pipelinesusing tools such asGitHub Actions, GitLab CI, Jenkins, Azure DevOps, or similar .Familiarity withGitOps principles and toolssuch asArgo CD or Flux .Solid understanding ofcloud networking concepts , load balancing, and service connectivity.Experience withmonitoring, logging, and alerting systemssuch asPrometheus, Grafana, ELK/EFK, Datadog, or equivalent .Proficiency in at least onescripting or programming language(e.g., Bash, Python).Experience working withrelational databases ; exposure to NoSQL or data platforms is a plus.Experience participating inon-call rotations , responding to production incidents, and performing root cause analysis.Understanding ofSLIs, SLOs, and error budgets , and how they are used to guide reliability and operational decisions.Strong problem-solving skills and the ability to debug complex production issues.Good verbal and written communication skills, especially during incidents and technical discussions.Nice to Have Experience operating systems at scale or in high-availability environments.Exposure to on-prem or hybrid infrastructure.Experience supporting data platforms, analytics, or AI/ML workloads.What We Value A strong sense ofownershipand responsibility for production systems.A focus onautomation, reliability, and operational simplicity .The ability to balance speed, stability, and long-term maintainability.Curiosity and willingness to continuously improve systems and processes.
Job Title
Site Reliability Engineer