Skip to Main Content

Job Title


Engineering Manager - Reliability/ Platform Engineering


Company : RecRoots


Location : Bengaluru, Karnataka


Created : 2025-07-23


Job Type : Full Time


Job Description

Our mission is to create transformative, innovative, and personalized experiences for millions of customers all across the world. We want customers to have an amazing experience wherever and whenever they choose: mobile, web, and through partners and 3rd parties.About the teamThe group operates, orchestrates, and optimizes managed cloud infrastructure. The Cloud capabilities are provided on platform instances that are privately owned and centrally managed. These platform instances, and the workloads running on them, are hosted both in datacenters (“on-premises”) and on public cloud infrastructure (AWS).The Cloud platform has three primary internal customer-facing verticals: virtualisation, containerisation, and serverless, corresponding to the three types of workloads it supports.At the highest level, the Cloud drives three primary business outcomes: ● Agility in provisioning and using cloud infrastructure. ● Efficiency in cost and utilisation of cloud infrastructure, as well as toil reduction for developers and engineers. ● Trust in the safety, reliability, and performance of our cloud infrastructure.Key Job Responsibilities and Duties :As an Engineering Manager - Site Reliability, you will lead Site Reliability and Software Engineers based in Bengaluru, working closely with their counterparts to drive the adoption, operations, orchestration, securing, and optimisation of Cloud platforms.You will contribute to and communicate our vision and mission in close collaboration with your Engineering Manager colleagues. Additionally, you will play a key role in planning and delivery of capabilities that contribute to objectives and initiatives at the department level.As a role model for organisation’s values, you will demonstrate a customer-centric mindset in your operations and decision-making processes.You can't achieve this on your own so, as Site Reliability Engineering is defined as treating the reliability of software systems as a software engineering problem, we will be expecting you to hire and manage a team of software engineers to optimize systems rather than system operators.You will be responsible for growing and coaching engineers, unlocking the creativity, and inspiring them to build the best solutions. You are comfortable with ambiguity, yet you excel at learning and driving clarity. You take end-to-end ownership of your area and embrace iteration, believing that failure—and failing fast—is a key part of building great tech. You will work to break down silos, collaborating closely with product leaders and engineering leaders across the organisation to ensure alignment with our vision.Your responsibilities will include:People Leadership ● Inspire and empower multiple multi-functional product teams ● Directly lead Engineers in multiple teams ● Nurture, grow and develop engineering talent in the teamTechnology, Craft & Delivery ● Technical Incident Management ● Building software applications and ensuring an “everything as code” mindset. ● Automation and toil reduction ● Monitoring and Alerting improvements ● Continuous Quality and Process ImprovementArchitecture & Product Strategy ● Thought partner for Product to define, shape and deliver the roadmap ● Stakeholder engagement and management ● Architectural Guidance ● Drive innovation in own teamWhat You'll Bring ● 10+ years of hands-on experience in software and site reliability engineering with at least 2+ years of strong people management & global stake-holder management experience within the technology sector. ● Demonstrated ability leading and managing a team of engineers in a fast-paced and complex environment ● Strong people leadership skills and experience dealing with sophisticated people issues ● Bachelor’s degree in Computer Science, Computer or Electrical Engineering, Mathematics, or a related field or 10 years of progressively responsible experience in the specialty as equivalent ● Strong programming skills and Kubernetes internals knowledge ● Strong business and technical vision ● A deep understanding of and proven record of shipping platforms at scale ● A deep understanding of reliability engineering and software development best-practices, and a track record of hands-on developing and shipping software and platforms at scale ● Strong interpersonal skills ● Strong work ethic; self-directed and resourceful ● Solution oriented and result driven ● Proactive, flexible and capable of working independently as well as working in a team ● Excellent verbal and written communication skills ● Analytical skills and data-driven mentality