Skip to Main Content

Job Title


Senior Manager, Systems Operations


Company : Joyent


Location : Mountain View, CA


Created : 2024-05-04


Job Type : Full Time


Job Description

Joyent powers the global cloud infrastructure and developer platform providing back-end services for Samsung's billions of devices. Joyent's data center footprint is within 100ms latency to 70% of the world's population, while our multi-cloud, Kubernetes-based developer platform extends our reach to additional resource regions. We're operating at hyperscale to power workloads that bring capability and delight to Samsung's employees and customers.Is this your next job Read the full description below to find out, and do not hesitate to make an application.Job Summary:Joyent is actively looking for a hands-on and dynamic Sr Manager to join our diverse team. The System Operations Team is responsible for the hosts and services supporting Samsung Private Cloud's (SPC) customer-facing products. We are looking for a Sr. Manager (Mountain View CA) who shares and practices our values: open communication, transparency, taking ownership, and a high level of craftsmanship.As the Senior Manager, you will be leading the Systems Operations team architecting tools to create and maintain cloud infrastructure, automate the management of complex service-oriented applications, databases, and other tools, and develop frameworks to ensure the SPC's stability and scalability.We are looking for a self-starter who can help shape the systems operations team and bring it to the next level. You are someone who lives and breathes SLIs and SLOs for products and services. You enjoy solving deep technical problems as much as you enjoy mentoring your team to do the same and working cross-functionally throughout the organization to grow our collective skills.In this role, you will be a hands-on leader, leading and inspiring a diverse, globally remote team by driving change through providing technical and leadership guidance and removing blockers to achieve goals. Under your leadership, your team will partner with developers to continuously improve performance, reliability, and cost efficiencies, not to mention you will play a crucial role in shaping the engineering and company culture at Joyent.Job Responsibilities:Build Automation: Design, build, and support SPC's cloud infrastructure, leveraging automation and infrastructure-as-code;Develop and execute a strategic roadmap for cloud infrastructure, aligning with business objectives and growth initiatives.Assess cloud technologies, tools, and services to pinpoint and implement avenues for expansion, enhancement, and streamlining.Define standards, best practices, and policies for cloud infrastructure management, ensuring compliance with security and regulatory requirements.Partner with product developers to build services according to modern design patternsMonitoring and Incident Management:Successfully design and implement SLI and SLO for supported servicesImplement robust monitoring and alerting systems to proactively detect and respond to infrastructure issues and performance bottlenecks.Define and maintain incident response procedures and oversee the resolution of critical incidents, coordinating cross-functional teams to minimize downtime and impact on business operations.Security and Compliance:Collaborate with security teams to implement security best practices and controls in cloud infrastructure, ensuring compliance with industry standards and regulations.Proactively conduct regular security assessments and audits, addressing vulnerabilities and implementing remediation measures as necessary.Build tools to empower self-service for SPC development teams, bolster platform scalability and availability, and improve security posture in service to SPC's customersStakeholder Engagement:Partner with key stakeholders, including software development teams, product managers, and business leaders, to understand requirements and prioritize initiatives.Communicate effectively with senior management and executive leadership, providing updates on project status, risks, and opportunities.Develop and maintain strong relationships with engineers, managers, customers, and other colleagues based on trust, empathy, and technical expertiseLeadership and Team Management:Lead and mentor platform engineers, providing guidance, support, and professional development opportunities.Foster a culture of collaboration, customer focus, innovation, and ownership within the team, promoting a shared vision and alignment with organizational goals.Set clear objectives and performance expectations, conducting regular meetings and providing constructive feedback to team members.Skills & CompetenciesStrategic Planning: Capability to develop and communicate a strategic vision and roadmap for initiatives, aligning them with business goals and objectives. This involves proactively identifying opportunities for process improvements, automation, and innovation to enhance productivity and efficiency within the remote team.Remote Management: Competence in remote team management, including task assignment, resource allocation, maximizing productivity and performance, keeping team focused, performance evaluation, and conflict resolution. This involves leveraging remote collaboration tools and platforms to monitor progress, track metrics, and ensure accountability within the team.Technical Proficiency: Proficiency in DevOps methodology and principles, practices, and tools to effectively guide and support the team in implementing continuous integration, continuous delivery, and infrastructure as code practices. This includes staying updated with emerging technologies and industry trends relevant to DevOps.Problem-Solving Ability: Strong problem-solving skills to identify and address challenges encountered by remote teams, such as communication gaps, technical issues, or workflow bottlenecks. This includes a proactive approach to troubleshooting and a willingness to seek input from team members to find solutions collaboratively.Empathy and Emotional Intelligence: Understand and empathize with remote team members' perspectives, experiences, and challenges. This includes fostering a supportive and inclusive remote work culture, promoting work-life balance, and addressing individual concerns or well-being issues.Ownership: Take ownership of the projects within Systems Operations, ensuring excellence in execution and accountability for results. Foster a sense of responsibility and pride in delivering high-quality workInnovation: Drive innovation by proposing and implementing creative solutions to challenges. Stay abreast of industry trends and technologies, bringing fresh ideas to the tableCustomer focus: Understand and prioritize customer needs, striving to exceed expectations in every interaction. Collaborate with cross-functional teams to ensure the delivery of customer-centric solutionsTeamwork/Collaboration: Ability to collaborate effectively with team members across different time zones and locations. This includes participating in virtual meetings, sharing documents and code repositories, and providing timely feedback on colleagues' work. Drive change within the organization while maintaining positive morale.Education & ExperiencePrevious hands-on experience in building an DevOps/SRE team with a minimum of 8 years of related experience with a Bachelor's degree or equivalent experience.Minimum of 5 years of experience in a leadership roleProficient in designing DevOps solutions while managing highly available cloud infrastructure and services (to include multi cloud and Kubernetes),Deep understanding of monitoring, logging, and observability platforms, and a passion for SLI and SLO best practicesExperience managing a production infrastructure/Software including 24/7 on callFamiliarity with Amazon Web Services, Google Cloud Platform, Terraform, Helm, Vault, and AnsibleExperience in creating and working with containers and leveraging container orchestration tools such as Kubernetes or NomadExperience developing CI/CD workflowsExperience in managing remote teams and demonstrating ability to lead by influenceStrong development experience in Go, Python, Bash, and/or other programming languagesCloud Certification is a plusJoyent is committed to employing a diverse workforce and providing Equal Employment Opportunities for all individuals regardless of race, color, religion, gender, age, national origin, marital status, sexual orientation, gender identity, status as a protected veteran, genetic information, status as a qualified individual with a disability, or any other characteristic protected by law.Compensation and BenefitsCompensation for this position will vary among specific regions due to geographical differentials in the labor market, and actual pay will be determined considering factors such as relevant skills, experience, and comparison to other employees in the role. Therefore, the annual base compensation range for this role (depending on the geographical location) is expected to be between $173000 to $240000.Regular full-time employees (salaried or hourly) have access to benefits including Medical, Dental, Vision, Life Insurance, 401(k), Employee Purchase Program, Vacation and Sick leave, electronic reimbursement and many more. In addition, regular full-time employees (salaried or hourly) are eligible for bonus compensation based on individual, department, and company performance.