Compensation: $50,000 USDRole OverviewThis position is a key member of the platform operations group, responsible for second‑line support, DevOps tooling, observability, automation, and system reliability. The role focuses on maintaining production and pre‑production environments, resolving complex technical issues, enhancing operational processes, and collaborating with engineering teams to drive stability and efficiency.The environment is fast‑paced and globally distributed, with team members across multiple regions. Individuals who thrive in dynamic, high‑ownership settings are encouraged to apply.Key ResponsibilitiesMonitor platform health, respond to system alerts, and provide first‑ and second‑line operational support.Conduct in‑depth troubleshooting for incidents affecting platform performance, stability, or integrations.Enhance automation for deployment, maintenance, and operational workflows.Manage application environments across DevOps and observability platforms.Follow established runbooks and escalation procedures to ensure service continuity.Work closely with development and L3 engineering teams to address platform‑level issues and drive improvements.Participate in on‑call rotations, lead recovery efforts during major incidents, and prepare detailed root‑cause analysis reports.Improve system resilience by contributing to post‑incident reviews and implementing corrective actions.Create and maintain documentation, dashboards, and operational playbooks to support training and readiness.Support regular maintenance activities, feature rollouts, and security‑related updates.Required Qualifications1. Technical SkillsPractical experience with containers and orchestration tools such as Docker and Kubernetes, including Helm, configuration management, and secret management.Experience designing or operating CI/CD pipelines (e.g., Jenkins, GitLab CI, Azure DevOps) and knowledge of artifact repositories.Proficiency with monitoring and observability tools such as Prometheus and Grafana.Hands‑on experience with centralized logging solutions (e.g., ELK Stack, Splunk) and basic tracing tools (e.g., AppDynamics, Dynatrace).Strong scripting abilities using Bash or Python for automation and operational tasks.Familiarity with Infrastructure as Code tools such as Terraform or Ansible.Understanding of cloud environments (AWS, Azure, GCP) and modern cloud‑native architectures.Experience with incident management systems and ITIL‑aligned processes.General knowledge of Linux system operations and fundamental networking concepts.2. Domain & MethodologyUnderstanding of business support systems (BSS), operational support systems (OSS), CRM workflows, and order‑processing capabilities within telecom or similar industries.Knowledge of SRE concepts such as SLIs, SLOs, SLAs, and error budgets.Experience working with hybrid or multi‑cloud deployments, especially Kubernetes‑based environments.Familiarity with cost optimization and capacity planning in cloud infrastructure.Understanding of IT governance and security frameworks (e.g., ITIL, ISO standards).3. Professional CompetenciesStrong analytical thinking and problem‑solving abilities, especially in time‑sensitive situations.Ability to communicate clearly with both technical and non‑technical stakeholders.Adaptability to new technologies, tools, and business processes.Customer‑centric approach to operational support and service quality.4. Experience & EducationBachelor’s degree in Computer Science, Information Technology, or equivalent technical discipline.Typically 4–7 years of experience in an L2 engineering, technical support, or operations role.Experience with ITSM or ticketing tools such as ServiceNow or Jira.Preferred QualificationsExperience driving proactive improvements and automation initiatives.Strong organizational and coordination skills.Familiarity with compliance, security standards, and operational best practices.Experience managing budgets or resource planning.Motivation for continuous learning and professional development.
Job Title
BSS DevOps & L2 Platform Operations Engineer (Telecom)