Grafana Labs is a remotefirst, opensource powerhouse. With more than 20M users worldwide, Grafana powers dashboards used by NASA, Microsoft, eBay, JPMorgan Chase and many more. Grafana Labs also helps over 3000 companies manage their observability strategies with the Grafana LGTM Stack, which can be run fully managed with Grafana Cloud or selfmanaged with the Grafana Enterprise Stack, featuring scalable metrics (GrafanaMimir), logs (GrafanaLoki) and traces (GrafanaTempo). Were scaling fast while staying true to our opensource legacy, global collaborative culture, and passion for meaningful work. Our team thrives in an innovationdriven environment where transparency, autonomy and trust fuel everything we do. Wed love you to raise your hand for what could be a truly careerdefining opportunity, even if you dont meet every requirement. This is a remote opportunity and we would be interested in applicants based in Canada, EST timezones at this time. Staff Software Engineer Grafana Cloud Observability, Kubernetes Monitoring The Opportunity: Grafana Cloud is our composable observability platform that integrates metrics, logs and traces with Grafana. It allows customers to leverage the best opensource observability software including Prometheus, Mimir, Loki and Tempo without the overhead of installing, maintaining and scaling their own stack. The Observability department focuses on enabling developers to understand the health and performance of their applications and infrastructure in any environment. We build and maintain the backend for opinionated applications such as Cloud Provider Observability, Database Observability and Kubernetes Monitoring, including dashboards, alerts, documentation and infrastructure, while working closely with other teams to ensure seamless experiences. We also strive to incorporate OSS contributions into our work by contributing to projects such as Alloy, Prometheus, OpenTelemetry and Beyla. The Observability department provides a core building block for customers using Grafana Cloud. What Youll Be Doing: In this role, you will bring your passion for observability and software engineering expertise to help us take our infrastructure monitoring capabilities within Grafana Cloud to the next level. Responsibilities include: Designing and implementing highquality, scalable integrations for various infrastructure components, applications and data ingestion pipelines. Creating middleware components and libraries that simplify development and maintenance of observability solutions. Representing Grafana Labs in opensource forums, working groups and events when necessary. Working with product teams, designers and documentation to develop features that align with wider product strategy and customer needs. Leading the technical direction and vision of the team, contributing to strategic discussions and future development of observability solutions. Collaborating with Sales, Product and Support teams to deliver a holistic product experience. Taking ownership of the services you run by deploying welltested, clean code. Embracing our opensource culture and contributing to projects that may not directly fall within your teams scope. As an entirely remote organization, we provide guidance and meet regularly using video calls, so an independent attitude, good communication skills and transparency are a must. We invest heavily in developer productivity. You can use modern AI coding assistants as part of your daily workflow, backed by a companyfunded usage budget so you can iterate quickly without unnecessary friction. We encourage pragmatic AIassisted development: faster prototyping, test generation, refactors, documentation and incident followupsalways paired with strong code review and quality standards. Youll also have access to frontier models. What Makes You a Great Fit: Passion for observability and eagerness to share knowledge through documentation and blog posts. Love to engage with customers and help them out. Excellent communication skills. Relevant opensource experience, ideally in the observability domain. Willingness to become an active member of the OpenTelemetry and Prometheus communities. Curiosity and a desire to learn new programming languages and frameworks, set up examples and figure out how things work. Good understanding of typical production environments; ideally you have been responsible for operating production services and organizing oncall. Active mentorship of other team members, identifying areas for focus and improvement. Requirements: Strong 8+ years of experience with at least one major programming language (Python, .NET, Java, Go, Rust, etc.). Demonstrated experience operating highscale production systems on Kubernetes, including oncall participation, incident response and postmortem practices. Familiarity with observability tooling (e.g., Grafana). Strong understanding of timeseries data, metrics cardinality challenges, and cost/performance tradeoffs in observability systems. Handson technical leadership experiencesetting technical direction, leading project teams, influencing architectural decisions beyond your immediate team. Deep understanding of distributed systems concepts: scalability, consistency, high availability and failure modes. Experience writing clean, maintainable, robust and performant software. Experience delivering projects from start to finish in a selfdriven manner. Excellent problemsolving and debugging skills. Strong mentoring and leadership skills. Bonus Points For: Operating or scaling Prometheus in highcardinality, multitenant environments. Working with OpenTelemetry Collector pipelines or similar telemetry ingestion systems. Certified Kubernetes Administrator (CKA)/Certified Kubernetes Application Developer (CKAD) or other CNCF certifications. Developing Kubernetes operators, controllers or custom resources. Strong understanding of metrics collection, visualization and alerting concepts. Contributing to or maintaining opensource projects with evidence of successful pull requests and community collaboration. Designing and building observability backends for various systems and applications. Compensation & Rewards: In Canada, the compensation range for this role is CAD186,368223,642. Actual compensation may vary based on level, experience and skillset, as assessed throughout the interview process. All roles include Restricted Stock Units (RSUs), giving every team member ownership in Grafana Labs success. Compensation ranges are country specific. If you are applying from a different location than Canada, your recruiter will discuss your specific markets defined pay range and benefits at the beginning of the process. Why Youll Thrive at Grafana Labs: 100% Remote, Global Culture Bring talent from around the world into a collaborative ecosystem. Scaling Organization Tackle meaningful work in a highgrowth, everevolving environment. Transparent Communication Expect open decisionmaking and regular companywide updates. InnovationDriven Autonomy and support to ship great work and try new things. Open Source Roots Built on communitydriven values that shape how we work. Empowered Teams High trust, low ego culture that values outcomes over optics. Career Growth Pathways Defined opportunities to grow and develop your career. Approachable Leadership Transparent execs who are involved, visible and human. Passionate People Join a team of smart, supportive folks who care deeply about what they do. InPerson Onboarding Learn all about what we do and how we do it from day1. Balance is Key Global annual leave policy of 30days per annum, with 3 days reserved for Grafana Shutdown Days to allow the team to disconnect. We will comply with local legislation where applicable. Equal Opportunity Employer: We will recruit, train, compensate and promote regardless of race, religion, color, national origin, gender, disability, age, veteran status and all other characteristics that make us different and unique. We believe that equality and diversity build a strong organization and were working hard to make sure thats the foundation of our organization as we grow. Grafana Labs may utilize AI tools in its recruitment process to assist in matching information provided in CVs to job postings. The recruitment team will continue to review inbound CVs manually to identify alignment with current openings. For information about how your personal data is used once youve applied to a job, check out our privacy policy. #J-18808-Ljbffr
Job Title
Staff Software Engineer - Grafana Cloud Observability, Kubernetes Monitoring | C