Position OverviewWe are looking for a Datadog Observability Engineer with strong experience implementing Datadog at theapplication layer . The role focuses on instrumenting business applications, enabling distributed tracing, improving application performance visibility, correlating logs/metrics, and providing real-time insights into user journeys, errors, latency, and reliability. The ideal candidate will collaborate closely withbackend/frontend developers, QA, product, and DevOps , helping teams build high-quality observability into their services and customer experience.Primary ResponsibilitiesImplement and manageDatadog APM, Real User Monitoring (RUM), Distributed Tracing, Service Monitoring, and Application Logsacross multiple applications. Instrument application code withDatadog libraries, OpenTelemetry, or native integrationsto capture business KPIs and performance metrics. Configuresynthetic tests, error tracking, and frontend performance dashboardsto monitor user experience and critical paths. Create meaningful dashboards for: latency and throughput endpoint/API performance error rates and exceptions RUM user behavior and UX performance SLA/SLO trends at the application level Lead the creation ofalerting strategies based on real application behavior , including anomaly detection, latency spikes, and error bursts. Correlate logs, metrics, and trace data to performroot-cause analysis of application failures and performance degradation . Work with development teams to: define observability requirements early in development integrate monitoring into CI/CD and test environments improve tagging, business context, and trace spans Conduct application performance reviews and identify opportunities for: response-time improvement database or API bottlenecks code-level optimizations Train developers and QA onhow to use Datadog tools for debugging, troubleshooting, and performance testing . Recommend improvements to observability maturity and documentation.Required SkillsHands-on experience with: Datadog APM Datadog Logs RUM (Real User Monitoring) Service Maps Distributed Tracing Synthetic Monitoring Strong application debugging and performance analysis experience, using trace/span data. Proficiency instrumenting apps in at least one modern programming language: Node.js, Java, Python, Go, Ruby, .NET, etc. Solid understanding of: HTTP APIs microservices queues/event-driven flows frontend performance basics Comfortable working with developers and QA to embed observability.Preferred SkillsFamiliarity withOpenTelemetryand custom instrumentation practices. Experience withdatabases, caching, async messaging , and how to measure them via tracing. Ability to derivebusiness KPIs from monitoring data(conversion impacts, latency cost, UX issues). Exposure toCI/CD integrationand automated observability testing.
Job Title
Datadog Application Observability Engineer