Role Overview As a Senior Platform Engineer, you will help design, build, and evolve Compass Digital Labs' AI platform for LLM-powered applications and agentic systems. You will create secure, scalable, productiongrade capabilities for orchestration, retrieval, tool integration, evaluation, observability, and governance. Working closely with platform, data, product, security, and engineering teams, you will enable AI copilots, operational automation, and intelligent customer experiences across our digital ecosystem. Key Responsibilities - Design and operate the core platform capabilities that power LLM applications, copilots, and agentic workflows across multiple environments. - Architect singleagent and multiagent execution patterns, including tool calling, workflow routing, state management, and humanintheloop checkpoints. - Build and maintain a secure integration layer that connects models to internal APIs, data products, and enterprise systems using patterns such as Model Context Protocol (MCP), OpenAPIdefined tools, and eventdriven services. - Develop retrieval and knowledge capabilities that support grounded responses, including document ingestion, chunking, embeddings, vector search, metadata filtering, reranking, and source attribution. - Establish evaluation frameworks and regression tests for response quality, task success, reliability, and safety; use offline and online evals to continuously improve production performance. - Implement guardrails and governance controls for identityaware access, PII handling, content safety, prompt and tool security, auditability, and compliance. - Create endtoend observability for prompts, tool invocations, agent traces, latency, failure analysis, and token or cost usage to support debugging and production operations. - Automate platform provisioning and deployment using Terraform, containers, CI/CD, and cloudnative services. - Optimize model selection, throughput, latency, resilience, and cost efficiency across AI workloads. - Collaborate with data and ML teams to expose governed structured and unstructured data to AI applications in a safe, reusable way. - Help define reusable standards, platform patterns, and engineering best practices for building reliable AI and agentbased systems at scale. Qualifications - Proven experience building or operating production AI/LLM platforms, developer platforms, or complex distributed systems. - Strong handson experience with Python and API or service development; experience with TypeScript, Go, or Java is a plus. - Experience designing agentic systems or advanced LLM applications that use tool calling, workflow orchestration, retrievalaugmented generation (RAG), and state management. - Familiarity with modern agent frameworks and platforms such as OpenAI Agents SDK, Amazon Bedrock Agents, LangGraph, or similar tooling. - Strong understanding of vector search, embeddings, knowledge base design, ranking/reranking, and grounded generation. - Experience with AWS and modern platform infrastructure, including containers, serverless services, Kubernetes, networking, and IAM. - Experience with Terraform or similar InfrastructureasCode tools and strong CI/CD automation practices. - Understanding of evaluation, prompt testing, offline benchmarks, and release guardrails for AI systems. - Handson experience with observability tooling for logs, metrics, tracing, and incident response. - Strong grasp of security, privacy, and governance for AI systems, including secrets management, RBAC, data protection, and responsible AI controls. - Ability to work crossfunctionally with product, data, ML, and platform teams and translate emerging AI capabilities into reliable platform services. - Bachelor's degree or equivalent in Computer Science, Engineering, or a related field. Nice to Have - Experience building internal developer platforms or selfservice tooling for AI teams. - Experience with realtime inference, streaming workflows, or eventdriven architectures. - Familiarity with data platform concepts such as dbt, Spark, Apache Iceberg, or data product design. - Background in hospitality, retail, or largescale enterprise environments. Position Details - Position Title: Senior Platform Engineer - Salary: 130,000.00 - 160,000.00 - Perks: Bonus Eligibility, 4 Weeks Vacation, RRSP and more! - Employment Type: Full Time, Permanent - Hybrid: 3 Times a Week - Toronto/ Mississauga - Remote: Canada Only In accordance with provincial legislation and our commitment to transparent hiring practices, the compensation range for this position is provided. Final compensation will be determined based on qualifications, experience, and internal equity. Canadian work experience is not required. Please note that artificial intelligence tools are utilized in the applicant screening process. #J-18808-Ljbffr
Job Title
Senior Platform Engineer, AI Platform