Job Title Generative AI Cloud Operations Engineer Evinova Location Toronto, ON At AstraZeneca, we pride ourselves on crafting a collaborative culture that champions knowledgesharing, ambitious thinking, and innovation ultimately providing employees with the opportunity to work across teams, functions, and even the globe. We are part of a new Healthtech business, Evinova, a fullyowned subsidiary of AstraZeneca Group. Evinova delivers marketleading digital health solutions that are sciencebased and evidenceled. We are building a new Healthtech business that combines deep scientific expertise with digital and artificial intelligence to serve a wider healthcare community. Introduction to Role The Machine Learning and Artificial Intelligence Operations team (ML/AI Ops) is a newly formed platform team that will spearhead the design, creation, and operational excellence of our LLMbased agent deployments, multiagent orchestration, and conversational AI systems pipelines. This team is responsible for design, implementation, deployment, health, and performance of all LLMbased applications, managing ML/AI and cloud resources, and automating operations through infrastructureascode and CI/CD pipelines. As a Generative AI Cloud Operations Engineer for clinical trial design, planning, and operational optimization, you will lead the development and management of AI operations systems for our trial management and optimization SaaS product. You will collaborate closely with AI Engineers to transition projects from research into productiongrade AI capabilities and optimize model deployment, governance, and infrastructure performance. Accountabilities Operational Excellence Drive the creation of proactive capability and process enhancements that ensures enduring value creation and analytic compounding interest. Design and implement resilient cloud generative AI agent operational capabilities to maximize system Aabilities (Learnability, Flexibility, Extensibility, Interoperability, Scalability). Drive precision and systemic cost efficiency, optimized system performance, and risk mitigation with a datadriven strategy, comprehensive analytics, and predictive capabilities at the treeandforest level of our generative AIbased systems, workloads and processes. ML/AI Cloud Operations and Engineering Develop and manage GenAI Ops systems for clinical trial design, planning and operational optimization. Integrate LLM proxies/routers including LiteLLM Proxy/Router or other solutions. Ensure proper RAG pipeline optimization and scaling. Integration of token usage, latency, response quality, and hallucination detection tools at a platform level. Partner closely with AI Engineers and data scientists to shepherd projects from embryonic research stages into productiongrade agentic generative AI capabilities. Leverage and teach modern tools, libraries, frameworks and best practices to design, validate, deploy and monitor generative AI agents in production (including LangChain, LangGraph, Google ADK, Langfuse, DSPy, Arize Phoenix, Pinecone, Weaviate, Splunk, Grafana, Prometheus, Xray) Enhance system scalability, reliability, and performance through effective infrastructure and process management. Ensure that any prediction we make is backed by deep exploratory data analysis and evidence, interpretable, explainable, safe, and actionable. Leverage Vertex AI, Azure Foundry, OpenAI, Anthropic, and other foundation model platforms to provide reliable and stable access to LLMs. Personal Attributes Customerobsessed and passionate about building products that solve realworld problems. Highly organized and detailoriented, with the ability to manage multiple initiatives and deadlines. Collaborative and inclusive, fostering a positive team culture where creativity and innovation thrive. Know when to ask for help and when to help others proactively. Essential Skills/Experience HS Diploma or GED. Minimum of 2 years deploying and maintaining generative AI agents or GenAIbased workflows/applications in production. Deep understanding of challenges in deploying generative AI applications and agents. Closely follows frontier developments in generative AI and GenAI tooling, techniques, and technologies. Deep understanding of the data science lifecycle (DSLC) and ability to shepherd data science projects from inception to production within the platform architecture. Expert in evals tools for LLMs such as Arize Phoenix, Langfuse, Braintrust, Freeplay or similar. Expert in CDK for Python and/or TypeScript. Strong software engineering abilities in Python/TypeScript. Expert in AWS services and containerization technologies like Docker and Kubernetes. Experience deploying GenAI agents using frameworks such as LangChain, LangGraph, LlamaIndex, Google ADK, or Strands Agents. Ability to collaborate effectively with engineering, design, product, and science teams. Strong written and verbal communication skills for reporting and documentation. Proven track record of deploying algorithms and machine learning models into production environments. Demonstrated ability to work closely with crossfunctional teams, particularly data scientists. Great People want to work with us! Find out why: GTAA Top Employer Award for 10 years. Top 100 Employers Award. Canadas Most Admired Corporate Culture. Learn more about working with us in Canada. View our YouTube channel. Why Evinova? Evinova is a global healthtech business, part of the AstraZeneca group. Our goal is to accelerate the delivery of lifechanging medicines, improve the design and delivery of clinical trials for better patient experiences and outcomes, and think more holistically about patient care before, during, and after treatment. By bringing our solutions to the wider life sciences community, we can build more unified approaches, simplify workloads, and benefit patients broadly. Join us on our journey to build a new kind of healthtech business that resets expectations of what a biopharmaceutical company can be. Compensation & Benefits Annual base salary ranges from $114,622.40 to $150,441.90. The base pay offered will vary depending on multiple individualized factors, including your skills and experience. Permanent positions offer an annual variable pay bonus, equitybased longterm incentive program (if applicable), a competitive flexbenefits & retirement savings program, 4 weeks paid vacation, and annual personal days. Fixedterm contract/temporary positions offer a contract benefits program. We are using AI as part of the recruitment process. This advertisement relates to a current vacancy. #J-18808-Ljbffr
Job Title
Generative AI Cloud Operations Engineer - Evinova