Skip to Main Content

Job Title


Senior Machine Learning Engineer (Reinforcement Learning)


Company : Datatonic


Location : Toronto, Ontario


Created : 2026-03-10


Job Type : Full Time


Job Description

Shape the Future of AI & Data with Us At Datatonic, we are Google Clouds premier partner in AI, driving transformation for worldclass businesses. We push the boundaries of technology with expertise in machine learning, data engineering, and analytics on Google Cloud Platform. By partnering with us, clients futureprove their operations, unlock actionable insights, and stay ahead of the curve in a rapidly evolving world. Your Mission As a Reinforcement Learning focused Senior Machine Learning Engineer, youll engineer beautiful code in Python and take pride in what you produce. Youll be an advocate of highquality engineering and best practice in production software as well as rapid prototypes. Though it is a handson technical role, we are particularly interested in candidates who want to lead projects and play an active role in client discussions. Your responsibilities will involve building trusted relationships with prospects, finding creative ways to use machine learning to solve problems, scoping projects, and overseeing the delivery of engagements. What Youll Do Develop solutions with reinforcement learning at a large scale. Design and deploy RL solutions from data selection, model training, to productionisation. Translate requirements: interpret vague requirements and develop models to solve realworld problems. Data science: conduct ML experiments using programming languages with machine learning libraries. Optimisation: optimise ML/RL solutions for performance and scalability. Custom code: implement tailored ML/RL code to meet specific needs. RL architecture design: create reinforcement learning architectures using Google Cloud tools and services. (Bonus!) Data engineering: ensure efficient data flow between databases and backend systems. (Bonus!) MLOps: automate ML workflows, focusing on testing, reproducibility, and feature/metadata storage. (Bonus!) Engineering software for production: build and deploy productiongrade software for machine learning and datadriven solutions. What Youll Bring Multiple years of experience as a machine learning engineer specifically using reinforcement learning. Prior work on designing and implementing RL algorithms on realworld projects (using nondummy data). Experience with data requirements for RL algorithms (quantity, type and schemas). A strong understanding of the training procedure and timelines for RL. Experience with selecting and adapting existing RL models for novel solutions (e.g., SAC, DQN, PPO etc.). Familiarity with developing RL algorithms using opensource ML libraries (preferably pythonbased e.g., pytorch or tensorflow). Ideally, experience with distributed RL libraries (e.g., Ray RLLib). Experience with RL in conjunction with a computer vision application or using computer vision data. Proficiency in Python as a backend language, capable of delivering productionready code in welltested CI/CD pipelines. Bonus Points If You Have Cloud expertise: familiarity with cloud platforms such as Google Cloud, AWS, or Azure. Software engineering: handson experience with foundational software engineering practices. ML integration: familiarity with exposing machine learning components through web services or wrappers (e.g., Flask in Python). Soft skills: strong communication and presentation skills to effectively convey technical concepts. Scaleup experience. Cloud certifications (Google Cloud Professional Machine Learning Engineer, AWS Solution Architect, etc.). Whats in It for You 20 days of paid vacation per calendar year. Public holidays for your province of residence. 5 wellness days (sickness, personal time, mental health). 5 lifestyle days (religious events, volunteer day, sick day). Matching group retirement savings plan after 3 months. Competitive group insurance plan on Day 1 individual premium paid 100%. Virtual medicine and family assistance program 100% employerpaid. Home office budget we are % remote! CAD$70/month for internet/phone expenses. CAD$1,500 every 3 years for tech accessories and office equipment (monitor, keyboard, mouse, desk, etc.) starting on Day1. Companysupplied MacBook Pro or Air. CAD$400/year for books, relevant app subscriptions, or an ereader. Opportunities for paid certifications. Opportunities for professional and personal learning through Udemy Business. Regular company offsites and meetups. Why Datatonic Datatonic is a UKbased company with an Americas division located in Canada. The Canadian team operates remotely, with members distributed across North and South America. This role is open to candidates located anywhere in Canada. Join us to work alongside AI enthusiasts and data experts who are shaping tomorrow. At Datatonic, innovation isnt just encouraged its embedded in everything we do. If youre ready to inspire change and deliver value at the forefront of data and AI, wed love to hear from you! Are you ready to make an impact Apply now and take your career to the next level. #J-18808-Ljbffr