Skip to Main Content

Job Title


Research Engineer


Company : Deloitte Canada


Location : Toronto, Ontario


Created : 2025-12-17


Job Type : Full Time


Job Description

Research Engineer Location: Toronto, ON, CA, M5C 3G7 Job Type: Permanent Work Model: Hybrid Reference code: 130069 Primary Location: Toronto, ON All Available Locations: Toronto, ON Our Purpose At Deloitte, our Purpose is to make an impact that matters. We exist to inspire and help our people, organizations, communities, and countries to thrive by building a better future. Our work underpins a prosperous society where people can find meaning and opportunity. It builds consumer and business confidence, empowers organizations to find imaginative ways of deploying capital, enables fair, trusted, and functioning social and economic institutions, and allows our friends, families, and communities to enjoy the quality of life that comes with a sustainable future. And as the largest 100% Canadianowned and operated professional services firm in our country, we are proud to work alongside our clients to make a positive impact for all Canadians. - Have many careers in one Firm. - Enjoy flexible, proactive, and practical benefits that foster a culture of wellbeing and connectedness. - Learn from deep subject matter experts through mentoring and onthejob coaching. We are looking for a passionate AI Researcher to join our team. You will work at the intersection of cuttingedge AI research and product engineeringdesigning, evaluating, and deploying generative AI (GenAI) systems that are both reliable and impactful. This role blends fundamental research, model evaluation, and practical software engineering to push forward the next generation of intelligent applications. What will your typical day look like? Responsibilities: - Collaborate with product managers, engineers, and stakeholders to design AIdriven solutions that meet technical and business requirements. - Research, prototype, and develop generative AI applications by combining nondeterministic LLMs with deterministic software engineering techniques. - Build evaluation frameworks and benchmarks to measure model quality, reliability, and business impact. - Generate regular reports on model accuracy, drift, and performance. - Debug, optimize, and enhance GenAI applications using prompt engineering, reinforcement learning, finetuning, and software engineering best practices. - Train and finetune large language models using Hugging Face Transformers. - Apply reinforcement learning finetuning techniques using Hugging Face TRL (Transformers Reinforcement Learning). - Manage training workflows with experiment tracking tools and distributed training accelerators (DeepSpeed, Accelerate, FSDP). - Run and optimize multiGPU training and inference, leveraging vLLM for highthroughput, lowlatency serving. - Contribute to the design of scalable MLOps/DevOps pipelines for model deployment, monitoring, and continuous training. - Ensure compliance with data privacy, security, and responsible AI guidelines when handling training or test datasets. - Stay current with emerging research in LLMs, RLHF/RLAIF, multimodal AI, and generative models; apply findings to improve our systems. - Author technical documentation and contribute to publications, patents, or opensource projects where applicable. About the team Deloitte AI and Data, Deloitte's Artificial Intelligence (AI) practice is comprised of AI/ML experts with handson experience in developing and deploying AI/ML solutions to create competitive advantage for the Canadian businesses as part of their overall Data and AI/ML transformations journey. The AI and Data Data Science team works together with Canadian businesses to envision and craft the solutions that drive automation, optimization, efficiency and many new opportunities with being mindful of driving responsible and transparent AI. We strive for empowering our clients' organization to become data and insightdriven organizations with an AI/ML first mindset to produce tangible business outcomes. Enough about us, lets talk about you You are someone with these required skills, experience and qualifications: - 3+ years experience in machine learning engineering, data engineering, or applied research (industry or academic). - Strong programming skills in Python and experience with frameworks such as PyTorch, TensorFlow, JAX. - Handson experience with Hugging Face Transformers for pretraining, finetuning, or inference. - Experience with Hugging Face TRL for reinforcement learning finetuning (e.g., PPO, DPO, GRPO, RLAIF). - Practical experience managing multiGPU training and distributed training at scale using DeepSpeed, Accelerate, or FSDP. - Experience running inference on large models using vLLM or similar optimized serving frameworks. - Familiarity with experiment tracking and reproducibility tools (e.g., W&B, MLflow). - Knowledge of MLOps practices including continuous training, continuous monitoring, and model lifecycle management. - Experience with GenAI frameworks such as LangChain, AutoGen (A2A), or MCP. - Demonstrated ability to write clean, maintainable, productionready code. - Experience building or supporting cloudbased AI systems (GCP, AWS, or Azure; certifications preferred). - Strong grasp of reinforcement learning, NLP, and/or generative modeling (transformers, diffusion, RAG, etc.). - Track record of research contributions (papers, patents, opensource projects) is a plus. Preferred Tech Stack - Transformers (Hugging Face) for model training, finetuning, and inference. - TRL (Transformers Reinforcement Learning) for RLbased finetuning (PPO, DPO, GRPO, RLAIF). - DeepSpeed, Accelerate, or FSDP for multiGPU and distributed training. - vLLM for optimized inference and serving of large models. - Weights & Biases (W&B) or MLflow for experiment tracking and reproducibility. - LangChain, AutoGen (A2A), or MCP for GenAI application development. - PyTorch as the primary deep learning framework. It would be great for you to have some of these nice to haves as well: - Experience with reinforcement learning from human/AI feedback (RLHF/RLAIF). - Contributions to opensource AI frameworks. - Familiarity with scaling laws, evaluation metrics, and benchmarking large models. - Interest in pushing the boundaries of trustworthy, explainable, and safe AI. Total Rewards The salary range for this position is $72,000 - $138,000, and individuals may be eligible to participate in our bonus program. Our Total Rewards Package extends well beyond traditional compensation and benefit programs and is designed to recognize employee contributions, encourage personal wellness, and support firm growth. Along with a competitive base salary and variable pay opportunities, we offer a wide array of initiatives that differentiate us as a peoplefirst organization, including $4,000 per year for mental health support benefits, a $1,300 flexible benefit spending account, firmwide closures known as 'Deloitte Days', dedicated days of learning (known as Development and Innovation Days), flexible work arrangements and a hybrid work structure. Our promise to our people: Deloitte is where potential comes to life. Be yourself, and more. We are a group of talented people who want to learn, gain experience, and develop skills. Wherever you are in your career, we want you to advance. You shape how we make impact. Diverse perspectives and life experiences make us better. Whoever you are and wherever youre from, we want you to feel like you belong here. We provide flexible working options to support you and how you can contribute. Be the leader you want to be. Some guide teams, some change culture, some build essential expertise. We offer opportunities and experiences that support your continuing growth as a leader. Have as many careers as you want. We are uniquely able to offer you new challenges and roles and prepare you for them. We bring together people with unique experiences and talents, and we are the place to develop a lasting network of friends, peers, and mentors. The next step is yours. At Deloitte, we are all about doing business inclusively that starts with having diverse colleagues of all abilities. Deloitte encourages applications from all qualified candidates who represent the full diversity of communities across Canada, including people with disabilities, Indigenous communities, and the Black community in support of living our values, creating a culture of Diversity Equity and Inclusion and our commitment to our AccessAbility Action Plan, Reconciliation Action Plan and the BlackNorth Initiative. We encourage you to connect with us at [email protected] if you require an accommodation for the recruitment process (including alternate formats of materials, accessible meeting rooms or other accommodations) or [email protected] for any questions relating to careers for Indigenous peoples at Deloitte (First Nations, Inuit, Mtis). By applying to this job you will be assessed against the Deloitte Global Talent Standards. Weve designed these standards to provide our clients with a consistent and exceptional Deloitte experience globally. Deloitte Canada has 20 offices with representation across most of the country. We acknowledge that Deloitte offices stand on traditional, treaty, and unceded territories in what is now known as Canada. We recognize that Indigenous Peoples have been the caretakers of this land since time immemorial, nurturing its resources and preserving its natural beauty. We acknowledge this land is still home to many First Nations, Inuit, and Mtis Peoples, who continue to maintain their deep connection to the land and its sacred teachings. We humbly acknowledge that we are all Treaty people, and we commit to fostering a relationship of respect, collaboration, and stewardship with Indigenous communities in our shared goal of reconciliation and environmental sustainability. #J-18808-Ljbffr