Skip to Main Content

Job Title


Data Scientist Agentic RAG & LLM (Databricks / Azure / AWS) - Australia & New Ze


Company : Rhino Partners Pte Ltd


Location : Sydney, New South Wales


Created : 2026-03-21


Job Type : Full Time


Job Description

Location: Australia & New Zealand (candidates must have valid working rights in either country)Position OverviewWe are seeking a highly skilled Data Scientist with strong expertise in Databricks, Azure, and AWS, specializing in Agentic Retrieval-Augmented Generation (RAG) and Large Language Models (LLMs). The role focuses on designing and productionizing intelligent AI/ML systems with scalable, cloud-native deployments, CI/CD pipelines, and MLOps best practices.The ideal candidate is handson, solutionoriented, and experienced in building and deploying advanced AI systems across multiple cloud platforms.Key ResponsibilitiesDesign and implement Agentic RAG pipelines using Databricks Vector Search, MLflow, Unity Catalog, integrated with Azure Cognitive Search and AWS OpenSearch.Develop agentbased workflows using LangChain, LangGraph, LlamaIndex, and other toolaugmented reasoning frameworks.Finetune, evaluate, and deploy LLMs (OpenAI, Anthropic, MosaicML, Hugging Face, Llama) for enterprise applications.Build CI/CD pipelines for ML & GenAI workloads, including:Automated build/test/deploy workflows (Azure DevOps, GitHub Actions, Jenkins, AWS CodePipeline).MLflow model registry integration with production/staging environments.InfrastructureasCode (IaC) using Terraform, Bicep, or CloudFormation for reproducible deployments.Implement MLOps best practices: experiment tracking, versioning, continuous evaluation, automated retraining pipelines.Ensure data governance, compliance, and security for sensitive datasets across Azure and AWS.Collaborate with engineering and product teams to integrate ETL/ELT pipelines in Azure Data Factory, Synapse, AWS S3, Redshift, Glue.Deploy and monitor models with online evaluation pipelines (MLflow Evaluate, DeepEval, custom scorers such as faithfulness, retrieval recall).Provide technical mentorship on GenAI architecture, CI/CD, and productiongrade LLM deployments.Required Skills & QualificationsBachelor''s or Master''s degree in Data Science, Computer Science, AI/ML, or related fields (PhD optional, not mandatory).4+ years of professional experience delivering ML/AI or data science solutions, including cloudnative deployments.Strong expertise with the Databricks ecosystem: Spark (PySpark/Scala), Delta Lake, Unity Catalog, MLflow, Vector Search.Handson experience with CI/CD pipelines for ML and GenAI:Azure DevOps, GitHub Actions, or Jenkins.Automated testing for ML pipelines.Model promotion workflows (dev staging prod).Proficiency in Python, SQL, distributed data processing, and cloudnative ML frameworks.Deep experience with Azure ML, Data Factory, Synapse, Data Lake and AWS SageMaker, Glue, S3, Redshift.Strong knowledge of LLM orchestration frameworks (LangChain, LangGraph, LlamaIndex).Solid understanding of LLM & RAG evaluation metrics (faithfulness, tokenF1, citation@k).Must have valid working rights in Australia or New Zealand.Preferred QualificationsExperience deploying multiagent LLM systems in production.Familiarity with InfrastructureasCode (Terraform, Bicep, CloudFormation) for CI/CD automation.Handson experience with containerization and orchestration (Docker, Kubernetes, AKS, EKS).Contributions to opensource GenAI/LLM projects or published research. #J-18808-Ljbffr