Skip to Main Content

Job Title


Lead/Senior Data Scientist


Company : DataWeave


Location : Kannur, Kerala


Created : 2025-12-19


Job Type : Full Time


Job Description

About Us DataWeave is a SaaS-based digital commerce analytics platform that empowers retailers with competitive intelligence and equips consumer brands with digital shelf analytics globally. Using proprietary AI technology, DataWeave analyzes over 500+ billion data points across 400,000+ brands, 4,000+ websites, and 20+ industry verticals. Our clients include Nordstrom, Overstock, The Home Depot, Mars, Bush Brothers, Mondelez, Pernod Ricard, and more. We are a globally distributed team of 220+ engineers, product managers, and eCommerce experts with technology offices in Bangalore.What We OfferOpportunities to work on cutting-edge AI research in NLP, Computer Vision, and Large Language Models (LLMs)Immediate impact on product and business decisions in the retail/eCommerce domainEnd-to-end ownership of projects from ideation to deploymentCulture of openness, collaboration, and mentorshipFlexible work environment and continuous learning opportunitiesCompetitive rewards and fast-paced career growthRole Overview The Lead / Sr Data Scientist will drive AI innovation and solve complex business problems in the retail domain. The role involves developing production-ready ML/DL models, leading research initiatives, mentoring junior team members, and ensuring AI solutions align with product strategy.ResponsibilitiesBuild robust ML models using state-of-the-art architectures for NLP, Computer Vision, and Deep LearningSolve complex retail problems such as product matching, attribute extraction, and price optimizationOptimize models for scalability, efficiency, and deployment with MLOps best practicesTake end-to-end ownership of AI projects, from research to productionMentor and guide junior team members, fostering a culture of innovation and collaborationCollaborate with cross-functional teams to translate business problems into AI solutionsRequired QualificationsBachelor’s degree in Computer Science, Data Science, Mathematics, or a related field6+ years of hands-on experience in AI/ML development (3+ years acceptable with exceptional expertise in GenAI/LLMs)Expert-level Python proficiency with experience in PyTorch or TensorFlowStrong experience in Generative AI, LLMs, vision-language models, and multimodal systemsHands-on experience with NLP and CV libraries: SpaCy, NLTK, HuggingFace Transformers, OpenCVExperience in model training, fine-tuning, quantization, evaluation, and deployment of transformer-based models (BERT, GPT, T5, LLaMA, etc.)Familiarity with model optimization and scalability techniques (quantization, distillation, pruning, ONNX, TensorRT-LLM, DeepSpeed, etc.)Strong understanding of LLM ecosystems including OpenAI, Anthropic, Meta, Google, Mistral, AWS BedrockProven ability to lead projects and mentor teams in a high-velocity product environmentPreferred / Good to HaveMaster’s or PhD in Computer Science, AI/ML, Applied Math, or related fieldsExperience in startups or high-growth environments with ownership mindsetBuilding full MLOps pipelines (MLFlow, Kubeflow, Airflow, SageMaker, Vertex AI)LLM fine-tuning and parameter-efficient training (PEFT: LoRA, QLoRA, DoRA, Adapters, etc.)Experience with LangChain, LangGraph, LlamaIndex, and multi-agent workflowsBuilding Retrieval-Augmented Generation (RAG) pipelines using vector DBs like Pinecone, Chroma, Qdrant, Weaviate, or FAISSPractical experience in evaluating LLM applications using Ragas, DeepEval, Promptfoo, or custom frameworksKnowledge of modern research in Transformer optimizations, self-supervised learning, agentic AI, and efficient training frameworksContributions to open-source ML/AI projects, publications, or active participation in research communities