Job Description

About UsDataWeave is a SaaS-based digital commerce analytics platform that empowers retailers with competitive intelligence and equips consumer brands with digital shelf analytics globally. Using proprietary AI technology, DataWeave analyzes over 500+ billion data points across 400,000+ brands, 4,000+ websites, and 20+ industry verticals. Our clients include Nordstrom, Overstock, The Home Depot, Mars, Bush Brothers, Mondelez, Pernod Ricard, and more. We are a globally distributed team of 220+ engineers, product managers, and eCommerce experts with technology offices in Bangalore.What We Offer- Opportunities to work on cutting-edge AI research in NLP, Computer Vision, and Large Language Models (LLMs) - Immediate impact on product and business decisions in the retail/eCommerce domain - End-to-end ownership of projects from ideation to deployment - Culture of openness, collaboration, and mentorship - Flexible work environment and continuous learning opportunities - Competitive rewards and fast-paced career growthRole OverviewThe Lead / Sr Data Scientist will drive AI innovation and solve complex business problems in the retail domain. The role involves developing production-ready ML/DL models, leading research initiatives, mentoring junior team members, and ensuring AI solutions align with product strategy.Responsibilities- Build robust ML models using state-of-the-art architectures for NLP, Computer Vision, and Deep Learning - Solve complex retail problems such as product matching, attribute extraction, and price optimization - Optimize models for scalability, efficiency, and deployment with MLOps best practices - Take end-to-end ownership of AI projects, from research to production - Mentor and guide junior team members, fostering a culture of innovation and collaboration - Collaborate with cross-functional teams to translate business problems into AI solutionsRequired Qualifications- Bachelor’s degree in Computer Science, Data Science, Mathematics, or a related field - 6+ years of hands-on experience in AI/ML development (3+ years acceptable with exceptional expertise in GenAI/LLMs) - Expert-level Python proficiency with experience in PyTorch or TensorFlow - Strong experience in Generative AI, LLMs, vision-language models, and multimodal systems - Hands-on experience with NLP and CV libraries: SpaCy, NLTK, HuggingFace Transformers, OpenCV - Experience in model training, fine-tuning, quantization, evaluation, and deployment of transformer-based models (BERT, GPT, T5, LLaMA, etc.) - Familiarity with model optimization and scalability techniques (quantization, distillation, pruning, ONNX, TensorRT-LLM, DeepSpeed, etc.) - Strong understanding of LLM ecosystems including OpenAI, Anthropic, Meta, Google, Mistral, AWS Bedrock - Proven ability to lead projects and mentor teams in a high-velocity product environmentPreferred / Good to Have- Master’s or PhD in Computer Science, AI/ML, Applied Math, or related fields - Experience in startups or high-growth environments with ownership mindset - Building full MLOps pipelines (MLFlow, Kubeflow, Airflow, SageMaker, Vertex AI) - LLM fine-tuning and parameter-efficient training (PEFT: LoRA, QLoRA, DoRA, Adapters, etc.) - Experience with LangChain, LangGraph, LlamaIndex, and multi-agent workflows - Building Retrieval-Augmented Generation (RAG) pipelines using vector DBs like Pinecone, Chroma, Qdrant, Weaviate, or FAISS - Practical experience in evaluating LLM applications using Ragas, DeepEval, Promptfoo, or custom frameworks - Knowledge of modern research in Transformer optimizations, self-supervised learning, agentic AI, and efficient training frameworks - Contributions to open-source ML/AI projects, publications, or active participation in research communities

Job Title

Company : DataWeave

Location : Bengaluru, Karnataka

Created : 2025-12-19

Job Type : Full Time