We are seeking a highly skilled Senior AI Consultant with a deep technical background in Large Language Models (LLMs) and Generative AI architecture. this will be a remote opportunity with few hours overlap with PST hours(4PM IST to 12AM IST).The ideal candidate possesses a rigorous analytical mindset typical of an IIT alumnus and has hands-on experience navigating the complexities of model fine-tuning, latency optimization, and enterprise-grade security.Design and implement end-to-end Retrieval-Augmented Generation (RAG) systems, including advanced indexing, query expansion, and re-ranking strategies.Expertly manage vector databases and optimize embedding models for domain-specific retrieval.Proactively identify and mitigate hallucinations using grounding techniques, self-correction loops, and verification frameworks.Execute advanced alignment techniques, demonstrating a clear understanding of the trade-offs between Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Proximal Policy Optimization (PPO).Evaluate and deploy various LLMs (e.g., GPT-4, Claude, Llama 3, Mistral) based on specific use cases and performance requirements.Develop both Informational Agents (search and synthesis) and Actionable Agents (tool-use and API execution).Implement the Model Context Protocol (MCP) to enable seamless integration between LLMs and external data sources or tools.Architect solutions that handle sensitive data with strict adherence to PII/PHI masking, encryption, and governance standards.Build and maintain CI/CD pipelines specialized for Machine Learning, focusing on model versioning, automated evaluation (evals), and data lineage (distinct from traditional App CI/CD).Optimize for Latency and Cost: Implement quantization, caching, and prompt engineering strategies to reduce token consumption and improve response times.Experience & QualificationsEducation: Degree from a premier Indian Institute of Technology (IIT).Technical Breadth: Proven track record of taking AI requirements from /"vague concept/" to /"technical specification/" and production deployment.Communication: Ability to articulate complex AI trade-offs to stakeholders and lead /"Day-in-the-Life/" operational excellence within an agile environment.Tools: Proficiency in Python, LangChain/LlamaIndex, PyTorch/TensorFlow, and cloud AI platforms (AWS Bedrock, Azure AI, or GCP Vertex).
Job Title
Senior AI Consultant (LLM & Generative AI Specialist)