Skip to Main Content

Job Title


AI/ML Engineer (DLMs, Embeddings, Fine-Tuning)


Company : B4B IT SOLUTIONS PVT LTD


Location : Hyderabad, Telangana


Created : 2025-12-18


Job Type : Full Time


Job Description

Company Overview for ClientOn behalf of a stealth-stage DeepTech AI company. This role is being recruited for a client that is transforming a major, multi‐billion dollar industry via proprietary AI.Company SummaryWe are representing a highly ambitious stealth DeepTech AI company operating across multiple international markets. Their core strength is a proprietary Deep Learning Model (DLM), fine-tuned on massive, domain-specific data to deliver performance superior to general-purpose foundation models. The company focuses on solving critical operational inefficiencies by providing AI-first automation, secure multi-platform tooling, and multilingual support. They are scaling rapidly toward significant ARR targets with strong unit economics and are positioned to capture a large market opportunity.AI/ML Engineer (DLMs, Embeddings, Fine-Tuning)Job Title: AI/ML EngineerEmployment Type: Full-TimeLocation: Hyderabad, IndiaExperience: 5+ Years in senior role.Role SummaryYou will be the core technical driver behind the client's competitive moat: the proprietary Deep Learning Model (DLM). This is a hands-on, highly technical role focused on achieving and maintaining domain-specific accuracy and cost-efficiency that generic models cannot replicate. You will own the end-to-end lifecycle of models — from data ingestion and fine-tuning to deployment and optimization — ensuring the platform delivers industry-leading accuracy (targeting 95% on complex domain benchmarks) while operating at a highly competitive cost advantage.Key Responsibilities- Proprietary Model Fine-Tuning: Lead fine-tuning and customization of Large Language Models (LLMs) or similar Deep Learning Models using techniques such as LoRA/PEFT for domain-specific performance. - Design and manage high-volume data ingestion and cleaning pipelines; implement automated QA checks and coordinate expert corpus review. - Develop, manage, and optimize embeddings generation and vector search workflows; integrate with vector databases (e.g., Qdrant or similar) to enable accurate Retrieval-Augmented Generation (RAG). - Cost & Performance Moat: Optimize model inference for maximum throughput and minimal cost-per-query, specifically targeting operations that are up to 1000x cheaper than large commercial general-purpose models using techniques like distillation and quantization. - Implement and refine core AI algorithms for specialized tasks such as predictive insights and automated content extraction. - Collaborate on deploying and monitoring AI microservices in production with a focus on scalability, reliability, and observability.Required Technical Skills- 5+ years of experience in AI/ML with deep specialization in LLMs, NLP, and fine-tuning techniques. - Persona: Must be a highly autonomous, senior individual contributor with a commitment to engineering and optimizing proprietary, deep-tech IP. - Expert-level Python and strong proficiency with PyTorch or TensorFlow. - Practical experience with vector databases, embeddings pipelines, and RAG architectures. - Demonstrated ability to optimize models for production deployment including GPU resource management, quantization, and distillation. - Strong knowledge of data cleaning methodologies and advanced ML algorithms.What We Offer- We offer a competitive salary aligned with the Hyderabad startup market. - Continuous learning support including conference allowance and learning resources. - Collaborative, lean team culture where decisions move fast and contributions are visible. - Opportunity for rapid career growth as the company scales and secures funding.How to ApplyPlease apply via LinkedIn Easy Apply or email with the following:- Updated CV highlighting relevant LLM, embedding's, and fine-tuning work. - Short cover note (2–3 paragraphs) describing a recent project where you improved model accuracy or cost-efficiency; include the approach, tools, and measurable outcome. - Links to project artifacts (GitHub, Colab notebooks, papers, demo videos) if available.Candidates who pass initial screening will be asked to answer 2–3 short technical questions and provide a concise case write-up or code sample demonstrating relevant experience.Join Us and Make an ImpactBy joining this team, you will contribute directly to a platform driving significant efficiency gains and reducing friction across large industry workflows. The client seeks professionals ready to commit, innovate, and scale as they pursue market leadership and industry transformation.#DeepTech #AIMLEngineer #LLM #Quantization #RAG #HyderabadJobs #Startup