Job Description

Position Summary We're seeking a Senior NLP Engineer with ahackermindset to revolutionize our information extraction engine for financial/legal data processing. This is aproduct-driven rolefocused on delivering production-ready solutions whereaccuracy and speed of output are paramount . Our systems arecustomer-facing with Human-in-the-Loop (HITL) workflows , requiring optimization for seamless human-AI collaboration. You'll have complete ownership from problem analysis to production deployment, leveraging any LLM, technique, or creative approach that delivers maximum accuracy in minimum time while ensuring optimal user experience. Key Responsibilities Core Product-Driven AI Work Enhance Information Extraction Engine : Redesign and optimize our current system using state-of-the-art LLMs for financial/legal document processing with focus on production accuracy and speed Accuracy Optimization : Achieve highest possible extraction accuracy through any means necessary—fine-tuning, prompt engineering, ensemble methods, or hybrid approaches LLM Integration : Implement and experiment with various LLMs (GPT, Claude, Llama, Gemini, etc.) to find optimal solutions for production use Creative Problem Solving : Think like ahacker —if traditional ML doesn't work, try prompt engineering, RAG, few-shot learning, or completely novel approaches LLM Development : Lead the pre-training and fine-tuning of large language models (LLMs) to optimize performance for specific financial/legal use cases Information Retrieval, Extraction & Classification : Develop and implement techniques for retrieving, extracting, classifying, and ranking large-scale financial/legal datasets using advanced algorithms and vector databases Production-First Mindset : Every solution must be production-ready with measurable accuracy and performance metrics Time-to-Accuracy Optimization : Balance between achieving high accuracy and delivering results within acceptable time constraints Human-in-the-Loop (HITL) Workflow Optimization HITL System Design : Build AI systems optimized for human review, correction, and validation workflows Confidence Scoring & Routing : Develop intelligent routing systems that send low-confidence extractions to human reviewers while auto-approving high-confidence results Interactive Correction Interfaces : Design systems that learn from human corrections and feedback in real-time Workflow Efficiency : Optimize human review processes to minimize time spent while maximizing accuracy improvements Active Learning Integration : Implement systems that strategically request human input on the most valuable data points Feedback Loop Optimization : Build mechanisms to continuously improve AI performance based on human corrections and preferences User Experience Design : Ensure seamless handoffs between AI processing and human review stages Engineering Responsibilities Third-Party LLM API Integration : Seamlessly integrate and manage multiple LLM APIs (OpenAI, Anthropic, Google, etc.) with fallback mechanisms API Development & Management : Build robust APIs for information extraction services with proper error handling and monitoring System Evaluation & Benchmarking : Develop comprehensive evaluation frameworks to measure accuracy, latency, and cost across different LLM approaches Performance Engineering : Optimize systems for low-latency, high-throughput information extraction Integration Architecture : Build seamless integrations with existing financial/legal data workflows and customer systems End-to-End Product Delivery Complete Product Ownership : Take problems from customer requirements through development, testing, and production deployment Quality Assurance : Ensure production systems maintain consistent accuracy and reliability standards Rapid Iteration : Quickly implement user feedback and production improvements Startup Engineering Mindset Engineering-First Approach : Prioritize engineering solutions that work in production over theoretical research Pragmatic Decision Making : Choose solutions based on production requirements, not academic interest Resource Optimization : Balance accuracy, speed, and cost for optimal business outcomes Scalability Focus : Build systems that can handle increasing data volumes and customer demands Integration Expertise : Excel at connecting different systems, APIs, and data sourcesRequired Qualifications Experience & Background 4-8 years of Applied AI/ML experiencewithproduction deploymentsandcustomer-facing products HITL System Experience : Hands-on experience building Human-in-the-Loop AI systems and workflows Production LLM Experience : Hands-on experience integrating and productionizing LLM solutions, not just research Information Extraction in Production : Proven track record of deploying IE systems that serve real business needs API Integration Expertise : Experience working with third-party APIs, handling rate limits, errors, and failovers Technical Skills Programming : Expert in Python with strong software engineering practices Production LLM Techniques : Pre-training and fine-tuning of large language models API integration and management (OpenAI, Anthropic, Google, etc.) Prompt engineering for production accuracy Model evaluation and benchmarking Cost optimization and performance tuning Information Retrieval : Vector databases, ranking algorithms, search systems Engineering Frameworks : FastAPI, Flask, Docker, Kubernetes, cloud services Quality Assurance : Multi-stage review processes, validation workflows, error detection Critical Product Mindset Customer-Centric : Focus on optimizing the end-user experience in human-AI collaborative workflows Workflow Optimization : Understand and improve human work patterns and efficiency Production-First : Every solution must work reliably in customer-facing environments User Trust : Build systems that maintain and enhance user confidence in AI outputs Iterative Improvement : Design systems that get better through human interaction Quality Obsession : Maintain high standards for both AI accuracy and user experienceDaily activities include: Monitoring production systems and addressing any issues Analyzing user feedback and correction patterns from support tickets Code reviews and preparing for weekly releases Collaborating with product and internal teams Incremental improvements to existing workflows Testing and evaluating changes through internal metrics before weekly deployment Planning and scoping work for upcoming weekly releasesEngineering scenarios you'll handle: API rate limiting and failover logic when OpenAI is down Building custom evaluation metrics for financial/legal document accuracy Optimizing prompt costs while maintaining extraction quality Debugging production issues affecting customer workflows Integrating with customer systems and data formats Success Metrics Production Accuracy : Measurable improvement in extraction precision/recall in live systems Automation Rate : Percentage of extractions that can be auto-approved without human review Cost Efficiency : Optimal balance of accuracy, speed, and LLM API costs Time to Production : Speed from problem identification to deployed solution This Role is NOT For You If: You prefer research over production implementation You want to publish papers rather than ship products You need perfect data or requirements before starting You're more interested in novel algorithms than customer problems You avoid the "messy" work of production systems and integrations This Role IS Perfect For You If: You love seeing your AI solutions used by real customers You get excited about optimizing production systems for accuracy and speed You enjoy the engineering challenges of integrating multiple LLM APIs You measure success by customer outcomes, not research metrics You thrive on solving practical problems with whatever works best You take pride in building reliable, scalable production systems How to Apply: Please send your resume and portfolio to Who We Are- Alphastream.ai envisions a dynamicfuture for the financial world, where innovation is propelled by state-of-the-art AI technology and enriched by a profound understanding of credit and fixed- income research. Our mission is to empowerasset managers, researchfirms, hedge funds,banks, and investors with smarter, faster, and curated data. We provide accurate, timely information, analytics, and tools across simple to complex financial and non-financial data, enhancing decision- making. With a focus on bonds, loans,financials and sustainability, we offer near real-time data via APIs and PaaS (Platform as a Service) solutions that act as the bridge between our offerings and seamless workflow integration. To learn more about us:What we offer "At Alphastream.ai we offer a dynamic and inclusive workplace where your kills are valued and your career can flourish.Enjoy competitive compensation, a comprehensive benefits package, and opportunities for professional growth. Immerse yourself in an innovative work environment, maintain a healthywork-life balance, and contribute to a diverse and inclusive culture. Join us to work with cutting-edge technology, and be part ofa team that recognizes and rewards your achievements, all while fosteringa fun and engaging workplace culture." Disclaimer- Alphastream.ai is an equal opportunity employer. We work to provide a supportive and inclusive environment where all individuals can maximize their full potential. Our skilled and creative workforce is comprised of individuals drawn from a broad cross section of all communities in which we operate and who reflect a variety of backgrounds, talents, perspectives, and experiences. Our strong commitment to a cultureof inclusion is evident through our constant focus on recruiting, developing, and advancing individuals based on their skills and talents.

Job Title

Company : alphastream.ai

Location : Bengaluru, Karnataka

Created : 2026-02-23

Job Type : Full Time