We are seeking a Machine Learning Engineer to join our team in building the Legal Intelligence runtime stack. This role focuses on developing real-time speech recognition, retrieval, and summarization systems optimized for low-bandwidth, resource-constrained environments. As an early team member, you'll play a crucial role in making our ML models fast, lightweight, and deployable across thousands of Indian courtrooms—from remote district courts to the Supreme Court.ResponsibilitiesDesign and optimize speech and text processing pipelines, with a focus on Indic languages.Implement compiler-aware workflows to reduce latency, memory usage, and energy consumption.Collaborate closely with the founding team to enhance model performance for seamless operation by judges and stenographers.Identify and implement innovative solutions to optimize machine learning models for various hardware architectures, including CPUs and GPUs.Work in close collaboration with cross-functional partners in design, backend, and frontend functions.Solve complex problems related to model efficiency and scalability.Build cost-effective and scalable systems that can operate efficiently in resource-constrained environments.QualificationsProven experience in designing and optimizing ML models for real-time applications.Strong proficiency in compiler-aware workflows and performance optimization techniques.Experience with speech and text processing, particularly in Indic languages.Familiarity with deploying ML models on various hardware architectures, including CPUs and GPUs.Excellent problem-solving skills and the ability to work in a collaborative, cross-functional environment.Bachelor's or Master's degree in Computer Science, Engineering, or a related field.
Job Title
ML Engineer - Runtime