Key Responsibilities Optimize transformer-based voice inference for ultra-low latency Fine‑tune models for emotion understanding and synthesis Profile and reduce bottlenecks in streaming ML pipelines Design and build SDKs for voice integration in consumer apps Collaborate with founders on architecture and customer feedback Own end‑to‑end ML system—from model design to infra deployment Tech Stack PyTorch CUDA vLLM SGLang Streaming Docker Kubernetes Why Join? Work alongside founders in a hardcore small team (4 ppl) Massive early traction with top consumer AI pipelines Foundational role owning ML infra from day one Equity up to 2 % + competitive compensation High‑impact consumer product reshaping voice interaction Interview Process Screening with CTO (30 min) Tech interview with co‑founder + take‑home ML task 3-day paid work trial
Job Title
Founding ML Engineer