Build the future of Conversational AI with ConnexAI. As a Speech Data Engineer, your work will power the data behind real-time speech systems used by millions worldwide, ensuring our AI learns from clean, accurate, and reliable datasets. By curating and engineering the voice data that fuels our models, youll help shape products that transform how people and businesses communicate.Youll be part of the team that manages and scales our massive speech corpora, builds automated pipelines for cleaning and validation, and works closely with annotation and Machine Learning teams to keep our models at the cutting edge.This is a pivotal moment for ConnexAI, as we expand our Automatic Speech Recognition capabilities and push the boundaries of conversational intelligence. Join us, and be part of the team setting the industry standard.Any additional information you require for this job can be found in the below text Make sure to read thoroughly, then apply.Core ResponsibilitiesOrganise and maintain extremely large-scale speech datasets, ensuring they are versioned correctly, catalogued, and easy to retrieve.Build automated pipelines using Python, Bash, AWS, Docker, and automation tools to clean and validate speech data, identifying duplicates, corrupted files, or inconsistencies.Coordinate with the annotation team to manage labelling workflows and ensure high-quality, consistent annotations across datasets.Set up and monitor automated model evaluation pipelines, tracking metrics such as Word Error Rate (WER) and Character Error Rate (CER) to provide actionable feedback to engineering and data science teams.Prepare specialised datasets for experimental or production models, supporting the development of cutting-edge ASR and conversational AI systems.Ensure all datasets are securely backed up, recoverable, and well-documented for use across teams.Collaborate closely with data scientists, ML engineers, and product teams to identify opportunities to improve data quality and model performance.Interview Process30-minute video call with the team leadTake-home technical exercise 90-minute face-to-face interviewAbout ConnexAIConnexAI is an award-winning Conversational AI platform. Designed by a world-class engineering team, ConnexAI's technology enables organisations to maximise profitability, increase revenue and take productivity to new levels.ConnexAI provides cutting-edge, enterprise-grade AI applications, including AI Agent, AI Guru, AI Analytics, ASR, AI Voice, and AI Quality.We value growth, both for our products and our people. As we scale, there will be clear opportunities to progress into senior data engineering, ML engineering, or data science roles. Our high retention rate reflects our inclusive, supportive, and empowering environment.
Job Title
Data Engineer (Speech)