Company DescriptionDeepLLMData specializes in driving advancements in AI by providing expertly curated human data to power frontier models. With expertise in Coding SFT (Supervised Fine-Tuning) and Reinforcement Learning with Human Feedback (RLHF), the company collaborates with domain-specific experts across STEM, Mathematics, PhD fields, and more. DeepLLMData also offers image and video annotation services, along with data provisioning for diverse machine learning applications. Our team’s mission is to support AI excellence globally.Role DescriptionThis is a contract-based, remote role for an expert in training Large Language Models (LLMs) with a specialization in C/C++ programming and a minimum of 3 years of professional experience. The primary responsibilities include curating and annotating high-quality coding data, assessing and enhancing LLM performance for coding tasks, collaborating with a team of AI researchers, and contributing to the training process of models through techniques like supervised fine-tuning and RLHF. The ideal candidate will also assist in evaluating outputs and providing insights to improve the DSQualificationsAdvanced proficiency in C and C++ programming with at least 3 years of professional experience.Strong understanding of machine learning concepts, supervised fine-tuning, and reinforcement learning with human feedback (RLHF).Experience in data annotation, curating, and preprocessing for AI model training is a Plus.Background in software engineering or AI development, with a focus on large-scale language models or coding-related tasks.Excellent problem-solving skills, attention to detail, and ability to work independently in a remote setting.Bachelor’s or advanced degree in Computer Science, Software Engineering, or related technical field is required; Master’s or PhD is a plus.
Job Title
LLM Model training for Coding : C/C Language 3 years