Skip to Main Content

Job Title


Research Intern - Multimodal AI - India


Company : Whissle


Location : Tirunelveli, Tamil nadu


Created : 2025-05-15


Job Type : Full Time


Job Description

About WhissleWhissle is an independent AI company focused on novel research in multi‑modal AI to provide cost‑effective real‑time solutions for audio, video, and touch‑enabled applications. We operate on three core pillars—academia, open‑source initiatives, and industry collaborations. Our models, bots, and APIs address a wide range of use cases, delivering efficient AI technology at scale.Research Intern – Multimodal AI (Remote, India)We’re looking for a research-minded intern to join our team and contribute to ongoing projects in real-time audio-visual understanding. This role is ideal for students from top engineering and science institutes in India (such as IITs, IIIT-Hyderabad, IISc, etc.) who are passionate about cutting-edge AI research and are excited to work in a remote, globally distributed team.You will collaborate with Whissle researchers on mutually aligned project areas in multimodal AI, and have the opportunity to co-author research papers, contribute to open-source releases, and work toward production-grade AI systems.A typical research project includes:Project scoping and literature reviewModel development, experimentation, and evaluationDeliverables: demos, papers, and real-world applicationsResponsibilitiesContribute to research in multimodal AI—including areas such as audio-visual ASR, large language models (LLMs), and TTS.Prototype and test models using modern ML frameworks (e.g., PyTorch, TensorFlow).Analyze multimodal datasets (audio, video, text, touch) and extract meaningful insights.Co-author technical papers, blog posts, and internal documentation.Present research findings in team meetings, academic forums, or open-source communities.Required QualificationsCurrently enrolled in a Bachelor’s, Master’s, or PhD program in Computer Science, Electrical Engineering, Computational Linguistics, Cognitive Science, or a related field at a recognized Indian university (IITs, IIITs, IISc, BITS, etc.).Experience with AI/ML research and interest in multimodal learning.Proficient in Python and comfortable with deep learning libraries (e.g., PyTorch, TensorFlow).Ability to deliver well-scoped research outcomes within timelines.Strong problem-solving, critical thinking, and communication skills.Comfortable working independently in a remote, asynchronous setup.Preferred QualificationsPublications or preprints in machine learning, signal processing, or related areas.Experience with multimodal learning techniques such as audio-visual fusion, cross-modal retrieval, or sensory integration.Familiarity with distributed training, performance optimization, and large-scale evaluation.Prior contributions to open-source ML tools or libraries.LogisticsLocation: Fully remote (with overlap in Indian Standard Time)Duration: 3 months (flexible hours; part-time option available)Compensation: Open-source collaboration with access to Whissle's research infra, stipend after 1 month.Growth Path: High-performing interns may be invited to join Whissle Research as full-time collaborators or fellowsWhissle is an equal-opportunity research organization. We believe that diverse teams lead to stronger ideas and more impactful innovation. We welcome applications from individuals of all backgrounds and identities.