Skip to Main Content

Job Title


AI/ML Model Compression & Quantization Engineer


Company : I Machines, Inc.


Location : moncton, New Brunswick


Created : 2026-05-07


Job Type : Full Time


Job Description

About Our CompanyWere a fast-paced, fabless semiconductor startup redefining the boundaries of AI through cutting-edge, scalable AI-infused multipurpose compute architecture. Our mission is to deliver scalable, efficient, and intelligent silicon solutions for the next generation of edge AI, robotics, autonomous systems, and mobile devices. Our leadership team brings together decades of experience in semiconductor innovation, spanning chip architecture, system design, and global business operations. The team includes pioneers behind several generations of groundbreaking compute architectures, experts in software-hardware co-design, SoC and AI development with hundreds of patents in our portfolio as well as leaders of multi-billion-dollar business units at top-tier technology companies.Position OverviewThis is a great opportunity to join a highly-skilled AI/ML Software team working at the intersection of HW/SW co-design. In this role, you will be responsible for designing and executing end-to-end model compression pipelines, including sensitivity analysis, quantization, pruning, and hybrid optimization techniques across large-scale transformer architectures.Key Responsibilities and DutiesBuild and own the end-to-end compression pipelineBaseline benchmarking and instrumentationSensitivity analysisTransformation mapping (quantization, sparsity, low-rank)Implement layerwise sensitivity scoring frameworksDesign and apply quantization strategiesPTQ, QAT, mixed-precision quantizationINT8, INT4, FP8, FP4 explorationDevelop mixed-precision policiesPer-layer/tensor precision assignmentDynamic range calibration and scaling strategiesImplement and evaluate pruning techniquesApply hybrid compression methodsSparse + quantized pipelinesLow-rank decompositionRun post-transformation recoveryQAT, LoRA-based recovery, distillationBenchmark across:Accuracy degradationLatency / throughputMemory footprintOptimize for iMachine ArchitectureQualifications and SkillsSuccessful candidates should possess the following qualifications and skills:Required Qualifications (You must possess these qualifications to be considered for the position)Bachelor of Science Degree in Electrical Engineering, Computer Science, Computer Engineering, or related field1+ year of experience with PyTorch / JAX / TensorFlowUnderstanding of:Transformer architectures (LLM, VLM, VLA)Numerical precision and quantization theoryHands-on experience with:TensorRT, ONNX Runtime, or similar inference stacksFamiliarity with:Sparse representations (CSR, COO, RLC )Low-rank approximation methods (SVD, factorization)Ability to analyze:Activation distributionsGradient statisticsNumerical stability issuePreferred QualificationsMS or PhD in Electrical Engineering, Computer Engineering, Computer Science, or related fieldExperience with:FP8 / FP4 pipelinesHardware-aware optimizationPrior work on:Multimodal models (vision-language, robotics policies)Knowledge of:Compiler stacks (TVM, Triton, XLA)ExpectationsDeliver production-ready compressed models with minimal accuracy lossAchieve quantifiable performance gains (latency, memory, throughput)Provide clear layerwise transformation justificationsBuild reusable tooling and automation pipelinesIterate quickly using data-driven decision makingWhy Join UsGet in early at a breakthrough deep-tech startup reshaping AI computeWork closely with industry innovators and experienced leaders where your work will have a direct impact on the success of the companyBe part of a mission-driven team building foundational technology for the futureWe balance sharp execution with continuous innovation to push the boundariesCompetitive compensation, equity, and growth opportunitiesBenefits and PerksAt I Machines, Inc., we offer competitive salaries and a comprehensive benefits package, including:Health, dental, and vision insuranceRetirement savings plansPaid time off and holidaysProfessional development opportunitiesFlexible ScheduleEqual Opportunity EmployerI Machines, Inc., is an equal opportunity employer and does not discriminate based on race, color, religion, gender, national origin, age, disability, or any other legally protected status. All qualified applicants will be considered for employment.