Skip to Main Content

Job Title


Sr. Systems Design Engineer - Data Center GPU


Company : Advanced Micro Devices


Location : Markham, Ontario


Created : 2026-03-07


Job Type : Full Time


Job Description

WHAT YOU DO AT AMD CHANGES EVERYTHING At AMD, our mission is to build great products that accelerate nextgeneration computing experiencesfrom AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, youll discover the real differentiator is our culture. We push the limits of innovation to solve the worlds most important challengesstriving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond. Together, we advance your career. THE ROLE: We are looking for a dynamic, energetic Senior Systems Design Engineer to join our growing Data Center GPU team. Our team fosters and encourages continuous technical innovation to showcase successes as well as facilitate continuous career development. In this role, will w ork closely with the automation, infrastructure, and validation teams to ensure scalability and reliability. You will also d ocument processes, best practices, and provide training for internal teams. As a key contributor to the success of AMDs product, you will be part of a leading team to drive and improve AMDs abilities to deliver the highest quality, industry leading technologies to market. THE PERSON: As a Systems Design Engineer, you will drive balanced, scalable, and automated solutions. In this high visibility position, your software systems engineering expertise will be necessary towards p roduct development, definition, and root cause resolution. You will have s trong problem-solving and debugging skills, e xcellent communication and collaboration abilitiesm and the a bility to work in fast-paced, cross-functional environments. KEY RESPONSIBILITIES: Containerization & Image Management Design, build, and maintain Docker images optimized for ML/AI workloads. Implement multistage builds , image hardening, and vulnerability scanning. Manage Docker registries (e.g., Harbor) and enforce retention policies for largescale deployments. Automation & Orchestration Develop and maintain Pythonbased automation scripts for Conductor workflows. Implement CI/CD pipelines for automated container builds and workload deployment. Integrate orchestration frameworks (Conductor, Kubernetes, Slurm) for multinode workload execution. ML/AI Workload Enablement Enable training and inference workloads using frameworks like PyTorch, TensorFlow, VLLM . Optimize distributed training and inference across multinode clusters using MPI and RDMA. Collaborate with app experts to benchmark and tune performance for AI/HPC workloads. Infrastructure & Performance Integrate ROCm stack and GPU resource management into containerized environments. Troubleshoot latency, networking, and storage bottlenecks for atscale workloads. Implement monitoring and logging for containerized ML workloads. PREFERRED EXPERIENCE: Strong proficiency in Python and automation frameworks. Handson experience with Docker and container orchestration (Kubernetes, Podman). Familiarity with CI/CD tools (Jenkins, GitHub Actions) and infrastructureascode (Terraform, Ansible). Knowledge of ML frameworks (PyTorch, TensorFlow) and GPU acceleration (ROCm, CUDA). Understanding of networking concepts (RDMA, MPI) for distributed workloads. Prior experience enabling ML/AI workloads in production or HPC environments. Exposure to orchestration platforms like Conductor or similar workflow engines. ACADEMIC CREDENTIALS: Bachelors or Masters degree in electrical or computer engineering, minimum 57 years relevant experience LOCATION: Markham, ON Benefits offered are described: AMD benefits at a glance. AMD does not accept unsolicited resumes from headhunters, recruitment agencies, or feebased recruitment services. AMD and its subsidiaries are equal opportunity, inclusive employers and will consider all applicants without regard to age, ancestry, color, marital status, medical condition, mental or physical disability, national origin, race, religion, political and/or thirdparty affiliation, sex, pregnancy, sexual orientation, gender identity, military or veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process. #J-18808-Ljbffr