A company is looking for a Senior DGX Cloud AI Infrastructure Software Engineer. Key Responsibilities Develop infrastructure software and tools for large-scale pre-training, post-training, and inference Optimize tools and libraries to enhance infrastructure efficiency and resiliency Co-design and implement APIs for integration with resiliency stacks Required Qualifications Minimum of 8+ years of experience in developing software infrastructure for large scale AI systems Bachelor's degree or higher in Computer Science or a related technical field (or equivalent experience) Experience with observability platforms for monitoring and logging Proven track record in building and scaling large-scale distributed systems Proficiency in programming languages such as Python, C/C++, and scripting languages
Job Title
Senior AI Infrastructure Software Engineer