Job description Location:Remote Type:Full-Time Experience Level:Senior (Minimum5+ years of relevant experience ) Industry:Cloud Infrastructure, AI/ML Ops, KubernetesAbout the Role Were looking for aSenior OpenShift Platform Engineerwith5+ years of hands-on experiencein OpenShift and Kubernetes. In this strategic role, you'll lead the design and implementation ofMLOps / LLMOps systemson OpenShift, mentor engineers, and help scale secure, high-performance AI infrastructure in collaboration with cross-functional teams.Key Responsibilities Platform Leadership Architect, install, upgrade, and manage OpenShift clusters, both on bare metal and VMware. Lead the deployment of MLOps / LLMOps workflows in OpenShift AI environments. Implement production-grade solutions for model deployment, monitoring, and validation pipelines. Infrastructure Excellence Set up robust monitoring(Prometheus, Thanos, Grafana) , logging, and backup strategies. Drive improvements in scalability, performance, and reliability of containerized platforms. Ensure secure and efficient configurations for RBAC, networking, and persistent storage (NetApp preferred). Collaboration & Communication Translate customer and product requirements into technical solutions. Lead architectural and code reviews across distributed engineering teams. Mentor team members and promote a high standard of engineering practices. Incident Management & Governance Own and lead root cause analysis (RCA) and post-mortem follow-ups. Define and enforce platform standards, technical governance, and compliance practices.Must-Have Qualifications Minimum 5 years of experiencein OpenShift cluster installation, management, and lifecycle operations. Proven experience designinghighly available and scalable systemsin enterprise environments. Hands-on experience withbare metal or VMware-based OpenShift deployments . Deep understanding ofKubernetes/OpenShift security, RBAC, networking, and persistent storage (NetApp preferred) . Expertise in setting upmonitoring, logging, and backup solutions . Proficiency withCI/CD DevOps toolssuch asGitLabandArgoCD . Solid experience withobservability toolslikePrometheus, Thanos, and Grafana . Strong communication and presentation skills to engage both technical and non-technical stakeholders. Ability to juggle multiple projects and deliver with minimal oversight.Bonus Skills (Highly Desirable) Experience withOpenShift AIand deployment ofMLOps/LLMOpsworkflows. Familiarity withOpenShift VirtualizationorKubeVirt . Open-source contributions in Kubernetes/MLOps communities. Experience leading customer-facing technical discussions and workshops.
Job Title
Senior OpenShift Platform Engineer (MLOps, LLMOps)