About the RoleWe are seeking a talented and experienced IT Engineer / Architect with a strong focus on site reliability engineering responsibilities to join our team. As a key member of our team, you will be responsible for ensuring the reliability, scalability, and performance of our infrastructure and applications, with a specific focus on the architecture design and implementation.ResponsibilitiesDesign, build, and maintain the architecture of our cloud-based infrastructure to ensure high availability, scalability, and security for our medical device applications, including but not limited to Ignition, PostgreSQL, HiveMQ, Qlik, Confluent Kafka, and Tanzu.Collaborate with cross-functional teams to develop and implement best practices for container orchestration and management, with a specific focus on Kubernetes.Develop and maintain CI/CD pipelines to automate the deployment and testing of applications and infrastructure changes, utilizing tools such as Tanzu, Confluent Kafka, and others as needed.Manage and maintain the repository of infrastructure as code, ensuring proper version control and documentation, with a specific focus on the specified applications.Monitor and analyze system performance, identifying and resolving potential issues to ensure optimal reliability and performance for the specified applications.Lead efforts to implement disaster recovery and business continuity plans for critical systems and applications, including those utilizing Ignition, PostgreSQL, HiveMQ, Qlik, Confluent Kafka, and Tanzu.Strong Linux Experience: Proficient in administering Linux systems (e.g., Ubuntu, CentOS, RHEL, Debian) in production environments.Strong knowledge of Linux internals including system calls, process management, networking, and filesystems.Experience with system monitoring and performance tuning on Linux servers.DevOps: Implements GitOps workflows for Kubernetes using declarative infrastructure in Git.Manages manifests, Helm charts, or Kustomize in version control.Automates reconciliation between Git and clusters for consistent deployments.Monitors and troubleshoots GitOps deployment issues, enforcing drift detection with Git-centric tools. QualificationsBachelor’s degree in computer science, Engineering, or a related field.Proven experience in designing and implementing architecture for cloud-based infrastructure, preferably in the medical device or healthcare industry, with expertise in the specified applications.Strong expertise in Kubernetes and other container orchestration technologies, with experience in managing the specified applications.Experience with infrastructure as code tools such as Terraform, Ansible, or CloudFormation, with a focus on the specified applications.Proficiency in developing and maintaining CI/CD pipelines using tools such as Tanzu, Confluent Kafka, and others as needed.Solid understanding of networking, security, and monitoring concepts in a cloud environment, with a focus on the specified applications.Experience working in Global / Multisite deployments of new architecture, change control for new requests as well as support in cases of issues. Required SkillsDrive for ResultsInterpersonal RelationshipsAdaptability Preferred SkillsPrevious Medical Devices or Pharma experienceCertified AWS Solution ArchitectCertified Kubernetes (CKS or CKA)ITILv4Equal Opportunity StatementWe are committed to diversity and inclusivity in our hiring practices.
Job Title
IT Site Reliability Engineer / Architect