Skip to Main Content

Job Title


Lead Platform Engineer


Company : Shipsy


Location : Guwahati, Assam


Created : 2025-05-31


Job Type : Full Time


Job Description

We’re looking for a Senior Platform Engineer to help scale and harden our platform that powers mission-critical logistics solutions. You’ll be at the core of our infrastructure evolution—driving reliability, security, and developer experience across our environments.This is a high-impact, hands-on engineering role with ownership over key initiatives including infrastructure automation, observability, platform security, and multi-tenant compute setup.Key Responsibilities• Infrastructure as Code: Manage and evolve our AWS-based infrastructure (ECS, RDS, Redshift, S3/MinIO, Elasticsearch) using Terraform.• Platform Reliability: Improve and maintain a reliable, secure platform running hundreds of services. You’ll own key parts of our ECS orchestration, network policies, and IAM boundaries.• Observability: Drive observability standards across services using New Relic, consolidated test reports, and enhanced logs/traces.• CI/CD & Developer Experience: Optimize pipelines, introduce best practices for retries, parallelism, and test reliability. Own platform tooling that helps developers ship confidently.• Multi-Tenancy & Sandboxes: Lead the design and implementation of dedicated sandbox environments to support isolated tenant workloads and testing.• Security & Access Controls: Reinforce least-privilege principles (e.g., no-root access), manage privilege escalation workflows via AWS Connect, and monitor compliance.• Collaboration: Work closely with Backend, Security, and QA teams to ensure the platform supports their goals with speed and safety.• Mentorship: Guide mid-level engineers through platform patterns, tooling, and infra reliability techniques.What You Bring• 5+ years of experience in infrastructure/platform/backend engineering.• Deep understanding of AWS services: ECS (Fargate), RDS (PostgreSQL), S3, IAM, CloudWatch.• Experience with Terraform or similar IaC tooling.• Strong experience with observability: logs, metrics, traces, alerting (e.g., New Relic, ELK).• Solid experience with Docker and container orchestration principles.• Proficiency in scripting languages like Python, Bash, or Go.• Comfort working in secure, production-grade environments with compliance boundaries (e.g., no-root, restricted shells).• Experience owning uptime, performance, and deployment reliability across services.Nice to Have• Experience with MinIO, Redshift tuning, and Elasticsearch scaling.• Knowledge of CI/CD systems like GitHub Actions or Jenkins.• Exposure to secure transport systems or logistics-heavy platforms.• Familiarity with flaky test analysis, sandbox isolation strategies, and test reliability tooling.What Success Looks Like• Platform SLAs improve across uptime, latency, and incident resolution.• Infrastructure is fully codified, version-controlled, and reproducible.• Teams experience faster feedback cycles and more predictable deploys.• Sandbox environments are scalable, secure, and isolated by tenant/workload.• Observability and debug tooling are accessible and insightful across the stack.