Lead Data EngineerLocation & Work Model Location: Remote Work Model: Full-time Timings :8AM IST to 5 PM ISTJob Summary We are seeking a highly experiencedSenior Cloud Data Engineerwith 10+ years of hands-on experience in designing, building, and managing large-scale data platforms and pipelines in cloud environments. The ideal candidate will have strong expertise inAWS, Snowflake, PySpark, and ETL frameworks , along with proven leadership experience in delivering scalable, secure, and high-performance data solutions. This role requires close collaboration with business stakeholders, cross-functional teams, and mentoring junior engineers while driving data-driven decision-making.Key Responsibilities Design, develop, and maintainscalable cloud-based data pipelinesusing AWS services such asS3, EMR, Glue, Lambda, Kinesis , and Snowflake. Lead end-to-enddata ingestion, transformation, and orchestrationworkflows usingPySpark, Apache Airflow, Kafka, and ETL tools . Architect and optimizedata warehousing solutionson Snowflake, Hive, Redshift, and HDFS for large-volume and high-performance analytics. Manage and executeon-prem to cloud data migrations , ensuring minimal downtime, cost optimization, and improved scalability. Implement and enforcedata governance, security, audit logging, and compliancestandards across data platforms. Develop and optimizePySpark jobsfor data ingestion into Snowflake, Hive, and HBase tables. Monitor and tune system performance, including query optimization, error handling, and recovery mechanisms. Collaborate with BI and analytics teams to support reporting solutions usingPower BI and Tableau . Lead and mentor a team of data engineers, conducting code reviews and promoting best practices. Work in anAgile/Scrum environment , participating in sprint planning, POCs, and continuous improvement initiatives. Coordinate with cross-functional teams and stakeholders to deliver business-aligned data solutions.Required Skills & Experience 10–12+ yearsof experience in Data Engineering with strong exposure to cloud-based data platforms. Strong hands-on experience withAWS (EMR, Glue, Lambda, S3, Kinesis, ECS) . Expertise inSnowflake (development and administration) . Advanced proficiency inPython and PySpark , including data structures and distributed processing. Solid experience withETL toolssuch as Informatica PowerCenter, Informatica BDM/BDE, Alteryx, and DBT. Strong knowledge ofBig Data technologies : Hadoop, Hive, HDFS, HBase, Spark. Experience withreal-time data processingusing Kafka, ActiveMQ, and Spark Streaming. Proficiency inSQL and databases : Oracle, Hive, Snowflake, Redshift, Netezza, Sybase. Hands-on experience withjob scheduling tools : Airflow, Control-M, Autosys, Tidal, Cron. Experience inperformance optimization, data validation, audit logging, and error handling . Exposure toBanking, Investment, or Financial Services domainsis a strong plus.Preferred Qualifications AWS Certified Solutions Architect – Associate AWS Certified Developer – Associate Experience working withBI toolssuch as Power BI and Tableau Exposure toDatabricksand modern analytics platforms Strong stakeholder management and leadership skillsIdeal Candidate Profile The ideal candidate is ahands-on technical leaderwho can balance architecture, development, and team leadership. You should be comfortable working in fast-paced environments, handling complex data challenges, and delivering reliable, business-ready data solutions at scale.
Job Title
Lead Data Engineer