Data Engineer — Supply Chain Platform
You will be part of the core engineering team building a next-gen Intelligence & Optimization platform. This role involves architecting scalable data pipelines, designing database schemas/models, enabling analytics, and ensuring data availability and reliability across our ecosystem. You will work closely with product, backend teams to deliver high-impact data solutions.
Data Architecture & Modelling
Design and maintain scalable database schemas and data models optimized for high-volume supply chain data.
Evaluate and recommend database technologies (SQL, NoSQL, distributed storage) based on workload patterns.
Drive ingestion, transformation, and storage strategy for real-time and batch data flows.
Data Engineering & Pipelines
Experience in performing complex queries in SQL ( RDBMS )
Build and maintain ETL/ELT pipelines for structured and semi-structured data.
Develop automation scripts for data workflows, backups, and disaster recovery.
Work on large-scale distributed systems involving Spark and Kafka.
Analytics Enablement
Experience in reporting, dashboards, and insights.
Optimize queries for performance and accuracy across multi-source datasets.
Database Ownership (DBA responsibilities)
Ensure availability, replication, backup, and DR setups.
Performance tuning, index & partition management, schema evolution.
Databases / Data Warehouse / Data Lake:
Postgres, MySQL, Snowflake, AWS Redshift, Cassandra and other NoSQL stores.
Big Data / Streaming:
Apache Spark, Kafka.
Scripting / Automation:
Airflow, Python, Shell Scripts, DataBricks
Cloud:
AWS (S3, Athena, Glue, etc. is a plus).
Build real-time data streams from various supply chain systems.
Enable predictive analytics by ensuring clean and reliable data pipelines.