Data Engineer
On-site
i2b Technologies
Startup
Product & Service
B2B
₹ 10-25 Lacs PA
Pre-seed
Information technology
Hyderabad, Telangana, India
Post Status: Active
Permanent
186 applications
Experience: 4-7 Years
Skills
Snowflake
Apache Kafka
Spark (framework)
AWS
Databricks
PostgreSQL
Data Engineering
Python
Apache Airflow
Data Analytics
Posted 10 days ago

About the job

Data Engineer — Supply Chain Platform


You will be part of the core engineering team building a next-gen Intelligence & Optimization platform. This role involves architecting scalable data pipelines, designing database schemas/models, enabling analytics, and ensuring data availability and reliability across our ecosystem. You will work closely with product, backend teams to deliver high-impact data solutions.

Key Responsibilities

Data Architecture & Modelling

  • Design and maintain scalable database schemas and data models optimized for high-volume supply chain data.

  • Evaluate and recommend database technologies (SQL, NoSQL, distributed storage) based on workload patterns.

  • Drive ingestion, transformation, and storage strategy for real-time and batch data flows.

Data Engineering & Pipelines

  • Experience in performing complex queries in SQL ( RDBMS )

  • Build and maintain ETL/ELT pipelines for structured and semi-structured data.

  • Develop automation scripts for data workflows, backups, and disaster recovery.

  • Work on large-scale distributed systems involving Spark and Kafka.

Analytics Enablement

  • Experience in reporting, dashboards, and insights.

  • Optimize queries for performance and accuracy across multi-source datasets.

Database Ownership (DBA responsibilities)

  • Ensure availability, replication, backup, and DR setups.

  • Performance tuning, index & partition management, schema evolution.

Tech Stack

  • Databases / Data Warehouse / Data Lake:
    Postgres, MySQL, Snowflake, AWS Redshift, Cassandra and other NoSQL stores.

  • Big Data / Streaming:
    Apache Spark, Kafka.

  • Scripting / Automation:
    Airflow, Python, Shell Scripts, DataBricks

  • Cloud:
    AWS (S3, Athena, Glue, etc. is a plus).

What You’ll Work On

  • Build real-time data streams from various supply chain systems.

  • Enable predictive analytics by ensuring clean and reliable data pipelines.