Industrial Big Data Platform
Built end-to-end data pipeline handling 30M+ daily data points for factory OEE/yield analysis.
Lead Data Engineer
Jan 2022 - Present
HadoopKafkaSparkSpring Cloud Alibaba
Overview
A comprehensive industrial IoT platform designed to ingest, process, and visualize real-time production data from over 500 factory sensors. The system provides factory managers with instant visibility into OEE (Overall Equipment Effectiveness) and predictive maintenance alerts.
Challenges
- Handling high-velocity data ingestion (30M+ points/day) without data loss.
- Unifying disparate data formats from legacy PLC systems.
- Reducing query latency for real-time dashboards.
Results
- Reduced data processing latency from 15 minutes to < 3 seconds.
- Identified bottlenecks increasing production line efficiency by 12%.
- Unified data governance across 3 factory sites.