Online Retail Operation

This project focused on creating a unified and reliable view of online retail operations. The objective was to design an automated data pipeline that integrates sales, inventory, and customer data into one trusted source, enabling real-time analytics and ensuring compliance with data protection standards.
Sales, customer, and inventory data were stored in separate spreadsheets, APIs, and legacy databases, resulting in inconsistency and poor integration.
Duplicate orders, missing identifiers, and inconsistent time formats made it difficult to produce accurate metrics and reconcile data across systems.
Customer records lacked proper anonymisation and retention policies, creating potential non-compliance with GDPR requirements.
Used Microsoft Fabric Dataflows to ingest and transform multi-source data. Standardised structures, fixed nulls, deduplicated records.
Built a Lakehouse schema with region-based access and hashed personal identifiers. Each transformation step logged for full auditability.
Created a central fact table linked to product, region, and customer dimensions. Developed 30+ reusable DAX measures.
Four-page Power BI dashboard with drill-through analytics for sales performance, profitability, cohort trends, and product segmentation.