Dimensional Modeling in Data Warehousing
Dr. Mohsin Dar
Assistant Professor
Cloud & Software Operations Cluster | SOCS
University of Petroleum and Energy Studies (UPES)
Database Systems - MTech First Semester
A star schema is a database organizational structure where a central fact table is surrounded by denormalized dimension tables, resembling a star shape.
A snowflake schema is an extension of the star schema where dimension tables are normalized into multiple related tables, creating a snowflake-like pattern.
| Aspect | Star Schema | Snowflake Schema |
|---|---|---|
| Query Performance | ✓ Faster (fewer joins) | ✗ Slower (more joins) |
| Storage Space | ✗ More space (redundancy) | ✓ Less space (normalized) |
| Query Complexity | ✓ Simple queries | ✗ Complex queries |
| Data Integrity | ✗ Lower (redundancy) | ✓ Higher (normalized) |
| Maintenance | ✗ More updates needed | ✓ Easier maintenance |
| ETL Complexity | ✓ Simpler ETL process | ✗ Complex ETL process |
A retail company wants to analyze sales data across products, customers, time, and stores.
Same retail company with normalized dimension tables to reduce redundancy.
Both star and snowflake schemas have their place in data warehousing. The choice depends on your specific requirements for query performance, storage efficiency, and data complexity. Understanding these schemas is crucial for designing effective data warehouses that meet your organization's analytical needs.