Dataware Housing
· Define a data warehouse
· Identify data warehouse use cases
. List the benefits of a data warehouse
What is a data warehouse?
A data warehouse is a system that aggregates data from one or more sources into a single consistent data store to support data analytics
Data warehouse analytics
Data warehouse systems support:
- Data Mining
- Artificial Intelligence
- Machine learning
- Front-end reporting
- OLAP Online analytical processing
Where are data warehouses hosted?
Traditional data warehouses hosted on-premises within enterprise datacenters initially hosted on mainframes, and then on Unix, Windows and Linux systems
Where are data warehouses hosted?
2000’s
Growth of larger data volumes, emergence of data ware housing appliances (DWAs), consisting of specialized hardware with pre-integrated software
2010-present
Adoption of Cloud Data Warehouses, providing eliminating hardware purchases, as a scalable service pay-as-you-go service
Who uses data warehouses?
Practically every industry
- e-Commerce
- Transportation
- Medical
- Government
- Banking and fin-tech
- Social media
Retail and e-commerce
. Analyze and report on sales performance
· Create machine learning assisted shopping recommendations
Transportation
· Optimizes routes, travel times, equipment needs & staffing requirements
Healthcare
· Apply AI to patient data to assist with diagnosis and treatment
Banking and fin-tech
· Evaluate risks, detect fraud and cross-sell services
Social media
. Measure customer sentiment and project product sales
Governments
· Analyze and evaluate citizen-focused programs and assist with policy change decisions
Benefits of a data warehouse
Centralizes data from disparate sources
· Creates a single source of truth
· Leverages all the data while enhancing speed to access
· Facilitates smarter decisions using BI
What are the benefits of a data warehouse?
Data warehouses enable organizations to centralize data from disparate data sources, such as transactional systems, operational databases, and flat files.
Data integration, removing bad data, eliminating duplicates, and standardizing data create
A single source of the truth that results in better data quality for analysis.
A single source of truth empowers users to leverage all the company's data and access
that data more efficiently.
In addition, separating database operations from data analytics generally improves data
access performance, leading to faster business insights.
Large-scale BI functions such as data mining, artificial intelligence, and machine
learning tools facilitate smarter decisions by data professionals and business leaders.
These capabilities build on each other to give organizations the means and opportunity
to realize competitive advantages and gains.
OLTP (Online Transaction Processing) and OLAP (Online Analytical Processing) are two different database systems used for different purposes in business operations.
How OLTP & OLAP Work Together
- OLTP Systems (Transactional) Collect Data First
· A customer purchases a product online (OLTP records the transaction).
· A bank processes a withdrawal and updates account balance in real-time.
- OLAP Systems (Analytical) Use This Data for Insights
· The e-commerce business analyzes customer purchase patterns to predict demand.
· A bank runs fraud detection models based on past transactions.
|
Feature |
OLTP (Transactional
System) |
OLAP (Analytical System) |
|
Purpose |
Handles real-time
transactions |
Analyzes and processes
large datasets |
|
Data Type |
Current, real-time data |
Historical, aggregated
data |
|
Operations |
Read, write, update,
delete (CRUD) |
Complex queries,
reporting, data mining |
|
Speed |
Fast, optimized for
frequent transactions |
Slower, optimized for
deep analysis |
|
Users |
Customers, cashiers,
employees |
Business analysts,
executives, data scientists |
|
Example |
ATM withdrawals,
e-commerce orders, retail sales |
Sales trend analysis,
market research, financial forecasting |
|
Database Type |
Relational databases
(e.g., MySQL, PostgreSQL) |
Data warehouses (e.g.,
Amazon Redshift, Google BigQuery) |


Comments
Post a Comment