a pool of data produced to support decision making - a repository of current and historical data
data warehouse
a physical repository where relational data are specially organized to provide enterprise-wide, cleansed data in a standardized format
data warehouse
a collections of integrated, subject-oriented databases designed to support DSS functions, where each unit of data is non-volatile (doesn't change) and relevant to some moment in time
data mart
a smaller subset of a data warehouse that is focused on a particular subject or department typically consisting of a single subject area
dependant data mart
a subset that is created directly from a data warehouse - ensures that the end user is viewing the same version of the data that is accessed by all other data warehouse users
independent data mart
a small data warehouse designed for a strategic business unit or a department - an alternative to high cost data warehouses
data marts
operational data stores
enterprise data warehouses (EDW)
three main types of data warehouses
operational data stores - don't do analytics in an operational database
provides a fairly recent form of customer information - an interim staging area for a data warehouse contaning only very recent information
oper marts
a data mart created when operational data needs to be analyzed multidimentionally
enterprise data warehouses (EDW)
a large-scale data warehouse that is used across the enterprise for decision support - to provide data for many types of DSS
metadata
describe the structure of and some meaning of the data
syntactic - describe the syntax of data
structural - describe the structure of data
semantic - describe the meaning of data
three types of metadata
data access
data federation - integration
change capture
three major processes in data integration
data integration
comprises three major processes: data access, data federation, & change capture
enterprice application integration (EAI) - (mostly "the cloud" today)
a technology that provides a vehicle for pushing data from source systems into a data warehouse (shares applications - not data as its focus)
enterprise information integration - (also used in streaming and real-time data analysis)
an evolving tool space that promises real-time data integration from a variety of sources
extraction, transformation, and load (ETL)
process that consumes 70% of time in a data-centric process and consists of reading data from one or more database, converting the extracted data and putting it into the data warehouse
alert systems (as opposed to periodic reporting systems)
systems that monitor the data flowing into the warehouse and inform all key people who have a need to know as soon as a critical event occurs
hard benefits
benefits to an organization that can be expressed in monetary terms
traditional warehouse
type of data warehouse used for strategic decisions only and uses highly restrictive reporting to confirm or check existing processes & patterns
active data warehouse
type of warehouse used for both strategic & tactical decisions - allowing a high number of users - and allows for flexible ac hoc reporting