-
enables
companies like R.R. Donnelley to eliminate outdated, incomplete or incorrectly
formatted data.
- Master data management
- (MDM)
-
•Collection of related files containing
records on people, places, or things.
•Prior to digital databases, business
used file cabinets with paper files.
Database
-
•Generalized category representing
person, place, thing on which we store and maintain information
•E.g., SUPPLIER, PART
Entity
-
•Specific characteristics of each
entity:
•SUPPLIER name, address
•PART description, unit price, supplier
Attributes
-
•Organize data into two-dimensional
tables (relations) with columns and rows.
Relational database
-
(columns) store data representing an attribute
Fields
-
uniquely identifies each record
Key field
-
•One field in each table
•Cannot be duplicated
•Provides unique identifier for all
information in any row
Primary key
-
•Used to clarify table relationships in a relational
database
- •Entity-relationship
- diagram
-
•Process of streamlining complex groups of data to:
•Minimize redundant data elements.
•Minimize awkward many-to-many relationships.
•Increase stability and flexibility.
•Normalization
-
•Used by relational databases to ensure
that relationships between coupled tables remain consistent.
•E.g., when one table has a foreign key that points to another
table, you may not add a record to the table with foreign key unless there is a
corresponding record in the linked table.
- •Referential integrity
- rules
-
•Specify structure of content of
database.
Data definition capabilities
-
•Automated or manual file storing
definitions of data elements and their characteristics.
Data dictionary
-
•Structured query language (SQL)
•Microsoft Access query-building tools
- •Data manipulation
- language
-
•Database that stores current and
historical data that may be of interest to decision makers
•Consolidates and standardizes data from
many systems, operational and transactional databases
•Data can be accessed but not altered
Data warehouse
-
•Subset of data warehouses that is
highly focused and isolated for a specific population of users
Data mart
-
tools
for consolidating, analyzing, and providing access to large amounts of data to
improve decision making
Business intelligence
-
•Supports multidimensional data
analysis, enabling users to view the same data in different ways using multiple
dimensions
Online Analytical Processing (OLAP)
-
•Finds hidden patterns and relationships in large databases
and infers rules from them to predict future behavior
Data Mining
-
•Uses data mining techniques, historical data, and
assumptions about future conditions to predict outcomes of events, such as the
probability a customer will respond to an offer or purchase a specific product
Predictive analysis
-
•Unstructured data (mostly text files) accounts for 80
percent of an organization’s useful information.
allows
businesses to extract key elements from, discover patterns in, and summarize
large unstructured data sets.
Text Mining
-
•Discovery and analysis of useful patterns and
information from the Web
•Content mining, structure mining, usage mining
•Web Mining
-
•States organization’s rules for
organizing, managing, storing, sharing information
•Information policy
-
•Responsible for specific policies and
procedures through which data can be managed as a resource
•Data administration
-
Database design and management group responsible for
defining and organizing the structure and content of the database, and
maintaining the database
-
major
obstacle to successful customer relationship management
Poor data quality
-
•caused by
•Redundant and
inconsistent data produced by multiple systems
•Data input errors
Data quality problems
-
structured
survey of the accuracy and completeness of data
Data quality audit
-
detects
and corrects incorrect, incomplete, improperly formatted, and redundant data
Data cleansing
|
|