Data Management Foundations Study Guide C175
With Full Elaborations
Why databases? - ·Helps organizations use data / information as a resource to make
decisions
File based structure (Prior to BDMS) - o Attempt to computerize old manual filing system
o Was easy to retrieve small amounts of information from
o As data grew, it became difficult to locate and retrieve information
o Retrieval speed depended on file structure and how data was organized within the
structure
File types that utilized file based structure - Flat files, heap files, Index files, hashed files
Flat files - no internal hierarchy and data is unstructured
Heap files - unsorted set of records uniquely identified by record ID
-allows heap files to be inserted or deleted using ID
index files - · File that stores a list of Look Up Field values from data file that includes the
location (address) in the data file of the corresponding record
· Smaller than the entire record, usually fits in main memory for quick look up
· Lists keywords and phrases that are in a particular file
· Becomes a pointer to the location in the file where the information can be found when
stored with keywords and phrases
hashed files - · Uses hash functions to decide where records should be placed on a disk
· Allows for faster data look up without the use of an index file
data - raw facts not processed to reveal meaning. Requires transformation to better
understand.
information - o Produced by Processing Raw Data to reveal its meaning
o Key to good decision making and business survival is accurate, relevant, and timely
information
knowledge - o Information and facts about a specific subject
m m
, o Implies familiarity awareness and understanding of Information
o Key Characteristic is that New Knowledge can be derived from Old Knowledge
data management - o A discipline that focuses on the proper generation, storage, and
retrieval of data
role of the DBMS - o An intermediary between user and database
o Receives all application requests and translates them into complex operations
advantages of DBMS - -Improves data sharing
-improves data security
-better data integraiton
-minimize data inconsistency
improvs Data access
-improves decision making
-increases End-User Productivity
Data quality - A comprehensive approach to ensuring the accuracy, validity, and timelines
of data.
data inconsistency - · exists when different versions of the same data appear in different
places
operation database - o Supports and reflects critical day-to-day operations that must be
recorded accurately and immediately
· Transactions (Sales, Payments or Purchases)
o Also known as Online Transaction Processing Database (OLTP), Transactional Database o
Production Database
analytical database - o Stores historical data and business metrics for tactical or strategic
decision making
End users can perform advanced analysis of data using sophisticated tools
main components: data warehouse, online analytical processing (OLAP)
data warehouse - · Specialized data that stores data in format optimized for decision
support
· Contains historical data obtained from operational databases as well as data from other
external sources
m m
With Full Elaborations
Why databases? - ·Helps organizations use data / information as a resource to make
decisions
File based structure (Prior to BDMS) - o Attempt to computerize old manual filing system
o Was easy to retrieve small amounts of information from
o As data grew, it became difficult to locate and retrieve information
o Retrieval speed depended on file structure and how data was organized within the
structure
File types that utilized file based structure - Flat files, heap files, Index files, hashed files
Flat files - no internal hierarchy and data is unstructured
Heap files - unsorted set of records uniquely identified by record ID
-allows heap files to be inserted or deleted using ID
index files - · File that stores a list of Look Up Field values from data file that includes the
location (address) in the data file of the corresponding record
· Smaller than the entire record, usually fits in main memory for quick look up
· Lists keywords and phrases that are in a particular file
· Becomes a pointer to the location in the file where the information can be found when
stored with keywords and phrases
hashed files - · Uses hash functions to decide where records should be placed on a disk
· Allows for faster data look up without the use of an index file
data - raw facts not processed to reveal meaning. Requires transformation to better
understand.
information - o Produced by Processing Raw Data to reveal its meaning
o Key to good decision making and business survival is accurate, relevant, and timely
information
knowledge - o Information and facts about a specific subject
m m
, o Implies familiarity awareness and understanding of Information
o Key Characteristic is that New Knowledge can be derived from Old Knowledge
data management - o A discipline that focuses on the proper generation, storage, and
retrieval of data
role of the DBMS - o An intermediary between user and database
o Receives all application requests and translates them into complex operations
advantages of DBMS - -Improves data sharing
-improves data security
-better data integraiton
-minimize data inconsistency
improvs Data access
-improves decision making
-increases End-User Productivity
Data quality - A comprehensive approach to ensuring the accuracy, validity, and timelines
of data.
data inconsistency - · exists when different versions of the same data appear in different
places
operation database - o Supports and reflects critical day-to-day operations that must be
recorded accurately and immediately
· Transactions (Sales, Payments or Purchases)
o Also known as Online Transaction Processing Database (OLTP), Transactional Database o
Production Database
analytical database - o Stores historical data and business metrics for tactical or strategic
decision making
End users can perform advanced analysis of data using sophisticated tools
main components: data warehouse, online analytical processing (OLAP)
data warehouse - · Specialized data that stores data in format optimized for decision
support
· Contains historical data obtained from operational databases as well as data from other
external sources
m m