0% found this document useful (0 votes)
615 views11 pages

Module 3 - Data and Database Management

The document discusses data, databases, and database management systems. It defines data as individual items stored in a database and a database as a collection of related data that can be accessed and organized. The document outlines different types of databases including centralized, distributed, relational, and NoSQL databases. It provides examples and properties of each type.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
0% found this document useful (0 votes)
615 views11 pages

Module 3 - Data and Database Management

The document discusses data, databases, and database management systems. It defines data as individual items stored in a database and a database as a collection of related data that can be accessed and organized. The document outlines different types of databases including centralized, distributed, relational, and NoSQL databases. It provides examples and properties of each type.
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
Download as pdf or txt
Download as pdf or txt
You are on page 1/ 11

MODULE 3 – DATA AND DATABASE MANAGEMENT

Good business decisions rely on access to accurate data. To facilitate this,


organizations need a system for recording, updating, and tracking data on a regular
basis. Once collected, data can be retrieved and presented in formats useable to
managers and employees of the organization through computerized databases.
Various database elements can be linked through database management systems. In
this module, we shall discuss the basic concepts on data, database and database
management system. We will also talk about the functions and components of a
database management system

After completing this course, you should be able to:


1. Discuss the basic concepts on data, database, and database management
system; and
2. Describe the different types of database
3. Explain the functions and components of a database and a database
management system
4. Utilize a database management system

WHAT IS DATA?
 For every database system, data served as the heart of each system
 refers to all the single items that are stored in a database, either individually or
as a set

WHAT IS A DATABASE?
 It is defined as any collection of electronic records that can be processed to
produce useful information. This record can be accessed, modified, managed,
controlled and organized to perform various data-processing operations.
 A collection of related data stored in a manner that enables information to be
retrieved as needed

Module 3- Data and Database Management Page 1 of 11


TYPES OF DATABASES
There are various types of databases used for storing different varieties of data:
a. Centralized database.
o The information (data) is stored at a centralized location and the users
from different locations can access this data. This type of database
contains application procedures that help the users to access the data
even from a remote location.
Advantages of Centralized Database
 It has decreased the risk of data management, i.e., manipulation
of data will not affect the core data.
 Data consistency is maintained as it manages data in a central
repository.
 It provides better data quality, which enables organizations to
establish data standards.
 It is less costly because fewer vendors are required to handle the
data sets.
Disadvantages of Centralized Database
 The size of the centralized database is large, which increases the
response time for fetching the data.
 It is not easy to update such an extensive database system.
 If any server failure occurs, entire data will be lost, which could
be a huge loss.
b. Distributed database
o Unlike a centralized database system, in distributed systems, data is
distributed among different database systems of an organization. These
database systems are connected via communication links. Such links
help the end-users to access the data easily.
o Examples of the distributed database are Apache Cassandra, HBase,
Ignite, etc.
o There are two kinds of distributed database.

Module 3- Data and Database Management Page 2 of 11


 Homogenous DDB- databases which have same underlying
hardware and run over same operating systems and application
procedures
 Heterogeneous DDB - the operating systems, underlying
hardware as well as application procedures can be different at
various sites of a DDB
Advantages of Distributed Database
 Modular development is possible in a distributed database, i.e.,
the system can be expanded by including new computers and
connecting them to the distributed system.
 One server failure will not affect the entire data set.

c. Relational Database
o This database is based on the relational data model, which stores data
in the form of rows (tuple) and columns (attributes), and together forms
a table (relation).
o A relational database uses SQL for storing, manipulating, as well as
maintaining the data. E.F. Codd invented the database in 1970. Each
table in the database carries a key that makes the data unique from
others.
o Examples: MySQL, Microsoft SQL Server, Oracle, etc

Properties of Relational Database


There are following four commonly known properties of a
relational model known as ACID properties, where:
Atomicity: This ensures the data operation will complete either
with success or with failure. It follows the 'all or nothing' strategy.
For example, a transaction will either be committed or will abort.
Consistency: If we perform any operation over the data, its
value before and after the operation should be preserved. For
example, the account balance before and after the transaction should
be correct, i.e., it should remain conserved.

Module 3- Data and Database Management Page 3 of 11


Isolation: There can be concurrent users for accessing data
at the same time from the database. Thus, isolation between the
data should remain isolated. For example, when multiple transactions
occur at the same time, one transaction effects should not be visible
to the other transactions in the database.
Durability: It ensures that once it completes the operation
and commits the data, data changes should remain permanent.
d. NoSQL Database
o Non-SQL/Not Only SQL is a type of database that is used for storing a
wide range of data sets. It is not a relational database as it stores data
not only in tabular form but in several different ways. It came into
existence when the demand for building modern applications increased.
o NoSQL presented a wide variety of database technologies in response to
the demands that can store structured, semi-structured, unstructured
and polymorphic data.
o The concept of NoSQL databases became popular with Internet giants
like Google, Facebook, Amazon, etc. who deal with huge volumes of
data. The system response time becomes slow when you use RDBMS for
massive volumes of data.

Four Types of NoSQL databases


 Key-value storage: It is the simplest type of database
storage where it stores every single item as a key (or
attribute name) holding its value, together. Redis,
Dynamo, Riak are some examples of key-value store
DataBases. They are all based on Amazon's Dynamo
paper.
o Applications: Useful for storage of Comments and
Session information. Pinterest uses Redis to store

lists of users, followers, unfollowers, boards .

 Document-oriented Database: A type of database


used to store data as JSON-like document. It helps

Module 3- Data and Database Management Page 4 of 11


developers in storing data by using the same document-
model format as used in the application code. It does not
support relations or joins. Amazon SimpleDB, CouchDB,
MongoDB, Riak, Lotus Notes, MongoDB, are popular
Document originated DBMS systems.
o Applications: SEGA uses MongoDB for handling 11

million in-game accounts built on MongoDB.

 Graph Databases: It is used for storing vast amounts of


data in a graph-like structure. Nodes and relationships are
the essential constituents of graph databases. A node
represents an entity. A relationship represents how two
nodes are associated. Most commonly, social networking
websites use the graph database. Neo4J, Infinite Graph,
OrientDB, FlockDB are some popular graph-based
databases.

 Wide-column stores: It is similar to the data


represented in relational databases. Here, data is stored in
large columns together, instead of storing in rows. HBase,
Cassandra, HBase, Hypertable are examples of column
based database.
o Applications: Spotify uses Cassandra to store user
profile attributes and metadata.

Advantages of NoSQL Database


 It enables good productivity in the application development as
it is not required to store data in a structured format.
 It is a better option for managing and handling large data sets.
 It provides high scalability.
 Users can quickly access data from the database through key-
value.

Module 3- Data and Database Management Page 5 of 11


e. Cloud Database
o A type of database where data is stored in a virtual environment and
executes over the cloud computing platform. It provides users with
various cloud computing services (SaaS, PaaS, IaaS, etc.) for accessing
the database.
o There are numerous cloud platforms, but the best options are:
1. Amazon Web Services(AWS)
2. Microsoft Azure
3. Kamatera
4. PhonixNAP
5. ScienceSoft
6. Google Cloud SQL

f. Object-oriented Databases
o The type of database that uses the object-based data model approach
for storing data in the database system. The data is represented and
stored as objects which are similar to the objects used in the object-
oriented programming language

g. Hierarchical Databases
o It is the type of database that stores data in the form of parent-children
relationship nodes. Here, it organizes data in a tree-like structure.
o Data get stored in the form of records that are connected via links. Each
child record in the tree will contain only one parent. On the other hand,
each parent record can have multiple child records.

h. Network Databases
o It is the database that typically follows the network data model. Here,
the representation of data is in the form of nodes connected via links
between them. Unlike the hierarchical database, it allows each record to
have multiple children and parent nodes to form a generalized graph
structure.

Module 3- Data and Database Management Page 6 of 11


i. Personal Database
o Collecting and storing data on the user's system defines a Personal
Database. This database is basically designed for a single user.
Advantages of Personal Database
 It is simple and easy to handle.
 It occupies less storage space as it is small in size.

j. Operational Database
o The type of database which creates and updates the database in real-
time. It is basically designed for executing and handling the daily data
operations in several businesses. For example, an organization uses
operational databases for managing per day transactions.

k. Enterprise Database:
o Large organizations or enterprises use this database for managing a
massive amount of data. It helps organizations to increase and improve
their efficiency. Such a database allows simultaneous access to users.
Advantages of Enterprise Database:
 Multi processes are supportable over the Enterprise database.
 It allows executing parallel queries on the system.

WHAT IS A DATABASE MANAGEMENT SYSTEM (DBMS)?


o Database management system (DBMS) is a software which is used to manage
the database. DBMS provides an interface to perform various operations like
database creation, storing data in it, updating data, creating a table in the
database and a lot more.
o It provides protection and security to the database. In the case of multiple
users, it also maintains data consistency.
o Some well-known DBMSs are Microsoft SQL Server, Microsoft Access, Oracle,
SAP, and others.

Module 3- Data and Database Management Page 7 of 11


FUNCTIONS OF A DATABASE MANAGEMENT SYSTEM

• The DBMS can provide a convenient and efficient interface for storing,
retrieving, and updating data, and for extracting useful information from the
database, and a clear and logical view of the process that manipulates the data.
• Data Independence: DBMS maintains the segregation between the program
and the data.
• Concurrency Control. This refers to a process to ensure that data is updated
correctly and appropriately when more than one person is accessing the data
• Recovery Services. This refers to mechanisms for recovery of data that is
apparently lost due to system crashes
• Utility Services: An initialization and maintenance operations on a database.
• Manipulation of data. It tells us that we can CRUD (create, retrieve, update,
and delete) our data within the DBMS.
• Database definition. Metadata describes the contents of your database. This is
a data that describes another data in terms of its constraints, size, and the data
type of the tables or the databases that are contained within your DBMS.
• Data Processing. This allows the user to perform different types of operations
such as string manipulation, date operations and so on. You can also
manipulate the data based on an event or what they call ‘triggers’ which is
based on the event or the conditions of the event.
• Data sharing. This means that within a database, privileges are provided to
different types of users. Different types of users can access the same database
but the same users can have different privileges.

COMPONENTS OF DBMS
DBMS have several components, each performing very significant tasks in the
database management system environment. Below is a list of components within the
database and its environment.

a. Software
This is the set of programs used to control and manage the overall database.
This includes the DBMS software itself, the Operating System, the network

Module 3- Data and Database Management Page 8 of 11


software being used to share the data among users, and the application
programs used to access data in the DBMS.
b. Hardware
Consists of a set of physical electronic devices such as computers, I/O devices,
storage devices, etc., this provides the interface between computers and the
real world systems.
c. Data
DBMS exists to collect, store, process and access data, the most important
component. The database contains both the actual or operational data and the
metadata.
d. Procedures
These are the instructions and rules that assist on how to use the DBMS, and
in designing and running the database, using documented procedures, to guide
the users that operate and manage it.
e. Database Access Language………………………………………………………
This is used to access the data to and from the database, to enter new data,
update existing data, or retrieve required data from databases. The user writes
a set of appropriate commands in a database access language, submits these
to the DBMS, which then processes the data and generates and displays a set
of results into a user readable form.
f. Query Processor,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
This transforms the user queries into a series of low level instructions. This
reads the online user’s query and translates it into an efficient series of
operations in a form capable of being sent to the run time data manager for
execution.
g. Run Time Database Manager…………………………………………………..
Sometimes referred to as the database control system, this is the central
software component of the DBMS that interfaces with user-submitted
application programs and queries, and handles database access at run time. Its
function is to convert operations in user’s queries. It provides control to
maintain the consistency, integrity and security of the data.

Module 3- Data and Database Management Page 9 of 11


h. Data Manager………………………………………………………………………..
Also called the cache manger, this is responsible for handling of data in the
database, providing a recovery to the system that allows it to recover the data
after a failure.
i. Database Engine……………………………………………………………………
The core service for storing, processing, and securing data, this provides
controlled access and rapid transaction processing to address the requirements
of the most demanding data consuming applications. It is often used to create
relational databases for online transaction processing or online analytical
processing data.
j. Data Dictionary ……………………………………………………………………………………
This is a reserved space within a database used to store information about the
database itself. A data dictionary is a set of read-only table and views,
containing the different information about the data used in the enterprise to
ensure that database representation of the data follow one standard as defined
in the dictionary.
k. Report Writer………………………………………………………………………..
Also referred to as the report generator, it is a program that extracts
information from one or more files and presents the information in a specified
format. Most report writers allow the user to select records that meet certain
conditions and to display selected fields in rows and columns, or also format
the data into different charts.

ACTIVITY

1. Go to this link:
https://www.quackit.com/mongodb/tutorial/about_mongodb.cfm
which is about MongoDB Tutorial. Read and practice how to do the following
tasks:
a. How to Install MongoDB
b. Creating a Database
c. Creating a Collection

Module 3- Data and Database Management Page 10 of 11


d. Create a Document
e. Query a Collection
f. Projection Queries
g. Limit Query Results
h. Sort Query Results
i. Create a relationship
j. Update a Document
k. Delete a Document
l. Drop a Collection
m. Drop a Database

STUDY QUESTIONS

Directions. Answer the following questions.


1. Differentiate a database from a database management system.
2. What do you think are the advantages of having a database management
system in an organization?
3. Give some applications that utilizes the different types of databases.

REFERENCES

Guru99. (n.d.). NoSQL Tutorial: Learn NoSQL Features, Types, What is, Advantages .
Retrieved from Guru99: https://www.guru99.com/nosql-tutorial.html
Javapoint. (2020). Types of Database. Retrieved from Javapoint:
https://www.javatpoint.com/types-of-databases
Almodiel, M., & Garcia, P. G. (2018). Fundamentals of Business Analytics: A Business
Analytics Course. University of the Philippines Open University.

Module 3- Data and Database Management Page 11 of 11

You might also like