Chapter Three: Data Base and Big Data
Chapter Three: Data Base and Big Data
Chapter Three: Data Base and Big Data
Relatively large
A modern database system is a complex software system whose task is to manage a large, complex
collection of data.
Databases touch all aspects of our lives
Types of Databases and Database Applications
Traditional applications:
Numeric and textual databases
More recent applications:
Multimedia databases
Geographic Information Systems (GIS)
Biological and genome databases
Data warehouses
Mobile databases
Real-time and active databases
Recent Developments (1)
Social Networks started capturing a lot of information about people
and about communications among people-posts, tweets, photos,
videos in systems such as:
- Facebook
- Twitter
- Linked-In
All of the above constitutes data
Search Engines, Google, Bing, Yahoo: collect their own repository of
web pages for searching purposes
What is “big data”?
"Big data are high-volume, high-velocity, and/or high-variety information
assets that require new forms of processing to enable enhanced decision
making, insight discovery and process optimization” (Gartner 2012)
Veracity: refers to the trustworthiness of the data
Value: will data lead to the discovery of a critical causal effect
Bottom line: Any data that exceeds our current capability of processing
can be regarded as “big”
Complicated (intelligent) analysis of data may make a small data “appear” to be
“big”
Impact of Databases and Database Technology
Businesses: Banking, Insurance, Retail, Transportation, Healthcare,
Manufacturing
Service industries: Financial, Real-estate, Legal, Electronic
Commerce, Small businesses
Education : Resources for content and Delivery
More recently: Social Networks, Environmental and Scientific
Applications, Medicine and Genetics
Personalized applications: based on smart mobile devices
A simplified architecture for a database system
What a DBMS Facilitates
Define a particular database in terms of its data types, structures, and constraints
Construct or load the initial database contents on a secondary storage medium
Manipulating the database:
Retrieval: Querying, generating reports
Modification: Insertions, deletions and updates to its content
Accessing the database through Web applications
Processing and sharing by a set of concurrent users and application programs – yet,
keeping all data valid and consistent
Other DBMS Functionalities
DBMS may additionally provide:
Protection or security measures to prevent unauthorized access
“Active” processing to take internal actions on data
Presentation and visualization of data
Maintenance of the database and associated programs over the lifetime of
the database application
Application Programs and DBMS
Applications interact with a database by generating
- Queries: that access different parts of data and formulate the result of a
request
- Transactions: that may read some data and “update” certain values or
generate new data and store that in the database
Example of a Database (with a Conceptual Data Model)
Data abstraction:
A data model is used to hide storage details and present the users with a
conceptual view of the database.
Programs refer to the data model constructs rather than data storage details
Support of multiple views of the data:
Each user may see a different view of the database, which describes only
the data of interest to that user.
Main Characteristics of the Database Approach (continued)