04SQL and Advanced SQL
04SQL and Advanced SQL
2. Advanced SQL
1. User Defined Data Types 2. Sequences 3. Advanced Functions
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
What is SQL ?
Standard language of relational database access is SQL (Structured Query Language). Designed for accessing tabular data. Three major categories..
DDL (Data Definition Language) - Used to create, modify, or drop database objects DML (Data Manipulation Language) - Used to select, insert, update, or delete database data (records) DCL (Data Control Language) - Used to provide data object access control
Data Types
Each column in DB2 table must be associated with a data type. The data type indicates kind of data that is valid for column. There are two major categories of data types in DB2...
Built-in data types User-defined data types
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
Tables
Is an unordered set of rows. Rows consist of columns. Each column is based on a data type. There are three types of tables:
Permanent (base) tables Temporary (declared) tables Temporary (derived) tables
Schema
Schemas are database objects used in DB2 to logically group other database objects. Most database objects are named using a two-part naming convention (SCHEMA_NAME.OBJECT_NAME ). When an object is created without specifying a schema, object will be associated with an implicit schema using the authorization ID. When an object is referenced in an SQL statement, it is also implicitly qualified with authorization ID of issuer (dynamic SQL) if no schema name is specified in SQL statement.
Schema
The CURRENT SCHEMA special register contains default qualifier to be used for unqualified objects referenced for dynamic SQL statements issued from within a specific DB2 connection. Value can be modified by user with SET CURRENT SCHEMA statement. Can use the QUALIFIER option of BIND command to define default qualifier at bind time. NOTE: for temporary tables 'session' qualifier is used
10
Tablespaces
Logical layers between database and tables stored in that database. Table spaces are created within a database, and tables are created within table spaces. DB2 supports two kinds of table spaces:
System Managed Space (SMS) Database Managed Space (DMS)
11
Views
Virtual tables derived from one or more tables or views Can be used interchangeably with tables when retrieving data. When changes are made to data through a view, the data is changed in underlying table itself. Views do not contain real data. Created to limit access to sensitive data while allowing more general access to other data. Can be deletable, updatable, insertable, and read-only.
12
Indexes
Physical objects that are associated with individual tables. Can not be defined on a view. Can define multiple indexes for a single table. Used for two primary reasons:
Ensure uniqueness of data values Improve SQL query performance
13
Indexes
Can be created on computed columns ...
So that optimizer can save computation time by using index instead of doing calculations.
Indexes are maintained automatically by DB2 as data is inserted, updated, and deleted.
14
Indexes
Can be defined ... In ascending or descending order,
Is unique or nonunique, and On a single column or multiple columns, To support both forward and reverse scans.
15
Buffer Pools
Are database objects used to cache data pages in memory. Once a data page is placed in a buffer pool, physical I/O access to disk can be avoided. Buffer pools can be assigned to cache only a particular table space, if required. Every database will have atleast one bufferpool
16
Transactions
Is a sequence of SQL statements that execute as a single operation. Term 'unit of work' is synonymous with the term 'transaction'. Starts implicitly with first executable SQL statement in a program. Ends when either an explicit or implicit COMMIT or ROLLBACK statement is encountered. SAVEPOINTs are like the bookmarks in a transaction, used to control the scope of ROLLBACK statement. SAVEPOINT does not end the current transaction.
17
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
18
Data Types
Categories of DB2 supplied datatypes:
Numeric String (Binary, Single Byte, Double Byte) Datetime
19
20
Uses least amount of storage in database for each value. Data value range for a SMALLINT is -32768 to 32767. Precision for a SMALLINT is 5 digits (to the left of the decimal). Two bytes of database storage are used for each SMALLINT column value.
21
Takes twice as much storage as a SMALLINT but has a greater range of possible values. Range value for INTEGER data type is -2,147,483,648 to 2,147,483,647 . Precision for INTEGER is 10 digits to the left of the decimal. Four bytes of database storage are used for each INTEGER column value.
22
Is available for supporting 64-bit integers. Range is -9,223,372,036,854,775,808 to +9,223,372,036,854,775,807 . Eight bytes of database storage are used for each BIGINT column value.
NOTE: As platforms include native support for 64 bit integers, processing of large numbers with BIGINT is more efficient than processing with DECIMAL and more precise than DOUBLE or REAL .
23
Is used for numbers with fractional and whole parts. DECIMAL data is stored in a packed format. Precision and scale must be provided
NOTE: The precision is the total number of digits (range from 1 to 31), and the scale is the Number of digits in the fractional part of the number.
24
Terms NUMERIC,NUM,DECIMAL, and DEC can all be used to declare a decimal/numeric column. If a decimal data type is to be used in a C program, host variable must be declared as a double.
NOTE: A DECIMAL number takes up p/2 + 1 bytes of storage, where p is the precision used. For example, DEC(8,2) would take up 5 bytes of storage (8/2 + 1), whereas DEC(7,2) would take up only 4 bytes (truncate the division of p/2).
25
27
28
Used to store character data with a varying length. Maximum length of a LONG VARCHAR column is 32,700.
Notes: LONG VARCHAR data types are similar to CLOB data types. The FOR BIT DATA clause can be used following character string column definition. During data exchange, data is treated and compared as binary (bit) data.
29
Are varying-length SBCS (single-byte character set) or MBCS (multibyte character set) character strings. Used to store greater than 32KB of text. Maximum size for each CLOB column is 2GB (gigabytes).
30
31
32
33
35
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
36
37
CREATE Statement
Table Index Schema View User-defined function User-defined data type Buffer pool Stored procedures Trigger Alias Method Transform Nickname Sequence Table space
38
DECLARE Statement
It is used to create temporary tables that are used only during a session. The only object that can be DECLARED is a table DECLARE GLOBAL TEMPORARY TABLE T1 LIKE TRANSACTIONS ON COMMIT PRESERVE ROWS NOT LOGGED IN SESSIONTEMP; Note:
'LIKE' clause defines a table with same column names, data types and nullability characteristics of each of the columns of specified table.
39
DROP Statement
To delete objects from database. Can drop any object created with CREATE <database object > and DECLARE <table> statements.
40
ALTER Statement
Table Table space Database partition Procedure Function Nickname Sequence Type View Method User mapping Buffer pool
41
Important...
Index can not be altered.
It has to be dropped and re-created.
Every time a DDL statement (except for DECLARE statement) is issued, catalog is updated.
Update includes a creation or modification timestamp and authorization ID of the user issuing the statement.
42
Constraints
Three types of constraints: Unique constraint Ensures unique values of a key in a table. Referential integrity Enforces referential constraints on insert, update, delete operations. Are imparted using insert rules, delete rules and update rules. Table check constraint Verifies that changed data does not violate conditions specified when a table was created or altered.
43
INSERT Rules
INSERT rule is implicit when a foreign key is specified. A row can be inserted at any time into a parent table without any action being taken in dependent table. A row cannot be inserted into dependent table unless there is a row in parent table with a parent key value equal to foreign key value of row being inserted, unless foreign key value is null. If an INSERT operation fails for one row during an attempt to insert more than one row, all rows inserted by the statement are removed from the database.
44
DELETE Rules
RESTRICT Prevents any row in parent table from being deleted if any dependent rows are found. NO ACTION Enforces the presence of a parent row for every child after all the referential constraints are applied. This is the default. The difference between NO ACTION and RESTRICT is based on when constraint is enforced.
45
DELETE Rules
CASCADE Implies that deleting a row in parent table automatically deletes any related rows in dependent table. SET NULL Ensures that deletion of a row in parent table sets values of foreign key in any dependent row to null (if nullable). Other parts of row are unchanged.
46
UPDATE Rules
RESTRICT Update for parent key will be rejected if a row in dependent table matches original values of key. NO ACTION Update operation for parent key will be rejected if any row in dependent table does not have a corresponding parent key when update statement is completed . This is the default.
47
Table-Check Constraint
Will enforce data integrity at table level. Once defined for a table, every UPDATE, INSERT statement will involve checking the constraint. If constraint is violated, row will not be inserted or updated. Can be defined at table creation time or later using ALTER TABLE statement.
48
49
50
51
52
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
53
54
Retrieving Data
SELECT * FROM EMPLOYEE SELECT EMPNO, FNAME, LNAME FROM EMPLOYEE ORDER BY FNAME SELECT EMPNO, FNAME, LNAME FROM EMPLOYEE WHERE EMPNO = 150 SELECT * FROM EMPLOYEE FETCH FIRST 5 ROWS ONLY SELECT EMPNO, FNAME, LNAME FROM EMPLOYEE WHERE MIN(SALARY) < 1000
55
56
SAMP_PROJECT Table
NAME Haas Thompson Walker Lutz PROJ AD3100 PL2100 MA2112 MA2111
57
SAMP_STAFF Table
NAME Hass Thompson Lucchessi Nicholls JOB PRES MANAGER SALESREP ANALYST
58
59
Outer Join:
Are a concatenation of the inner join and rows from the left table, right table, or both tables that are missing from the inner join.
NOTE:
When you perform an outer join on two tables, you arbitrarily assign one table as the left table and the other one as the right table.
61
62
OR
SELECT SAMP_PROJECT.NAME, SAMP_PROJECT.PROJ, SAMP_STAFF.NAME, SAMP_STAFF.JOB FROM SAMP_PROJECT INNER JOIN SAMP_STAFF ON SAMP_STAFF.NAME = SAMP_PROJECT.NAME
63
64
NAME JOB ---------- -------------------Haas PRES Lucchessi SALESREP Nicholls ANALYST Thompson MANAGER
Column functions
Also known as vector functions. Work on a group of rows to provide a result. The group is specified using a fullselect and optionally grouped using GROUP BY clause.
68
Column Functions:
SELECT MAX(smallint(length)) FROM test SELECT AVG(noseats) FROM test_center SELECT country, AVG(noseats) FROM test_center GROUP BY country
69
70
71
72
73
74
Subqueries
Subqueries can be used in IN clause to specify search arguments for SQL statement. SELECT DISTINCT name,phone FROM test_center tc, test_taken tt WHERE tc.tcid=tt.tcid AND number IN (SELECT number FROM test WHERE name LIKE 'DB2%') The subquery used in this example is known as an uncorrelated subquery.
An uncorrelated subquery is one where the values retrieved by the subquery are not directly related to the rows processed by the outer SELECT
75
Subqueries
A correlated subquery is a query in which subquery references values of the outer SELECT SELECT tc.name, count(*) FROM test_center tc,test t WHERE tc.tcid IN (SELECT tcid FROM test_taken tt WHERE tt.number=t.number) GROUP BY tc.name The WHERE clause in the subquery references a table in the outer FROM clause.
76
Quantified Predicates
Used to compare a value or values with a collection of values. 'SOME', 'ANY', 'ALL' SELECT c.cid, lname, fname FROM db2cert.candidate c WHERE cid = SOME (SELECT tt.cid FROM db2cert.test_taken tt WHERE c.cid = tt.cid)
77
Case Expression
SELECT fname,lname, CASE WHEN integer(SCORE) < 65 THEN 'Not Passed' WHEN integer(SCORE) <=90 THEN 'Passed' ELSE 'Excellent' END FROM candidate c, test_taken tt WHERE c.cid=tt.cid AND char(number)='500'
78
Set Operators
Union (UNION operator), Intersection (INTERSECT operator), Difference (EXCEPT operator).
79
UNION operator combines results of two or more separate queries into a single result.
SELECT number,'Minimum:', MIN(integer(score)) FROM test_taken GROUP BY number UNION SELECT number,'Maximum:', MAX(integer(score)) FROM test_taken GROUP BY number
80
81
82
Data Modifications
Inserting Rows Updating Rows Removing Data
83
Inserting Rows
INSERT INTO test (number,name,type,cut_score,length,totaltaken,totalpassed) VALUES ('508','DB2 Data Propagation','P',NULL,90,0,0) INSERT INTO test VALUES ('508','DB2 Data Propagation','P',DEFAULT,90,79,11) INSERT INTO test_taken (CID,TCID,NUMBER,DATE_TAKEN,SEAT_NO) VALUES ('888','TR01','500','2000-06-04','1'), ('888','TR01','501','2000-07-11','2'), ('888','TR01','502','2000-11-08','1')
84
Updating Rows
UPDATE EMPLOYEE SET salary = salary + 1000 WHERE deptid = 10 AND age > 24 UPDATE test_taken SET (date_taken,tcid) = (SELECT current date,tcid FROM test_center WHERE substr(city,1,7)='Toronto' AND country='Canada') WHERE CHAR(cid)= '888' AND number=test_id('500')
85
Removing Data
DELETE FROM candidate WHERE hphone IS NULL AND wphone IS NULL DELETE FROM candidate WHERE cid IN (SELECT cid FROM test_taken WHERE MONTH(date_taken)=2)
86
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
87
Index
An index is a list of locations of rows sorted by contents of one or more specified columns. Indexes contain a pointer, known as a record id (RID), to physical location of rows in table. These are two main purposes for creating indexes:
To ensure uniqueness of values To improve query performance
88
Unique Index
Guarantees uniqueness of data values in one or more columns. Uniqueness is also checked during execution of CREATE INDEX statement.
If table already contains rows with duplicate key values, index is not created.
89
90
Nulls, when it comes to indexing, are treated as equal to all other nulls. Null can not be inserted twice if the column is a key of a unique-index because it violates uniqueness rule for index.
91
92
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Index 7. Views 8. Identity Columns
93
Views
Are logical tables which are derived from one or more base tables or views. Can be used interchangably with base tables when retrieving the data. Does not contain real data. View can be ....
Updatable Deletable Insertable Read-only.
NOTE: constraints defined on base table are independents of operations that can be performed using a view.
94
95
Deletable View
A view meet all rules listed below to be considered a deletable view...
Each FROM clause of outer fullselect identifies only one base table, deletable view , deletable nested table expression, or deletable common table expression. The outer fullselect doesn use VALUES clause. t The outer fullselect doesn use GROUP BY or HAVING clauses. t The outer fullselect doesn include column functions in its select list. t The outer fullselect doesn use set operations (UNION,EXCEPT, or t INTERSECT) with exception of UNION ALL . The base tables in operands of a UNION ALL must not be same table, and each operand must be deletable. The select list of outer fullselect does not include DISTINCT .
96
Updatable View
Is a special case of deletable view A deletable view becomes a updatable view when at lease one of its columns is updatable. Column of a view is updatable when...
View is deletable Column resolves to a column of a base table All the corresponding columns of UNION ALL have exactly matching data types (including length or precision and scale).
97
Insertable Views
Allows you to insert rows using view definition. A view is insertable when.....
All of its columns are updatable.
98
Read-Only Views
Is a non-deletable view. A view can be read-only if it does not comply with at least one of the rules of the deletable views. CREATE VIEW view1 (name,work_phone,home_phone) AS SELECT DISTINCT fname,wphone,hphone FROM candidate c, test_taken tt WHERE c.cid = tt.cid
99
Inoperative Views
Is no longer available for SQL statements. A view becomes inoperative if...
Privilege upon which view definition is dependent, is revoked. Object upon which view definition is dependent, is dropped View, upon which this view definition is dependent, becomes inoperative.
100
101
SQL
1. SQL Getting Started 2. Database Objects 3. Data Types 4. Data Definition Language, Constraints & Rules 5. Data Manipulation Language 6. Indexes 7. Views 8. Identity Columns
102
Identity Columns
A numeric column in a table for which DB2 automatically generates a unique numeric value for each row that is inserted into the table A table may have a single column that is defined with the identity attribute Intended to be used for generating unique primary key values
Examples: order number, employee number, stock number
103
CREATE TABLE EMPLOYEES (empno INTEGER GENERATED BY DEFAULT AS IDENTITY (START WITH 1000 INCREMENT BY 1 CACHE 10), name CHAR(20));
104
Notes:
Identity column Must be numeric: SMALLINT, INTEGER, BIGINT, DECIMAL with precision 0 Implicitly NOT NULL Identity column does not imply a unique column. If uniqueness is required must create a unique index or primary key on the column. START WITH n INCREMENT BY m n specifies the first value for the column, can be positive or negative, default is 1 m specifies the interval between consecutive values of the column, can be positive or negative resulting in ascending or descending sequence CACHE or NO CACHE Specifies whether to keep some pre-allocated values in memory for faster access Performance option - reduces synchronous I/O to the log when values are generated When a database is deactivated (normally or by failure), all cached sequence values are lost
105
--->inserts 103,frame
INSERT INTO inventory (description) VALUES ('frame'); --->inserts 104,frame COMMIT; SELECT * FROM inventory order by partno; 100 door 101 hinge 102 handle 104 frame 200 window
107
108
IDENTITY_VAL_LOCAL()
(SET AUTOCOMMIT OFF) CONNECT TO sample; CREATE TABLE T1 ( C1 INTEGER GENERATED ALWAYS AS IDENTITY, C2 INTEGER ) ; INSERT INTO T1 (C2) VALUES (5); INSERT INTO T1 (C2) VALUES (6); SELECT * FROM T1;
C1 ----1 2
C2 ----5 6
VALUES IDENTITY_VAL_LOCAL( );
IDENTITY_VAL_LOCAL( ) -------------------2
COMMIT;
109
Notes:
Identitymissing: when input data file does not contain any values for the identity column. Example: a table tab1 has C1 identity, C2, C3, C4 columns. Input file contains data for columns C2, C3 and C4 only. To invoke load: load from data.del of del replace into tab1 (C2,C3,C4) ....is equivalent to... load from data.del of del modified by identitymissing replace into table1 Identityignore: even though the input data file contains data for the identity column, the data should be ignored and identity values should be generated for each row. Identityoverride: used for loading user-supplied values into a table having a GENERATED ALWAYS identity column. Useful when migrating data from another database.
110
Advanced SQL
1. User Defined Data Types 2. Sequences 3. Advanced Functions
111
112
113
114
Creation of this user-defined data type will result in creation of following casting functions: CHAR(PHONENO): translates data values from the PHONENO data type to base data type CHAR PHONENO(CHAR): translates data values from base data type CHAR to PHONENO data type
115
116
117
Important Functions...
CREATE FUNCTION SUMDOLLAR(US_DOLLAR) RETURNS US_DOLLAR SOURCE SYSIBM.SUM(DECIMAL()) SELECT SUMDOLLAR(TOTAL) FROM US_SALES CREATE FUNCTION DOLLAR_AVG(US_DOLLAR) RETURNS DECIMAL SOURCE SYSIBM.AVG(DECIMAL()) SELECT DOLLAR_AVG(TOTAL) FROM US_SALES
118
Important Functions
SELECT DOLLAR_AVG(TOTAL) - 10 FROM US_SALES SELECT ((SELECT DOLLAR_AVG(TOTAL) FROM US_SALES WHERE YEAR = 1987)- (SELECT DOLLAR_AVG(TOTAL) FROM US_SALES WHERE YEAR=1986)) FROM US_SALES
119
120
121
Reference Columns
In typed table definition, columns can be defined as reference columns to another typed table.
Referenced typed table is called a target table.
A reference column holds values that correspond to OID values of target table and clearly identify rows in target tables. Data type of a reference column is REFERENCE , the same type as OID in target table. Reference column is similar to foreign key; however, evaluation, like a foreign key, is not performed for operations such as insert, update, or delete.
123
Reference Columns
CREATE TYPE DEPT_T AS (NAME CHAR(40), LOCATION CHAR(20)) REF USING INTEGER MODE DB2SQL CREATE TYPE EMP_T UNDER PERSON_T AS (SALARY INTEGER, DEPTREF REF(DEPT_T)) MODE DB2SQL Deptref REF(Dept_t) means that this attribute DEPTREF of EMP_T type is reference type and target of reference is a row of table whose row type is DEPT_T or its sub-type. CREATE TABLE DEPT OF DEPT_T (REF IS Oid USER GENERATED) CREATE TABLE EMP OF EMP_T UNDER PERSON INHERIT SELECT PRIVILEGES (DEPTREF WITH OPTIONS SCOPE Dept) DEPTREF WITH OPTIONS SCOPE DEPT means that values in column DEPTREF are pointing to rows in table DEPT or values in any sub-tables of DEPT table.
124
125
126
127
Advanced SQL
1. User Defined Data Types 2. Sequences 3. Advanced Functions
128
Sequences
SEQUENCE object lets DBA or developer create a value that gets incremented under programmer control and can be used across many different tables. CREATE SEQUENCE CUSTOMER_NO AS INTEGER
By default this sequence number starts at one and increments by one at a time and is of an INTEGER datatype.
NEXTVAL :
Function generates next value for sequence which can then be used for subsequent SQL statements INSERT INTO CUSTOMERS VALUE (NEXTVAL FOR CUSTOMER_NO, 'Prakash', ...)
129
Sequences
PREVVAL:
Will only return last value generated by that application. Function can be used multiple times within application.
130
Advanced SQL
1. User Defined Data Types 2. Sequences 3. Advanced Functions
131
Advance Functions
Trigonometric Functions
COS , SIN , TAN , COT , TANH , COSH , SINH , ATAH , ATAN2
Math Functions
INTEGER , FLOOR , CEILING , TRUNC , SQRT , LN , EXP
String Functions
RTRIM,LTRIM,INSTR,TRANSLATE,REPEAT,REPLACE,CONCAT, SUBSTR, LENGTH , LOWER/LCASE , UPPER/UCASE
Statistical Functions
CORRELATION , STDDEV , VARIANCE
Date Functions
DATE , TIME , TO_CHAR , TO_DATE , DAYNAME , DAYOFWEEK
Logic Functions
COALESCE , NULLIF
Speciality Functions
MQPUBLISH , MQREAD , ENCRYPT , REC2XML
132
Summary
SQL
SQL Getting Started Database Objects Data Types, Constraints, Rules Data Definition Language Data Manipulation Language Indexes Views Identity Columns
Advanced SQL
User Defined Data Types Sequences Advanced Functions
133