SQL Server Note
SQL Server Note
Triggers:
Type of triggers
1. After Triggers
2. Instead of Triggers
Triggers.sql
GO
Q.
Q.
Join selects the data from two or more tables based on one condition, while
union selects the data from two tables based on two different conditions. (Join
is used to establish a conditional selection for two different tables, while
union is used to select similar data based on different conditions.)
Join selects columns from 2 or more tables, while union selects rows.
Union combines the result of two select statements and generates one result
set and then eliminates the duplicate rows from that result set.
Q.
Union All selects all the rows from all the select statements whereas Union
eliminates the duplicate rows.
What are the new features in SQL Server 2008?
Q.
-=
*=
/=
%=
Enhanced convert function: - SQL Server 2008 has enhanced the convert
function when you do the conversion between binary and hexadecimal.
New Date and Time data types: New Date and Time functions: The Merge statement: Filtered Indexes
Change Data Capture
What is filtered index in SQL Server 2008?
Filtered Index is a new feature in SQL SERVER 2008. Filtered Index is used
to index a portion of rows in a table, that means it applies filter on INDEX
which improves query performance, reduce index maintenance costs, and
reduce index storage costs compared with full-table indexes.
When we see an Index created with some WHERE clause then that is actually
a FILTERED INDEX.
Q.
Full-text search refers to the functionality in SQL Server that supports full-text
queries against character-based data. These types of queries can include words and
phrases as well as multiple forms of a word or phrase. To support full-text queries,
full-text indexes must be implemented on the columns referenced in the query. The
columns can be configured with character data types (such as char and varchar) or
with binary data types (such as varbinary and image). A full-text index is made up
of word tokens that are derived from the text being indexed. For example, if the
indexed text contains the phrase tables can include indexes, the full-text index
would contain four tokens: tables, can, include, and indexes. Because the
list of tokens can be easily searched, full-text queries can quickly locate the
necessary records.
Q.
What is the difference between User Defined Function and Stored
Procedure?
Q.
UDF have its own advantage and usage but in this article we will see the limitation
of UDF. Things UDF cannot do and why Stored Procedure are considered as more
flexible then UDFs. Stored Procedure are more flexibility then User Defined
Functions(UDF).
As I said earlier this article is written to show Limitations of UDF. I use UDF for many
reasons, the main reason I use it I can do repetitive task in SELECT statement as
well as modularizing my frequently used code.
Q.
Q.
Serializable
REPEATABLE READS (phantom reads): - Phantom reads occur when an
insert or delete action is performed against a row that belongs to a range of
rows being read by a transaction
READ COMMITTED: - This is the default isolation level in SQL Server. When
it's used, SQL Server will use shared locks while reading data. It ensures that
a physically corrupt data will not be read and will never read data that
another application has changed and not yet committed, but it does not
ensure that the data will not be changed before the end of the transaction.
READ UNCOMMITTED (dirty reads):- When it's used, SQL Server not issue
shared locks while reading data. So, you can read an uncommitted
transaction that might get rolled back later. This isolation level is also called
dirty read. This is the lowest isolation level. It ensures only that a physically
corrupt data will not be read.
SNAPSHOT isolation
Dirty Reads are reading the UnCommitted Data, while Phantom reads are
repeatable reads.
NOLOCK hint can also cause the DIRTY READS.
The SNAPSHOT isolation level prevents phantom reads. READ UNCOMMITTED, READ
COMMITTED and REPEATABLE READ all allow phantom reads. Aside from the SNAPSHOT
isolation level, the SERIALIZABLE isolation level also prevents phantom reads.
Phantom reads occur when an insert or delete action is performed against a row that
belongs to a range of rows being read by a transaction. The transaction's first read of the
range of rows shows a row that no longer exists in the second or succeeding read as a result
of a deletion transaction. Similarly, the transaction's second or succeeding read shows a
row that did not exist in the original read as the result of an insertion by a different
transaction.
Q.
Except returns all the distinct rows from left hand side table which are not present in
right hand side table, where as Not In returns all the rows from left hand side table
that are not present in right hand side table.
Except returns the Distinct rows, where as Not In doesnt do so.
Q.
System Databases
Master database holds information for all the databases on the SQL Server
instance.
Msdb database stores information regarding database backups, SQL Agent
Information, DTS package, SQL Server jobs and some replication information such
as for log shipping.
Tempdb holds temporary objects such as global and local temporary tables and
stored procedures.
Model database is a template database used in creation of any new user database
created in the instance.
Resource database
Q.
Q.
2.
ALTER INDEX
3.
UPDATE_STATISTICS
Q.
Q.
Both primary key and unique key enforces the uniqueness of the column on
which they are defined.
By default Primary Key creates a Clustered Index on the column, where as
unique key creates a non-clustered index by default.
Primary key doesnt allow NULLs, where as Unique key allows the NULL value.
Primary Key creates a Clustered Index, but it doesnt mean that creating clustered
index creates a Primary Key.
Q.
Sr.
No.
Table Variables
Temp Table
Performance Differences
1.
2.
3.
4.
5.
6.
7.
Syntactical Differences
8.
9.
10.
Q.
You can define indexes on temporary tables. In many cases, these indexes can improve the performance of queries
that use tempdb. The optimizer uses these indexes just like indexes on ordinary user tables. The only requirements
are:
The table must contain data when the index is created. If you create the temporary table and create the
index on an empty table, Adaptive Server does not create column statistics such as histograms and densities. If
you insert data rows after creating the index, the optimizer has incomplete statistics.
The index must exist while the query using it is optimized. You cannot create an index and then use it in a
query in the same batch or procedure.
The optimizer may choose a suboptimal plan if rows have been added or deleted since the index was
created or since update statistics was run.
Providing an index for the optimizer can greatly increase performance, especially in complex procedures that create
temporary tables and then perform numerous operations on them.
No. "Indexes cannot be created explicitly on table variables, and no statistics are kept
on table variables." You can implicitly create a unique clustered index by marking a
column as a primary key.
Q.
You can only add indexes on table variables by adding PRIMARY KEY
and UNIQUE constraints when you declare them. But that can take you a
long way.
A hint is that if you plan to put lots of data into your table variable,
consider a temp table instead. Temp tables has statistics, and if you
will use the table as input to a query, SQL Server will have more
information to work from.
Q.
Q.
--Nth Highest
SELECT * FROM Employee E1
WHERE (N-1) = (SELECT COUNT(DISTINCT(E2.Salary)) FROM Employee E2
WHERE E2.Salary > E1.Salary)
--Nth Lowest
SELECT * FROM Employee E1
WHERE (N-1) = (SELECT COUNT(DISTINCT(E2.Salary)) FROM Employee E2
WHERE E2.Salary < E1.Salary)
select * from Employee
select * from (select Employee_ID, Salary, DENSE_RANK()
over (order by salary desc) as rankid from Employee) t
where rankid = N
Q.
Sr. No.
Truncate
Delete
5
6
DELETE is a DML
command
DELETE command can be
used with or without
where clause.
Q.
What is normalization?
First normal form (1NF) sets the very basic rules for an organized database:
Create separate tables for each group of related data and identify each
row with a unique column or set of columns (the primary key).
Q.
What is a view?
[SELECT Statement]
DROP VIEW SAMPLEVIEW
Types of Views:
System Views
o
Catalog View
Simple View
Complex View
Indexed View:
An indexed view is the same thing as a normal view, however the critical difference
is that an indexed view actually allows you to create a clustered index on it,
effectively working around the "one clustered index per table" limitation.
Indexed views require a more rigid definition of what they will return. For this
reason, we cannot use a wildcard (*) in the underlying query. Each column must be
individually called out; also, each table named in the query must be in 2 part dot
notation (dbo.tablename), referencing both the schema of the table and the table
name itself, and the view must be declared with schemabinding enabled for it.
Q.
/* CROSS JOIN */
SELECT t1.*,t2.*
FROM Table1 t1
CROSS JOIN Table2 t2
GO
Self Join:
Q.
Q.
2.
HAVING is typically used with GROUP BY clause, when GROUP BY is not
used HAVING behave likes a WHERE clause.
Q.
Q.
What is QUOTED_IDENTIFIER?
Q.
Q.
what is required before CTE and what all restrictions are there for CTE?
Q.
Q.
Order by clause cannot be used in sub queries unless you are using Top or For
XML in sub query.
Q.
Q.
Q.
Yes, SQL Server internally adds a four length value to make it unique.
--Create Non-Unique Clustered Index
CREATE CLUSTERED INDEX [IX_Employee] ON [dbo].[Employee]
(
[id] ASC
)
GO
--Create Unique Clustered Index
CREATE UNIQUE CLUSTERED INDEX [IX_Employee] ON [dbo].[Employee]
(
[id] ASC
)
GO
Q.
OUTPUT clause has accesses to inserted and deleted tables (virtual tables)
just like triggers. OUTPUT clause can be used to return values to client clause.
OUTPUT clause can be used with INSERT, UPDATE, or DELETE to identify the
actual rows affected by these statements.
OUTPUT clause can generate table variable, a permanent table, or temporary
table. Even though, @@Identity will still work in SQL Server 2005
INTO TESTDATA
inserted.ID, inserted.NAME INTO @DATA
1, 'A'
2, 'B'
3, 'C'
Example 2: OUTPUT clause with insert statement, without using any table
variable or temporary table.
CREATE TABLE TESTDATA
(
ID INT,
NAME VARCHAR(50)
)
INSERT
OUTPUT
SELECT
UNION
SELECT
UNION
INTO TESTDATA
inserted.ID, inserted.NAME
1, 'A'
2, 'B'
SELECT 3, 'C'
DROP TABLE TESTDATA
GO
INTO TESTDATA
1, 'A'
2, 'B'
3, 'C'
UPDATE TESTDATA
SET NAME = 'Z'
OUTPUT inserted.ID, inserted.NAME, deleted.ID, deleted.NAME INTO @DATA
WHERE ID IN (1, 2, 3)
SELECT * FROM @DATA
SELECT * FROM TESTDATA
DROP TABLE TESTDATA
GO
INTO TESTDATA
1, 'A'
2, 'B'
3, 'C'
DELETE TESTDATA
OUTPUT deleted.ID, deleted.NAME INTO @DATA
WHERE ID IN (3)
SELECT * FROM @DATA
SELECT * FROM TESTDATA
DROP TABLE TESTDATA
GO
Data Types:
SQL Server provides both datatypes to store character information. For the most
part the two datatypes are identical in how you would work with them within SQL
Server or from an application. The difference is that nvarchar is used to store
unicode data, which is used to store multilingual data in your database tables.
Other languages have an extended set of character codes that need to be saved
and this datatype allows for this extension. If your database will not be storing
multilingual data you should use the varchar datatype instead. The reason for this is
that nvarchar takes twice as much space as varchar, this is because of the need to
store the extended character codes for other languages.
Excel always stores the data in Unicode format.
Datatype
Min
Max
Storage
Bigint
-2^63
2^63 1
8 bytes
Int
-2,147,483,648
2,147,483,647
4 bytes
Smallint
-32,768
32,767
2 bytes
Tinyint
255
1 bytes
Bit
1 bytes
Decimal
-10^38+1
10^38-1
Precision 1-9 = 5
bytes, precision
10-19 = 9 bytes,
precision 20-28 =
13 bytes, precision
29-38 = 17 bytes
Float
-1.79E + 308
1.79E + 308
4 bytes when
precision is less
than 25 and 8
bytes when
precision is 25
through 53
DateTime
1753-01-01
00:00:00.000
9999-12-31
23:59:59.997
8 bytes
Char
0 chars
8000 chars
Defined Width
Varchar
0 chars
8000 chars
2 bytes + no. of
characters
Varchar(max)
0 chars
2^31 chars
2 bytes + no. of
characters
Text
4 bytes + no. of
characters
Notes
NChar
0 chars
4000 chars
Nvarchar
0 chars
4000 chars
declare
declare
declare
declare
Defined width * 2
@test varchar(2)
@first varchar(4)
@second varchar(4)
@third varchar(3)
Output:
te
test
tes
Ranking Functions:
select * from (
select firstname, surname, salary, dense_rank() over (order by salary desc)
rankid from Employee) rs
where rankid < 3
Q.
CTE are not replacement of the Temp Table or Temp Variable Table
Usually the execution plan of the CTE like sub-query but there are cases when
it can be different as well
Advantages of CTE:
Recursive CTE:
USE AdventureWorks
GO
;WITH Emp_CTE AS (
SELECT EmployeeID, ContactID, LoginID, ManagerID, Title, BirthDate
FROM HumanResources.Employee
WHERE ManagerID IS NULL
UNION ALL
SELECT e.EmployeeID, e.ContactID, e.LoginID, e.ManagerID, e.Title,
e.BirthDate
FROM HumanResources.Employee e
INNER JOIN Emp_CTE ecte
ON ecte.EmployeeID = e.ManagerID
)
SELECT *
FROM Emp_CTE
GO
Multiple CTE:
/* Method 1 */
;WITH CTE1 AS (SELECT 1 AS Col1),
CTE2 AS (SELECT 2 AS Col2)
SELECT CTE1.Col1,CTE2.Col2
FROM CTE1
CROSS JOIN CTE2
GO
/* Method 2 */
;WITH CTE1 AS (SELECT 1 AS Col1),
CTE2 AS (SELECT COL1+1 AS Col2 FROM CTE1)
SELECT CTE1.Col1,CTE2.Col2
FROM CTE1
CROSS JOIN CTE2
GO
Now if your CTE goes beyond 5th recursion it will throw an error and stop executing.
If you put MAXRECURSION value too low it may be possible before your desire result
is accomplished and will throw an error.
For example if you change MAXRECURSION to value 3. It will throw following error.
The statement terminated. The maximum recursion 3 has been exhausted
before statement completion.
In summary MAXRECUSION is good way to protect your CTE to go into infinite loop.
Set Operators:
UNION
UNION ALL
INTERSECT: The SQL INTERSECT operator takes the results of two queries
and returns only rows that appear in both result sets. For purposes of
duplicate removal the INTERSECT operator does not distinguish between
NULLs. The INTERSECT operator removes duplicate rows from the final result
set. The INTERSECT ALL operator does not remove duplicate rows from the
final result set.
INTERSECT ALL
EXCEPT
TOP Clause:
We can update or delete data from a table by using the TOP clause.
We can pass variable to change the number of the rows on the fly.
declare @n int
set @n=3
But we know that there is another row with the same managerid values. Now
let us include the TIES option in the query.
select top 10 percent with ties
*
from employee order by managerid desc
select left('abcd', 2)
--ab
select right('abcd', 2)
--cd
APPLY Clause:
We can also consider CROSS APPLY clause as INNER APPLY as it will use outer (main)
query as an input of sub-query or function and will return the result set. In CROSS
APPLY we will be getting full set of left side query (main query) and its
corresponding value from right side query or function, if it is not available in right
side query or function, it will return NULL.
CROSS APPLY clause works like an inner apply whereas OUTER APPLY works like a
Left apply.
SELECT p.ProductID, p.Name, p.ProductNumber
, ca.Quantity, ca.LocationID
FROM
Production.Product AS p
CROSS/OUTER APPLY
(SELECT TOP(10) * FROM Production.ProductInventory
WHERE ProductID = p.ProductID) ca
WHERE
p.ProductID in (1,2,3,531,706)
ORDER BY
p.ProductID ASC
SELECT p.ProductID, p.Name, p.ProductNumber
, pmi.Quantity, pmi.LocationID
FROM
Production.Product AS p
CROSS APPLY
dbo.fn_GetMax_ProductItem(p.ProductID, 2) AS pmi
WHERE
p.ProductID in (1,2,3,531,706)
ORDER BY
p.ProductID ASC
@@IDENTITY returns the last identity value generated for any table in the
current session, across all scopes.
SCOPE_IDENTITY returns the last identity value generated for any table in the
current session and the current scope.
USE AdventureWorks;
GO
IF OBJECT_ID(N't6', N'U') IS NOT NULL
DROP TABLE t6;
GO
IF OBJECT_ID(N't7', N'U') IS NOT NULL
DROP TABLE t7;
GO
CREATE TABLE t6(id int IDENTITY);
CREATE TABLE t7(id int IDENTITY(100,1));
GO
CREATE TRIGGER t6ins ON t6 FOR INSERT
AS
BEGIN
INSERT t7 DEFAULT VALUES
END;
GO
--End of trigger definition
SELECT id FROM t6;
--id is empty.
SELECT id FROM t7;
--ID is empty.
--Do the following in Session 1
INSERT t6 DEFAULT VALUES;
SELECT @@IDENTITY;
/*Returns the value 100. This was inserted by the trigger.*/
SELECT SCOPE_IDENTITY();
/* Returns the value 1. This was inserted by the
INSERT statement two statements before this query.*/
SELECT IDENT_CURRENT('t7');
/* Returns value inserted into t7, that is in the trigger.*/
SELECT IDENT_CURRENT('t6');
/* Returns value inserted into t6. This was the INSERT statement four
statements before this query.*/
PIVOT:
USE AdventureWorks
GO
select distinct con.FirstName + ' ' + con.LastName CustomerName, p.Name,
s.LineTotal
into #temp1
from [Sales].[SalesOrderDetail] s
join Sales.SalesOrderHeader sh
on sh.SalesOrderID = s.SalesOrderID
join Sales.Customer c
on c.CustomerID = sh.CustomerID
join Sales.Individual i
on i.CustomerID = c.CustomerID
join Person.Contact con
on con.ContactID = i.ContactID
join Production.Product p
on p.ProductID = s.ProductID
select distinct name
into #temp2
from #temp1
order by 1
declare @sql nvarchar(max),
@col nvarchar(max)
select @col = coalesce(@col, '') + QUOTENAME(name) + ','
from (
select distinct name
from #temp2
) x
set @col = LEFT(@col, len(@col) - 1)
set @sql = N'SELECT CustomerName, $COL$ FROM #TEMP1
PIVOT (SUM(LineTotal) FOR NAME IN ($COL$)) AS PVT';
SET @sql = REPLACE(@SQL, '$COL$', @COL)
UNPIVOT:
USE AdventureWorks
GO
select distinct con.FirstName + ' ' + con.LastName CustomerName, p.Name,
s.LineTotal
into #temp1
from [Sales].[SalesOrderDetail] s
join Sales.SalesOrderHeader sh
on sh.SalesOrderID = s.SalesOrderID
join Sales.Customer c
on c.CustomerID = sh.CustomerID
join Sales.Individual i
on i.CustomerID = c.CustomerID
join Person.Contact con
on con.ContactID = i.ContactID
join Production.Product p
on p.ProductID = s.ProductID
select distinct name
into #temp2
from #temp1
order by 1
declare @sql nvarchar(max),
@col nvarchar(max)
select @col = coalesce(@col, '') + QUOTENAME(name) + ','
from (
select distinct name
from #temp2
) x
set @col = LEFT(@col, len(@col) - 1)
set @sql = N'SELECT CustomerName, NAME, LineTotal FROM (SELECT CustomerName,
$COL$ FROM #TEMP1
PIVOT (SUM(LineTotal) FOR NAME IN ($COL$)) AS PVT) AS P
UNPIVOT (LineTotal FOR NAME IN ($COL$)) as unpvt';
SET @sql = REPLACE(@SQL, '$COL$', @COL)
EXEC sp_executesql @sql;
drop table #temp1
drop table #temp2
Will a dataset converted to Pivot when revert to UnPivot the new dataset is similar
to the original dataset.
Yes, but in some cases it might not. It depends on the logic we have used
generating the Pivot. If during pivot we dont have any filter while merging
the data, we will not be able to revert it.
GROUP BY WITH ROLLUP/WITH CUBE in SQL Server
select con.FirstName + ' ' + con.LastName CustomerName, p.Name Items
, convert(float, sum(s.LineTotal)) Total
from [Sales].[SalesOrderDetail] s
join Sales.SalesOrderHeader sh on sh.SalesOrderID = s.SalesOrderID
join Sales.Customer c on c.CustomerID = sh.CustomerID
join Sales.Individual i on i.CustomerID = c.CustomerID
join Person.Contact con on con.ContactID = i.ContactID
join Production.Product p on p.ProductID = s.ProductID
where con.ContactID in (16067, 15414)--, 16761, 4708)
group by con.FirstName + ' ' + con.LastName, p.Name
order by 1
Normal Group By statement will provide the above result set. If we want below
result then we need to use ROLLUP or CUBE clause.
select case when grouping(con.FirstName + ' ' + con.LastName) = 1
then 'All Customers'
else con.FirstName + ' ' + con.LastName
end CustomerName
, case when grouping(p.Name) = 1
then 'All Items'
else p.Name
end Items
, convert(float, sum(s.LineTotal))
from [Sales].[SalesOrderDetail] s
join Sales.SalesOrderHeader sh on sh.SalesOrderID = s.SalesOrderID
join Sales.Customer c on c.CustomerID = sh.CustomerID
join Sales.Individual i on i.CustomerID = c.CustomerID
join Person.Contact con on con.ContactID = i.ContactID
join Production.Product p on p.ProductID = s.ProductID
where con.ContactID in (16067, 15414)
group by con.FirstName + ' ' + con.LastName, p.Name
with rollup
order by 1
INSERT, UPDATE, and DELETE statements can reference the sparse columns
by name. SPARSE column can work as one XML column as well.
SPARSE column can take advantage of filtered Indexes, where data are filled
in the row.
SPARSE column saves lots of database space when there are zero or null
values in database.
SPARSE column can not have default value or rule or computed column.
Clustered index or a unique primary key index can not be applied SPARSE
column. SPARSE column can not be part of clustered index key.
Table containing SPARSE column can have maximum size of 8018 bytes
instead of regular 8060 bytes.
A table operation which involves SPARSE column takes performance hit over
regular column.
Datepart:
D
Day
DD
DayOfYear
DY
Y
DW
W
WeekDay
- WeekDay
- WeekDay
- WeekDay
HH
HOUR
M
MM
MONTH
MCS
- MicroSeconds
MicroSecond - MicroSeconds
MI
Minute
N
- Minutes
- Minutes
- Minute
MilliSecond - MilliSeconds
MS
- MilliSeconds
NANOSECOND
- NanoSeconds
NS
- NanoSeconds
Q
QQ
Quarter
S
Second
SS
- Second
- Second
- Second
Week
WK
WW
YEAR
- Year
YY
- Year
YYYY
- Year
Index Scan retrieves all the rows from the table. Index Seek retrieves selective rows
from the table.
Index Scan (table scan):
Since a scan touches every row in the table whether or not it qualifies, the cost is
proportional to the total number of rows in the table. Thus, a scan is an efficient
strategy if the table is small or if most of the rows qualify for the predicate.
Index Seek:
Since a seek only touches rows that qualify and pages that contain these qualifying
rows, the cost is proportional to the number of qualifying rows and pages rather
than to the total number of rows in the table.
Index Scan is nothing but scanning on the data pages from the first page to the last
page. If there is an index on a table, and if the query is touching a larger amount of
data, which means the query is retrieving more than 50 percent or 90 percent of the
data, and then optimizer would just scan all the data pages to retrieve the data
rows. If there is no index, then you might see a Table Scan (Index Scan) in the
execution plan.
Index seeks are generally preferred for the highly selective queries. What that
means is that the query is just requesting a fewer number of rows or just retrieving
the other 10 (some documents says 15 percent) of the rows of the table.
In general query optimizer tries to use an Index Seek which means that optimizer
has found a useful index to retrieve recordset. But if it is not able to do so either
because there is no index or no useful indexes on the table then SQL Server has to
scan all the records that satisfy query condition.
Type of Constraints:
Not NULL: CREATE TABLE Persons
(
P_Id int NOT NULL,
LastName varchar(255) NOT NULL,
FirstName varchar(255),
Address varchar(255),
City varchar(255)
)
Unique Key: -
Primary Key: -
Foreign Key: -
Check: -
Default:-