Query parameterization

Post on 26-May-2015

118 views 0 download

Tags:

description

Sql server

Transcript of Query parameterization

Query Tuning

Sql Server Query Parameterization

INDEX

Adhoc Query Predicates Order Execution Plan Query Optimizer Parameter Sniffing Indexes Statistics Database Engine Tuning Advisor

Adhoc Query

Any non-Parameterized queries are called addhoc queries. For example :

SELECT MsgID, Severity FROM SqlMessage WHERE MsgID = 100

In sql server if we execute a sql query it goes through two steps just like any other programming languages:

Compilation Execution

Properties Of Addhoc Queries Case sensitive Space sensitive Parameter sensitive 

Sql server treats two same sql queries of different parameters as a two different sql statements. For example:

SELECT MsgID, Severity FROM SqlMessage WHERE MsgID = 1 SELECT MsgID, Severity FROM SqlMessage WHERE MsgID = 2

Effect Of Faulty C# Code

Sql server has took extra n * (Compilation time) ms to display records

Extra time to insert records in cached plans.

Sql server has to frequently fire a job to delete the cached plan since it will reach the max limit very soon.

It will not only decrease the performance of this sql query but all sql queries of other application since this faulty code will force to delete cached query plans of other sql statements.

Predicates Order

Does order of predicates in WHERE clause matters?

WHERE vcLanguage = 'English' AND ntAge = 12

WHERE ntAge = 12 AND vcLanguage = 'English'

Execution Plan

Query Optimizer

Query Optimizer

The query optimizer in SQL Server is cost-based. It includes:

Cost for using different resources (CPU and IO) Total execution time It determines the cost by using:   Cardinality: The total number of rows processed at each

level of a query plan with the help of histograms , predicates and constraint

Cost model of the algorithm: To perform various operations like sorting, searching, comparisons etc.

Parameter Sniffing

Sql server generates execution paln according to the first parameter

This execution plan may bad for other parameters.

 Solution:

Create multiples stored procedures. Use optimizer for query hints.

What Is An Index ?

Index is a way to organize data to make searching, sorting and grouping fasters 

we need indexing when :

WHERE, ON, HAVING clause (Searching) ORDER BY clause (Sorting) GROUP BY clause (Grouping) etc.

Table Scan

RollNo Name Country Age

101 Greg UK 23

102 Sachin India 21

103 Akaram Pakistan 22

107 Miyabi China 18

108 Marry Russia 27

109 Scott USA 31

110 Benazir Banglades 17

111 Miyabi Japan 24

112 Rahul India 27

113 Nicolus France 19

SELECT * FROM Student WHERE RollNo = 111

Time complexity of table scan is : O(n)

Types Of Index

Table without any index is called Heap

There are two type of index:

Clustered index Non-Clustered index

Clustered Index

When we create a clustered index on any table physical organization of table is changed.

Now data of table is stored as a balanced tree(B tree).

CREATE UNIQUE [CLUSTERED] INDEX <Name>

ON <ObjectName>(

<ColumnName>  [ASC | DESC ] [ ,...n ]

)

Types Of Scanning

Table scan: It is very slow can and it is used only if table has not any clustered index.

Index scan: It is also slow scan. It is used when table has clustered index and either in WHERE clause non-key columns are present or query has not been covered (will discuss later) or both.

Index Seek: It is very fast. Our goal is to achieve this.

Clustered Index

If we create table with primary key, sql server automatically creates clustered index on that table

A table can have only one clustered index .

Physical order of rows of table is same as logical order of key columns of clustered index.

Terms Of Execution Plan

Predicate: It is condition in WHERE clause which is either non-key column or column which has not been covered.

Object: It is name of source from where it getting the data. It can be name of table, Clustered index or non-clustered index

Output list: It is name of the columns which is getting from object.

Seek Predicate: It is condition in WHERE clause which is either key column or fully covered.

Non-Clustered Index

It is logical organization of data of table. A non-clustered index can be of two types.

Heap Based on clustered index.

If table has clustered index then leaf node of non-clustered index keeps the key columns of clustered index.

If the table has not any clustered index then leaf node of non-clustered index keeps RID which unique of each row of table.

Based On Clustered Index

Based On Heap

Covering Of Queries We can specify maximum 16 column names.

Sum of size of the columns cannot be more than 900 bytes.

All columns must belong to same table.

Data type of columns cannot be ntext, text, varchar (max), nvarchar (max), varbinary (max), xml, or image

It cannot be non-deterministic computed column.

Statistics Analysis

The query optimizer uses statistics to create query plans that improve query performance

A correct statistics will lead to high-quality query plan.

The query optimizer determines when statistics might be out-of-date by counting the number of data modifications since the last statistics update and comparing the number of modifications to a threshold.

Auto Create Statistics

Default setting of auto create statistics is ON.

It creates when: Clustered and non clustered Index is created Select query is executed.

Auto create and updates applies strictly to single-column statistics.

Why Query 2 Is Performing Better If we perform following operations on field of any

table in query predicate:

Using any system function or user defined function

Scalar operation like addition, multiplication etc. Type casting

In this situation sql server query optimizer is not able to estimate correct cardinality using statistics.

To Improve Cardinality

If possible, simplify expressions with constants in them. If possible, don't perform any operation on the any field

of a table in WHERE Clause, ON Clause, HAVING Clause

Don't use local variables in WHERE Clause, ON Clause, HAVING Clause. 

If there is any cross relationship among fields or there is a complex expression in a field in a query predicates, it is better to create a computed column and then create a non-clustered index on it.

To Improve Cardinality

If possible, don't update the value of parameters of a function or stored procedure before using in sql statement

Use OPTIMIZE FOR clause when you want to optimize a sql query on the basis of specific parameter value.

If you want to update the value parameter of a stored procedure or a function create a similar procedure or function and execute it form base procedure or function by passing the updated value as a parameter. 

Create user defined multi column statistics if query predicates have more than one fields of a table.

SQL Server Tools

Sql query profiler Database Tuning Advisor Client Statistics Resource Governor Data Collections

THANK YOU

Ritesh KumarSkype: mfs_riteshBlog: http://exacthelp.blogspot.com/