Recently we had really nice meetup gathering here in Auckland. Was a beautiful evening, there were 20 people talking about SQL server, was perfect. One of the topics was “Merge Join Operator”
For this demo I used SQL Server 2014 Developer Edition.
As a database I used StackOverflow, you can download this DB from Brent Ozar web page, just follow the instructions
How to Download the Stack Overflow Database via BitTorrent
In this example I will use two tables Badges (25 million rows) and Users (8 million rows) and I will join them on Userid (Badges) and Id (Users) columns. We have just clustered indexes on those two tables. I just want to show you one query example with Merge Join and Sort operator.
So lets try following query
We are returning 98599 rows with this query.
When we check statistics we can see that query creating so-called Worktable
(98599 row(s) affected)
Table ‘Worktable’. Scan count 0, logical reads 0, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
Table ‘Badges’. Scan count 1, logical reads 155202, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
Table ‘Users’. Scan count 1, logical reads 120104, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
Reason for that is that our SQL Server thinks that when we have duplicated values in our outer table and then SQL server create so-called Worktable in the tempdb.
Lets now check the Execution Plan
You can see here that we have Merge Join Operator in our execution plan. Query optimizer thinks that for this number of rows (this case 98599) Merge Join is the better option then using Nested Loop or Hash join operator, even if we need sort operator to perform Merge Join in this query.
Merge join needs sorted input to perform efficiently as possible. Our join columns are Badges.UserID and Users.Id
As I mentioned at the beginning of the post we do not have any indexes on tables, except clustered index.
Table Badges is sorted by clustered index PK_Badges_Id, column Id
Table Users is sorted by clustered index PK_Users_Id, column Id
You noticed that we have sort operator in our execution plan. Merge join needs sorted input to perform efficiently as possible, because of that we have the sort operator on our Badges table sorted by UserId column since that is our join column.
In some cases this sort operator can be really expensive so let’s remove it from our execution plan.
To accomplish this we need to add an index which will be sorted by column we are using as a join predicate (UserID in Badges Table) and in this case we need to include column we have in the WHERE clause as well. This way we will presort the data before joining the tables.
If we run this query again we will get following execution plan
(98599 row(s) affected)
Table ‘Badges’. Scan count 1, logical reads 323, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
Table ‘Users’. Scan count 1, logical reads 120102, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
We have an index seek on newly created Non Clustered index and at the same time we have the lower number of logical reads than in previous query (323<155202).
If we add another index on User table to remove that Clustered Index Scan.
We will get following execution plan
(98599 row(s) affected)
Table ‘Badges’. Scan count 1, logical reads 323, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
Table ‘Users’. Scan count 1, logical reads 32140, physical reads 0, read-ahead reads 0, lob logical reads 0, lob physical reads 0, lob read-ahead reads 0.
We still have Scan but it is Non clustered index scan, size of the index is much smaller then clustered one.
If you run following query you will get Index Size and Page Number for the particular index
We have index scan on user table using newly created non-clustered index NCI_UsersTAB_Id. Size of the non clustered index NCI_UsersTAB_Id and page number behind it is smaller than in case using clustered index scan and for that reason index scan operation is faster, less logical reads 32000 instead of 120102.
In this example, I tried to keep my execution plan to continue using MERGE JOIN operator without forcing it. More additional explanations and examples about MERGE JOIN you can find on following MSDN article.
In one of the previous examples, we saw what bad estimates and excessive memory grant can do with our query plan. But what if we have underestimated number of rows? Here is an example.
Leave a Reply