Jim Dehner 

 Tableau Visionary and Ambassador

Tableau How to’s,  Use Cases, and Forums Questions

Hey #DataFam

Let’s Talk

Open for

Post Links

So Just how does the Order of Operations affect your calculations?

Tableau, like most systems, has an order or sequence in which filters are applied and calculations are executed.  The process starts at the top and proceeds to the bottom in a single pass.  Calculations executed lower in the order depend on all the steps that precede them.


 
 

Steps 1 and 2 – Extract and Data Source Filters


First just a quick work on the first 2 filters – Extract and Data Source work as data is loaded from the source and limit the amount of data available workbook. 

The remaining filters are executed in the context of a worksheet and limit the data that is included in the table that underlays that sheet – filters on a worksheet can be applied to other sheets to limit the underlying table on those sheets also.


Step 3 – Context Filters:


Placing a dimension "In Context" will apply the filter before any calculations are executed, the Top N is determined or Sets are formed. – The data associated with the filter is not on the worksheet's underlying table and is therefore not available to the remainder of the order of operation.

Just open the drop-down and select Add to Context

 

Step 4 Sets, Fixed LOD and Top N

I'm going to use this simple table in the next few examples to show the effect of context filters

 
 
 
After adding Total Cost =  (Price*Quantity) and an LOD: { Fixed Type :sum(Total Cost)}
 
A text chart like this would result with no filters applied – and the LOD value for Fruit is 5.75
 
 
 
 
 
Now see the Context filter effect on the LOD – 
 
Starting without the Produce Name in Context, Banana has been filtered out but the LOD value remains unchanged – the LOD is calculated before the dimension filter is applied
 
 
 
 
 
 
Place Produce Name in Context 
 
Banana is filtered out in the Context filter before the LOD is calculated so the count of values in Fruit is now just 2 and the LOD total is reduced to 3.75 – 
 
 
 
Sets are affected by using a context filter also.  The set will contain the Top 3 Produce Names
 
 
 
 
With no filters applied and nothing in the Context filter,  Apple, Banana, and Green Beans are the Top 3
 
 
 
 
 
If we filter out Banana without placing Produce Name in Context only Apple and Green Beans are in the result – 2 Produce Names – Not 3?  
 
The Set is determined on the entire data set – before the Dimension filter is applied – Later, after the Set is formed Banana is filtered out 
 
But if Produce Name is placed in Context and Banana is filtered out before the set is formed 3 values are returned Apple, Green Beans, and Spinach (Remember Banana is no longer in the data table for the worksheet)
 
 
 
 
 
Let's see what happens with a Top N Filter
 
Start with a filter on Produce Name set for Top N =3
 
 
 
 
Create a viz and add a filter on Type – with no filters applied and nothing in Context.
 
The Top 3 are Apple, Banana and Green Beans 
 
 
 
 
Filter out Vegetable from Type but do not place in Context
 
 
 
And only 2 values are returned Apple and Banana – the three Top N values were determined before the Dimension filter is applied – and included 1 vegetable – Green Beans.
 
Later when the Dimension filter filters out  Vegetable there are only the 2 Fruit values remaining in the Top N
 
But when Vegetables are filtered out in Context, the  Top N are determined only from Fruit and 3 values are returned – Apples, Bananas, and Grapes – 
 
 
 
 

Step 5 – Dimension Filters

 
Dimension filters come in a discrete and a continuous versions – The discrete dimension filter should feel familiar if you have ever used a filter on a spreadsheet. The process is the same – just select the dimension values to include in the view –  
 
Using the Superstore data set that came with Tableau 
 
Start with a simple bar chart of sales by year and segment – no filters applied 
 
 
 
 
Apply a filter on Segment and the segment is removed from the view – 
 
 
 
The dimension does not need to be on the rows or the columns in the view – the data will be filtered from the data table and the view adjusted
 
 
 
 
Dimension filters can also be applied to continuous data – Here a range filter with start and stop dates to a date field  
 
 
Or as a relative filter – like the last 6 Quarter shown below
 
 
 
Dimension filters are applied to the underlying table for the worksheet after the context filter and set, top N, and Fixed  LOD calculation noted earlier.   After they have been applied the table has been set and further calculations or filters are only applied to values in the table.
 

Step 6 – Data Blending 

 
The topic is much too broad for a full discussion here but understand where Blending occurs – After Fixed LOD's and dimension filters have been applied. Blending results in data that is aggregated at the level of the link (relationship) among the data sets – but further dimension filtering or use of LOD's across the blended data sets is not available.
 
 
Step 7 – Include and Exclude LOD's
 
Include and Exclude are the less frequently used pair of LOD expressions –
 
Based on their position in the Order of Operation they operate on the data table that results after the application of all Dimensional and context filters — 
 
So if the table is what do they do
 
Include and Exclude refer to the visible portion of the table that is in the Viz – Include will use dimension in the table NOT visible in the viz into the calculation  – Exclude will not use dimensions visible in the view — 
 
Let's see how they work:
 
For these examples, we will use a small product sales data set – 2 Customers, 2 Product Lines, 4 Products, and a variety of colors
 
 
 
 
Add an Include LOD at the Product level – 
 
 
 
When the viz includes the Product level and lower the average from the LOD and a simple average on quantity return the same result
 
 
 
As the hierarchy is compressed the value from the LOD is fixed at the product level
 
 
 
 
 
and  return the average at the product level even when it is no longer in the view
 
 
 
 
Exclude will eliminate the dimension from that calculation and the expression will be based on the next higher level in the view
 
With the same data set and
 
 
 
 
will return this when the detail is below the level of Product  – the same a simply summing the quantity
 
 
 
 
but as the hierarchy is compressed the values are locked at the Product level
 
 
 
 
 
and remains at that level 
 
 
 
 
 

Step 8 – Measure Filters

 
Measure filters limit the view based on the values in the data table (not the categorization created by the Dimensions) and can be applied to the aggregate level or the row-level data.
 
When you drag the measure to the rows shelf a window will open giving you to filter at the Row Level "All Values" or at any of the noted levels of aggregation – 
 
 
Continuing with the previous data set and filtering for the Sum(Quantity)  – a continuous range filter will open – 
 
 
 
Change the filter and any total value from the view that is not in the filter range is filtered out
 
 
 
when the hierarchy is collapsed to Product Line the only 2 totals that in the range are 14 and 19 – the 94 and 62 are filtered out of the viz
 
 
 
 

Now, look at how the Row Level Measure filter produces a different result

The filter is NOT an aggregate (no SUM() around the measure)
But for this view, the result is the same as the aggregate example



Collapse the hierarchy as before the results are different – and 2 totals 37 and 55 are not filtered out –  Why?  – 


The filter is applied at the row level in the data table and all the values between 10 – 35 are included – Then when the measure is brought to the viz they are summed – 


 

Step 9 – Grand Totals 


Much like Blending Data – Grand totals are a large topic and will be addressed in later posts – But note that Grand Totals are before Table Calculations – 


Step 10 – Table Calculations


As the name implies, Table Calculations are applied to the underlying table for the individual worksheet.  At the bottom of the order of operations, they work with the fully filtered data and are always aggregates.  They also can't be used in calculations that precede them in the order of operation – e.g. Table Calculations, Sets, or Top N – 

They are extremely powerful tools used to compare data, total, look up or provide running sums across or within the data table -understand their position in the order of operation is at the bottom and the data table has been set. Using Table Calculations will be the subject on another FAQ so come back.


Hope this helps provide a better sense of what the order of operation is and how it is important when creating your viz


A workbook containing the examples used here can be found and downloaded from 
Link to workbook

 

Presented at Pittsburgh TUG 12/17/20 see Video link  at the 45-minute mark

Enjoy and let me know if you have questions
Jim

Leave a Reply

Your email address will not be published.