Deeper look at CPU utilization : TMAM Example

This is a followup to my previous post on Deeper look at CPU utilization : The power of PMU events .

So let’s go back to my previous example using the General Exploration View of Intel VTune  :

Capture 12

One of the highlighted metric is “DTLB Overhead” and it’s an estimation of the performance penalty paid for missing the first-level data TLB (DTLB) that includes hitting in the second-level data TLB (STLB) as well as performing a hardware page walk on an STLB miss.

One of the recommendation is to use the famous Large Page sizes  ! So let’s do it !

After restarting my database instance with Large Page this time and rerunning my program this is what we get :

capture 13

 

Using Large Page size has significantly reduced the “DTLB Overhead” metric and it’ is no more highlighted ! We also have slightly  improved our CPI  from 0.553 to 0.532 and reduced our execution time from 5.37 to 5.20.

This was just a very simple example of the utilization of TMAM !

That’s it 😀

REF:

One thought on “Deeper look at CPU utilization : TMAM Example

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s