Explanation of Statistics/2
- Instruction or data mispredictions/hits: should be small (~0)
- L1-L2 bandwidth used ([MB/s] average per process)
- total number of bytes moved between L1 and L2 divided by total run time
-
-
- Memory bandwidth used ([MB/s] average per process)
- total number of bytes moved between L2 an memory divided by total time
-
-
- Performance Mflop/s: larger is better
- it is computed from the #fp instructions divided by total run time
- the madd instruction counts as 1, so this statistic usually greatly underestimates the performance in Mflop/s
-
- Time accessing memory/Total time: should be small (~0)
#bytes moved = (L1 cache miss * L1 cache line size + Quadwords written back *16 bytes)
#bytes moved = (L2 cache miss * L2 cache line size + Quadwords written back *16 bytes)