|
|
|
|
Compute Event Statistics | ||||
---|---|---|---|---|
Ntasks | Avg | Min | Max | |
PM_CYC | * | 13600813080.10 | 12263265612 (20) | 16521803203 (80) |
PM_FPU0_CMPL | * | 144938310.49 | 143350547 (27) | 189732170 (0) |
PM_FPU1_CMPL | * | 95645162.97 | 94108230 (64) | 108848810 (0) |
PM_FPU_FMA | * | 167599115.97 | 167287688 (159) | 208546794 (0) |
PM_INST_CMPL | * | 16511364892.48 | 13858144445 (39) | 22570219644 (64) |
PM_LD_CMPL | * | 5325135266.16 | 4443475626 (39) | 7321049030 (64) |
PM_ST_CMPL | * | 3364185584.42 | 2854809401 (39) | 4493389034 (64) |
PM_TLB_MISS | * | 12623075.34 | 10886496 (112) | 14513818 (65) |
Communication Event Statistics (100.00% detail, -3.2229e-04 error) | |||||||
---|---|---|---|---|---|---|---|
Buffer Size | Ncalls | Total Time | Min Time | Max Time | %MPI | %Wall | |
MPI_Barrier | 0 | 3360 | 568.217 | 0.000 | 4.013 | 18.37 | 6.06 |
MPI_Waitall | 128 | 79500 | 554.948 | 0.000 | 4.047 | 17.95 | 5.92 |
MPI_Waitall | 4624 | 477 | 482.188 | 0.000 | 7.413 | 15.59 | 5.14 |
MPI_Waitall | 2112 | 876000 | 247.277 | 0.000 | 1.448 | 8.00 | 2.64 |
MPI_Waitall | 1088 | 876000 | 217.913 | 0.000 | 1.632 | 7.05 | 2.32 |
MPI_Waitall | 4160 | 730000 | 205.392 | 0.000 | 1.054 | 6.64 | 2.19 |
MPI_Waitall | 576 | 876000 | 183.123 | 0.000 | 2.547 | 5.92 | 1.95 |
MPI_Waitall | 320 | 876000 | 157.252 | 0.000 | 0.868 | 5.09 | 1.68 |
MPI_Isend | 32 | 4467908 | 112.708 | 0.000 | 0.002 | 3.64 | 1.20 |
MPI_Isend | 512 | 1908000 | 40.701 | 0.000 | 0.043 | 1.32 | 0.43 |
MPI_Isend | 1024 | 1590000 | 35.463 | 0.000 | 0.036 | 1.15 | 0.38 |
MPI_Isend | 128 | 1989408 | 31.907 | 0.000 | 0.041 | 1.03 | 0.34 |
MPI_Irecv | 32 | 4467908 | 31.102 | 0.000 | 0.002 | 1.01 | 0.33 |
MPI_Isend | 64 | 1910385 | 29.211 | 0.000 | 0.042 | 0.94 | 0.31 |
MPI_Isend | 256 | 1908000 | 29.164 | 0.000 | 0.043 | 0.94 | 0.31 |
MPI_Waitall | 2080 | 72000 | 24.534 | 0.000 | 1.198 | 0.79 | 0.26 |
MPI_Waitall | 4128 | 60000 | 20.243 | 0.000 | 1.005 | 0.65 | 0.22 |
MPI_Waitall | 1056 | 72000 | 17.011 | 0.000 | 1.581 | 0.55 | 0.18 |
MPI_Waitall | 544 | 72000 | 15.360 | 0.000 | 1.719 | 0.50 | 0.16 |
MPI_Waitall | 288 | 72000 | 11.344 | 0.000 | 1.099 | 0.37 | 0.12 |
MPI_Irecv | 128 | 1989408 | 9.432 | 0.000 | 0.014 | 0.31 | 0.10 |
MPI_Irecv | 512 | 1908000 | 9.278 | 0.000 | 0.014 | 0.30 | 0.10 |
MPI_Irecv | 256 | 1908000 | 8.847 | 0.000 | 0.014 | 0.29 | 0.09 |
MPI_Irecv | 64 | 1910385 | 8.826 | 0.000 | 0.015 | 0.29 | 0.09 |
MPI_Irecv | 1024 | 1590000 | 7.333 | 0.000 | 0.012 | 0.24 | 0.08 |
MPI_Waitall | 0 | 1762880 | 7.074 | 0.000 | 0.044 | 0.23 | 0.08 |
MPI_Waitall | 735216 | 3 | 3.976 | 0.000 | 3.976 | 0.13 | 0.04 |
MPI_Waitall | 352 | 6000 | 2.159 | 0.000 | 1.696 | 0.07 | 0.02 |
MPI_Isend | 96 | 91500 | 1.969 | 0.000 | 0.011 | 0.06 | 0.02 |
MPI_Waitall | 2144 | 6000 | 1.912 | 0.000 | 0.805 | 0.06 | 0.02 |
MPI_Waitall | 1120 | 6000 | 1.847 | 0.000 | 0.683 | 0.06 | 0.02 |
MPI_Waitall | 608 | 6000 | 1.517 | 0.000 | 0.744 | 0.05 | 0.02 |
MPI_Waitall | 1296 | 477 | 1.483 | 0.000 | 0.014 | 0.05 | 0.02 |
MPI_Waitall | 15264 | 500 | 1.373 | 0.000 | 1.373 | 0.04 | 0.01 |
MPI_Waitall | 400 | 477 | 1.361 | 0.000 | 0.014 | 0.04 | 0.01 |
MPI_Waitall | 2208 | 6000 | 0.994 | 0.000 | 0.707 | 0.03 | 0.01 |
MPI_Waitall | 4192 | 5000 | 0.968 | 0.000 | 0.516 | 0.03 | 0.01 |
Load balance by task: HPM counters |
---|
by MPI rank, by MPI time |
Load balance by task: memory, flops, timings |
---|
by MPI rank, by MPI time |
Communication balance by task (sorted by MPI rank) |
---|
Communication balance by task (sorted by MPI time) |
Communication Topology : point to point data flow |
---|
|
Communication Topology : Connectivity (sorted by # neighbors) |
---|
Message Buffer Size Distributions: time |
---|
|
Message Buffer Size Distributions: Ncalls |
---|
|
Message Buffer Size Distributions: data volume |
---|
|
Switch volume by node |
---|
|
Memory usage by node |
---|
|