MPI Performance
- Low latency and high bandwidth.
- Fetchop-assisted fast message queuing
- Fast fetchop tree barriers
- Very fast MPI and SHMEM one-sided communication
- Interoperability with SHMEM
- Support for SSI to 512 P
- Automatic NUMA placement
- Optimized MPI collectives
- Internal MPI statistics reporting
- Integration with PCP
- Direct send/recv transfers
- No-impact thread safety support
- Runtime MPI tuning