Abstract:Barrier synchronization is an important communication pattern for high performance super computers. This paper proposed a new NIC-based barrier communication offload method. The new method improved the traditional dissemination barrier algorithm to support parallel barrier message sending and receiving, which greatly reduced the communication delay. Based on the new barrier algorithm, this paper proposed new descriptor based hardware-software interface and the hardware implementation. The performance was greatly improved, compared with the traditional barrier implementation.