LIU Jie, CHI Li-hua, XIE Lin-chuan, WANG Yang, GAN Xin-biao, FENG Hua,HU Qing-feng
(Science and Technology on Parallel and Distributed Processing Laboratory, National Univ of Defense Technology, Changsha, Hunan 410073,China) 在知网中查找 在百度中查找 在本站中查找
DSP processor can be used to solve the high performance computation problems, which has the characteristics of high computing performance and low power. Matrix multiplication algorithm is the kernel of many scientific and technology computation, so it is of importance for theorem and practice. Based on general purpose DSP (GPDSP), a new parallel algorithm for matrix multiplication was proposed. And a peak performance model for matrix multiplication was built. From the peak performance model, an architecture of GPDSP was set up, and the parameter of GPDSP with Tflops was given, which includes the number of pipe-line, the number of SIMD registers, the breadth and latency for the hierarchical memories.