|
[1]
|
王飞, 王建业, 张安堂. 实对称矩阵特征值分解高速并行算法的FPGA实现[J]. 空军工程大学(自然科学版), 2008, 9(6): 67-70.
|
|
[2]
|
位寅生, 谭久彬, 郭荣. MUSIC空间谱估计并行运算算法[J]. 系统工程与电子技术, 2012, 34(1): 12-16.
|
|
[3]
|
程豪, 张云泉, 张先轶, 李玉成. CPU-GPU并行矩阵乘法的实现与性能分析[J]. 软件技术与数据库, 2010, 36(13): 24-26.
|
|
[4]
|
伍湘君, 黄丽萍. 超级计算机上矩阵乘的并行计算与实现[J]. 应用气象学报, 2005, 16(1): 122-127.
|
|
[5]
|
唐俊奇. 多处理机中矩阵乘法的算法研究[J]. 中国西部科技, 2007, 2: 4-8.
|
|
[6]
|
L. E. Cannon. A cellular computer to implement the Kalman filter algorithm. Montana State University, 1969.
|
|
[7]
|
L. S. Blackford, J. Demmel, J. Dongarra, I. Duff, S. Hammarling, G. Henry, M. Heroux, L. Kaufman, A. Lumsdaine, A. Petitet, R. Pozo, K. Remington and R. C. Whaley. An updated set of basic linear algebra subprograms (BLAS). ACM Transactions on Mathe- matical Software, 2002, 28(2): 135-151.
|
|
[8]
|
S. Robinson. Toward an optimal algorithm for matrix multipli- cation. SIAM News, 2005,
http://www.siam.org/pdf/news/174.pdf
|
|
[9]
|
H. Cohn, R. Kleinberg, B. Szegedy and C. Umans. Group-theo- retic algorithms for matrix multiplication. Proceedings of the 46th Annual Symposium on Foundations of Computer Science, 23-25 October 2005.
|
|
[10]
|
迟学斌. 高性能并行计算[M]. 北京: 中国科学院计算机网络信息中心, 2005: 31-36.
|
|
[11]
|
Y. Fournier, J. Bonelle, C. Moulinec, Z. Shang, A. G. Sunderland and J. C. Uribe. Optimizing Code_Saturne computations on Peta- scale systems. Computers & Fluids, 2011, 45: 103-108.
|