[1]PATERA A T. A spectral element method for fluid dynamics: Laminar flow in a channel expansion[J]. Journal of Computational Physics, 1984, 54(3): 468-488.
[2]KOTHE D. Toward predictive modeling of nuclear reactor performance: application development experiences, challenges, and plans in CASL[R/OL]. [2020-04-16]. Baton Rouge: Louisiana State University. https:∥www.casl.gov/sites/default/files/docs/CASL-U-2014-0359-000.pdf.
[3]PIETER W. High performance computing in fluid dynamics: Proceedings of the summer school on high performance computing in fluid dynamics held at Delft University of Technology[M]. The Netherlands: Springer Science & Business Media, 1996.
[4]YTTERSTRÖM A. A tool for partitioning structured multiblock meshes for parallel computational mechanics[J]. The International Journal of Supercomputer Applications and High Performance Computing, 1997, 11(4): 336-343.
[5]SAULE E, BAS E O, CATALYUEREK U V. Load-balancing spatially located computations using rectangular partitions[J]. Journal of Parallel and Distributed Computing, 2012, 72(10): 1201-1214.
[6]POTHEN A, SIMON H D, LIOU K P. Partitioning sparse matrices with eigenvectors of graphs[J]. Siam Journal on Matrix Analysis & Applications, 1990, 11(3): 430-452.
[7]BARNARD S T, SIMON H D. Fast multilevel implementation of recursive spectral bisection for partitioning unstructured problems[J]. Concurrency: Practice and Experience, 1994, 6(2): 101-117.
[8]BARNARD S T. PMRSB: Parallel multilevel recursive spectral bisection[C]∥Kennedy K. Proceedings of the 1995 ACM/IEEE Conference on Supercomputing. San Diego: IEEE, 1995: 27.
[9]BARNARD S T, SIMON H. A parallel implementation of multilevel recursive spectral bisection for application to adaptive unstructured meshes[R]. US: Society for Industrial and Applied Mathematics, 1995.
[10]王琥,李光耀,钟志华. 有限元并行计算自动分区方法的优化[J]. 计算机辅助设计与图形学学报,2005(8):1766-1772.
WANG Hu, LI Guangyao, ZHONG Zhihua. Optimization on automatic mesh partition method in parallel finite element computation[J]. Journal of Computer-Aided Design & Computer Graphics, 2005(8): 1766-1772(in Chinese).
[11]FOURNIER Y, BONELLE J, MOULINEC C, et al. Optimizing code_saturne computations on petascale systems[J]. Computers & Fluids, 2011, 45(1): 103-108.
[12]OJHA R, PAWAR P, GUPTA S, et al. Performance optimization of OpenFOAM on clusters of Intel Xeon Phi (TM) processors[C]∥Dinkar S. 2017 IEEE 24th International Conference on High Performance Computing Workshops (HiPCW). Jaipur, India: IEEE, 2017: 51-59.
[13]孟德龙,文敏华,韦建文,等. 神威太湖之光上OpenFOAM的移植与优化[J]. 计算机科学,2017,44(10):64-70.
MENG Delong, WEN Minhua, WEI Jianwen, et al. Porting and optimizing OpenFOAM on Sunway TaihuLight System[J]. Computer Science, 2017, 44(10): 64-70(in Chinese).
[14]李芳,李志辉,徐金秀,等. 基于十亿亿次国产超算系统的流体力学软件众核适应性研究[J]. 计算机科学,2020,47(1):24-30.
LI Fang, LI Zhihui, XU Jinxiu, et al. Research on adaptation of CFD software based on manvcore architecture of 100P domestio supercomputing system[J]. Computer Science, 2020, 47(1): 24-30(in Chinese).
[15]TOMBOULIDES A G, LEE J C Y, ORSZAG S A. Numerical simulation of low Mach number reactive flows[J]. Journal of Scientific Computing, 1997, 12(2): 139-167.
[16]OFFERMANS N, MARIN O, SCHANEN M, et al. On the strong scaling of the spectral element solver Nek5000 on petascale systems[C]∥Proceedings of the Exascale Applications and Software Conference 2016. New York: Association for Computing Machinery, 2016: 1-10.
[17]GRAHAM A. Kronecker products and matrix calculus with applications[M]. New York: Courier Dover Publications, 2018.
[18]JOHNSON D S, STOCKMEYER L. Some simplified NP-complete graph problems[J]. Theoretical Computer Science, 1976, 1: 237-267.
[19]FIEDLER M. A property of eigenvectors of nonnegative symmetric matrices and its application to graph theory[J]. Czechoslovak Mathematical Journal, 1975, 25(4): 619-633.
[20]CHAN T, GILBERT J R, TENG S H. Geometric spectral partitioning[M]. [S. l.]: Xerox Corporation, Palo Alto Research Center, 1994: 1-10.
[21]FU H, LIAO J, YANG J, et al. The Sunway TaihuLight Supercomputer: System and applications[J]. Science China Information Sciences, 2016, 59(7): 1-16.
[22]WANG X, ZHOU Z, HU C, et al. Accelerating and tuning small matrix multiplications on Sunway TaihuLight: A case study of spectral element CFD Code Nek5000[J]. The International Journal of High Performance Computing Applications, 2020, 34(2): 178-186.
[23]JIANG L, YANG C, MA W. Enabling highly efficient batched matrix multiplications on SW26010 many-core processor[J]. ACM Transactions on Architecture and Code Optimization, 2020, 17(1): 1-23.
|