Title | Optimising the performance of the spectral/hp element method with collective linear algebra operations |
Publication Type | Journal Article |
Year of Publication | 2016 |
Authors | D. Moxey, C.d. Cantwell, R.m. Kirby, S.j. Sherwin |
Journal | Computer Methods in Applied Mechanics and Engineering |
Volume | 310 |
Pagination | 628-645 |
ISSN | 0045-7825 |
Abstract | As computing hardware evolves, increasing core counts mean that memory bandwidth is becoming the deciding factor in attaining peak performance of numerical methods. High-order finite element methods, such as those implemented in the spectral/hp framework Nektar++, are particularly well-suited to this environment. Unlike low-order methods that typically utilise sparse storage, matrices representing high-order operators have greater density and richer structure. In this paper, we show how these qualities can be exploited to increase runtime performance on nodes that comprise a typical high-performance computing system, by amalgamating the action of key operators on multiple elements into a single, memory-efficient block. We investigate different strategies for achieving optimal performance across a range of polynomial orders and element types. As these strategies all depend on external factors such as BLAS implementation and the geometry of interest, we present a technique for automatically selecting the most efficient strategy at runtime. |
DOI | 10.1016/j.cma.2016.07.001 |