Publication Details

Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems

KUNÍK Oliver and JAROŠ Jiří. Accelerating Hybrid Local Domain Decomposition for the k-Wave Toolbox on Multi-GPU Systems. Ostrava, 2024.
Czech title
Zrychlení hybridního lokálního rozložení domény pro k-Wave toolbox na multi-GPU systémech
Type
presentation,poster
Language
english
Authors
Keywords

k-Wave, HPC, Hybrid decomposition, Local decomposition, CUDA, Multi-GPU

Abstract

The k-Wave toolbox is designed for high-fidelity acoustic wave simulations using Fourier collocation for spatial derivatives, but its performance is constrained by communication overhead on multi-CPU and multi-GPU systems. We present a hybrid local domain decomposition approach that partitions the simulation domain into subdomains, each assigned to a GPU with configurable resolution. Using CUDA and cuFFT for Fourier transforms and NVLink for halo exchanges, our method minimizes inter-subdomain communication and accelerates multi-GPU performance. Testing on the Karolina supercomputer shows strong scalability and accuracy, especially with uniform-resolution subdomains, and proves effective even for large-scale simulations beyond single-GPU memory limits.

Published
2024
Pages
1
Conference
8th Users' Conference of IT4Innovations, Ostrava, CZ
Place
Ostrava, CZ
Back to top