Wang, Pan, Cheng, Zhiquan, Martin, Ralph Robert, Liu, Huahai, Cai, Xun and Li, Sikun 2013. NUMA-aware image compositing on multi-GPU platform. The Visual Computer 29 (6-8) , pp. 639-649. 10.1007/s00371-013-0803-7 |
Preview |
PDF
Download (1MB) | Preview |
Abstract
Sort-last parallel rendering is widely used. Recent GPU developments mean that a PC equipped with multiple GPUs is a viable alternative to a high-cost supercomputer: the Fermi architecture of a single GPU supports uniform virtual addressing, providing a foundation for non-uniform memory access (NUMA) on multi-GPU platforms. Such hardware changes require the user to reconsider the parallel rendering algorithms. In this paper, we thoroughly investigate the NUMA-aware image compositing problem, which is the key final stage in sort-last parallel rendering. Based on a proven radix-k strategy, we find one optimal compositing algorithm, which takes advantage of NUMA architecture on the multi-GPU platform. We quantitatively analyze different image compositing modes for practical image compositing, taking into account peer-to-peer communication costs between GPUs. Our experiments on various datasets show that our image compositing method is very fast, an image of a few megapixels can be composited in about 10 ms by eight GPUs.
Item Type: | Article |
---|---|
Date Type: | Publication |
Status: | Published |
Schools: | Computer Science & Informatics |
Uncontrolled Keywords: | Multi-GPU System; Parallel rendering; Image compositing |
Additional Information: | PDF uploaded in accordance with publisher's policy http://www.sherpa.ac.uk/romeo/issn/0178-2789/ [accessed 06/05/2015] The final publication is available at Springer via http://dx.doi.org/10.1007/s00371-013-0803-7 |
Publisher: | Springer |
ISSN: | 0178-2789 |
Last Modified: | 02 Dec 2024 07:30 |
URI: | https://orca.cardiff.ac.uk/id/eprint/50739 |
Citation Data
Cited 3 times in Scopus. View in Scopus. Powered By Scopus® Data
Actions (repository staff only)
![]() |
Edit Item |