Graphics Processing Unit (GPU) Performance on an N-Body Problem

Report No. ARL-CR-0629
Authors: Pat Collins
Date/Pages: August 2009; 26 pages
Abstract: The objective of this study is to evaluate the performance of clusters of Nvidia graphics processing units on an N-body problem derived from the computation of vector potentials. Two clusters are used for this purpose. The first is a 2-node, Intel Xeon system with a single Tesla S870 system cross connected to each node. The second is a 20-node Opteron system with one Quadro FX 5600 GPU per node. The results show a significant increase in performance when GPUs accelerate the computation. With 16 GPUs and a sufficiently large problem, an estimated 3 teraflops is achieved.
Distribution: Approved for public release
  Download Report ( 0.228 MBytes )
If you are visually impaired or need a physical copy of this report, please visit and contact DTIC.

Last Update / Reviewed: August 1, 2009