Performance of Quantum Chromodynamics (QCD) Simulations on the SGI Altix
Allalen Mohammed 1, Brehm Matthias 1, Stüben Hinnerk 2
1Leibniz Supercomputing Centre (LRZ), Garching, Germany
*e-mail: allalen@lrz.de
2Konrad-Zuse-Zentrum für Informationstechnik Berlin (ZIB), Berlin, Germany
Received:
Received: 7 July 2008; accepted 31 October 2008; published online: 10 December 2008
DOI: 10.12921/cmst.2008.14.01.69-75
OAI: oai:lib.psnc.pl:647
Abstract:
We study performance and scaling of the Berlin Quantum Chromodynamics Program (BQCD) on the SGI Altix 4700 at Leibniz Supercomputing Centre (LRZ). We employ different communication methods (MPI, MPI with two OpenMP threads per process, as well as the shmem library) and run the MPI version on the two types of nodes of that machine. For comparison with other machines we made performance measurements on an IBM p690 cluster and a Cray XT4.
Key words:
References:
[1] G. Schierholz and H. Stüben, Optimizing the Hybrid Monte Carlo Algorithm on the Hitachi SR8000, in: S. Wagner, W. Hanke, A. Bode and F. Durst (Eds.), High Performance Computing in Science and Engineering, Munich 2004, Springer-Verlag, pp 385-393.
[2] T. Streuer and H. Stüben, Simulations of QCD in the Era of Sustained Tflop/s Computing, in: C. Bischof, M. Brückner, P. Gibbon, G. Goubert, T. Lippert, B. Mohr, F. Peters (Eds.), Parallel Computing: Architectures, Algorithms and Applications, NIC Series 38, 535-542 (2007)
[3] S. Duane, A. Kennedy, B. Pendleton and D. Roweth, Phys. Lett. B195, 216 (1987).
We study performance and scaling of the Berlin Quantum Chromodynamics Program (BQCD) on the SGI Altix 4700 at Leibniz Supercomputing Centre (LRZ). We employ different communication methods (MPI, MPI with two OpenMP threads per process, as well as the shmem library) and run the MPI version on the two types of nodes of that machine. For comparison with other machines we made performance measurements on an IBM p690 cluster and a Cray XT4.
Key words:
References:
[1] G. Schierholz and H. Stüben, Optimizing the Hybrid Monte Carlo Algorithm on the Hitachi SR8000, in: S. Wagner, W. Hanke, A. Bode and F. Durst (Eds.), High Performance Computing in Science and Engineering, Munich 2004, Springer-Verlag, pp 385-393.
[2] T. Streuer and H. Stüben, Simulations of QCD in the Era of Sustained Tflop/s Computing, in: C. Bischof, M. Brückner, P. Gibbon, G. Goubert, T. Lippert, B. Mohr, F. Peters (Eds.), Parallel Computing: Architectures, Algorithms and Applications, NIC Series 38, 535-542 (2007)
[3] S. Duane, A. Kennedy, B. Pendleton and D. Roweth, Phys. Lett. B195, 216 (1987).