VASP benchmark on dual-core dual-opteron cluster
Posted: Fri Apr 07, 2006 2:06 am
Hi,
I think this benchmark might be interesting to some VASP
users, so I post the result here. We have managed to
compile VASP 4.6.26 with PGI 6.0.8/MPICH 1.2.6, with
GOTO library, on our Opteron 275 cluster with gigabit
ethernet switch.
The purpose of doing this benchmark is to find out the way
to improve the parallel performance of VASP on this cluster,
and find out the most effective way to run parallel VASP job
on dual-core dual-opteron cluster.
At the moment, each node has only 4GB memory and 4
cores. The test case is a system of 8 TiO2 uint cell. The
benchmark was done on a seperated 4-node cluster.
We are thinking to upgrade to infiniband switch and increase
to 8GB memory on each node.
Regards
Jyh-Shyong Ho, Ph.D.
Research Scientist
National Center for High Performance Computing
Hsinchu, Taiwan, ROC
8core2node
Total CPU time used (sec): 904.177
User time (sec): 801.994
System time (sec): 102.182
Elapsed time (sec): 1384.815
Minor page faults: 5446219
Major page faults: 0
Voluntary context switches: 900481
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
4core2node
Total CPU time used (sec): 1717.815
User time (sec): 1557.233
System time (sec): 160.582
Elapsed time (sec): 2019.904
Minor page faults: 10251676
Major page faults: 0
Voluntary context switches: 1875775
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
2core2node
Total CPU time used (sec): 3168.858
User time (sec): 2974.274
System time (sec): 194.584
Elapsed time (sec): 3608.532
Minor page faults: 19797483
Major page faults: 0
Voluntary context switches: 2477002
free energy TOTEN = -210.161120 eV
Iteration 1( 14)
4core4node
Total CPU time used (sec): 1671.648
User time (sec): 1530.916
System time (sec): 140.733
Elapsed time (sec): 1859.653
Minor page faults: 10251508
Major page faults: 0
Voluntary context switches: 2015003
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
8core4node
Total CPU time used (sec): 791.257
User time (sec): 703.924
System time (sec): 87.333
Elapsed time (sec): 1095.928
Minor page faults: 5446097
Major page faults: 0
Voluntary context switches: 699000
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
16core4node
Total CPU time used (sec): 518.280
User time (sec): 382.768
System time (sec): 135.512
Elapsed time (sec): 1483.999
Minor page faults: 2777032
Major page faults: 0
Voluntary context switches: 1526597
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
1core1node
Total CPU time used (sec): 5399.341
User time (sec): 5237.087
System time (sec): 162.254
Elapsed time (sec): 5435.051
Minor page faults: 32855418
Major page faults: 0
Voluntary context switches: 3
free energy TOTEN = -210.159666 eV
Iteration 1( 14)
2core1node
Total CPU time used (sec): 3318.183
User time (sec): 3117.879
System time (sec): 200.305
Elapsed time (sec): 3442.154
Minor page faults: 19797485
Major page faults: 0
Voluntary context switches: 639192
free energy TOTEN = -210.161120 eV
Iteration 1( 14)
4core1node
Total CPU time used (sec): 1746.881
User time (sec): 1584.151
System time (sec): 162.730
Elapsed time (sec): 2071.770
Minor page faults: 10251756
Major page faults: 0
Voluntary context switches: 1759164
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
I think this benchmark might be interesting to some VASP
users, so I post the result here. We have managed to
compile VASP 4.6.26 with PGI 6.0.8/MPICH 1.2.6, with
GOTO library, on our Opteron 275 cluster with gigabit
ethernet switch.
The purpose of doing this benchmark is to find out the way
to improve the parallel performance of VASP on this cluster,
and find out the most effective way to run parallel VASP job
on dual-core dual-opteron cluster.
At the moment, each node has only 4GB memory and 4
cores. The test case is a system of 8 TiO2 uint cell. The
benchmark was done on a seperated 4-node cluster.
We are thinking to upgrade to infiniband switch and increase
to 8GB memory on each node.
Regards
Jyh-Shyong Ho, Ph.D.
Research Scientist
National Center for High Performance Computing
Hsinchu, Taiwan, ROC
8core2node
Total CPU time used (sec): 904.177
User time (sec): 801.994
System time (sec): 102.182
Elapsed time (sec): 1384.815
Minor page faults: 5446219
Major page faults: 0
Voluntary context switches: 900481
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
4core2node
Total CPU time used (sec): 1717.815
User time (sec): 1557.233
System time (sec): 160.582
Elapsed time (sec): 2019.904
Minor page faults: 10251676
Major page faults: 0
Voluntary context switches: 1875775
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
2core2node
Total CPU time used (sec): 3168.858
User time (sec): 2974.274
System time (sec): 194.584
Elapsed time (sec): 3608.532
Minor page faults: 19797483
Major page faults: 0
Voluntary context switches: 2477002
free energy TOTEN = -210.161120 eV
Iteration 1( 14)
4core4node
Total CPU time used (sec): 1671.648
User time (sec): 1530.916
System time (sec): 140.733
Elapsed time (sec): 1859.653
Minor page faults: 10251508
Major page faults: 0
Voluntary context switches: 2015003
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
8core4node
Total CPU time used (sec): 791.257
User time (sec): 703.924
System time (sec): 87.333
Elapsed time (sec): 1095.928
Minor page faults: 5446097
Major page faults: 0
Voluntary context switches: 699000
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
16core4node
Total CPU time used (sec): 518.280
User time (sec): 382.768
System time (sec): 135.512
Elapsed time (sec): 1483.999
Minor page faults: 2777032
Major page faults: 0
Voluntary context switches: 1526597
free energy TOTEN = -210.160380 eV
Iteration 1( 14)
1core1node
Total CPU time used (sec): 5399.341
User time (sec): 5237.087
System time (sec): 162.254
Elapsed time (sec): 5435.051
Minor page faults: 32855418
Major page faults: 0
Voluntary context switches: 3
free energy TOTEN = -210.159666 eV
Iteration 1( 14)
2core1node
Total CPU time used (sec): 3318.183
User time (sec): 3117.879
System time (sec): 200.305
Elapsed time (sec): 3442.154
Minor page faults: 19797485
Major page faults: 0
Voluntary context switches: 639192
free energy TOTEN = -210.161120 eV
Iteration 1( 14)
4core1node
Total CPU time used (sec): 1746.881
User time (sec): 1584.151
System time (sec): 162.730
Elapsed time (sec): 2071.770
Minor page faults: 10251756
Major page faults: 0
Voluntary context switches: 1759164
free energy TOTEN = -210.160380 eV
Iteration 1( 14)