Performance: Difference between revisions
From Biowerkzeug Wiki
Jump to navigationJump to search
try sortable |
use <source/> |
||
(14 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
== Single processor performance == | |||
As a basic performance test we run the simulations in the <tt>test</tt> directory with these commands (on Linux): | As a basic performance test we run the simulations in the <tt>test</tt> directory with these commands (on Linux): | ||
cd testjobs | cd testjobs | ||
Line 6: | Line 7: | ||
<li>CPU properties are found with | <li>CPU properties are found with | ||
<pre>cat /proc/cpuinfo</pre> | <pre>cat /proc/cpuinfo</pre> | ||
and model names from [http://en.wikipedia.org/wiki/List_of_Intel_microprocessors | and model names from [http://en.wikipedia.org/wiki/List_of_Intel_microprocessors List of Intelmicroprocessors] for Intel (also see the [http://en.wikipedia.org/wiki/List_of_Intel_microprocessors#Detailed_x86_architecture_microprocessor_lists detailed lists of Intel x86 CPUs]) and [http://en.wikipedia.org/wiki/List_of_AMD_microprocessors List of AMD microprocessors] for AMD cpus (also see the [http://en.wikipedia.org/wiki/List_of_AMD_microprocessors#Detailed_microprocessor_release_lists detailed lists of AMD CPUs]).</li> | ||
<li>Also report the binary used (<tt>hippo</tt> or <tt>hippo_p3</tt>) and the revision.</li> | <li>Also report the binary used (<tt>hippo</tt> or <tt>hippo_p3</tt>) and the revision.</li> | ||
<li>Note that these tests only utilize a ''single core''.</li> | <li>Note that these tests only utilize a ''single core''.</li> | ||
<li>These results only give a rough idea of the ''relative'' performance of different architectures.</li> | <li>These results only give a rough idea of the ''relative'' performance of different architectures.</li> | ||
<li>The timings have errorbars of about ±2s</li> | |||
</ul> | </ul> | ||
Please add your own results. | Please add your own results. | ||
{| class="wikitable sortable" | {| class="wikitable sortable" | ||
| | |+ Benchmark (single core) and [[#Scaling|scaling]] results on multiple cores/cpus. '''cores''' indicates how many cores are available to the user on this cpu ''or'' the maximum number of cores on the node that were used for benchmarking scaling. | ||
! vendor | ! vendor | ||
! model | ! model | ||
Line 22: | Line 24: | ||
! binary | ! binary | ||
! revision | ! revision | ||
! scaling | |||
|- | |- | ||
| Intel | | Intel | ||
Line 30: | Line 33: | ||
| hippo | | hippo | ||
| rev32 <!-- deathspud --> | | rev32 <!-- deathspud --> | ||
|- | |||
| Intel | |||
| [http://en.wikipedia.org/wiki/List_of_Intel_Core_2_microprocessors#.22Yorkfield.22_.2845_nm.29 Core 2 Quad Q9550] | |||
| 2.83 | |||
| 4 | |||
| 1:05 | |||
| hippo | |||
| rev35 <!-- deathspud --> | |||
| [[Image:Scaling Q9550.png|thumb|none|Intel Quad Core Q9550 2.8 GHz]] | |||
|- | |- | ||
| Intel | | Intel | ||
Line 38: | Line 50: | ||
| hippo_p3 | | hippo_p3 | ||
| rev32 <!-- deathspud --> | | rev32 <!-- deathspud --> | ||
|- | |||
| Intel | |||
| [http://en.wikipedia.org/wiki/List_of_Intel_Xeon_microprocessors#.22Harpertown.22_.28standard-voltage.2C_45_nm.29 Quad Core Xeon E5420] | |||
| 2.5 | |||
| 8 = 2x4 | |||
| 1:16 | |||
| hippo | |||
| rev35 <!-- darthtater --> | |||
| [[Image:Scaling E5420.png|thumb|none|Dual Intel Quad Core Xeon E5420 2.5 GHz]] | |||
|- | |- | ||
| Intel | | Intel | ||
Line 65: | Line 86: | ||
| AMD | | AMD | ||
| [http://en.wikipedia.org/wiki/List_of_AMD_Phenom_microprocessors#.22Agena.22_.28B2_.26_B3.2C_65_nm.29 Phenom X4 9850] | | [http://en.wikipedia.org/wiki/List_of_AMD_Phenom_microprocessors#.22Agena.22_.28B2_.26_B3.2C_65_nm.29 Phenom X4 9850] | ||
| 2. | | 2.5 | ||
| 4 | | 4 | ||
| 1:36 | | 1:36 | ||
| hippo_p3 | | hippo_p3 | ||
| rev32 <!-- greenwulf --> | | rev32 <!-- greenwulf --> | ||
|- | |||
| AMD | |||
| [http://en.wikipedia.org/wiki/List_of_AMD_Phenom_microprocessors#.22Agena.22_.28B2_.26_B3.2C_65_nm.29 Phenom X4 9850] | |||
| 2.5 | |||
| 4 | |||
| 1:37 | |||
| hippo_p3 | |||
| rev35 <!-- greenwulf --> | |||
| [[Image:Scaling AMD X4 9840.png|thumb|none|AMD Phenom X4 9850 Quad Core 2.5 GHz]] | |||
|- | |- | ||
| AMD | | AMD | ||
Line 105: | Line 135: | ||
<small>(You don't ''have'' to link to the CPU wikipedia page; just put down whatever you got and even if you're unsure. If in doubt just put the output from | <small>(You don't ''have'' to link to the CPU wikipedia page; just put down whatever you got and even if you're unsure. If in doubt just put the output from | ||
<source lang="bash"> | |||
cat /proc/cpuinfo | \ | cat /proc/cpuinfo | \ | ||
awk 'BEGIN {FS=":"}; \ | awk 'BEGIN {FS=":"}; \ | ||
Line 112: | Line 143: | ||
/siblings/ {cores=$2}; \ | /siblings/ {cores=$2}; \ | ||
END {printf("|-\n| %s\n| %s\n| %.1f\n| %d\n| TIME\n| ?\n| ?\n", vendor, model, GHz, cores)}' | END {printf("|-\n| %s\n| %s\n| %.1f\n| %d\n| TIME\n| ?\n| ?\n", vendor, model, GHz, cores)}' | ||
</source> | |||
into the wiki table and replace ''TIME'' by what you measured.</small> | into the wiki table and replace ''TIME'' by what you measured.</small> | ||
== Scaling == | |||
All tests were run with Hippo beta rev35 on empty systems. See remarks on the [[Talk:Performance#Scaling|test methodology]]. | |||
In the table below, '''# cpus''' really means how many independent cores are available. | |||
{| class="wikitable sortable" | |||
|+ Scaling of the Hippo WALP test case using OpenMP on single-board machines | |||
! vendor | |||
! # cpus | |||
! walltime and scaling | |||
|- | |||
| Intel | |||
| 4 | |||
| [[Image:Scaling Q9550.png|thumb|none|Intel Quad Core Q9550 2.8 GHz]] | |||
|- | |||
| Intel | |||
| 8 | |||
| [[Image:Scaling E5420.png|thumb|none|Dual Intel Quad Core Xeon E5420 2.5 GHz]] | |||
|- | |||
| AMD | |||
| 4 | |||
| [[Image:Scaling AMD X4 9840.png|thumb|none|AMD Phenom X4 9850 Quad Core 2.5 GHz]] | |||
|} |
Latest revision as of 15:49, 8 December 2008
Single processor performance
As a basic performance test we run the simulations in the test directory with these commands (on Linux):
cd testjobs time ./calc_testjobs
- We report the user time.
- CPU properties are found with
cat /proc/cpuinfo
and model names from List of Intelmicroprocessors for Intel (also see the detailed lists of Intel x86 CPUs) and List of AMD microprocessors for AMD cpus (also see the detailed lists of AMD CPUs). - Also report the binary used (hippo or hippo_p3) and the revision.
- Note that these tests only utilize a single core.
- These results only give a rough idea of the relative performance of different architectures.
- The timings have errorbars of about ±2s
Please add your own results.
vendor | model | GHz | cores | time/min | binary | revision | scaling |
---|---|---|---|---|---|---|---|
Intel | Core 2 Quad Q9550 | 2.83 | 4 | 1:04 | hippo | rev32 | |
Intel | Core 2 Quad Q9550 | 2.83 | 4 | 1:05 | hippo | rev35 | |
Intel | Core 2 Quad Q9550 | 2.83 | 4 | 1:06 | hippo_p3 | rev32 | |
Intel | Quad Core Xeon E5420 | 2.5 | 8 = 2x4 | 1:16 | hippo | rev35 | |
Intel | Quad Core Xeon E5420 | 2.5 | 4 | 1:16 | hippo | rev32 | |
Intel | Quad Core Xeon L5410 | 2.33 | 4 | 1:27 | hippo | rev32 | |
Intel | Quad Core Xeon L5410 | 2.33 | 4 | 1:31 | hippo_p3 | rev32 | |
AMD | Phenom X4 9850 | 2.5 | 4 | 1:36 | hippo_p3 | rev32 | |
AMD | Phenom X4 9850 | 2.5 | 4 | 1:37 | hippo_p3 | rev35 | |
AMD | Opteron 248 | 2.2 | 1 | 2:18 | hippo_p3 | rev32 | |
Intel | Core Duo T2300 (Mac Mini) | 1.66 | 2 | 2:41 | hippo_p3 | rev32 | |
Intel | Xeon 2.4 | 2.4 | 1 | 3:03 | hippo_p3 | rev26 | |
AMD | Athlon XP 1800+ | 1.53 | 1 | 4:30 | hippo_p3 | rev26 |
(You don't have to link to the CPU wikipedia page; just put down whatever you got and even if you're unsure. If in doubt just put the output from
cat /proc/cpuinfo | \
awk 'BEGIN {FS=":"}; \
/vendor_id/ {vendor=$2}; \
/model name/ {model=$2}; \
/cpu MHz/ {GHz=$2/1000}; \
/siblings/ {cores=$2}; \
END {printf("|-\n| %s\n| %s\n| %.1f\n| %d\n| TIME\n| ?\n| ?\n", vendor, model, GHz, cores)}'
into the wiki table and replace TIME by what you measured.
Scaling
All tests were run with Hippo beta rev35 on empty systems. See remarks on the test methodology.
In the table below, # cpus really means how many independent cores are available.
vendor | # cpus | walltime and scaling |
---|---|---|
Intel | 4 | |
Intel | 8 | |
AMD | 4 |