[Radiance-general] Optimizing Radiance

Fri Apr 6 15:20:43 PDT 2012

Hi group, 

I'm setting up Radiance on a computer cluster with lots of nodes. Tweeking gcc flags is not my best quality(understatement), so I bring it here. 
Playing a bit with options which I found on Marc's benchmark page, I got rendering times per core over twice as long as on my 2.2 Ghz macbook. Doesn't make me happy :) 
Below I've pasted the cpuinfo from a node on the cluster. Does anyone in the group has an idea what flags I should give to gcc to optimize Radiance? 
The flags I've used so far - are:   -march=native -m64 -msse -msse2 -funroll-loops -ffast-math  -O3  -Dlinux -D_FILE_OFFSET_BITS=64

Cheers, 

iebele

cat /proc/cpuinfo:

processor	: 7
vendor_id	: GenuineIntel
cpu family	: 6
model		: 23
model name	: Intel(R) Xeon(R) CPU           L5420  @ 2.50GHz
stepping	: 10
cpu MHz		: 2493.445
cache size	: 6144 KB
physical id	: 0
siblings	: 4
core id		: 2
cpu cores	: 4
apicid		: 2
initial apicid	: 2
fpu		: yes
fpu_exception	: yes
cpuid level	: 13
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx lm constant_tsc arch_perfmon pebs bts rep_good aperfmperf pni dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm dca sse4_1 xsave lahf_lm tpr_shadow vnmi flexpriority
bogomips	: 4987.90
clflush size	: 64
cache_alignment	: 64
address sizes	: 38 bits physical, 48 bits virtual