2. semester

Plán pre 2.semester:

Zdrojáky:

Zdrojáky

Zmeny

Tento semester som vytvoril hlavnú kostru, do ktorej sa dajú pridávať ďalšie algoritmy. Do tejto kostry som pridal veci z prvého semestra a plus som ešte pridla ďalšie. Hlavná časť bolo skúsiť SIMD inštrukcie na ARM-e a porovnať to s Intelom(SSE).

Porovnávané algoritmy:

ARM(NEON)

Pozorovania:

Kompilové s flagmi: -mfloat-abi=softfp -mfpu=neon -g -O3 -std=gnu++0x -Wno-psabi
Crosskompiloval som to na Intely s gcc 4.4

Intel(SSE)

Pozorovania:

Kompilované s flagmi: -mssse3 -std=c++11 -g -O3

Použitý hardware

ARM

Processor	: ARMv7 Processor rev 2 (v7l)
BogoMIPS	: 996.14
Features	: swp half thumb fastmult vfp edsp thumbee neon vfpv3 
CPU implementer	: 0x41
CPU architecture: 7
CPU variant	: 0x3
CPU part	: 0xc08
CPU revision	: 2

Hardware	: OMAP3 Beagle Board
Revision	: 0020
Serial		: 0000000000000000

Intel

processor	: 0
vendor_id	: GenuineIntel
cpu family	: 6
model		: 15
model name	: Intel(R) Core(TM)2 Duo CPU     T7100  @ 1.80GHz
stepping	: 13
microcode	: 0xa1
cpu MHz		: 1801.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 0
cpu cores	: 2
apicid		: 0
initial apicid	: 0
fdiv_bug	: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush
dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc arch_perfmon pebs bts aperfmperf pni
dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm ida dtherm tpr_shadow vnmi
flexpriority
bogomips	: 3591.04
clflush size	: 64
cache_alignment	: 64
address sizes	: 36 bits physical, 48 bits virtual
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 6
model		: 15
model name	: Intel(R) Core(TM)2 Duo CPU     T7100  @ 1.80GHz
stepping	: 13
microcode	: 0xa1
cpu MHz		: 800.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 1
cpu cores	: 2
apicid		: 1
initial apicid	: 1
fdiv_bug	: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush
dts acpi mmx fxsr sse sse2 ss ht tm pbe nx lm constant_tsc arch_perfmon pebs bts aperfmperf pni
dtes64 monitor ds_cpl vmx est tm2 ssse3 cx16 xtpr pdcm lahf_lm ida dtherm tpr_shadow vnmi
flexpriority
bogomips	: 3591.04
clflush size	: 64
cache_alignment	: 64
address sizes	: 36 bits physical, 48 bits virtual
power management: