Kepler vs Xeon Phi

Kepler vs Xeon Phi : our measures
and their complete source code
http://www.hpcmagazine.fr/en-couverture/kepler-vs-xeon-phi-nos-mesures/
Florent Duguet, PhD
CEO - Altimesh
http://www.altimesh.com/
... article in French
Presentation & translation by
Ronan Keryell (SILKAN / Aptina)

Some functional analogies...
● Vendor data
● Flops/memop: minimal ratio to avoid waiting for
memory

3 microbenchmarks
From theory to practice...
● 1 memory bound : read a vector
– K20: Naïve/vectorized with float4/use texture cache
– Phi : Naïve/vectorized/gather/aligned vector load
● 1 compute bound : Hörner approximation iterated
(expm1())^12 (= 12 add, 24 mul, 60 madd)
– K20: Naïve/vectorized with float4 or double4
– Phi : Naïve/intrinsics
● 1 latency bound : b[i] += a[i + index[k]]
– K20: Naïve/loop interchange/ __ldg to skip L2$
– Phi : Naïve/vectorized/gather/aligned vector load

Conclusion
● (...) = (vendor data)
● Warning : in this experimentation fma counts for
1 FLOP instead of usual (... and constructors !)
2 FLOP
● Disclaimer : examples available :-) on
http://www.hpcmagazine.fr/files/sources/003-Kepler-vs-Phi.zip

Kepler vs Xeon Phi

Recomendados

Recomendados

Más contenido relacionado

La actualidad más candente

La actualidad más candente (19)

Destacado

Destacado (12)

Similar a Kepler vs Xeon Phi

Similar a Kepler vs Xeon Phi (20)

Más de Mert Akın

Más de Mert Akın (20)

Último

Último (20)

Kepler vs Xeon Phi