Check out my first novel, midnight's simulacra!

SIMD

From dankwiki

Revision as of 18:38, 19 September 2009 by Dank (talk | contribs)

(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Jump to navigation Jump to search

x86

AVX (Advanced Vector eXtensions) -- to be introduced on Intel's Sandy Bridge (2010) and AMD's Bulldozer (2011), and implemented within the VEX coding scheme

SSE3

movddup -- move a double from a 8-byte-aligned memory location or lower half of XMM register to upper half, then duplicate upper half to lower half

SSE2

movapd -- move two packed doubles from a 16-byte-aligned memory location to XMM registers, or vice versa, or between two XMM registers.
- movupd -- movapd safe for unaligned memory references, with far inferior performance.
mulpd -- the multiplier is a 16-byte-aligned memory location or XMM register. the target XMM register serves as the multiplicand.

SSE

movaps -- move four packed singles from a 16-byte-aligned memory location to XMM registers, or vice versa, or between two XMM registers.
- movups -- movaps safe for unaligned memory references, with far inferior performance.

Fused Multiply-Add

The FMA instruction set extension to x86 should hit around 2011

Other Architectures

PowerPC implements AltiVec

See Also

"Why no FMA in AVX in Sandy Bridge?", Intel Developers Forum
SSE5 guide at AMD

Retrieved from "https://nick-black.com/dankwiki/index.php?title=SIMD&oldid=1007"

X86