Check out my first novel, midnight's simulacra!
SIMD
From dankwiki
x86
- AVX (Advanced Vector eXtensions) -- to be introduced on Intel's Sandy Bridge (2010) and AMD's Bulldozer (2011), and implemented within the VEX coding scheme
SSE3
movddup -- move a double from a 8-byte-aligned memory location or lower half of XMM register to upper half, then duplicate upper half to lower half
SSE2
movapd -- move two packed doubles from a 16-byte-aligned memory location to XMM registers, or vice versa, or between two XMM registers. mulpd -- the multiplier is a 16-byte-aligned memory location or XMM register. the target XMM register serves as the multiplicand.
Fused Multiply-Add
- The FMA instruction set extension to x86 should hit around 2011
Other Architectures
- PowerPC implements AltiVec
See Also
- "Why no FMA in AVX in Sandy Bridge?", Intel Developers Forum
- SSE5 guide at AMD