Return Styles: Pseud0ch, Terminal, Valhalla, NES, Geocities, Blue Moon. Entire thread

Dot product

Name: Anonymous 2011-08-02 14:18

Single precision floating point.
System V AMD64 ABI convention.


sses_dot:
    movaps xmm0, [rdi]
    mulps xmm0, [rsi]
    haddps xmm0, xmm0
    haddps xmm0, xmm0
    ret

Name: Anonymous 2011-08-02 22:33

Also, to the ``lifthp faggots'' in this thread, does your Lisp compiler generate SSE SIMD instructions out of the box from regular code? I guarantee you that it does not.

Instead, you have to hack it in, and write your own XMM register allocator.

http://www.pvk.ca/Blog/Lisp/hacking_SSE_intrinsics-part_1.html

Newer Posts
Don't change these.
Name: Email:
Entire Thread Thread List