/prog/ - Introduction to ASM

Name: Anonymous 2011-09-14 17:19

Anyone know a good way to start learning ASM? What's the best ASM language for x86?

Name: Anonymous 2011-09-14 17:32

You should probably start off with x86.

Name: Anonymous 2011-09-14 17:36

>>2 There are multiple assembly languages for x86. AT&T, intel, etc.

Name: Anonymous 2011-09-14 17:41

>>3
There are two major syntaxes for the IA32 assembly language, commonly known as x86, the AT&T and Intel, I personally prefer the Intel one.

Name: Anonymous 2011-09-14 19:18

>>2
>starting with a shitty CISC like x86
>not starting with glorious PowerPC, ARM or MIPS

Name: Anonymous 2011-09-14 19:22

http://homepage.mac.com/randyhyde/webster.cs.ucr.edu/www.artofasm.com/index.html

Name: Anonymous 2011-09-14 19:25

Oh, it doesn't look like AoA has been updated to 64 bit, but all you really need to know is that the 64 bit registers have the prefix r, e.g. rax, rbx, etc. There are a bunch of other changes but they aren't relevant to a nub.

Name: Anonymous 2011-09-14 19:43

Autoassembler!

CheatEngine.org

Name: Anonymous 2011-09-14 19:45

>>5
hipster faggot

Name: Anonymous 2011-09-14 19:52

>>9
Admit it, you know the x86 instruction set is a steaming pile of ass.

Name: Anonymous 2011-09-14 20:17

>>5

MIPS is trash

Name: tdavis 2011-09-14 20:31

An irregular instruction set by itself isn't necessarily bad, x86 has aged some. I'm not very interested in SSE/MMX. Are those any good, or is it gimacks?

Name: tdavis 2011-09-14 20:56

It's usually a bad idea to "argue with the answerbook" so the SSE/MMX must be good at least in some cases. I'd like to hear from anyone who has used them... just curious.

Name: Anonymous 2011-09-14 22:18

OP, start off with the Intel manuals. They have everything you need to know, and more.

>>4
AT&T syntax is horrid.

>>5
Start with x86 and you'll find the RISCs boring and the encodings somewhat inefficient.

>>6,7
AoA does not teach asm, but some bastardized "HLA" language which is part asm and part high-level language. No one takes it seriously for that reason.

>>10
It's complex but elegantly extended.

Name: Anonymous 2011-09-15 0:41

>>12
Of course they're good at what they do; they're specialized instructions intended for efficiency in a specific use case.

I've used them for a couple of demoscene demos back in the 90's and they helped my demos be rendered efficiently on the hardware that was available at that time.

Name: tdavis 2011-09-15 0:45

>>15

You lie. Give real info or shut-up.

What speed-up factor? I thought so.

Name: Anonymous 2011-09-15 2:12

>>16
SEE and MMX is good, but you will have increased latency and your data needs to be structured to accomodate the use of these instructions(typically vectors of similar variables)
[edit] Example

The following simple example demonstrates the advantage of using SSE. Consider an operation like vector addition, which is used very often in computer graphics applications. To add two single precision, four-component vectors together using x86 requires four floating-point addition instructions

vec_res.x = v1.x + v2.x;
vec_res.y = v1.y + v2.y;
vec_res.z = v1.z + v2.z;
vec_res.w = v1.w + v2.w;

This would correspond to four x86 FADD instructions in the object code. On the other hand, as the following pseudo-code shows, a single 128-bit 'packed-add' instruction can replace the four scalar addition instructions.

movaps xmm0,address-of-v1 ;xmm0 = v1.w | v1.z | v1.y | v1.x
addps xmm0,address-of-v2 ;xmm0 = v1.w+v2.w | v1.z+v2.z | v1.y+v2.y | v1.x+v2.x
movaps address-of-vec_res,xmm0

Name: Anonymous 2011-09-15 2:40

>>16
My demoscene days were from the 90's and I don't have that code or data anymore because I've moved on from that hobby. Why should I maintain esoteric knowledge about things that I will NEVER directly use in the future? Why should I invest effort into gathering esoteric data right now for the sole purpose of this discussion? If you really want that data, it won't be hard to get it yourself.

The point I was making is that the SSE/MMX systems are intended for a specific use case: problems that are easily divided into SIMD systems. I have used these instructions in the past to process some audio and graphics tasks in the case of my demos. Some other applications of this system would benefit tasks such as video codecs, audio codecs, general sorting and indexing, and recursive cyclic graph processing. I don't work with these instructions any more because I've found that I can use other people's libraries to solve these sorts of problems.

Name: Anonymous 2011-09-15 3:25

All these esoteric tricks lead to three common objectives:
1.Reduce the cycle count
2. Reduce cache misses
3. Reduce branching
SSE often does all three at negligible cost. In machines without a GPU, it would be a stable basis for software rendering.

Name: Anonymous 2011-09-15 5:08

>>8
nope. it's buggy as fuck and leaves symbols left over everywhere. enjoy your lack of scoping.

Name: Anonymous 2011-09-15 5:57

>>17
VEX too

Name: Anonymous 2011-09-15 7:18

>>17
Image processing is where I see a lot of MMX code.

Name: Anonymous 2011-09-15 12:24

What's the best ASM language for x86?
Try x86 assembly

Name: Anonymous 2011-09-15 14:20

>>22
Didn't SSE supersede MMX?

Name: Anonymous 2011-09-15 16:21

>>24
No. SSE uses XMMx registers while MMX uses MMx which actually alias to the x87 arithmetic stack.

Name: Anonymous 2011-09-15 16:37

>>25
I know, but what I meant was, doesn't SSE replace MMX' functionality?

Name: Anonymous 2011-09-15 17:03

>>16
The speed-up factor can be up to 8x for SSE2, but typically it's about 2x-4x for most tasks. LoseThos would probably benefit a lot from it specially if it does silly things such as software-painting the whole screen 60 times a second.

>>26
SSE was mostly floating point, whereas MMX is integer. SSE2 added integer support and is what effectively replaced MMX. Now we have the same story: AVX is FP only and therefore useless for a lot of stuff, AVX2 is going to add integer support.

Also while we're at it, MMX is 64-bit wide, SSE(2) is 128, and AVX(2) 256.

Name: Anonymous 2011-09-15 17:47

http://pastebin.com/ccLEs6Gr

The speedup here is often 10x but that might be because GCC sucks ass at codegen.

Name: Anonymous 2011-09-15 21:21

How can an OS author, who says he loves programming and built an OS for programming on x86-64 not know or care about SSE or AVX, or SIMD in general?

I find it hard to believe. I bet if he were to rewrite his video drivers using SSE for rasterization, he'd be able to switch over to a 32-bit RGBA VESA linear framebuffer with decent real-time performance. I mean, shit, you can run UT99 in software rendering with 32-bit color on modern hardware and good get good frame rates.

Name: Anonymous 2011-09-15 22:04

>>27
Now we have the same story: AVX is FP only and therefore useless for a lot of stuff
Oh come on, really? Why do they do this shit every time, do they learn nothing of past mistakes?

Name: tdavis 2011-09-15 22:20

Ooo Oooo! I see a spot whre I can add two 64-bit numbers at once! Woopiee!

Name: Anonymous 2011-09-15 23:14

>>31
kill yourself faggot

Name: Anonymous 2011-09-15 23:49

>>31
this.

assembly is nerdy like that.

Name: Anonymous 2011-09-16 2:33

>>30
Because there's only so much surface area to pack in new features on a given process node. Ivy Bridge with AVX2 is on 22nm, while Sandy Bridge with AVX was on 32nm.

Name: Anonymous 2011-09-16 2:39

>>31
Think bit-blitting. With SSE2, you can blit 16 bytes in a single instruction and maximize memory throughput if you ensure that the instruction pipeline is well padded. Want to handle things like translucency in 32-bit RGBA? No problem, you can blend 4-pixels at once. The integer bitwise SSE2 instructions let you handle things like sprites with transparent pixels as well.

Really, you're retarded if you can't see outside of the box at how SIMD is good for graphics.

It's no wonder you're consigned to a life of living in a basement apartment, living off of social security and disability payments, working on an OS that no one but yourself actually uses, and not even sharing your source code to the world. It sure is a sad existence. More so when you consider that your Lord and Savior, God Almighty himself is a figment of your imagination.

Name: Anonymous 2011-09-16 4:18

>>34
But Ivy Bridge won't have AVX2 support.

Name: Anonymous 2011-09-16 7:18

>>35
Do you really think he's the real thing and not just a troll like any other troll on this board, such as the ultrafinitist?

Introduction to ASM

1 Name: Anonymous 2011-09-14 17:19

2 Name: Anonymous 2011-09-14 17:32

3 Name: Anonymous 2011-09-14 17:36

4 Name: Anonymous 2011-09-14 17:41

5 Name: Anonymous 2011-09-14 19:18

6 Name: Anonymous 2011-09-14 19:22

7 Name: Anonymous 2011-09-14 19:25

8 Name: Anonymous 2011-09-14 19:43

9 Name: Anonymous 2011-09-14 19:45

10 Name: Anonymous 2011-09-14 19:52

11 Name: Anonymous 2011-09-14 20:17

12 Name: tdavis 2011-09-14 20:31

13 Name: tdavis 2011-09-14 20:56

14 Name: Anonymous 2011-09-14 22:18

15 Name: Anonymous 2011-09-15 0:41

16 Name: tdavis 2011-09-15 0:45

17 Name: Anonymous 2011-09-15 2:12

18 Name: Anonymous 2011-09-15 2:40

19 Name: Anonymous 2011-09-15 3:25

20 Name: Anonymous 2011-09-15 5:08

21 Name: Anonymous 2011-09-15 5:57

22 Name: Anonymous 2011-09-15 7:18

23 Name: Anonymous 2011-09-15 12:24

24 Name: Anonymous 2011-09-15 14:20

25 Name: Anonymous 2011-09-15 16:21

26 Name: Anonymous 2011-09-15 16:37

27 Name: Anonymous 2011-09-15 17:03

28 Name: Anonymous 2011-09-15 17:47

29 Name: Anonymous 2011-09-15 21:21

30 Name: Anonymous 2011-09-15 22:04

31 Name: tdavis 2011-09-15 22:20

32 Name: Anonymous 2011-09-15 23:14

33 Name: Anonymous 2011-09-15 23:49

34 Name: Anonymous 2011-09-16 2:33

35 Name: Anonymous 2011-09-16 2:39

36 Name: Anonymous 2011-09-16 4:18

37 Name: Anonymous 2011-09-16 7:18