What is a simple example of replacing c code with assembly to improve performance?

+4 A:

Here are the assembly coding pros:

Assembly code can take advantage of a processor's unique instructions as well as various specialised hardware resources. On the other hand, C code is generic, and must support various hardware platforms. Thus, it is difficult for C to support platform-specific code.
The assembly programmer is usually very familiar with the application and can make assumptions that are unavailable to the compiler.
The assembly programmer can use human creativity; the compiler, advanced as it may be, is merely an automatic program.

On the other hand, here are the assembly coding cons:

The assembly programmer has to handle time-consuming machine-level issues such as register allocation and instruction scheduling. With C code, these issues are taken care of by the compiler.
Assembly coding requires specialised knowledge of the DSP architecture and its instruction set, whereas C coding only requires knowledge of the C language—which is rather common.
With assembly code, it is extremely difficult and time consuming to port applications from one platform to another. Porting is relatively easy for C applications.

Sorantis 2009-09-07 19:51:45

Code optimization and register allocation are a well researched field and a good optimizer can do things you could only do with the most intimate knowledge of the workings of the CPU you are coding for. It can actually happen that apparently well devised assembly code runs slower than compiler optimized one.Assembly coding is very error prone, and at the level it is done, the compiler can usually find out just as well what and how to optimize as a human.A good optimizer will exploit all special instructions a CPU has to offer.

karx11erx 2009-09-09 11:27:12

Looking at performance or maintanability, there is hardly any area where assembly coding would have any advantages over using a good compiler.

karx11erx 2009-09-09 11:27:51

+2 A:

Here is a simple(ish) example - the Swap code for Watt-32.

In this case, __asm is used to integrate assembly code inline with C/C++ code throughout for performance. Since this is an alternative, cross-platform network stack, there are many points where keeping the performance as critical as possible is important.

Reed Copsey 2009-09-07 19:55:48

MSalters 2009-09-09 11:37:38

+1 A:

The syntax will depend of your compiler; I use gcc, and it supports a couple of different ways to inline assembler code.

Check this link for description and examples: http://www.ibiblio.org/gferg/ldp/GCC-Inline-Assembly-HOWTO.html#s4

cjcela 2009-09-08 17:32:52

+6 A:

I'm not a game developer, but I write almost nothing but assembly code for a living (I'm a library writer). Generally this is for performance reasons, but I also do it to work around compiler bugs, or to use hardware features like condition flags that are actually easier to express in assembly than in C.

I'm usually writing complete functions in assembly, so I tend to write .s files that are assembled into object files and linked into an executable or library. People who just need to move a single loop into assembly often use inline assembly in their C source, which is supported by most compilers via some sort of intrinsic.

"Simple" examples are pretty rare, since if it was simple, the compiler would do a sufficiently good job and there would be no need for assembly.

Stephen Canon 2009-09-08 18:31:00

+1 A:

You will find very little inline assembly in most modern games for the PC, Xbox 360 or PS3. Modern optimizing compilers do a fairly good job of instruction scheduling and register allocation so the performance gain from writing inline assembly is rarely worth the effort any more. Inline assembly is not even supported for 64 bit code in Visual Studio.

Inline assembly used to be important for accessing hardware specific instructions that the compiler would not automatically use. With modern compilers intrinsics are the preferred way of accessing hardware specific instructions. In games intrinsics are often used for math heavy code to access hardware specific vector math instructions (using SSE on the PC or VMX on the Xbox 360 / PS3 PPU or the SPU instruction set on the PS3 SPUs). Intrinsics are platform/compiler specific extensions that look like regular C/C++ functions but map directly to single instructions on the underlying hardware. You can see the documentation for the x86 and x64 intrinsics in Visual Studio on MSDN.

You may still find some really performance critical bits of code written in assembly in some games but generally whole functions will be written in assembly rather than using bits of inline assembly in C/C++ code. I haven't seen any inline assembly in any PC/Xbox 360/PS3 games in any of the code I've worked on in the last 5 years or so.

mattnewport 2009-09-09 00:47:58

+2 A:

I have given up assembly coding years ago when I found that an optimizing C++ compiler would beat me hands down when it came to performance because the people building the optimizer consider all sorts of things, like pipeline stalls, partially parallel execution of subsequent, independent code fragments (a good optimizer can rearrange your code a fair bit), and the disadvantages of assembly code (hard to read, hard to debug, not portable) by far outweigh the advantages it used to have in those days when compilers didn't have good optimizers.

If I was you I wouldn't bother myself with assembly coding for normal programming tasks. Even if you could save a CPU clock cycle here or there, looking at the overall performance of a complex application the effect is negligable.

karx11erx 2009-09-09 11:22:13

+1 A:

Michael Abrash wrote a book called the Graphics Programming Black Book. It is definitely worth a read. You can get the PDFs for free online here.

Michael Abrash's classic Graphics Programming Black Book is a compilation of Michael's previous writings on assembly language and graphics programming (including from his "Graphics Programming" column in Dr. Dobb's Journal). Much of the focus of this book is on profiling and code testing, as well as performance optimization. It also explores much of the technology behind the Doom and Quake 3-D games, and 3-D graphics problems such as texture mapping, hidden surface removal, and the like. Thanks to Michael for making this book available.

Mike Houston 2009-09-09 11:33:46

+1 A:

Even if you do use some api like __asm to inline assembly code, there is an overhead involved. The compiler will first dump all yr registers, (or the ones that you are using in your inlined code depending if the compiler chooses to optimize), then inline your code, then restore those registers. I feel that if there is no SIGNIFICANT advantage of inlining assembly code, it should be avoided, given the tradeoff between maintainability, porting, correctness, readability and performance..

Lok 2010-10-21 02:17:57

This is only true of the lamest compilers. Most compilers let you specify what registers are effected by an `asm` block. It can use that information to avoid spilling unless it's actually necessary. That said, there can still be overhead, but it's usually in the form of preventing the optimizer from reordering instructions across the `asm` block.

Stephen Canon 2010-10-21 02:41:42

ansaurus

tags:

views:

answers:

What is a simple example of replacing c code with assembly to improve performance?

related questions