ansaurus

Question

C++ optimization of reference-to-pointer argument

Answer 1

+1 A:

The variant with a temporary variable could be faster since it doesn't imply that every change to the pointer is reflected back to the argument and the compiler has better chances on generating faster code. However the right way to test this is to compile and look at the disassembly.

Meanwhile this has noting to do with avoiding aliasing. In fact, the variant with a temporary variant does employ aliasing - now you have two pointers into the same array and that's exactly what aliasing is.

sharptooth 2009-10-28 10:13:32

I don't understand your reasoning for faster code.

Martin York 2009-10-28 10:19:16

Well, it depends on the compiler dumbness. With a temporary it can allocate it on a register and only write to the argument once. Without a temporary it might decide to update the argument on each change.

sharptooth 2009-10-28 10:21:02

But it could just as easily put the parameter in a register while the function executes.

Martin York 2009-10-28 10:24:54

Actually there's more to it than dumbness -- it depends on whether the compiler can assume that nothing aliases the pointer or not.

Crashworks 2009-10-28 10:25:19

Answer 2

+1 A:

I would use a temporary if there is a possibility that the function is transactional.

i.e. the function succeeds or fails completely (no middle ground).
In this case I would use a temp to maintain state while the function executes and only assign back to the in_out parameter when the function completes successfully.

If the function exits prematurely (ie via exception) then we have two situations:

With a temporary (the external pointer is unchanged)
Using the parameter directly the external state is modified to reflect position.

I don't see any optimization advantages to either method.

Martin York 2009-10-28 10:23:21

Good answer, this and sellibitze' answer are right on the money. It does depend on whether "fail completely" means abandon the input altogether (then just throwing an exception suffices), but where I need to recognize one of several alternatives in the input, properly transactional functions would definitely be the way to go. Thanks! (Sorry I can't upvote this yet.)

mjmt 2009-10-29 01:26:40

Answer 3

A:

One thought that comes immediately in mind: exception safety. If you throw an exception during parsing, the use of a temporary variable is what you should do to provide strong exception safety: Either the function call succeeded completely or it didn't do anything (from a user's perspective).

sellibitze 2009-10-28 10:24:27

If that is what you require. If for example the function parses a stream you may expect the pointer to represent the point in the stream the exception happened.

Martin York 2009-10-28 10:30:18

True. What I tried to bring across is that *if* the strong guarantee is what you want, the temporary pointer variable is the way to go. This "error position" could be part of the exception object, for example.

sellibitze 2009-10-28 10:54:56

Answer 4

A:

Crashworks 2009-10-28 10:31:22

There is no "restrict" in C++, which is what the tag points to. That's a C99 thing, and I don't think it's going into C++0x.

David Thornley 2009-10-28 16:55:30

Nonsense. Both GCC and MSVC support it in their current implementations.

Crashworks 2009-10-28 20:03:38

Answer 5

+2 A:

All good answers, but if you're worried about performance optimization, the actual parsing is going to take nearly all of the time, so pointer aliasing will probably be "in the noise".

Mike Dunlavey 2009-10-28 16:32:18

Unless the parsing involves reading from and writing to the pointer a lot. Then its load-hit-stores would be in the inner loop.

Crashworks 2009-10-28 20:04:48

@Crash: Hi again. You're right. If I'm tuning code like that, if I still think it's not fast enough, and that inner loop is the "bottleneck", then I will care about it, absolutely.

Mike Dunlavey 2009-10-28 20:14:20

... when I write a parser, it's mostly spending time like this: `while(WHITE(*p)) p++;`, either that or looking up symbols in a table. It's usually not a performance issue unless the input is *massive*.

Mike Dunlavey 2009-10-28 20:22:28

In this case, the LHS would be in the `p++` expression (because p itself is being written to by reference). Definitely it's something you'd need to measure before changing; it might be a big deal, might not. But it can be a fortyfold difference in how long that `p++` op takes, so in a tight loop like that, it would be significant.

Crashworks 2009-10-28 20:44:29

I mention this just because I actually did get a huge speedup out of a parser once by doing this very thing.

Crashworks 2009-10-28 20:45:07

@Crash: Boy, that's bare-metal coding!

Mike Dunlavey 2009-10-28 21:32:40

@Crashworks: You're right, GCC even at -O3 is doing a store smack in the middle of a loop, strange that it's not deferred. @Mike: The thing here though is *p since p is a reference really means **p, and the difference doesn't seem to be optimized away completely (at least by GCC which is a dealbreaker). But yeah I know this isn't a big deal, I really just wanted to avoid writing my code one way or the other without understanding why. Thanks very much for the help.

mjmt 2009-10-29 00:37:20

mjmt -- it's not odd that it's not deferred; it's what the compiler has to do unless you make it promises about pointer aliasing. see my cruelly downvoted answer at bottom.

Crashworks 2009-10-29 02:54:48

Answer 6

A:

How does the compiler know that p_in_out isn't aliased somehow? It really can't optimize away writing the data back through the reference.

struct foo {
    setX(int); setY(int); 
    const char* current_pos;
} x;
parse_foo(x.current_pos, x);

I look at this and ask why you didn't just return the pointer Then you don't have a reference to a pointer and you don't have to worry about modify the original.

const char* parse_foo(const char* p, foo& out) {
    //use p;
    return p;
}

It also means you can call the function with an rvalue:

p = parse_foo(p+2, out);

jmucchiello 2009-10-29 19:00:03

ansaurus

tags:

views:

answers:

C++ optimization of reference-to-pointer argument

related questions