ansaurus

Question

Answer 1

+18 A:

Given that you deal with immutable objects with readonly fields, I would say that you have hit the one case when I don't find public fields to be a dirty habit.

Fredrik Mörk 2009-09-11 12:50:25

Agreed, but ideally only when there is a demonstrable performance requirement.

Marc Gravell 2009-09-11 12:54:05

Answer 2

+2 A:

I know you feel kind of dirty doing this, but it isn't uncommon for rules and guidelines to get shot to hell when performance becomes an issue. For example, quite a few high traffic websites using MySQL have data duplication and denormalized tables. Others go even crazier.

Moral of the story - it may go against everything you were taught or advised, but the benchmarks don't lie. If it works better, just do it.

ryeguy 2009-09-11 12:55:23

I'd add a caveat: "if it works better, *and* you actually need the better performance".

LukeH 2009-09-11 13:03:44

@Luke Many people just assume you need the performance. I'd add one more caveat that you don't need the performance until there's a problem.

Imagist 2009-09-11 15:24:33

Answer 3

+8 A:

IMO, the "no public fields" rule is one of those rules which are technically correct, but unless you are designing a library intended to be used by the public it is unlikely to cause you any problem if you break it.

Before I get too massively downvoted, I should add that encapsulation is a good thing. Given the invariant "the Value property must be null if HasValue is false", this design is flawed:

class A {
    public bool HasValue;
    public object Value;
}

However, given that invariant, this design is equally flawed:

class A {
    public bool HasValue { get; set; }
    public object Value { get; set; }
}

The correct design is

class A {
    public bool HasValue { get; private set; }
    public object Value { get; private set; }

    public void SetValue(bool hasValue, object value) {
        if (!hasValue && value != null)
            throw new ArgumentException();
        this.HasValue = hasValue;
        this.Value    = value;
    }
}

(and even better would be to provide an initializing constructor and make the class immutable).

erikkallen 2009-09-11 12:55:28

You had me on edge the entire post as to whether to upvote you, but you earned my upvote with the immutability recommendation.

Imagist 2009-09-11 15:21:40

Answer 4

+4 A:

If you really need that extra performance, then it's probably the right thing to do. If you don't need the extra performance then it's probably not.

Rico Mariani has a couple of related posts:

LukeH 2009-09-11 12:56:21

Answer 5

A:

Here's some scenarios where it is OK (from the Framework Design Guidelines book):

DO use constant fields for constants that will never change.

DO use public static readonly fields for predefined object instances.

And where it is not:

DO NOT assign instances of mutable types to readonly fields.

From what you have stated I don't get why your trivial properties don't get inlined by the JIT?

RichardOD 2009-09-11 12:57:03

Answer 6

+1 A:

Personally, the only time I would consider using public fields is in a very implementation-specific private nested class.

Other times it just feels too "wrong" to do it.

The CLR will take care of performance by optimising out the method/property (in release builds) so that shouldn't be an issue.

NathanE 2009-09-11 13:08:35

Right: private implementation details. And even there I would only do it if there are pressing performance or time-pressure reasons. +1

peSHIr 2009-09-11 13:24:29

Answer 7

+1 A:

Try compiling a release build and running directly from the exe instead of through the debugger. If the application was run through a debugger then the JIT compiler will not inline the property accessors. I was not able to replicate your results. In fact, each test I ran indicated that there was virtually no difference in execution time.

But, like the others I am not completely oppossed to direct field access. Especially because it is easy to make the field private and add a public property accessor at a later time without needed make any more code modifications to get the application to compile.

Edit: Okay, my initial tests used an int data type instead of double. I see a huge difference when using doubles. With ints the direct vs. property is virtually the same. With doubles property access is about 7x slower than direct access on my machine. This is somewhat puzzling to me.

Also, it is important to run the tests outside of the debugger. Even in release builds the debugger adds overhead which skews the results.

Brian Gideon 2009-09-11 13:12:21

Great points, here are the results I get now:Release build (through debugger): Direct field access: 112 Property access: 494 Total difference: 382 Average difference: 3.82E-05 Release build (direct): Direct field access: 4 Property access: 115 Total difference: 111 Average difference: 1.11E-05 Certainly not no difference, but arguably negligible.

MKing 2009-09-11 13:22:51

Interesting. I admit that I crafted my own test and did not test your exact code, but I am surprised that you saw that much difference. The difference I saw was always less than 1%. If I get time I will use your exact code.

Brian Gideon 2009-09-11 13:31:45

Good analysis

Marc Gravell 2009-09-11 14:54:22

Okay I used your exact code and I got some surprising results. I edited my answer to reflect that. It seems that doubles return types are not getting inlined? Strange.

Brian Gideon 2009-09-11 15:01:41

You can not change a field to a property, recompile and then expect any references to work. Field pointers do not behave the same as property pointers. Try using an external library with a field referenced from a console app. Now alter the field to become a property and only recompile the library, run the app and the reference will fail.

Brett Ryan 2009-09-13 13:14:56

@Brett, yeah that is because you have technically changed the public interface. You will also have a problem with ref parameters. Code that uses reflection may fail. Attached attributes may not be allowable on properties. I could go on and on. But, for the most part the change is pretty easy.

Brian Gideon 2009-09-13 20:36:54

Answer 8

+1 A:

Not that I disagree with the other answers, or with your conclusion... but I'd like to know where you get the order of magnitude performance difference stat from. As I understand the C# compiler, any simple property (with no additional code other than direct access to the field), should get inlined by the JIT compiler as a direct access anyway.

The advantedge of using properties even in these simple cases (in most situations) was that by writing it as a property you allow for future changes that might modify the property. (Although in your case there would not be any such changes in future of course)

Charles Bretana 2009-09-11 13:39:29

The order of magnitude stat comes from testing, see the initial question: result:Direct field access: 3262Property access: 24248In this case more like 7.4x slower (not 10x) but similar and other tests produce different results.

MKing 2009-09-11 15:39:12

Answer 9

A:

If you modify your test to use the temp variables you assign rather than directly access the properties in your calculation you will see a large performance improvement:

        sw.Start();
        for (int i = 0; i < loopCount; i++)
        {
            x = point._x;
            y = point._y;
            z = point._z;
            calculatedValue = x * y / z;
        }
        sw.Stop();
        double fieldTime = sw.ElapsedMilliseconds;
        Console.WriteLine("Direct field access: " + fieldTime);

        sw.Reset();
        sw.Start();
        for (int i = 0; i < loopCount; i++)
        {
            x = point.X;
            y = point.Y;
            z = point.Z;
            calculatedValue = x * y / z;
        }
        sw.Stop();

Venr 2009-09-11 14:08:42

Answer 10

A:

Perhaps I'll repeat someone else, but here's my point too if it may help.

Teachings are to give you the tools you need to achieve a certain level of ease when encountering such situations.

The Agile Software development methodology says that you have to first deliver the product to your client no matter what your code might look like. Second, you may optimize and make your code "beautiful" or according to the programming states of the art.

Here, either you or your client require performance. Within your project, PERFORMANCE is CRUCIAL, if I understand correctly.

So, I guess you'll agree with me that we don't care about what the code might look like or whether it respects the "art". Do what you have to to make it performant and powerful! Properties allow your code to "format" the data I/O if required. A property has its own memory address, then it looks for its member address when you return the member's value, so you got two searches of address. If performance is such critical, just do it, and make your immutable members public. :-)

This reflects some others point of view too, if I read correctly. :)

Have a good day!

Will Marcouiller 2009-09-11 14:40:31

Answer 11

+22 A:

Your test isn't really being fair to the property-based versions. The JIT is smart enough to inline simple properties so that they have a runtime performance equivalent to that of direct field access, but it doesn't seem smart enough (today) to detect when the properties access constant values.

In your example, the entire loop body of the field access version is optimized away, becoming just:

for (int i = 0; i < loopCount; i++)
00000025  xor         eax,eax 
00000027  inc         eax  
00000028  cmp         eax,989680h 
0000002d  jl          00000027 
}

whereas the second version, is actually performing the floating point division on each iteration:

for (int i = 0; i < loopCount; i++)
00000094  xor         eax,eax 
00000096  fld         dword ptr ds:[01300210h] 
0000009c  fdiv        qword ptr ds:[01300218h] 
000000a2  fstp        st(0) 
000000a4  inc         eax  
000000a5  cmp         eax,989680h 
000000aa  jl          00000096 
}

Making just two small changes to your application to make it more realistic makes the two operations practically identical in performance.

First, randomize the input values so that they aren't constants and the JIT isn't smart enough to remove the division entirely.

Change from:

Point point = new Point(12.0, 123.5, 0.123);

to:

Random r = new Random();
Point point = new Point(r.NextDouble(), r.NextDouble(), r.NextDouble());

Secondly, ensure that the results of each loop iteration are used somewhere:

Before each loop, set calculatedValue = 0 so they both start at the same point. After each loop call Console.WriteLine(calculatedValue.ToString()) to make sure that the result is "used" so the compiler doesn't optimize it away. Finally, change the body of the loop from "calculatedValue = ..." to "calculatedValue += ..." so that each iteration is used.

On my machine, these changes (with a release build) yield the following results:

Direct field access: 133
Property access: 133
Total difference: 0
Average difference: 0

Just as we expect, the x86 for each of these modified loops is identical (except for the loop address)

000000dd  xor         eax,eax 
000000df  fld         qword ptr [esp+20h] 
000000e3  fmul        qword ptr [esp+28h] 
000000e7  fdiv        qword ptr [esp+30h] 
000000eb  fstp        st(0) 
000000ed  inc         eax  
000000ee  cmp         eax,989680h 
000000f3  jl          000000DF (This loop address is the only difference)

StarPacker 2009-09-11 15:15:39

Great answer. I knew the difference shouldn't be that big.

Meta-Knight 2009-09-11 15:23:40

Thanks for such a thorough response! You're right, and this more closely mimics what would happen in my actual application.

MKing 2009-09-11 15:47:00

Also, how did you get the JIT compiled x86 code?

MKing 2009-09-11 15:51:22

Just run the optimized release build outside of the debugger (If you start a managed process with a debugger attached, it will change the JIT behavior). Then, when it is waiting at the Console.ReadLine at the end, attach the VS debugger to the process (Tools->Attach To Process). Then right-click on a stack frame in the right method in the call stack window and select "Go To Disassembly"

StarPacker 2009-09-11 16:33:12

Awesome answer. This demonstrates a *crucial* factor when measuring performance: ALWAYS MEASURE REAL-WORLD PERFORMANCE. Unless you are an expert on building benchmarks, you WILL write a benchmark that measures some completely meaningless thing, like this one did.

Eric Lippert 2009-09-11 21:04:17

ansaurus

tags:

views:

answers:

Are public fields ever OK?

related questions