ansaurus

Question

which has better performance? static versus objects

Answer 1

+8 A:

Did you profile your program?

You should profile your code. Objects are fast, unoptimal code is dead slow.

After you optimize it, this task would be I/O bound anyway (which means it spends most of time waiting for disks to fetch another part of data).

And yeah, your boss is better off doing bossy things like playing golf or dilberting around, not telling you bullshit about software design. 'cause you're not trying to play golf for him, do you?

alamar 2009-06-10 08:10:11

I think that's a "no"

jerryjvl 2009-06-10 08:13:26

Ask you boss for something like this: http://www.red-gate.com/products/ants_performance_profiler/index.htm

tanascius 2009-06-10 08:15:04

... or just try out the demo

Stormenet 2009-06-10 08:17:43

You should profile your code.Objects are fast, unoptimal code is dead slow.

alamar 2009-06-10 08:22:21

"cause you're not trying to play golf for him, do you?" Great!

Dmitriy Matveev 2009-06-10 08:27:38

"Не ссыте в наших лифтах, мы же не катаемся на ваших унитазах"

alamar 2009-06-10 08:29:39

Answer 2

A:

Many file systems have performance problems when the number of entries in a directory increases beyond a certain limit. Which one are you using?

If you add a logging function in the debug version of your program you may get an indication of the places where the most time is spent. That's where the optimization should take place.

Pim 2009-06-10 08:14:08

Answer 3

+2 A:

I can answer number 1: having many files in a single directory gives you poor performance. It doesn't have anything to do with your code - it's a Windows thing (or a NTFS thing, I don't know). Splitting things up under different subdirectories indeed improves performance a lot.

As for number 2, I highly doubt that using static methods will make a huge difference. Using static methods is faster but only marginally so. We're talking microseconds here. There's probably something else going on. There's only one way to find out, and that is like alamar says, to profile your code.

You can use a tool like Ants to profile your code and see what operations are the bottleneck. It can list the time spent in all methods in your program, so you can see what takes the most time, which could really be anything. But then at least you know what to optimize.

Razzie 2009-06-10 08:15:27

Answer 4

+4 A:

The difference between an instance call and a static call is so miniscule that I would happily wager that it has nothing to do with your performance issue. At all. Yes, static call is technically faster (by a tiny, tiny amount), but that is nothing compared to all the file IO you are doing. As has already been stated - profile your code, and stop worrying about things like this (premature optimisation). Most likely, the bottleneck is poor collection performance, perhaps fixable with dictionary etc.

Timings:

static: 154ms
instance: 156ms

So 2ms difference over 50M calls! Forget about it...

Based on:

class Program
{
    static void Main()
    {
        StaticMethod(); // JIT
        Program p = new Program();
        p.InstanceMethod(); // JIT

        const int LOOP = 50000000; // 50M
        Stopwatch watch = Stopwatch.StartNew();
        for (int i = 0; i < LOOP; i++) StaticMethod();
        watch.Stop();
        Console.WriteLine("static: " + watch.ElapsedMilliseconds + "ms");

        watch = Stopwatch.StartNew();
        for (int i = 0; i < LOOP; i++) p.InstanceMethod();
        watch.Stop();
        Console.WriteLine("instance: " + watch.ElapsedMilliseconds + "ms");
    }
    [MethodImpl(MethodImplOptions.NoInlining | MethodImplOptions.NoOptimization)]
    void InstanceMethod() { }
    [MethodImpl(MethodImplOptions.NoInlining | MethodImplOptions.NoOptimization)]
    static void StaticMethod() { }
}

edit:

If we assume (for example) that we create a new method every 20 calls (if (i % 20 == 0) p = new Program();), then the metrics change to:

static: 174ms
instance: 873ms

Again - nowhere near enough to indicate a bottleneck, when that is over 50M calls, and we're still under a second!

Marc Gravell 2009-06-10 08:16:28

You're doing it slightly wrong, the instance example should feature some object instantiations (like, on each tenth iteration, p = new Program())

alamar 2009-06-10 09:26:06

@alamar - that depends on how many objects are created during exection...

Marc Gravell 2009-06-10 09:32:11

Some definitely would be, otherwise it's still nearly static.

alamar 2009-06-10 09:33:52

Answer 5

+4 A:

Your task sounds like it should definitely be IO-bound, not CPU-bound. Micro-optimising by removing proper OO design would be madness. The difference between static methods and instance methods is usually unmeasurably small (if it's even present) anyway.

As alamar says, you should profile your app before going any further. There's a free profiler available from Microsoft or you could use JetBrains dotTrace profiler. There are others, of course - those are the two I've used.

Just as an indication of whether it's IO-bound or CPU-bound, if you run task manager while the app is running, how much CPU is the process taking? And is the disk thrashing the whole time?

Putting a vast number of files in a directory will slow down access to that directory, but only when you actually create or open a file, or list the files in the directory. I'm surprised it makes quite that much difference, admittedly. However, having 200,000 files in a directory sounds pretty unmanageable anyway. Using a hierarchical approach is likely to be better in terms of using these files afterwards.

Why does your boss think that the merge and split should take the same amount of time in the first place?

Jon Skeet 2009-06-10 08:17:22

How to profile application execution performance with Microsoft CLR memory profiler?

Dmitriy Matveev 2009-06-10 08:30:20

It's not just a memory profiler. It provides call graphs too, so you can see if you're calling certain methods vast numbers of times.

Jon Skeet 2009-06-10 08:57:37

Answer 6

A:

It is impossible to answer this without knowing your FS. But as others have noted, FSes generally are not optimized for massive collapsed directory trees.
I think rejecting OOP due to a possible (you haven't profiled) ~10% speed increase is ridiculous, particularly when the page says, "please do not take this data too literally".

Finally, though you haven't given much information, I see no reason to think this "unbalance" is odd. Writing is slower, sometimes significantly so.

Matthew Flaschen 2009-06-10 08:17:52

Answer 7

A:

My Answers are

Depending on your OS & file system, performance starts to degrade after 20 -30k files/subfolders. It's a fact of life. Ntfs Performance And Large Volumes of files and directories
A statement that Non OO code is faster than OO code is redicilous. You cannot know what your performance bottle neck is until you profile the code. See the answers to this question for good information Performance anti-patterns

Binary Worrier 2009-06-10 08:22:03

ansaurus

tags:

views:

answers:

which has better performance? static versus objects

related questions