ansaurus

Question

Answer 1

+14 A:

For me, the output looks a bit different:

 testA:  1 wallclock secs ( 0.16 usr +  0.11 sys =  0.27 CPU) @ 37.04/s (n=10)
        (warning: too few iterations for a reliable count)
 testB:  0 wallclock secs ( 0.09 usr +  0.02 sys =  0.11 CPU) @ 90.91/s (n=10)
        (warning: too few iterations for a reliable count)

Benchmarking this with a decent value of iterations (I chose 100,000), I get this:

 testA: 23 wallclock secs (12.15 usr + 10.05 sys = 22.20 CPU) @ 4504.50/s (n=100000)
 testB: 11 wallclock secs ( 6.02 usr +  5.57 sys = 11.59 CPU) @ 8628.13/s (n=100000)

A look at the code tells me that those two subs probably spend most of their time globbing the files, so I did this:

my @files = glob "C:\\Windows\\System32\\*";
my $time = timethese( 1_000_000, {
                testA => sub {
                                map $_->[0],
                                    sort {$a->[1] <=> $b->[1]}
                                        map [$_, -s $_],
                                             @files;
                         },
                testB => sub {
                            sort { -s $a <=> -s $b } @files;
                         },
                }
        );

And get:

 testA: 103 wallclock secs (56.93 usr + 45.61 sys = 102.54 CPU) @ 9752.29/s (n=1000000)
 testB: -1 wallclock secs ( 0.12 usr +  0.00 sys =  0.12 CPU) @ 8333333.33/s (n=1000000)
        (warning: too few iterations for a reliable count)

Something smells fishy here, doesn't it?

So, let's take a look at the docs:

perldoc -f sort

In scalar context, the behaviour of "sort()" is undefined.

Aha! So let's try it again:

my @files = glob "C:\\Windows\\System32\\*";
my $time = timethese( 100_000, {
                testA => sub {
                              my @arr=  map $_->[0],
                                    sort {$a->[1] <=> $b->[1]}
                                        map [$_, -s $_],
                                             @files;
                         },
                testB => sub {
                            my @arr = sort { -s $a <=> -s $b } @files;
                         },
                }
        );

This gives me:

 testA: 12 wallclock secs ( 7.44 usr +  4.55 sys = 11.99 CPU) @ 8340.28/s (n=100000)
 testB: 34 wallclock secs ( 6.44 usr + 28.30 sys = 34.74 CPU) @ 2878.53/s (n=100000)

So. To answer your questions: A Schwartzian transform will help you whenever you use it in a meaningful way. Benchmarking will show this when you benchmark in a meaningful way.

innaM 2009-02-27 11:15:45

The problem was with sort returning a scalar value as you mention. Changing that fixed the problem. Could you please check if the map expression is "wrong"? The manual does state that map works as map BLOCK LIST // map EXPR, LIST.In my case both of them work just fine.

muteW 2009-02-27 11:40:15

I thought the warnings I got where triggered by that use of map. I'll check.

innaM 2009-02-27 12:23:53

You're right. But where did those warnings go??

innaM 2009-02-27 12:25:48

One possible explanation could be that the warnings were because the directory entries return a size of 0. Was this being stored as undef in $a->[1] or in $b->[1]?

muteW 2009-02-27 15:48:07

No. But you got me on the right track. The dir I used for testing at first contains a very, very weird file. I'll delete that bugger and then edit my answer a bit.

innaM 2009-02-27 16:06:48

Historical note: enough different potential behaviors have been suggested for scalar-context-sort that it was decided it was best to leave it officially undefined. Here's one: http://www.nntp.perl.org/group/perl.perl5.porters/2004/06/msg92477.html

ysth 2009-03-01 04:11:35

Answer 2

+5 A:

Apart from the excellent answer by Manni, another factor to consider is that some caching might be going on in your example, e.g. on the file system level.

If the same file is accessed multiple times, the FS might do some caching resulting in less drastic time savings of the Schwartzian Transform than expected.

2009-02-27 12:38:24

Good point. The costlier it is to obtain your sorting criteria, the more dramatic the effects of the transform.

innaM 2009-02-27 13:18:29

Answer 3

+2 A:

I know this is technically already answered rather completely, but I had a couple relevant side notes.

First, I usually prefer cmpthese() to timethese() because it tells you which is better in a human readable and informative way instead of just presenting times.

Second, for theoretical problems like this, I usually try to avoid syscalls entirely where possible, since the kernel can make you wait forever if it's in the mood to do so -- not really a fair test.

Thrid, It's interesting to note that the transform is always more expensive if the list is already sorted: If you set $point_of_interest=2, the transform wins; but if you set $point_of_interest=1, the regular sort will win. I find that result quite interesting and worth mentioning.

use strict;
use Benchmark qw(cmpthese);

my $point_of_interest = 2;
my @f = (1 .. 10_000) x $point_of_interest;
my @b;

cmpthese( 500, {
    transform => sub {
        @b = 
        map  {$_->[0]}
        sort {$a->[1] <=> $b->[1]}
        map  {[$_, expensive($_)]}
        @f
    },  
    vanilla => sub { @b = sort { expensive($a) <=> expensive($b) } @f },
}); 

sub expensive {
    my $arg = shift;

    $arg .= "_3272";
    $arg += length "$arg";
    $arg = $arg ** 17;
    $arg = sqrt($arg);

    $arg;
}

jettero 2009-02-27 15:59:37

Curiously, before this post I wasn't a liar, but after it I am. My cmpthese() seems to always find vanilla faster no matter what, though my timethese() seems to show what I claimed above.

jettero 2009-02-27 16:09:37

I get the expected results for both cmpthese and timethese. I disagree about your point about the readability of the output of cmpthese. I find it hard to read. Maybe, just maybe that's why you now think you're a liar?

innaM 2009-02-27 16:19:40

No, clearly that's subjective. I think I actually prefer timethese() now that I notice how much different the result is under cmpthese(). I wonder why the difference. Seems to happen under my perl5.6.1 too.

jettero 2009-02-27 16:35:03

Do you really think that cmpthese gave you the wrong results? At the risk of repeating myself, I personally find it very hard to read the output of cmpthese and often confuse the fastest and the slowest results.

innaM 2009-02-27 16:59:44

It is possible I simply misread them since I'm getting the results I expect now. It's more likely that I simply read the words wrong than the output though -- I'm dyslexic.

jettero 2009-02-27 20:05:07

Answer 4

+3 A:

For a thorough examination of this case, see my Perlmonks post "Wasting Time Thinking about Wasted Time". I also expand on that in the "Benchmarking" chapter in Mastering Perl. As others have already noted, it's the benchmarking code that's the problem, not the transform. It's a mistake in Intermediate Perl.

However, to answer your question, a cached-key transform is useful when the sort-key computation is expensive and you'd have to compute it many times. There's a trade off between the extra work to cache the sort key and the cycles you save doing it. Typically, the more elements you have to sort the better the cached-key transform will perform.

brian d foy 2009-02-27 19:57:39

Wow. How could I miss this all those years?

innaM 2009-02-27 20:21:54

ansaurus

tags:

views:

answers:

When are Schwartzian Transforms useful?

related questions