ansaurus

Question

Is 'shift' evil for processing Perl subroutine parameters?

Answer 1

+1 A:

It isn't intrinsically evil, but using it to pull off the arguments of a subroutine one by one is comparatively slow and requires a greater number of lines of code.

David Dorward 2009-08-25 13:53:09

-1 for incorrect information. As @John Siracusa says, it's a common Perl idiom to use shift to process arguements.

Tony Miller 2009-08-25 13:57:27

I don't see where he contested that it wasn't a common Perl idiom. He did say it was slower (which is true, because the argument array must be manipulated instead of just accessed).

JoshJordan 2009-08-25 14:03:40

Tony is right, kind of.Based on my benchmarking, shift is faster for 2 parameters or less.List assign is faster for 3 or more.Since anything that takes 3 or 4 or 10 params should probably not be doing positional parameters in that way, using the shift style is reasonable.

Adam Kennedy 2009-08-25 15:27:30

@JoshJordan: It would be slower in all cases for the reason you gave, but... it's been heavily optimized in the perl interpreter, resulting in it being faster to shift a single argument than to copy it, as shown by the benchmarks in John Siracusa's answer. (List assignment is still faster for grabbing a large number of params, though.)

Dave Sherohman 2009-08-25 19:10:08

Answer 2

+34 A:

The use of shift to unpack arguments is not evil. It's a common convention and may be the fastest way to process arguments (depending on how many there are and how they're passed). Here's one example of a somewhat common scenario where that's the case: a simple accessor.

use Benchmark qw(cmpthese);

sub Foo::x_shift { shift->{'a'} }
sub Foo::x_ref   { $_[0]->{'a'} }
sub Foo::x_copy  { my $s = $_[0]; $s->{'a'} }

our $o = bless {a => 123}, 'Foo';

cmpthese(-2, { x_shift => sub { $o->x_shift },
               x_ref   => sub { $o->x_ref   },
               x_copy  => sub { $o->x_copy  }, });

The results on perl 5.8.8 on my machine:

            Rate  x_copy   x_ref x_shift
x_copy  772761/s      --    -12%    -19%
x_ref   877709/s     14%      --     -8%
x_shift 949792/s     23%      8%      --

Not dramatic, but there it is. Always test your scenario on your version of perl on your target hardware to find out for sure.

shift is also useful in cases where you want to shift off the invocant and then call a SUPER:: method, passing the remaining @_ as-is.

sub my_method
{
  my $self = shift;
  ...
  return $self->SUPER::my_method(@_);
}

If I had a very long series of my $foo = shift; operations at the top of a function, however, I might consider using a mass copy from @_ instead. But in general, if you have a function or method that takes more than a handful of arguments, using named parameters (i.e., catching all of @_ in a %args hash or expecting a single hash reference argument) is a much better approach.

John Siracusa 2009-08-25 13:54:03

You should show when it's actually faster to no use shift. I think with either 2 or 3 args it's faster to copy @_. As a rule of thumb I use shift when there's 1 arg and copy @_ when there's more than 1.

mpeters 2009-08-25 18:06:30

Answer 3

+15 A:

It is not evil, it is a taste sort of thing. You will often see the styles used together:

sub new {
    my $class = shift;
    my %self = @_;
    return bless \%self, $class;
}

I tend to use shift when there is one argument or when I want to treat the first few arguments differently than the rest.

Chas. Owens 2009-08-25 13:59:34

Answer 4

+6 A:

Assigning @_ to a list can bring some helpul addtional features.

It makes it slightly easier to add additional named parameters at a later date, as you modify your code Some people consider this a feature, similar to how finishing a list or a hash with a trailing ',' makes it slightly easier to append members in the future.

If you're in the habit of using this idiom, then shifting the arguments might seem harmful, because if you edit the code to add an extra argument, you could end up with a subtle bug, if you don't pay attention.

e.g.

sub do_something {
 my ($foo) = @_;
}

later edited to

sub do_something {
  my ($foo,$bar) = @_; # still works
}

however

sub do_another_thing {
  my $foo = shift;
}

If another colleague, who uses the first form dogmatically (perhaps they think shift is evil) edits your file and absent-mindedly updates this to read

sub do_another_thing {
  my ($foo, $bar)  = shift; # still 'works', but $bar not defined
}

and they may have introduced a subtle bug.

Assigning to @_ can be more compact and efficient with vertical space, when you have a small number of parameters to assign at once. It also allows for you to supply default arguments if you're using the hash style of named function parameters

e.g.

 my (%arguments) = (user=>'defaultuser',password=>'password',@_);

I would still consider it a question of style / taste. I think the most important thing is to apply one style or the other with consistency, obeying the principle of least surprise.

cms 2009-08-25 14:16:01

I don't think that `my ($foo, $bar) = shift;` is a *subtle* bug at all. It's a brick to the head, no?

Telemachus 2009-08-25 14:17:37

I've seen it happen too many times. Stupid mistake of course, but it's surprisingly easy to become 'code blind' when you're reading lots of it.

cms 2009-08-25 14:21:38

Typically, if you unpack by shift, an additional param is added by adding another line. `my $foo = shift; my $bar = shift;` I've never seen what you describe

daotoad 2009-08-25 15:56:09

It's just my guess as to why someone might think shifting is 'evil'. I wouldn't agree with that conclusion myself.

cms 2009-08-25 16:20:17

Answer 5

+1 A:

I don't think shift is evil. The use of shift shows your willingness to actually name variables - instead of using $_[0].

Personally, I use shift when there's only one parameter to a function. If I have more than one parameter, I'll use the list context.

 my $result = decode($theString);

 sub decode {
   my $string = shift;

   ...
 }

 my $otherResult = encode($otherString, $format);

 sub encode {
   my ($string,$format) = @_;
   ...
 }

Kevin 2009-08-25 14:20:05

Answer 6

+7 A:

This is, as others have said, a matter of taste.

I generally prefer to shift my parameters into lexicals because it gives me a standard place to declare a group of variables that will be used in the subroutine. The extra verbosity gives me a nice place to hang comments. It also makes it easy to provide default values in a tidy fashion.

sub foo {
    my $foo = shift;       # a thing of some sort.
    my $bar = shift;       # a horse of a different color.
    my $baz = shift || 23; # a pale horse.

    # blah
}

If you are really concerned about the speed of calling your routines, don't unpack your arguments at all--access them directly using @_. Be careful, those are references to the caller's data you are working with. This is a common idiom in POE. POE provides a bunch of constants that you use to get positional parameters by name:

sub some_poe_state_handler {
    $_[HEAP]{some_data} = 'chicken';
    $_[KERNEL]->yield('next_state');
}

Now the big stupid bug you can get if you habitually unpack params with shift is this one:

sub foo {
    my $foo = shift;
    my $bar = shift;
    my @baz = shift;

    # I should really stop coding and go to bed.  I am making dumb errors.

}

I think that consistent code style is more important than any particular style. If all my coworkers used the list assignment style, I'd use it too.

If my coworkers said there was a big problem using shift to unpack, I'd ask for a demonstration of why it is bad. If the case is solid, then I'd learn something. If the case is bogus, I could then refute it and help stop the spread of anti-knowledge. Then I'd suggest that we determine a standard method and follow it for future code. I might even try to set up a Perl::Critic policy to check for the decided upon standard.

daotoad 2009-08-25 16:15:32

+1 but note that if the arguments need comments (and there are more than two of them), better to use named arguments and provide a defaults hash.

Sinan Ünür 2009-08-25 18:42:01

I usually set my limit at 3, but absolutely.

daotoad 2009-08-25 20:59:02

Answer 7

A:

Perl::Critic is your friend here. It follows the the "standards" set up in Damian Conway's book Perl Best Practices. Running it with --verbose 11 gives you an explanation on why things are bad. Not unpacking @_ first in your subs is a severity 4 (out of 5). E.g:

echo 'package foo; use warnings; use strict; sub quux { my foo= shift; my (bar,baz) = @_;};1;' | perlcritic -4 --verbose 11

Always unpack @_ first at line 1, near 'sub quux { my foo= shift; my (bar,baz) = @_;}'.
  Subroutines::RequireArgUnpacking (Severity: 4)
    Subroutines that use `@_' directly instead of unpacking the arguments to
    local variables first have two major problems. First, they are very hard
    to read. If you're going to refer to your variables by number instead of
    by name, you may as well be writing assembler code! Second, `@_'
    contains aliases to the original variables! If you modify the contents
    of a `@_' entry, then you are modifying the variable outside of your
    subroutine. For example:

       sub print_local_var_plus_one {
           my ($var) = @_;
           print ++$var;
       }
       sub print_var_plus_one {
           print ++$_[0];
       }

       my $x = 2;
       print_local_var_plus_one($x); # prints "3", $x is still 2
       print_var_plus_one($x);       # prints "3", $x is now 3 !
       print $x;                     # prints "3"

    This is spooky action-at-a-distance and is very hard to debug if it's
    not intentional and well-documented (like `chop' or `chomp').

    An exception is made for the usual delegation idiom
    `$object->SUPER::something( @_ )'. Only `SUPER::' and `NEXT::' are
    recognized (though this is configurable) and the argument list for the
    delegate must consist only of `( @_ )'.

olle 2009-08-25 16:30:45

This appears to be mistakenly attributing a case of using @_ directly, rather than taking local copies. This is indeed, almost always the wrong thing to do. However, neither shifting, nor assigning @_ to a list suffer from the problems described in the output.However, cargo-culting around this utility's output might explain why people think that operators like shift are evil, I suppose.

cms 2009-08-25 17:06:08

Don't put too much faith into Perl Best Practices. There are too many situations to apply a single rule to all of them.

brian d foy 2009-08-25 23:08:25

Answer 8

+1 A:

There is an optimization for list assignment.

The only reference I could find, is this one.

5.10.0 inadvertently disabled an optimization, which caused a measurable performance drop in list assignment, such as is often used to assign function parameters from @_ . The optimisation has been re-instated, and the performance regression fixed.

This is an example of the affected performance regression.

sub example{
  my($arg1,$arg2,$arg3) = @_;
}

Brad Gilbert 2009-08-25 17:48:12

If anybody has a better reference, please feel free to leave a comment.

Brad Gilbert 2009-08-25 17:48:47

ansaurus

tags:

views:

answers:

Is 'shift' evil for processing Perl subroutine parameters?

related questions