ansaurus

Question

How can I read and parse chunks of data into a Perl hash of arrays?

Answer 1

A:

Problems with your state machine, I think you may use this logic:

if(!$head)
{
  # seek and get head
} 
else
{
  if (!$total) 
  {
    # seek and get total
  }
  else
  {
    # seek run
    # if found :
      # push run to temp and decrease total
      # if total eq 0 :
        # push temp to bighash
        # reset head, total and temp
  }
}

oraz 2010-04-16 06:43:55

Answer 2

A:

The code looks correct but I'll strongly recommend adding:

use warnings
use strict

in everything but the most trivial one liners, also add

 elsif ($head && /^$/) {

to your last condition, to catch problems.

piotr 2010-04-16 06:52:18

@piotr: it still doesn't work. There is duplicate of arrays inside each hash.

neversaint 2010-04-16 06:56:12

can you paste your Dumper output?

piotr 2010-04-16 12:25:31

Answer 3

+1 A:

Based on your code, here is one way to do it

my $head;
my %result;
while (<>) {
    chomp;
    next if (/^\#/);

    if ( /^\d{1,2}:(\w+)/ ) {
        $result{$1} = []; 
        $head = $1; # $head will be used to know which key the following values
                    # will be assigned to
    }
    elsif (/^Run \#\d+: (\w+),.*/) {
        push(@{$result{$head}},$1); #Add the number found to the array that is assigned to the                        
                                    #last key found
    } 
}

ccheneson 2010-04-16 11:02:20

Answer 4

+2 A:

Replace

push @{$bighash{$head}}, [@temp];

with

push @{$bighash{$head}}, @temp;

You only have one array per $head value, right? The second statement adds all the values in @temp to the arrayref in $bighash{$head}. The first form, on the other hand, constructs an array reference out of the items in @temp and pushes that to $bighash{$head}, giving you an arrayref of arrayrefs.

Alternately you might want

$bighash{$head} = [@temp];

If you only expect to encounter each $head value once.

rjh 2010-04-16 11:04:53

Answer 5

+5 A:

An alternative way to do parsing like this is to read entire paragraphs. For more information on the input record separator ($/), see perlvar.

For example:

use strict;
use warnings;
use Data::Dumper qw(Dumper);
my %bighash;

{
    local $/ = "\n\n"; # Read entire paragraphs.
    while (my $paragraph = <>){
        # Filter out comments and handle extra blank lines between sections.
        my @lines = grep {/\S/ and not /^\#/} split /\n/, $paragraph;
        next unless @lines;

        # Extract the key and the SRR* items.
        my $key = $lines[0];
        $key =~ s/^\d+://;
        $bighash{$key} = [map { /^Run \#\d+: +(SRR\d+)/ ? $1 : () } @lines];
    }
}

print Dumper(\%bighash);

FM 2010-04-16 12:07:39

Absolutely. There is really no need to mess with anything else.

Sinan Ünür 2010-04-16 13:53:58

ansaurus

tags:

views:

answers:

How can I read and parse chunks of data into a Perl hash of arrays?

related questions