ansaurus

Question

How can I parse just part of a file with Perl?

Answer 1

+5 A:

You'll need a regular expression. Something like the following should work

while (<>) {
  /(Grade[0-9]+)\s*([0-9]+\.[0-9]+)/;
  $op{$1} = $2;
}

as a filter. The op hash will store the grade names and scores. This is preferable to automatically instantiating variables.

Noufal Ibrahim 2010-10-18 16:05:20

I had a typo in my regexp. I've fixed it now.

Noufal Ibrahim 2010-10-18 16:27:03

Answer 2

A:

Creating dynamic variable names is probably not going to help you much in producing a graph; using an array is almost certainly a better idea.

However, if you really think you want to do this:

while (my $line = <$your_infile_handler>){
   if ($line =~ m/(.*) = ([0-9.]*)){
      $$1 = $2;
   }
}

should accomplish this.

Wooble 2010-10-18 16:07:19

Hi wooble, you are right. I've been looking at some scripts that produce graphs from data like GD : (http://www.ibm.com/developerworks/library/os-perlgdchart/), and they do mention creating an array such as Data [] []. But I'm not entirely sure how to populate that array by parsing the file. I'm going to give this some tries and post back with difficulties I'm having ...

c0d3rs 2010-10-18 16:35:26

Answer 3

+2 A:

You want to use a hash. Something like this should do the trick:

my %grades = (); # this is a hash
open(my $fh, "grade_file.txt" ) or die $!;
while( my $line = <$fh> ) {
     if( my( $name, $grade ) = $line =~ /^(Grade\d+)\s(\d+\.\d+\%) ) {
         $grades{$name} = $grade;
     }
}
close($fh);

Your %grades hash would then contain the name and grade pairs. (Access it like my $value = $grades{'Grade1'}

Also just a note. The language is called "Perl", not "PERL". Many people in the Perl community get upset about it :-)

Cfreak 2010-10-18 16:07:23

Hi All - Thanks for the replies. However, I must admit a mistake I made, I mentioned that it says Grade1 - 80 %Grade2 - 80 %etc..The problem is your solution makes use of 'Grade' as a selection criteria in the regex expression. However, that is only one file. Most of my other files, have individual names in them, as in:Mike 80%Shawn 60%Jason 44%So it makes i

c0d3rs 2010-10-18 16:26:16

Also, thanks for letting me know about Perl! I'll not make the mistake a second time ;).

c0d3rs 2010-10-18 16:30:07

Answer 4

+3 A:

If you can guarantee that your points of interest are nested between two =s (and there isn't an odd number of these demarcations in a given file), the flip-flop operator is a handy thing here:

use strict;    # These two pragmas go a long, ...
use warnings;  # ... long way in helping you code better

my %scores;    # Create a hash of scores

while (<>) {   # The diamond operator processes all files ...
               # ... supplied at command-line, line-by-line

    next unless /^=+$/ .. /^=+$/;  # The flip-flop operator used ...
                                   # ... to filter out only 'grades'

    my ( $name, $grade ) = split;  # This usage of split will break ...
                                   # ... the current line into an array    

    $scores{$name} = $grade;       # Associate grade with name
}

Zaid 2010-10-18 18:42:32

+1 for mentioning the flip flop operator. Interesting.

Noufal Ibrahim 2010-10-19 06:03:31

Answer 5

A:

See Zaid's answer for an example of using the flip-flop operator (which is what I would recommend). However, if you run into difficulties with that (sometimes the DWIMmery might get in the way), you can also explicitly maintain state while reading the file line-by-line:

#!/usr/bin/perl

use strict; use warnings;

my %grades;
my $interesting;

while ( my $line = <DATA> ) {
    if ( not $interesting and $line =~ /^=+\s*\z/ ) {
        $interesting = 1;
        next;
    }
    if ( $interesting ) {
        if ( $line =~ /^=+\s*$/ ) {
            $interesting = 0;
            next;
        }
        elsif ( my ($name, $grade) = $line =~ /^(\w+)\s+(\d+\.\d+%)/ ) {
            # Keep an array in case the same name occurs
            # multiple times
            push @{ $grades{$name} }, $grade;
        }
    }
}

use YAML;
print Dump \%grades;

Sinan Ünür 2010-10-18 20:25:47

ansaurus

tags:

views:

answers:

How can I parse just part of a file with Perl?

related questions