ansaurus

Question

Perl Oneliner to parse multiple conditions in regex

Answer 1

+1 A:

while(<>){
 chomp;
 s/\[|\]//g;
 if ($_ =~ /^>/){
    @s = split /\s+/;
    print "$s[0] $s[1] $s[3]\n";
 }    
}

$ perl -F"\s+" -lane '$F[3]=~s/\]//;$F[1]=~s/\[//;print "$F[0] $F[1] $F[3]";' file
>AF001546_1 88 462
>AF001543_1 88 261

ghostdog74 2010-03-02 06:09:28

Answer 2

+1 A:

try this perl -lne 'print "$1 $2 $3 $4" if /(\w+)_\d+\D+(\d+)\D+(\d+)](\D+)/m'

you need to use the modifier /m

coder 2010-03-02 06:10:23

No. the /m modifier only changes ^ and $ which aren't even in your regex. Futhermore, the -n switch means it's processing a line at a time anyway.

p00ya 2010-03-02 06:14:55

yep i agree. i want to insist on /m

coder 2010-03-02 06:24:33

Answer 3

+1 A:

Depending on how flexible the whitespace is, this is fairly readable:

print "$1 $2 $3 $4" if /([^_]+)_\d+ \[(\d+) - (\d+)\] (?:\d+ )?(.*)/

p00ya 2010-03-02 06:12:08

Answer 4

+1 A:

perl -lne 'print "$1 $2 $3 $4" if /(>\w+)\D+(\d+)\D+(\d+)\D+\d*\s+(\w+)/'

Amarghosh 2010-03-02 06:12:36

Answer 5

+2 A:

You use the following code also

use strict;
use warnings;

my $str=">AF001546_1 [88 - 462] 1 MGQQ";

if($str=~/(\w+)\s\D([0-9]{2}) - ([0-9]{3})\D\s\d\s(.*)/)
{
     print "$1 $2 $3 $4\n";
}

muruga 2010-03-03 09:57:39

ansaurus

tags:

views:

answers:

Perl Oneliner to parse multiple conditions in regex

related questions