ansaurus

Question

Answer 1

+2 A:

First of all, the keyword is elsif, second of all you don't need it. You can compress the code in the get_file_data loop to:
```
next if $line =~ /^\s*$|^>/; 
$sequence .= $line;
```
As long as you're going to use regular expressions -- unless too unwieldy -- you might as well search for all the cases that you want to ignore. If you find that actual second case, you can add it as an another alternation. Say you wanted to exclude lines that begin with #-. Then you would just add it in like so: /^\s*$|^>|^#-/
Another thing is that my position=(); needs to have the @ sigil, before position, or otherwise, perl thinks you're trying to something tricky with a call to position().
You need the following changes:
```
 my $h= '[VLIM]';   
 my $s= '[AG]';
 my $x= '[ARNDCEQGHILKMFPSTWYV]';
```
Otherwise, you're just assigning to $h to an array reference with a single slot populated by whatever would be returned from the sub VLIM.

Third, don't use $&. Replace pos($sequence)-length($&)+1

push @positions, $-[0];

or better yet, use English:

use English qw<-no_match_vars>;
...
push @positions, $LAST_MATCH_START[0];

I would suggest the following for the file reading:

use IO::File;
...
# Use real file handles
my $fh = IO::File->new( "<seq.txt" );
get_file_data( $fh ); # They can be passed
...
sub get_file_data{
    my $file_handle = shift; 
    ...
    # while loop conserves resources
    while ( my $line = <$file_handle> ) { 
        next if $line =~ /^\s*$|^>/;
        $sequence .= $line;
    }

A suggestion for going forward -- it helps me immensely:

A. Install Smart::Comments

B. Put this at the top of your script:
```
 use Smart::Comments;
```
C. Every time you're not sure what you've got so far, like if you wanted to see the current contents of $sequence, place the following in the code:
```
### $sequence
exit 0;
```
just show it and exit. When you get too many printouts, delete them.

Axeman 2009-05-06 20:47:29

@axeman: changes made.

shubster 2009-05-06 20:50:56

Answer 2

A:

Use "elsif" instead of "elseif".
Are @file_data and @fasta_file_data supposed to be the same thing?

In match_positions:

Remove the parenthesis around the sub name.
Change "my position" to "my @position".
Change the pattern from /regexp/ig to /$regexp/ig.

Nathan Kitchen 2009-05-06 21:02:08

@Nathan Kitchen: changes made

shubster 2009-05-06 21:05:20

yes they are the same and I changed it.

shubster 2009-05-06 21:10:23

I don't think they should be. @fasta... holds the lines from the input, but the other holds the other lines. You seem to passing get_file_data the name of the file, but then just jumping to the point where you've read it into an array. Please see my #5, for how to make that a reality.

Axeman 2009-05-06 21:46:10

ansaurus

tags:

views:

answers:

Search for motif in protein sequence?

related questions