ansaurus

Question

How to capture results of regex and replace patterns in bash

Answer 1

+2 A:

PHP IS an option. If you master PHP, you can run it from shell. Run

php myfile.php

and get the output right into console. If the PHP file is executable and begins with

#!/path/to/php/executable

then you can run

./myfile.php

I'm no big expert in bash programming, but in order to obtain the list of files that match a certain patter you can use the command

ls -l | grep "pattern_unquoted"

I suggest you to go for the PHP ;-)

djechelon 2010-10-10 00:35:32

Thanks for the suggestion. However, I won't be running the script from command line. It will live in a cronned shell script.

Jordan 2010-10-10 00:42:49

+1: If Jordan can program PHP and not Bash, then PHP is probably the way forward... particularly if Jordan is the one who is going to have to maintain the script.

Johnsyweb 2010-10-10 00:46:02

@Jordan: If you can run from the command-line, you can run from Cron.

Johnsyweb 2010-10-10 00:46:55

Answer 2

+3 A:

A different take on the problem:

#!/bin/sh

YOUR_MAX_SEQ=3

find /path/to/files -maxdepth 1 -name 'LOG_*.csv' -print \
  | sed -e 's/\.csv$//' \
  | awk -F_ '$3 > SEQ { print }' SEQ=$YOUR_MAX_SEQ

Brief explanation:

Find all files in /path/to/files matching LOG_*.csv
Chop the .csv off the end of each line
Using _ as a separator, print lines where the third field is greater than $YOUR_MAX_SEQ

This will leave you with a list of the files that met your criteria. Optionally, you could pipe the output through sed to stick the .csv back on.

If you're comfortable with PHP, you'd probably be comfortable with Perl, too.

Blrfl 2010-10-10 01:21:11

Hi. Thanks for this. However it does not seem to handle the part about incrementing the sequence on files where current sequence is less than 3 and then saving file name with new incremented file name

Jordan 2010-10-10 01:46:46

Replace the `print` in the `awk` with `printf "%s_%s_%03d.csv\n", $1, $2, $3+1 }' SEQ=$YOUR_MAX_SEQ`.

Blrfl 2010-10-10 09:53:31

Oops... The `SEQ=$YOUR_MAX_SEQ` doesn't belong in my last comment.

Blrfl 2010-10-11 03:19:44

Answer 3

A:

First of all, your question is tagged [bash], but your shebang is #!/bin/sh. I'm going to assume Bash.

#!/bin/bash
function check_file()
{
    # example file name "LOG_20101031144515_001.csv"
    filename=$1

    # attempt to get the sequence (ex. 001) part of file

    seq=${filename%.csv}
    seq=${seq##*_}

    # if sequence is greater than 003, then raise alert

    if (( 10#$seq > 3 ))
    then
        echo "Alert!"
    else
        # else change file name to next sequence (ex. LOG_20101031144515_002.csv)
        printf -v newseq "%03d" $((seq + 1))
        echo "${filename/$seq/$newseq}" # or you could set a variable or do a mv
    fi
}

Dennis Williamson 2010-10-11 04:22:13

@Dennis. Thank you for the feedback; it looks very close to what I am trying to do. However, I copied the code verbatim into a test file and it's throwing the following error about the function's final closing curly bracket: syntax error near unexpected token `}'

Jordan 2010-10-11 05:06:53

@Dennis. Again, thank you. I got beyond curly bracket syntax issue; the script was missing "fi" to close the if block. Also, the script was missing the bit to increment the seq; I added that as "seq=$(( seq + 1 ))". Now I am stuck on the substitution in the last echo. In the line, echo "${filename%%$seq.csv}$newseq.csv", $seq is not being replaced by $newseq. Rather the "$newseq.csv" is being appended and the resulting output is LOG_20101031144515_001.csv002.csv rather than LOG_20101031144515_002.csv

Jordan 2010-10-11 05:25:40

Success! After some hunting, I found the bash substitution pattern and replaced the echo line in Dennis' code, with: echo "${filename/$seq/$newseq}

Jordan 2010-10-11 05:41:07

@Jordan: Sorry about the omissions and errors. I've edited my answer.

Dennis Williamson 2010-10-11 15:19:41

ansaurus

tags:

views:

answers:

How to capture results of regex and replace patterns in bash

related questions