ansaurus

Question

ubuntu/linux bash: traverse directory and subdirectories to work with files

Answer 1

+6 A:

Any reason you can't use find for this? Stick the per-file operation you want in it's own script (I've called it dostufftomyfile.sh below), then do:

find $dir -type f -print0 | xargs -0 dostufftomyfile.sh

Replacing $dir with the top level directory you'll be searching from...

Edited to add... When you write the shell script, make sure you put $@ in double-quotes... e.g., you'll want your dostufftomyfile.sh script to have this structure:

#!/bin/sh
for f in "$@"
do
    echo "Processing file: $f"
    # Do something to file $f
done

if you don't quote $@ then the spaces in filenames will be ignored (which I suspect you won't want) :-)

Chris J 2010-10-17 17:51:10

Why not `find "$dir" -type f -exec dostufftomyfile.sh {} +`?

enzotib 2010-10-17 18:43:45

It depends how efficient you want it to run and how many files you're dealing with. -exec will execute the script once for each file. Piping through xargs results in less executions of the script as xargs batches files up and passes multiple files through on the command line. There's nothing fundamentally wrong with using -exec, it's just that it can be slower :-)

Chris J 2010-10-17 19:23:00

The `-exec ... +` construct (as opposed to `-exec ... ;`) passes multiple files in much the same way that `xargs` does. Unfortunately, not all implementations of `find` support this. Note: you should quote `"$dir"` in the `find` command.

Gordon Davisson 2010-10-17 20:06:45

I've tried the find command multiple times and it kept listing the directories as well as files, but this does do what I needed nicely, however I still like to be able to control the files and directories like in mine.

vzybilly 2010-10-18 05:26:53

@Gordon - Ahh -- I've not come across the '+' construct before; guess that's what comes of learning trad UNIX and then learing the GNU extensions as and when I need them :-)

Chris J 2010-10-18 21:59:53

@vzybilly ... you have to specify "-type f". If you don't specify this, then you'll get every file back. -type specifies the type of file you want 'find' to return, so it's 'f' for file, 'd' for directory, 'l' for sym-link, etc. If you're specifying "-type f" and it's still returning directories, then you'll need to provide more information (how you're calling find, etc). If you mean that the file has is given as the *full path name*, then in the 'dostufftomyfile.sh' script, you can get the base file by calling basename, e.g.: "filenameonly = `basename $f`" (note the use of backticks).

Chris J 2010-10-18 22:02:36

Answer 2

A:

Chris J's answer is the preferred way to do things if you can put the per-file stuff in a separate command(/script). If you want everything in a single script, my favorite incantation is something like this:

while IFS="" read -r -d $'\000' file <&3; do
    dostuffwith "$file"
done 3< <(find -x  "$dir" -mindepth 1 -type f -print0)

See BashFAQ #20 and #89 for explanations and some other options. Note that this only works in bash (i.e. the script must start with #!/bin/bash). Also, it processes the contents of a given directory in alphabetic order, rather than files-before-subdirectories.

If you really want to step through the files "by hand" (i.e. to get more control over the traversal order), here's how I'd do it:

#!/bin/bash

process_dir() {
    local -a subdirs=()
    echo "Scanning directory: $1"

    # Scan the directory, processing files and collecting subdirs
    for file in "$1"/*; do
        if [[ -f "$file" ]]; then
            echo "Processing file: $file"
            # actually deal with the file here...
        elif [[ -d "$file" ]]; then
            subdirs+=("$file")
            # If you don't care about processing all files before subfolders, just do:
            # process_dir "$file"
        fi
    done

    # Now go through the subdirs
    for d in "${subdirs[@]}"; do
        process_dir "$d"
    done
}

clear
if [[ -z "$1" ]]; then
    read -p "Please enter a directory for me to scan " dir
else
    dir="$1"
fi
process_dir "$dir"

Gordon Davisson 2010-10-17 19:31:58

this works lovely for what I need to do, Thanks ^^

vzybilly 2010-10-18 05:28:12

Answer 3

A:

You have the error "No such file .... due to this

ARRAY=( $(ls -d */) )

When its expanded, directories with whitespaces will get stored in array as individual elements. eg Desktop/test_files/folder 1/folder 2/"folder 3"/.

In the array, element 0 will be Desktop/test_files/folder, element 1 will be 1/folder and so on. That's why your script can't find the directory.

You can set the IFS to $'\n' before assigning to the array

OLDIFS=$IFS
IFS=$'\n'
ARRAY=($(ls -d */))
IFS="$OLDIFS"

ghostdog74 2010-10-17 23:39:18

ansaurus

tags:

views:

answers:

ubuntu/linux bash: traverse directory and subdirectories to work with files

related questions