ansaurus

Question

Can grep show only words that match search pattern?

Answer 1

+6 A:

Just awk...no need combination of tools

# awk '{for(i=1;i<=NF;i++){if($i~/^th/){print $i}}}' file
the
the
the
this
thoroughly

ghostdog74 2009-10-10 00:54:12

Yuo: grep is just the wrong tool for this job.

dmckee 2009-10-10 01:31:49

Answer 2

+7 A:

Try grep -o

grep -oh "\w*th\w*" *

Edit: matching from Phil's comment

Dan Midwood 2009-10-10 01:01:36

That doesn't work.

Kinopiko 2009-10-10 01:07:36

I've been upvoted but I just realised it doesn't work. Maybe some regex will do it. This only outputs a "th" for each match.

Dan Midwood 2009-10-10 01:08:45

Oh, right. You just need to match all the word-constituent characters on either side: grep -o "\w*th\w*"

Phil 2009-10-10 01:20:37

the words 'another' and 'other' appears in your output due to \w* in front of th.

ghostdog74 2009-10-10 01:29:14

@ghostdog74 I assumed the another and other were the filenames, not the content of a file.

Dan Midwood 2009-10-10 01:35:01

That works now.

Kinopiko 2009-10-10 01:37:12

I think that other and another should appear if they are in the input text (which, if I understand correctly, they are not).

dmckee 2009-10-10 01:46:09

@Dan, as you can see from OP's sample output, "another" and "other" doesn't appear. Because you have grepped \w*th, the \w* in front of "th" would grab these 2 words as well...

ghostdog74 2009-10-10 01:58:31

@ghostdog74 If the input contains another and other then they will be included in the output. It looks from the Q that "some-other-text-file" and "yet-another-text-file" are file names and the question is about matching in multiple files.

Dan Midwood 2009-10-10 02:06:40

@Dan , ah i see... my bad for misinterpreting the output.

ghostdog74 2009-10-10 03:15:35

This worked for me - thank you!

Neil Baldwin 2009-10-10 08:57:42

Answer 3

A:

You could pipe your grep output into Perl like this:

grep "th" * | perl -n -e'while(/(\w*th\w*)/g) {print "$1\n"}'

Kinopiko 2009-10-10 01:06:09

that won't give the correct result. also, if using Perl, no need to use grep. do everything in Perl.

ghostdog74 2009-10-10 01:15:16

Thanks for pointing out the error, ghostdog74. I have changed it to print all the words on the line, not just the first.

Kinopiko 2009-10-10 01:26:45

like i said, grep is not necessary. perl -n -e'while(/(\s+th\w*)/g) {print "$1\n"}' file

ghostdog74 2009-10-10 01:30:53

I don't think it's important here to avoid using grep.

Kinopiko 2009-10-10 01:33:23

up to you. i am just illustrating a point. If its not necessary, don't do it. that extra "|" will cost you one process more.

ghostdog74 2009-10-10 02:03:23

OK, thanks for your comments.

Kinopiko 2009-10-10 02:11:40

Answer 4

+3 A:

You could translate spaces to newlines and then grep, e.g.:

cat * | tr ' ' '\n' | grep th

Adam Rosenfield 2009-10-10 01:43:06

Nice. I should have thought of that.

dmckee 2009-10-10 01:51:24

no need cat. tr ' ' '\n' < file | grep th. Slow for big files.

ghostdog74 2009-10-10 02:00:15

This didn't work. The output still contained the filename and the entire line from the file that contained the match.Anyway, one of the other solutions offered worked.Thanks for the input though.

Neil Baldwin 2009-10-10 08:59:38

@ghostdog74: good point, although if you have more than file, you'll need to use cat. @Neil Baldwin: are you sure you typed it in right? When there's only one input file (stdin in this case), grep doesn't print the filename.

Adam Rosenfield 2009-10-10 14:58:15

@Adam - yes, sorry Adam, it does work with one file but not multiple.

Neil Baldwin 2009-10-10 15:52:19

@Neil Baldwin: just list all of your files as parameters to cat, it works fine with multiple files

Adam Rosenfield 2009-10-10 18:41:28

@Adam - so where you've got 'file' in the example, I would just put 'file1 file2 file3' etc. ?

Neil Baldwin 2009-10-10 20:27:14

Answer 5

A:

You can olso try pcregrep. There is also -w option in grep but in some cases it doesn't work as expected: (from wikipedia)

cat fruitlist.txt
apple
apples
pineapple
apple-
apple-fruit
fruit-apple

grep -w apple fruitlist.txt
apple
apple-
apple-fruit
fruit-apple

Maciek Sawicki 2009-11-14 12:15:02

Answer 6

A:

cat *-text-file | grep -Eio "th[a-z]+"

Mumbling Mac 2010-09-14 15:30:51

ansaurus

tags:

views:

answers:

Can grep show only words that match search pattern?

related questions