ansaurus

Question

Answer 1

+5 A:

Try [^/]+ instead of .*?:

sed 's|\(http://[^/]*/\).*|\1|g'

Gumbo 2009-07-09 10:51:34

sed 's|\(http:\/\/[^\/]+\)|\1|' still spews out the whole thing.

Joel 2009-07-09 10:55:20

@Joel: edited version should work.

chaos 2009-07-09 16:55:21

Answer 2

+17 A:

Neither basic nor extended Posix/GNU regex recognizes the non-greedy quantifier; you need PCRE. Fortunately that's pretty easy to get:

perl -pe 's|(http://.*?/).*|\1|'

chaos 2009-07-09 10:58:23

Works perfectly.

Joel 2009-07-09 11:02:05

Answer 3

+1 A:

sed 's|(http:\/\/[^\/]+\/).*|\1|'

Lucero 2009-07-09 10:58:59

Answer 4

A:

another way, not using regex, is to use fields/delimiter method eg

string="http://www.suepearson.co.uk/product/174/71/3816/"
echo $string | awk -F"/" '{print $1,$2,$3}' OFS="/"

ghostdog74 2009-07-09 10:59:12

Answer 5

A:

sed does not support "non greedy" operator.

You have to use "[]" operator to exclude "/" from match.

sed 's,\(http://[^/]*\)/.*,\1,'

P.S. there is no need to backslash "/".

andcoz 2009-07-09 11:08:44

Answer 6

A:

sed -E interprets regular expressions as extended (modern) regular expressions

stepancheg 2009-07-09 11:25:07

Non greedy regex matching in sed?