ansaurus

Question

Regular Expression - Capture and Replace Select Sequences

Answer 1

A:

I have no sed available to me at the moment.

Wouldn't

sed -r 's/(....),(....),(.*\.ext)(http.*\.ext)/\1,\2,\3\n\1,\2,\4/g'

do the trick?

Edit: removed the lazy quantifier

Jens 2010-05-28 07:07:27

Very good idea (I hope the part before the URLs is that constant). But I thought that sed doesn't support lazy quantifiers.

Tim Pietzcker 2010-05-28 07:11:10

It does not? *sigh* Let me think...

Jens 2010-05-28 07:16:59

Well, I think it should work without the lazyness, too.

Jens 2010-05-28 07:22:29

Answer 2

+1 A:

If the number of URLs in each line is guaranteed to be two, you can use:

sed -r "s/([A-Z0-9,]{10})(.+\.ext)(.+\.ext)/\1\2\n\1\3/" < input

Amarghosh 2010-05-28 07:18:56

Answer 3

+1 A:

This does not require the first two fields to be a particular width or limit the set of (non-comma) characters between the commas. Instead, it keys on the commas themselves.

sed 's/\(\([^,]*,\)\{2\}\)\(.*\.ext\)\(http:.*\)/\1\3\n\1\4/' inputfile.txt

You could change the "2" to match any number of comma-delimited fields.

Dennis Williamson 2010-05-28 10:58:43

ansaurus

tags:

views:

answers:

Regular Expression - Capture and Replace Select Sequences

related questions