ansaurus

Question

Answer 1

A:

Try this regular expression:

(^|\n).+(\n[ \t]+.+)*

Assuming that ^ marks the start of the string, \n is the line break character and . does not match line breaks.

Gumbo 2010-02-07 21:33:23

Answer 2

A:

Assuming an implementation that

Matches multiple lines (/.../m)
Uses \A to indicate the start of a line

this should match one "term":

\A[^\t][^\n]+\n(\t[^\n]+\n)+

calmh 2010-02-07 21:37:50

Answer 3

A:

Match a line with a leading non-whitespace character followed by one or more lines with leading TABs:

$ perl -0077 -pe 's/^(\S.+\n(^\t.+\n)+)/<term>\n$1<\/term>\n/mg' dict
<term>
term 1
        definition 1
        definition 2
</term>

<term>
term 2
        definition 1
        definition 2
        definition 3
</term>

Greg Bacon 2010-02-07 21:39:20

ansaurus

tags:

views:

answers:

Parse text using regular expressions

related questions