ansaurus

Question

With Parsec, how do I parse zero or more foo1 terminated by foo2 and all separated by dot?

Answer 1

A:

Try something like

many (foo1 >>= (\v -> char '.' >> return v)) >>= \v1 ->
  foo2 >>= \v2 ->
  -- ...
  -- combine v1 & v2 somehow

(Just a sketch, of course.)

In general, the many combinator is Parsec's equivalent of Kleene star; and if you're going to add something simple like a trailing dot to an existing parser, using >> / >>= may actually be cleaner and simpler than using do notation.

Michał Marczyk 2010-03-03 19:27:15

Answer 2

A:

sure, it would catch the foo2 case. Using for your foo1, Leiden's word:

let a = sepBy word (char '.')
parseTest a "foo.bar.baz"
parseTest a "foo"
parseTest a ".baz"

ja 2010-03-03 19:42:21

Answer 3

+2 A:

First, you want endBy instead of sepBy:

do k <- endBy foo1 (char'.')
   j <- foo2

Second, it would

catch the just foo2 case

From the documentation:

endBy p sep parses zero or more occurrences of p, separated by sep. Returns a list of values returned by p.

Alexey Romanov 2010-03-03 22:13:35

Answer 4

+3 A:

You want endBy, not sepBy.

foo = do k <- foo1 `endBy` char '.'
         j <- foo2
         ...

That will force the separator to be present after each occurrence of foo1.

Of course, endBy is trivially replaceable by many, which may be clearer.

foo = do k <- many $ foo1 <* char '.' 
         j <- foo2
         ...

or, without Control.Applicative:

foo = do k <- many $ do x <- foo1; char '.'; return x
         j <- foo2
         ...

Edward Kmett 2010-03-04 03:44:56

ansaurus

tags:

views:

answers:

With Parsec, how do I parse zero or more foo1 terminated by foo2 and all separated by dot?

related questions