ansaurus

Question

Partitioning in clojure with a lazy collection of strings

Answer 1

+5 A:

Not quite sure what you're going for, but the following does what your first example does, and does so lazily.

Step-by-step for clarity:

user=> (def str-coll ["abcd" "efgh" "jklm"])
#'user/str-coll
user=> (map seq str-coll)
((\a \b \c \d) (\e \f \g \h) (\j \k \l \m))
user=> (flatten *1)
(\a \b \c \d \e \f \g \h \j \k \l \m)
user=> (partition 3 *1)
((\a \b \c) (\d \e \f) (\g \h \j) (\k \l \m))

All together now:

(->> str-coll 
  (map seq)
  flatten
  (partition 3))

Alex Taggart 2010-07-27 23:29:51

No need to flatten, just concat the character sequences by using mapcat:(partition-all 3 (mapcat seq str-coll))

Jürgen Hötzel 2010-07-28 15:19:07

ataggart and Jürgen, thanks much for the solutions: mapping to a seq was exactly what I was missing. Getting over that hurdle led me to realize that partition-by wasn't acting as lazily as I'd hoped. While each partition is provided in a lazy manner, the individual components of each partition are not; so partitioning the initial file at delimiters does not provide the desired lazy strings that feed into this.

Brad Chapman 2010-07-28 15:54:34

@Jürgen: mapcat isn't lazy (it uses apply), hence why I didn't use it.

Alex Taggart 2010-07-28 17:50:45

@ataggart: Nope, It is! Just check: (type (mapcat seq str-coll))Why should apply prevent laziness?

Jürgen Hötzel 2010-07-28 19:43:20

@Jürgen: mapcat returns a lazy seq, but how that comes into existence isn't fully lazy. See my additional "answer" for more info.

Alex Taggart 2010-07-28 20:30:39

Jürgen Hötzel 2010-07-28 20:50:03

@Jürgen: concat is irrelevant since the problem is the non-laziness of apply immediately prior to invoking concat. I urge you to read the more detailed answer I provided, and/or look at the implementation of apply.

Alex Taggart 2010-07-28 22:02:36

@Jürgen:Since concat works on strings, seq is unnecessary: `(partition-all 3 (apply concat str-coll))` and `(partition-all 3 (mapcat identity str-coll))` both work too :)

Rafał Dowgird 2010-07-30 07:48:42

Answer 2

+1 A:

Alex Taggart 2010-07-28 20:26:16

There is no "get all those n lazy seqs". concat is invoked with a "lazy argument list". You can check this in our example practical by setting the string collection to a infinite lazy list:(def str-coll (repeat "abcd"))And then just take part of the result:(take 10 (partition-all 3 (mapcat seq str-coll)))

Jürgen Hötzel 2010-07-29 04:34:23

Please see the edit at the bottom of my post for more proof of my claim.

Alex Taggart 2010-07-29 05:15:32

Alex Taggart 2010-07-29 05:21:33

Though your point regarding apply working with an infinite series has me baffled.

Alex Taggart 2010-07-29 05:40:42

ansaurus

tags:

views:

answers:

Partitioning in clojure with a lazy collection of strings

related questions