ansaurus

Question

Answer 1

+2 A:

Do you have to work with the incomplete list while it's being built up - i.e. the elements have to be at the correct positions?

If you really have to, binary trees might help to make index-based updates more efficient,

but if not, just collect the values as they come in and sort afterwards.

Dario 2010-10-08 17:41:08

That is an excellent point. Thanks.

Nathon 2010-10-08 18:40:05

... or collect values in sorted order.

Hynek -Pichi- Vychodil 2010-10-08 23:29:18

Answer 2

+2 A:

There is an array module that may have the semantics you are looking for. But it is a functional container and may not have the performance characteristics you are looking for, esp. it you are dealing with a large array.

However, your first attempt may have some merit, just do it recursively. (If you know the list sizes going in.)

Completely untested, but start with somthing like this:

aggregator(X, Y) ->
    aggregator(X, Y, []).

aggregator(_, 0, Accum) ->
    Accum;
aggregator(X, Y, Accum) ->
    aggregator(X, Y-1, [aggr_x(X, Y, [])|Accum]).

aggr_x(0, _, AccX) ->
    AccX;
aggr_x(X, Y AccX) ->
    receive
      {Value, {X, Y}} ->
        aggr_x(X-1, Y, [Value|AccX]) ->
    end.

Keep in mind that this will receive high indexed elements first, and that for large data sets, your inbox may get very full, and you will need to consider the performance of matching on a receive with a deep inbox.

dsmith 2010-10-08 18:04:48

I'm looking into the array module. I don't see how to do the first approach recursively though.

Nathon 2010-10-08 18:39:10

Sorry, this was two separate suggestions-- use the array module or do the receive recursively based on pattern matching. The code that I give here is essentially that same as your first snip-it of code, but with a single receive that is called recursively. And it returns arrays instead of tuples-of-tuples.

dsmith 2010-10-08 18:57:52

Answer 3

+3 A:

At end is an explanation of how I am wrong.

Arrays are implemented so that when functional semantics allow, they are updated in O(1) time. That is, if it only is referenced once and the reference is dropped as soon as the new array is computed, the new array will be computed by overwriting the changed entry. As this is all handled by the compiler/interpreter, you still have the safety of functional programming.

Because Erlang arrays are only one-dimensional, you need to do the index computation yourself.. Perhaps having one aggregator per row would be reasonable and help when running on many processors?

Example code assumes 0 is Origo for Row and Column and that every position will be received exactly once.

aggregator(Rows, Cols)->
  Size = Rows*Cols,
  aggregator(Cols, array:new(Size), Size).

aggregator(Cols, Array, 0)->Array;
aggregator(Cols, Array, N)->
  receive
    {Value, {Col, Row}}->
      aggregator(Cols, array:set(Col+Row*Cols, Value, Array), N-1)
  end.

I have tested the fixed array structure by running:

test_array(Size, Times)->
  test_array(Size, array:new(Size, {default, 0}), Times).
test_array(_Size, Array, 0)->
  Array;
test_array(Size, Array, Times) ->
  X = random:uniform(Size)-1,
  V = array:get(X, Array),
  test_array(Size, array:set(X, V+1, Array), Times-1).

with Times=100000000 and Size at [10, 100, 1000, 10000, 100000]. Wallclock times in seconds were: [101,129,140,182,241] This is very roughly about O(log(Size)) where I would have expected about O(1). Thus I have showed I was wrong, but perhaps in the future array will behave more like I thought it did.

Koistinen 2010-10-09 22:27:05

Are you referring to the array() data type, or are you referring to lists? The array() data type is simply a tuple and I wouldn't think that the compiler knows anything specific about it. Does this optimization you are referring to apply generally to all tuples? Do you know of any reading materials explaining this? Thanks

dsmith 2010-10-09 23:00:15

I have tested a fixed array and it seems that you are right and I remembered wrongly. I'll update my answer reflect this.

Koistinen 2010-10-10 13:48:13

ansaurus

tags:

views:

answers:

Replace in a list in Erlang

related questions