ansaurus

Question

Dynamic Array with O(1) removal of any element

Answer 1

+4 A:

This idea is used in Knuth (Fisher–Yates) shuffle. An element picked at random is replaced with the last one in the array. Since what we want is a random permutation anyway, the reordering doesn't matter.

Rafał Dowgird 2009-06-29 08:33:08

The reordering *does* matter; that's where the uniform randomness of the final permutation comes from!

ShreevatsaR 2009-06-29 08:54:40

@ShreevatsaR: I mean the reordering of the elements not picked yet - they will be reordered anyway, so changes to their order introduced by picking other elements do not matter. Of course, this requires (a simple) proof that this reordering doesn't really affect the uniformness of the final distribution.

Rafał Dowgird 2009-06-29 09:49:04

Thanks, I chose this since it's closest to what I thought of, plus it makes me feel like I was almost as smart as Knuth.

GMan 2009-06-29 21:32:49

Answer 2

A:

Hm, does that algorithm really have O(1) removal time?

That would mean that

Finding the element to remove is O(1)
Finding the last element (which will replace the deleted element) is O(1)
Finding the second-to-last element (the new "last" element) is O(1)

...which is not possible in any data structure I can come up with. Although a double-linked list could fullfill these constraints, given that you've already got a pointer to the element to remove.

Christoffer 2009-06-29 08:36:51

What? Removal times have nothing to do with search times. So yes, this has O(1) removal time.

GMan 2009-06-29 08:39:08

Store the size of the array SIZE and a pointer PTR in a struct representing the array. (1) is PTR+n, where n is the element to remove, which is O(1). (2) is PTR+SIZE which is O(1). (3) is PTR+(SIZE-1), probably realized by SIZE-- but still, which is O(1).

Kevin Montrose 2009-06-29 08:39:59

a standard array? i believe GMan meant removal by index, not value.

Autoplectic 2009-06-29 08:41:20

Ah, of course. I assumed removal by value, that's what has been relevant at work lately.

Christoffer 2009-06-29 09:09:45

Answer 3

+2 A:

I remember using this method plenty of times before. But I don't know a name for it.

Simple example: In a computer game you are iterating all the "bad guys" and calculating their movements etc. One thing that can happen to them is to disappear (their dead body finished fading away and is 99% transparent now). At that point you remove it from the list like you do, and resume iterator without increasing the iteration counter.

Something similar to this is done in a Binary Heap when deleting an item, however there the next step is to maintain the heap rule - O(log n).

yairchu 2009-06-29 08:51:08

Answer 4

+3 A:

So, does this strange/useless structure have a name, and does it have any uses?

I've used something similar in simulations of multi-process systems.

In a scheduler for processes implemented as state machines, each process is either waiting for an external event, active or completed. The scheduler has an array of pointers to the processes.

Initially each process is active, and the scheduler has the index of the last waiting and first completed process, initially zero and the length of the array.

V-- waiting
[ A-active, B-active, C-active, D-active ]
                             completed --^
^- run

To step the process to its next state, the scheduler iterates over the array and runs each process in turn. If a process reports that it is waiting, it's swapped with the process after the last waiting process in the array.

           V-- waiting
[ A-waiting, B-active, C-active, D-active ]
                              completed --^
             ^- run

If it reports that it has completed, it's swapped with the process before the first completed array.

           V-- waiting
[ A-waiting, D-active, C-active, B-completed ]
                   completed --^
             ^- run

So as the scheduler runs and processes transition from active to waiting or completed, the array become ordered with all the waiting processes at the start, all the active ones in the middle, and the completed ones at the end.

                      V-- waiting
[ A-waiting, C-waiting, D-active, B-completed ]
                   completed --^
                        ^- run

After either a certain number of iterations, or when there are no more active processes, the completed processes are cleaned out of the array and external events are processed:

                      V-- waiting
[ A-waiting, C-waiting, D-completed, B-completed ]
          completed --^
                        ^- run == completed so stop

This is similar in that it's using swapping to remove items from a collection, but it is removing items from both ends rather and leaving the 'collection' in the middle.

Pete Kirkham 2009-06-29 09:12:41

Answer 5

+1 A:

It's called a Set.

Dave Gamble 2009-08-04 03:46:41

Commonly, set element removal is O(log(n)).

Stefan Monov 2010-02-15 07:51:10

Answer 6

A:

I dont know of a name for it but it is better than a list in certain cases.

In particular, this would be vastly superior to a singly or doubly linked list for very small data. Because you store everything contiguously there's no extra pointer overhead per element.

Michael Anderson 2010-01-06 08:49:59

ansaurus

tags:

views:

answers:

Dynamic Array with O(1) removal of any element

related questions