ansaurus

Question

Pattern name for flippable data structure?

Answer 1

+1 A:

Your objects should have one role and responsibility. In your case should the ContinuousScalarField be responsible for interpolating?

Perhaps you might be better off doing something like:

IInterpolator interpolator = field.GetInterpolator();
Measurement measurement = Interpolator.InterpolateAt(...);

I hope this makes sense, but without fully understanding your problem domain it's hard to give you a more coherent answer.

jonnii 2008-10-29 18:31:15

Answer 2

+4 A:

If an object has two modes like this, I would suggest exposing two interfaces to the client. If the object is in append mode, then you make sure that the client can only ever use the IAppendable implementation. To flip to query mode, you add a method to IAppendable such as AsQueryable. To flip back, call IQueryable.AsAppendable.

You can implement IAppendable and IQueryable on the same object, and keep track of the state in the same way internally, but having two interfaces makes it clear to the client what state the object is in, and forces the client to deliberately make the (expensive) switch.

Tim Robinson 2008-10-29 18:35:16

I've also done similar to this before and it works quite well - it follows the spirit of making it easy to do things right and hard to do things wrong - which is always an issue with this sort of problem.

Phil Nash 2008-10-30 21:04:17

Answer 3

A:

You could have a state variable. Have a method for starting the high level processing, which will only work if the STATE is in SECTION-1. It will set the state to SECTION-2, and then to SECTION-3 when it is done computing. If there's a request to the program to interpolate a given point, it will check if the state is SECTION-3. If not, it will request the computations to begin, and then interpolate the given data.

This way, you accomplish both - the program will perform its computations at the first request to interpolate a point, but can also be requested to do so earlier. This would be convenient if you wanted to run the computations overnight, for example, without needing to request an interpolation.

Elie 2008-10-29 18:35:52

Answer 4

A:

"I've just used lazy-evaluation to construct the data structures" -- Good

"if the user calls the "field.add()" method again, I have to completely discard those data structures and start over from scratch." -- Interesting

"in the standard use case, the caller never adds another value to the collection after starting to issue queries" -- Whoops, false alarm, actually not interesting.

Since lazy eval fits your use case, stick with it. That's a very heavily used model because it is so delightfully reliable and fits most use cases very well.

The only reason for rethinking this is (a) the use case change (mixed adding and interpolation), or (b) performance optimization.

Since use case changes are unlikely, you might consider the performance implications of breaking up interpolation. For example, during idle time, can you precompute some values? Or with each add is there a summary you can update?

Also, a highly stateful (and not very meaningful) flip method isn't so useful to clients of your class. However, breaking interpolation into two parts might still be helpful to them -- and help you with optimization and state management.

You could, for example, break interpolation into two methods.

public void interpolateAt( Point3d p );
public Measurement interpolatedMasurement();

This borrows the relational database Open and Fetch paradigm. Opening a cursor can do a lot of preliminary work, and may start executing the query, you don't know. Fetching the first row may do all the work, or execute the prepared query, or simply fetch the first buffered row. You don't really know. You only know that it's a two part operation. The RDBMS developers are free to optimize as they see fit.

S.Lott 2008-10-29 18:52:33

What if the application is multi-threaded. If the "interpolateAt" method returns void, and the "interpolatedMasurement" method fetches the result, it seems like a huge potential source of race conditions.

benjismith 2008-10-29 19:17:19

@benjismith: there's a race only if the fetch is ignorant of the state of the iterpolate. This is the place where it's handy for a thread to queue up the results and the measurement waits and dequeue a result.

S.Lott 2008-10-29 19:31:16

Answer 5

+2 A:

I generally prefer to have an explicit change, rather than lazily recomputing the result. This approach makes the performance of the utility more predictable, and it reduces the amount of work I have to do to provide a good user experience. For example, if this occurs in a UI, where do I have to worry about popping up an hourglass, etc.? Which operations are going to block for a variable amount of time, and need to be performed in a background thread?

That said, rather than explicitly changing the state of one instance, I would recommend the Builder Pattern to produce a new object. For example, you might have an aggregator object that does a small amount of work as you add each sample. Then instead of your proposed void flip() method, I'd have a Interpolator interpolator() method that gets a copy of the current aggregation and performs all your heavy-duty math. Your interpolateAt method would be on this new Interpolator object.

If your usage patterns warrant, you could do simple caching by keeping a reference to the interpolator you create, and return it to multiple callers, only clearing it when the aggregator is modified.

This separation of responsibilities can help yield more maintainable and reusable object-oriented programs. An object that can return a Measurement at a requested Point is very abstract, and perhaps a lot of clients could use your Interpolator as one strategy implementing a more general interface.

I think that the analogy you added is misleading. Consider an alternative analogy:

Key[] data = new Key[...];
data[idx++] = new Key(...); /* Fast! */
...
Arrays.sort(data); /* Slow! */
...
boolean contains = Arrays.binarySearch(data, datum) >= 0; /* Fast! */

This can work like a set, and actually, it gives better performance than Set implementations (which are implemented with hash tables or balanced trees).

A balanced tree can be seen as an efficient implementation of insertion sort. After every insertion, the tree is in a sorted state. The predictable time requirements of a balanced tree are due to the fact the cost of sorting is spread over each insertion, rather than happening on some queries and not others.

The rehashing of hash tables does result in less consistent performance, and because of that, aren't appropriate for certain applications (perhaps a real-time microcontroller). But even the rehashing operation depends only on the load factor of the table, not the pattern of insertion and query operations.

For your analogy to hold strictly, you would have to "sort" (do the hairy math) your aggregator with each point you add. But it sounds like that would be cost prohibitive, and that leads to the builder or factory method patterns. This makes it clear to your clients when they need to be prepared for the lengthy "sort" operation.

erickson 2008-10-29 18:56:05

I think you're right. The Builder pattern is the way to go here.

benjismith 2008-10-30 17:24:24

ansaurus

tags:

views:

answers:

Pattern name for flippable data structure?

related questions