Simplifying algorithm testing for researchers.

views:

135

answers:

+1 Q:

Simplifying algorithm testing for researchers.

I work in a group that does a large mix of research development and full shipping code.

Half the time I develop processes that run on our real time system ( somewhere between soft real-time & hard real-time, medium real-time? )

The other half I write or optimize processes for our researchers who don't necessarily care about the code at all.

Currently I'm working on a process which I have to fork into two different branches.

There is a research version for one group, and a production version that will need to occasionally be merged with the research code to get the latest and greatest into production.

To test these processes you need to setup a semi complicated testing environment that will send the data we analyze to the process at the correct time (real time system).

I was thinking about how I could make the:

Idea
Implement
Test
GOTO #1

Cycle as easy, fast and pain free as possible for my colleagues.

One Idea I had was to embed a scripting language inside these long running processes. So as the process run's they could tweak the actual algorithm & it's parameters.

Off the bat I looked at embedding:

Lua (useful guide)
Python (useful guide)

These both seem doable and might actually fully solve the given problem.

Any other bright idea's out there?

Recompiling after a 1-2 line change, redeploying to the test environment and restarting just sucks.

The system is fairly complicated and hopefully I explained it half decently.

Not sure I understand your system, but if the build and deployment is too complicated, maybe you could automate it? If deployment is completely automatic, would that solve the problem?

I don't understand how a scripting language would solve the problem? If you change your algorithm, you still need to restart calculation from the beginning, don't you?

sleske 2009-05-17 23:20:02

The system is setup so you start your process and it waits for data. Data is fed into the system on a every N number of minutes cycle. I had envisioned that I would read the script on ever data receive event. So you could make a change during or in the middle of an update and the next update would run with your new change.

Brian Gianforcaro 2009-05-17 23:28:55

And while build system isn't exactly wonderful, it's not really the real problem.

Brian Gianforcaro 2009-05-17 23:30:41

It kind of sounds like what you need is CruiseControl or something similar; every time hyou touch the baseline code, it rebuilds and reruns tests.

Charlie Martin 2009-05-18 01:08:11

I have buildbot setup, for continuous integration. But the results arn't really testable. It's more of a case of a scientist having to sit down and analayze the results.

Brian Gianforcaro 2009-05-18 01:22:10

It sounds then like the test process itself is your problem; having a scientist have to be involved seems suboptimal. Why not build a collection of tests with known inputs compared to known outputs?

Charlie Martin 2009-05-18 01:31:44

Brian Gianforcaro 2009-05-18 01:57:12

Couldn't you just capture test cases from past data?

Charlie Martin 2009-05-18 02:49:44

+1 A:

Embedding Lua is much easier than embedding Python.

Lua was designed from the start to be embedded; Python's embeddability was grafted on after the fact.
Lua is about 20x smaller and simpler than Python.

You don't say much about your build process, but building and testing can be simplified significantly by using a really powerful version of make. I use Andrew Hume's mk, but you would probably be even better off investing the time to master Glenn Fowler's nmake, which can add dependencies on the fly and eliminate the need for a separate configuration step. I don't ordinarily recommend nmake because it is somewhat complicated, but it is very clear that Fowler and his group have built into nmake solutions for lots of scaling and portability problems. For your particular situation, it may be worth the effort to master it.

Norman Ramsey 2009-05-18 01:44:02

We currently do use a rather sophisticated make based build system. It uses on the fly dependencies as you describe, it's great. Good suggestions anyway. And I am leaning more towards lua, I agree with your bullets.

Brian Gianforcaro 2009-05-18 01:59:39

+2 A:

If you can change enough of the program through a script to be useful, without a full recompile, maybe you should think about breaking the system up into smaller parts. You could have a "server" that handles data loading etc and then the client code that does the actual processing. Each time the system loads new data, it could check and see if the client code has been re-compiled and then use it if that's the case.

I think there would be a couple of advantages here, the largest of which would be that the whole system would be much less complex. Now you're working in one language instead of two. There is less of a chance that people can mess things up when moving from python or lua mode to c++ mode in their heads. By embedding some other language in the system you also run the risk of becoming dependent on it. If you use python or lua to tweek the program, those languages either become a dependency when it becomes time to deploy, or you need to back things out to C++. If you choose to port things to C++ theres another chance for bugs to crop up during the switch.

TwentyMiles 2009-05-18 02:31:00

ansaurus

tags:

views:

answers:

Simplifying algorithm testing for researchers.

related questions