ansaurus

Question

Chained data computation. Make and alternatives ?

Answer 1

+1 A:

Some alternatives that spring to mind:

Ant has pretty nice support for customizing dependencies using Java.
SCons allows you to write custom dependency code using Python.

JesperE 2009-04-16 04:55:33

Answer 2

+1 A:

Two other alternatives are

Jam the boost project uses it and
QMake used by the QT

oykuo 2009-04-16 05:10:45

Answer 3

+1 A:

Rake is a Ruby implementation of Dependency-Oriented Programming that is heavily inspired by Make and Ant, but much cleaner and nicer to use.

Recently, there has been a newcomer on the scene, which is called Tap. It also allows Dependency-Oriented Programming but extends it with concepts such as Workflows. It was designed by a PhD biochemistry student who works in a biomolecular research lab, specifically to do exactly the things you mention: keeping scientific data derived from experiments up to date.

Jörg W Mittag 2009-04-16 22:39:26

I don't really like ruby, but I see some nice ideas here. Do you know if similar initiatives exist for python ?

Stefano Borini 2009-04-17 00:45:32

Answer 4

+1 A:

Assuming it is possible to discover that the database records are newer, then it should be possible to write a program that sets the date of a sentinel file to the date of the newest data record (or "now", if that is simpler) in the relevant source tables. Doing that for each database or query will give you a collection of sentinel files that can be used along with your existing CSV source files to feed the dependency tree and drive the whole calculation with standard make.

One easy answer to getting the sentinels updated on every build would be to use a build script that runs the data proxy generator followed by make in place of just the make command itself.

It should be possible to arrange for make to automatically update the sentinels as part of the normal dependency checks. Something like the following (untested) should do the trick:

all: result
clean:
        -rm table_*.txt
        -rm step*.csv
        -rm results.txt

results.txt: step2.csv
        write_report -o results.txt step2.csv

step1.csv: source.csv table_A.txt 
        do_step1 -o step1.csv source.csv

step2.csv: step1.csv table_B.txt
        do_step2 -o step2.csv

table_A.txt:
        touch_sentinel -o table_A.txt rawdata.sqlite A 

table_B.txt:
        touch_sentinel -o table_B.txt otherdata.sqlite B

where touch_sentinel creates an output file dated since the latest update to a table in a database. Determining how to learn the date is an exercise to the reader...

RBerteig 2009-04-17 22:05:17

_very_ interesting. Thanks!

Stefano Borini 2009-04-17 23:42:35

ansaurus

tags:

views:

answers:

Chained data computation. Make and alternatives ?

related questions