ansaurus

Question

Answer 1

+7 A:

A VM is a big task to consider. Have you considered basing your VM on something like LLVM?

LLVM will provide a good base to start from and there are plenty of example projects which you can use for understanding.

Jeff Foster 2009-11-28 12:20:10

Answer 2

+5 A:

Before writing a JIT ("Just-in-time") compiler, you should at least consider how you would write a "Way-ahead-of-time" compiler.

That is, given a program consisting of instructions for your VM, how would you produce a program consisting of x86 (or whatever) instructions, that does the same as the original program? How would you optimise the output for different instruction sets, and different versions of the same architecture? The example opcode you've given has quite a complicated implementation, so which opcodes would you implement "inline" by just emitting code that does the job, and which would you implement by emitting a call to some shared code?

A JIT has to be able to do this, and it also has to make decisions while the VM is running about which code it does it to, when it does it, and how it represents the resulting mixture of VM instructions and native instructions.

If you're not already an assembly-jockey, then I don't recommend writing a JIT. That's not to say "don't do it ever", but you should become an assembly-jockey before you start in earnest.

An alternative would be to write a non-JIT compiler to convert your VM instructions (or the original scripting language) to Java bytecode, or LLVM, as Jeff Foster says. Then let the toolchain for that bytecode do the difficult, CPU-dependent work.

Steve Jessop 2009-11-28 14:05:29

Answer 3

+1 A:

Steve Jessop has a point: JIT compiler is way harder then normal compiler. And normal compiler is hard by itself.

But, reading the last part of question, I wonder if you really want a JIT compiler.

If your problem is like this:

I want to create a ray tracing program which allows user to provide their shader procedures etc. using my own domain specific language. It goes OK. I have my language defined, interpreter implemented and it works nice and correctly. But it's slow: How can I execute it as native code?

Then here's what I used to be doing is similar situations:

Translate your user provided procedures to C functions that can be called from your program.
Write them out to normal C source file with proper #includes etc.
Compile them as .dll (or .so in *nix) using normal C compiler.
Load .dll dynamically in your program, find out your functions pointers and use them in your ray tracer in place of interpreted versions.

Some notes:

In some environments it might be impossible: no access to C compiler or system policy that forbids you to load your own dll. So check before you try it.
Do not discard your interpreter. Keep it as reference implementation of your language.

Tomek Szpakowicz 2009-11-28 14:36:41

"(dumb but it happens)". Seems a bit weird to have a C compiler but no dynamic linking. But not having a C compiler at all is pretty common, if you consider that most code doesn't run on PCs...

Steve Jessop 2009-12-01 17:15:04

@Steve: I guess I'll remove this comment. It was about restricting right to using your own code (exe's, dll's etc) as system policy, not lack of compiler.I know it happens.Anyway, if user cannot load its own (as opposed to installed by admin) code, program incorporating JIT needs to operate with somehow elevated privileges as well.In some environments you won't be able to execute block of data as binary code (protection against buffer overflows etc.) so you'll need to load it as shared libraries anyways.

Tomek Szpakowicz 2009-12-01 17:35:25

Yes, or depending on the system it might be the other way round - JITs can allocate executable memory with mid-level privileges, but it requires digital signing (or kernel-level privileges) to authorise a dll to be loaded. I can only assume the assumption is that anyone smart enough to write a JIT and demonstrate it working, is smart enough not to write malicious code into memory and execute it. Whereas any fool can load a library, and hence shouldn't be allowed to ;-)

Steve Jessop 2009-12-01 18:13:59

@Steve: And anyone smart enough to create a nuclear bomb is also smart enough not to do this... Egh... Wait... Hmm... Damn it!

Tomek Szpakowicz 2009-12-01 18:36:16

Yeah, I'm not saying the reasoning is clinical, just that they must feel something along those lines - the risk/reward ratio for one is lower than the other.

Steve Jessop 2009-12-01 19:14:49

ansaurus

tags:

views:

answers:

Designing a virtual machine with JIT

related questions