Let's make some things clear:
1. In every test emulation setup time calculated into the results. It takes 1.5 secs for a simple "RTS" program. (Later this time will be gone, eg. buliding jump tables take a lot of time, but it has to do only once.) So, I could decrease the running times with this value, but I want to be as correct as possible, and that vaules wouldn't be the ones what I actually measured.
2. Emulation is highly clock-speed dependant. On a 604/233 system results were a lot better, than on 604/180 actually is. (I could have explanation for this, but it is not really interesting, rather technical.) I had just no opportunity of getting such system right now. So, measuring the speed on a higher clocked system WILL imply better results.
3. The tests are sort. This is true, but on some system these tests take AGES to run. At the beginning I had a slower machine, and the emulation was slower too, that is why I chose these tests. Now everything run better, but I don't want to change the tests, because of these are the base of comparsion of the recent results.
4. Emulation is beta, not finished yet. AOS4 is beta not finished yet. I have ideas for improvements, but right now I am about stabilization and integration.
(BTW, I don't know what is wrong with julia test, it is running just fine on AmigaOS3.x, AmigaOS4 and UAE. Except MorphOS. Where is the fault then? :-D Just a joke, don't take too serious...)