3. The tests are sort. This is true, but on some system these tests take AGES to run. At the beginning I had a slower machine, and the emulation was slower too, that is why I chose these tests. Now everything run better, but I don't want to change the tests, because of these are the base of comparsion of the recent results.
Running some real apps could be more fair in the future... Also making tests on OS4 without JIT could be interesting. To see how much PPC native OS can speed up 68k apps in the interpretive mode etc.