Welcome, Guest. Please login or register.

Author Topic: What is the real power of Akiko chip in cd32 ?  (Read 52502 times)

Description:

0 Members and 1 Guest are viewing this topic.

Offline Piru

  • \' union select name,pwd--
  • Hero Member
  • *****
  • Join Date: Aug 2002
  • Posts: 6946
    • Show only replies by Piru
    • http://www.iki.fi/sintonen/
Re: What is the real power of Akiko chip in cd32 ?
« Reply #59 on: October 30, 2011, 10:02:20 PM »
Quote from: Karlos;665843
Not sure with DoomAttack, to be honest. I doubt it's rendering to the framebuffer directly, given the way Doom renders the scene columns at at a time. You'd at least want to use a buffer that could hold say four 8-bit pixel columns so that you could then transfer them as 32-bit words over the bus. That would be 800 bytes for a 200 pixel display. Or 1600, if you wanted to have a nice 16 column, cache-aligned wide buffer :)
Ah the cache size vs resolutions were discussed in the PPC ADoom port readme:
http://www.aminet.net/game/shoot/ADoomPPC.readme

Dunno where that overflow comes from though and how you'd go on calculating fast/slow resolutions for various 68k.
 

Offline Piru

  • \' union select name,pwd--
  • Hero Member
  • *****
  • Join Date: Aug 2002
  • Posts: 6946
    • Show only replies by Piru
    • http://www.iki.fi/sintonen/
Re: What is the real power of Akiko chip in cd32 ?
« Reply #60 on: October 30, 2011, 10:05:06 PM »
Quote from: Karlos;665846
How is 6888x extended precision floating point performance, for example? IIRC, the PPC only supports 64-bit IEEE 754, so any accurate emulation is probably going to end up hampered trying to find interesting ways of handling the extra precision. Or, is it (as I suspect) the case that only extended precision read/write are really emulated and any extended precision operations are all carried out at 64-bit instead?

You're right, for performance reasons all FPU calculations are performed by using the native FPU datatypes. This means that the FPU does give slightly different results under emulation. It's fast though.
 

Offline Karlos

  • Sockologist
  • Global Moderator
  • Hero Member
  • *****
  • Join Date: Nov 2002
  • Posts: 16878
  • Country: gb
  • Thanked: 5 times
    • Show only replies by Karlos
Re: What is the real power of Akiko chip in cd32 ?
« Reply #61 on: October 30, 2011, 10:17:23 PM »
Quote from: Piru;665851
You're right, for performance reasons all FPU calculations are performed by using the native FPU datatypes. This means that the FPU does give slightly different results under emulation. It's fast though.

Which is fair enough. Other than a few esoteric fractal generators, did anything use extended precision?

I know there are methods of doubling up (if you'll pardon the pun) and using two floats to represent a number with greater precision, though I can't recall the specifics at the moment.

-edit-

Yep, I remember now, it was for GPGPU stuff where real double precision may not be supported (or, it might be too slow):

http://forums.nvidia.com/index.php?showtopic=73067
« Last Edit: October 30, 2011, 10:35:01 PM by Karlos »
int p; // A
 

Offline Karlos

  • Sockologist
  • Global Moderator
  • Hero Member
  • *****
  • Join Date: Nov 2002
  • Posts: 16878
  • Country: gb
  • Thanked: 5 times
    • Show only replies by Karlos
Re: What is the real power of Akiko chip in cd32 ?
« Reply #62 on: October 30, 2011, 10:38:47 PM »
So, getting back on topic: Akiko is great if you need to do C2P on a CPU as weak as the one the CD32 ships with. Unfortunately, that also tends to imply that you are restricted to very simple engines since the moment you've got a 50MHz 030 expansion in there, it might be faster just to use CPU C2P...
int p; // A
 

Offline matthey

  • Hero Member
  • *****
  • Join Date: Aug 2007
  • Posts: 1294
    • Show only replies by matthey
Re: What is the real power of Akiko chip in cd32 ?
« Reply #63 on: October 30, 2011, 11:24:49 PM »
Quote from: Karlos;665852
Which is fair enough. Other than a few esoteric fractal generators, did anything use extended precision?

Yes. All fp values are extended to extended precision in registers and all calculations use extended precision on the 68k FPU unless specified to use less precision. More precision means less chance of rounding and cumulative errors and there is no speed difference so it's generally best to keep the extra precision and round the value to the needed precision before storing or outputting. Even the 68060 which traps extended precision immediate floats still does full extended precision fpu calculations and register updates. Very few programs rely on the extra precision of the floating point.

I have heard that the CD32 patched graphics.library/WriteChunkyPixels() to use the Akiko so it might have transparently sped up more programs than Wing Commander while not making them incompatible to non CD32 Amigas. The docs for this function mention installing and using HW chunky to planar although it's a bit cryptic.
« Last Edit: October 31, 2011, 12:34:54 AM by matthey »
 

Offline Cammy

Re: What is the real power of Akiko chip in cd32 ?
« Reply #64 on: October 31, 2011, 01:39:45 AM »
I'm really sorry, everyone! I checked the archive on the FTP and it was incomplete, I guess it never finished uploading the first time. I have re-uploaded it and checked that it's the right size now, so if anyone would like to try and re-download it, it should work fine now.

I'm very sorry for linking to a bad archive.
A1200 030@28Mhz/2MB+32MB/RTC/KS3.1/IDE-CF+4GB/4-Way Clockport Expander/IndivisionAGA/PCMCIA NIC
A1200 020@14Mhz/2MB+8MB/FPU/RTC/KS3.0/IDE-CF+2GB/S-Video
CD32 020@14Mhz/2MB+8MB/RTC/KS3.1/IDE-CF+4GB
A600 030@30Mhz/2MB+64MB/RTC/IDE-CF+4GB/Subway USB/S-Video/PCMCIA NIC/USB Numeric Keypad+Hub+Mouse+Control Pad
A500 000@7Mhz/512kB+512kB/ROM Switcher/KS3.1+1.3/S-Video

Get AmigaOS
 

Offline bbond007

  • Hero Member
  • *****
  • Join Date: Mar 2009
  • Posts: 1517
    • Show only replies by bbond007
Re: What is the real power of Akiko chip in cd32 ?
« Reply #65 on: October 31, 2011, 05:06:04 AM »
Quote from: Cammy;665864
I'm really sorry, everyone! I checked the archive on the FTP and it was incomplete, I guess it never finished uploading the first time. I have re-uploaded it and checked that it's the right size now, so if anyone would like to try and re-download it, it should work fine now.

I'm very sorry for linking to a bad archive.


Cammy,

I tried to download it again and this time I got "unexpected end of archive". Maybe it just still uploading? I'll try again later...

thanks

_nate
 

Offline Cammy

Re: What is the real power of Akiko chip in cd32 ?
« Reply #66 on: October 31, 2011, 07:29:48 AM »
Please try again, I've re-uploaded it from my Amiga just to be sure, and then re-downloaded and extracted it.
A1200 030@28Mhz/2MB+32MB/RTC/KS3.1/IDE-CF+4GB/4-Way Clockport Expander/IndivisionAGA/PCMCIA NIC
A1200 020@14Mhz/2MB+8MB/FPU/RTC/KS3.0/IDE-CF+2GB/S-Video
CD32 020@14Mhz/2MB+8MB/RTC/KS3.1/IDE-CF+4GB
A600 030@30Mhz/2MB+64MB/RTC/IDE-CF+4GB/Subway USB/S-Video/PCMCIA NIC/USB Numeric Keypad+Hub+Mouse+Control Pad
A500 000@7Mhz/512kB+512kB/ROM Switcher/KS3.1+1.3/S-Video

Get AmigaOS
 

Offline Karlos

  • Sockologist
  • Global Moderator
  • Hero Member
  • *****
  • Join Date: Nov 2002
  • Posts: 16878
  • Country: gb
  • Thanked: 5 times
    • Show only replies by Karlos
Re: What is the real power of Akiko chip in cd32 ?
« Reply #67 on: October 31, 2011, 08:18:55 AM »
Quote from: matthey;665859
Yes. All fp values are extended to extended precision in registers and all calculations use extended precision on the 68k FPU unless specified to use less precision.


I realise that, what I meant was, are there many applications which depend on being able to use the additional precision? I know some fractal generators used "long double" types for increased precision at high magnification, but were there any others?
int p; // A
 

Offline psxphill

Re: What is the real power of Akiko chip in cd32 ?
« Reply #68 on: October 31, 2011, 10:37:44 AM »
Quote from: matthey;665859
I have heard that the CD32 patched graphics.library/WriteChunkyPixels() to use the Akiko so it might have transparently sped up more programs than Wing Commander while not making them incompatible to non CD32 Amigas.

I was under the impression that commodore only documented using Akiko by calling WriteChunkyPixels.
 

Offline matthey

  • Hero Member
  • *****
  • Join Date: Aug 2007
  • Posts: 1294
    • Show only replies by matthey
Re: What is the real power of Akiko chip in cd32 ?
« Reply #69 on: October 31, 2011, 12:58:20 PM »
Quote from: psxphill;665888
I was under the impression that commodore only documented using Akiko by calling WriteChunkyPixels.


Didn't the CD32 version of WingCommander need patching to run on regular Amiga's though? It could have easily been other incompatibilities than using the Akiko directly. I wouldn't be too surprised to see a game use undocumented programming to give a speed increase. A more impressive game was a competitive advantage as long as it worked on the majority of hardware, in many game developers minds.
 

Offline Digiman

  • Hero Member
  • *****
  • Join Date: May 2010
  • Posts: 1045
    • Show only replies by Digiman
Re: What is the real power of Akiko chip in cd32 ?
« Reply #70 on: November 03, 2011, 08:23:26 PM »
Quote from: Karlos;665804
I think it's you that has rather missed the point. The 50% it gave for an 020 won't scale upwards linearly with faster systems. Akiko gave a 50% speed increase on slow systems where the time it takes to do C2P per frame is comparable to the time it takes to do everything else, that is to say, systems which are too slow to run the game at all. Going from 3FPS to 5FPS doesn't at all imply a 15FPS game will go to 22FPS.

For an engine of a given complexity, as the CPU gets faster, the portion time it takes to do C2P on the CPU becomes less and less of the overall time of execution time until you hit the magical copyspeed bottleneck. At that point, performing C2P takes no more processor time than simply copying data from Fast RAM to Chip RAM.

Some fast 030 C2P routines are already approaching copyspeed, once you start using 040, they get there. On these systems, using an Akiko would likely result in an even slower FPS count than a copyspeed software C2P routine. The reason for that is the way Akiko works. You have to write your chunky data to it, then read back the planar result then write that to Chip RAM all using the CPU. All that copying back and forth takes longer than simply reading the source data from Fast RAM, transforming it then writing it to Chip RAM.

At copyspeed for AGA, even an infinitely fast CPU/Fast RAM combination (where  the Chip RAM write speed of ~7MB/s becomes the only rate limiting factor) would be limited to around 90fps at 320x256 for 256 colours.

Doom uses a fairly simple (and efficient) 2D BSP and optimised column and span texturing routines for walls and floors and a sprite routine for objects, each of which has been pretty well-tuned. For more complex game engines, the time taken to render the frame becomes dominant and so C2P time becomes even less important. Take Quake for example. Here you have a full 3D BSP and a much more general-case textured/shaded/depth-buffered polygon rendering system. All much more CPU work than Doom. On 68K, eliminating the C2P all together (i.e. using an RTG card) only gives a comparatively small increase in fps over AGA. For 603/604 PPC systems, you see a bigger improvement because they are able to render the frame significantly faster than the 040/060 can and the whole argument about time spent rendering versus time spent C2P/copying becomes relevant again.


Simple point is nobody bought 040s to play games 68030 was a minimal improvement mhz per mhz over 020 ergo A4000/030 25mhz was slow and dead end for games.

My point is spot on thanks.  So getting back on topic in relevance to A1200 and CD32 (ie 1992 and 1993) chunky screen gaming, and accepting the known fact that A1200 had to compete out of the box for less than half the price of a 386DX for £899 with monitor blah blah...... if you wanted Wolfenstein style games having Akiko in A1200 rather than not would have made a difference of 50%, and even if Commodore had the brains to use a cheaper 28mhz 020 in A4000 (which is same speed as 25mhz 030 really) OR eventually launched the A1200 28mhz 020 pizza box successor it still makes a difference at the bottom end.

We are talking games and Amigas for games use are stock A1200 + 2mb Fast Ram at best, so yes Akiko would have been usefull (as would have IDE connector and Fast Ram SIMM slot on CD32 motherboard).

If anyone had programmed 030 games worth a sh1t then I might be swayed but fact is 90% of AGA games are badly programmed mildly breathed over crap.

Or Akiko on A1200 AND proper programming would have made games on Amiga 1200 for 99% of people expecting to play better games than a 1985 A1000 a bit less 'old' looking in 93/94.

I fail to see how £3000 A4000 with 040 50mhz and 16mb Ram performs playing  Doom engines compared to £1500 PC running them the same speed is remotely relevant and yet more thread dilution and off topic dreaming. WHO CARES? It's like saying a Mercedes 6L V12 pissed all over a Ford Fiesta 1.6L straight 4 engine....true but bollox that is not helpful :)
 

Offline ChaosLord

  • Hero Member
  • *****
  • Join Date: Nov 2003
  • Posts: 2608
    • Show only replies by ChaosLord
    • http://totalchaoseng.dbv.pl/news.php
Re: What is the real power of Akiko chip in cd32 ?
« Reply #71 on: November 03, 2011, 09:00:33 PM »
Quote from: Digiman;666422
Simple point is nobody bought 040s to play games
I did.
Wanna try a wonderfull strategy game with lots of handdrawn anims,
Magic Spells and Monsters, Incredible playability and lastability,
English speech, etc. Total Chaos AGA
 

Offline psxphill

Re: What is the real power of Akiko chip in cd32 ?
« Reply #72 on: November 04, 2011, 12:17:32 AM »
Quote from: Digiman;666422
SWe are talking games and Amigas for games use are stock A1200 + 2mb Fast Ram at best, so yes Akiko would have been usefull (as would have IDE connector and Fast Ram SIMM slot on CD32 motherboard).

akiko was too little too late. AGA had trouble competing with the SNES and it's mode 7 graphics.
 
The advantage of the Amiga was always the power of custom chips instead of using brute processing power.
 
The blitter in AGA was basically the same as the one they wire wrapped in the early 80's, that was their mistake. They could even have lived with bitplanes, because it's only cpu rendering which makes that beneficial.
 

Offline NovaCoder

Re: What is the real power of Akiko chip in cd32 ?
« Reply #73 on: November 04, 2011, 02:28:56 AM »
Akiko wasn't really used by any commercial games so it doesn't really matter how powerful it was ;)

A fast 030 can almost do C2P as fast as the bus can accept the data coming from the accelerator so the performance hit of having to do chunky-to-planar each frame is not really a big deal.  

All of my recent Amiga coding has been with chunky graphics and C2P so I basically write code without having to worry about how the graphics are actually being rendered.

An interesting note, with my recent port of ScummVM over to AGA (which uses the chipset for double buffering, all of the graphics are generated as chunky screen in FASTRAM) I've found that most speed limitations are not with the AGA chipset, C2P or even the slow bus....it's normally my 030 which is too slow to generate the chunky screen in time.   This is true up to 320x200 8bit, for SuperVGA (640x480 8bit) the AGA chipset/bus speed/C2P lets the side down and you really need to use RTG ;)
« Last Edit: November 04, 2011, 03:06:32 AM by NovaCoder »
Life begins at 100 MIPS!


Nice Ports on AmiNet!
 

Offline Karlos

  • Sockologist
  • Global Moderator
  • Hero Member
  • *****
  • Join Date: Nov 2002
  • Posts: 16878
  • Country: gb
  • Thanked: 5 times
    • Show only replies by Karlos
Re: What is the real power of Akiko chip in cd32 ?
« Reply #74 from previous page: November 04, 2011, 09:56:10 AM »
Quote from: Digiman;666422
Simple point is nobody bought 040s to play games 68030 was a minimal improvement mhz per mhz over 020 ergo A4000/030 25mhz was slow and dead end for games.


Speak for yourself. I got my first 040 not long after the AB3D2 demo first appeared ;)


Quote

My point is spot on thanks.  So getting back on topic in relevance to A1200 and CD32 (ie 1992 and 1993) chunky screen gaming, and accepting the known fact that A1200 had to compete out of the box for less than half the price of a 386DX for £899 with monitor blah blah......


Sorry, but no. Perhaps if the Akiko were implemented in a somewhat better fashion it would be useful, but it wasn't.

Let's imagine we have a 020+FastRAM+Akiko.

Having to first calculate 32 pixels of worth of data in fast ram (no data cache on 020), then copy it as 8 longwords to the Akiko. Then read it back again as 8 long words, then write each of those 8 longword to different addresses (offset into each bitplane) in your chip RAM. Rinse and repeat until the entire buffer has been converted.

Had the Akiko been implemented in such a fashion that you fed it your 8 bitplane addresses and then simply wrote 8 longwords of chunky pixels to it at a time and it in turn wrote them to ChipRAM incrementing the address pointers as it went, then it wouldn't have sucked.
int p; // A