Author Topic: CopyMem Quick & Small released! (Read 14245 times)

guest11527 · « **Reply #59 on:** January 05, 2015, 03:33:51 PM »

Quote from: Georg;781067

Maybe smart if this works on a "per cliprect" basis, but does it?

Actually, it is a single buffer. Manually clipping the text before rendering it to screen would complicate matters a lot. Clipping is done in BltTemplate() of the graphics library once rendering is complete.

Quote from: Georg;781067

Quote from: Georg;781067
Otherwise for things like text output in hidden simple refresh windows (like output in a shell window while compiling something, with the source code text editor in the front hiding all or most of it) it can do a lot of unnecessary work in the off-screen buffers.
I wouldn't be so sure. Look, you have to clip at some point. You can either clip while rendering the glyphs (which is what 1.3 did) or clip only once. Given that the complexity of the clipping is pretty high compared to rendering the text itself, it is probably better to "do the additional work" because it results in a much simpler algorithm. I believe the right approach is to optimize for the *typical* case, and the typical case is that the window you render text to is front-most, thus no clipping done.
Quote from: Georg;781067
Similar for long text strings where big parts may ends up being clipped away. Like maybe in a listview gadget.

Actually, the typical ASL/Reqtools requester isn't *that* stupid. I don't know how MUI works, but the system requesters only render those lines that are actually visible on the screen and not those that are clipped away completely.

guest11527 · « **Reply #60 on:** January 05, 2015, 03:43:43 PM »

Quote from: Thorham;781066

No, it's not, hence the reason FBlit+FText makes a real difference.

How much, and is that due to FBlit? How does that work on graphics cards?

Quote from: Thorham;781066

Which is slow, because you get additional memory accesses. Far better to do everything in registers, write to chipmem and be able to use the CPU pipeline on 68020+.

Well, there isn't really much chance to avoid memory accesses. You can probably get away rendering in fast ram for graphics cards in first place and then copy directly to the screen, but in one way or another, you need to fiddle all the bits in the right places to begin with, and there isn't much to be optimized *unless* you restrict yourself to some "nice" font sizes. Optimizing topaz.8 is pretty easy and you can double the speed of the Os, but that's really the exception.

Quote from: Thorham;781066

You can write a properly optimized font renderer for any normal text editor font size. You can also take syntax coloring in account and not write all bit planes for each character.

Actually, all this bit-plane handling is pretty much obsolete in first place (I mean, custom-chip graphics), but leave this as it is: Rendering only a single bitplane is pretty dangerous for an Os function because it cannot know what else is on the screen. For the program, it may be possible (I believe ViNCed even does that, but my memory is fading) - but you don't need a new Os function for that, or need to write your own renderer. You can just set the rastport flags.

Thorham · « **Reply #61 on:** January 05, 2015, 04:45:44 PM »