Twitter updates
- @exileous Download the CUDA 5 toolkit and take a look at the PTX 3.1 documentation :) 20 hours ago
Development
Graphics
Calendar
May 2012 M T W T F S S « Mar 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31
Tag Archives: optimisation
War story: Cache if it you can
The first post in what I hope becomes a series of war stories, right from the trenches. If you have some piece of code which pops up at the top of your profiler output, and you’re about to show off … Continue reading
Compiler magic
Just seen this: Inside template heavy code, Visual C++ 8 was able to unroll a loop and add a prefetchnta call to prefetch data – first time I’ve seen this with VC++ 8. Well done, folks! [Update:] This seems even … Continue reading
Cache-aware programming
I’ve been working today on a project, and after the first implementation session I ran it through a profiler to see whether I had some obvious performance bottlenecks. Turned out not to be the case, but looking through the code, … Continue reading
AMD releases “Barcelona” software optimization guide
AMD just released the software optimization guide for their K10 (“Barcelona”) CPU line. You can get it from here. Seems the SSE units are really 128 bit wide (as expected). They improved the instruction fetching to fill the wider execution … Continue reading