From 97% to 66% CPU usage with very little effort

Here are the flags that make GHC generate Really Fast Code (TM):

-fexcess-precision -fvia-C -optc-O2 -optc-mfpmath=sse -optc-msse2 -optc-march=pentium-m

Now I can add some more/better audio stuff to d01234...