>>31
It runs just fine. Only about 3 times as slow as the C version.
It processes about 35MB of binary input data in about 10 seconds, generating about 90MB of output metadata. This is the entirety of my dataset, which I'd say is perfectly fine.
Originally the code was slow and it took about a minute to do the entire thing, but after I profiled my code I managed to locate the bottlenecks and optimize them away. In case you didn't know, SBCL and ClozureCL both generate native code. I even added some inline assembly for SBCL/x86 which improves some float-related's operation's performance by quite a good deal. A slightly slower, but portable version is provided for the rest of the implementations (the version is chosen by using the conditional reader macros).
All in all, I'm perfectly happy with this and wouldn't want it anyway else. Had I written this in C, I can see myself cutting corners and maybe even making it slower as algorithmic optimizations are a lot more costlier in C (you have to rewrite a lot more code if you want to make major changes), while here I barely need to rewrite anything, the macros generate a lot of my code, so I only need to make a few changes to the code generators.