Slightly better than >>7 but still nowhere near OP (fastest but largest portable one --- on x86, that is) nor the asm versions.
Strangely enough xlatb isn't faster than OP's on x86, even though it's a specific table lookup instruction (and was kept in 64-bit mode, for some reason.)