BTW2: I studied some CryptoNessi block cipher proposals and must
admit that I was a bit disappointed when I found out that the
so-called legacy block cipher Khazad is about half as fast as AES.
It uses an 8x8 MDS matrix as linear mixing layer as well as you do
in your F function. I think that's one of the reasons why it's so
slow compared to AES or Anubis. (At least in software is)
[...]
What about the key schedule ?
A paper would be cool. I don't like reading other ppl's
C code