A while ago, I downloaded the text version of the Dialects of China project,
which derives most of it's data from Hanyu Fangyin Zihui, a list of readings
of 2700 characters across almost 20 dialects. I've converted the file and
zipped it up. Unzipped it is 5105 kilobytes, zipped it is 460 kb. The
original file docmas9.txt was over 55000 lines long. A bit of programming
went a long way.
http://www.dylanwhs.ukgateway.net/download/doc.htm
Also needs Big5 fonts, plus a font to display IPA characters like "lucida
sans unicode".
Cheers,
Dyl.
Nice work.