I’ve been having to use a lot of utf8 recently, and being old school I still use the command line a lot.
One project has been importing a lot of international data into a mysql database.
The database is in utf8, but when I used the command line, non latin1 data was coming back corrupted.
It turned out that the command line doesn’t automatically detect the character set, so it was printing as if it was latin1. There is a flag that can be passed in called –default-character-set and this can be set to utf8. Once set, utf8 data is correctly displayed on my terminal.
mysql --default-character-set=utf8 testdatabase
This is also useful when piping in utf8 data.