Hello,
I'm running eggdrop 1.8, patched to include UTF-8 support. My system LANG is set to UTF-8. My chat client supports UTF-8. The server I'm connecting to supports UTF-8. I have no troubles at all sending any sort of valid UTF-8 character to the channel my bot is connected to.
I have a simple script that connects to Microsoft translate and returns the translated result. It is working for many languages already. However when translating from Russian, Japanese, Chinese and possibly others it encounters a problem.
in my proc I experimented with "putlog $text" immediately at the beginning of the proc, where $text is everything the bot thinks the user has entered for that binding.
For example:
User enters: "!trans ru|en очень хорошо"
logfile shows: "ru|en >G5=L E>@>H>"
Naturally this does not work out to be the correct translation.
If I urlencode очень хорошо to %D0%BE%D1%87%D0%B5%D0%BD%D1%8C%20%D1%85%D0%BE%D1%80%D0%BE%D1%88%D0%BE and send it to the translator I get the expected result of "very well".
For some reason some UTF-8 characters work for example:
User enters: "!trans fr|en très bien"
logfile shows: "fr|en tr▒s bien"
However the character is correctly percent-encoded to %C8%A8 in that case and I get the expected result of "very well".
Thanks for any insight.