hi, when converting from unicode to utf8, the utf8 representation can be quite a bit longer than the unicode text especially when double byte is involved.
So for example #30011;#20687;#26908;#32034; converts to 12 utf8 characters ????.
Has anyone written a formula or found a way to reduce the length of the produced utf8 characters so they can fit into existsing tables without having to increase table columns ?
Umm, very difficult; if you talk about numbers or IP numbers you can save some bytes using a base 64 to 128 encoding, but when talking about text I don't know.