![]() | |
![]() |
| | Thread Tools | Display Modes |
#1
| ||||
| ||||
|
|
I have looked into my Linux box and found this in /usr/share/i18n/charmaps/BIG5.gz: % Chinese charmap for BIG5 (CP950) % version: 0.92 % Contact: Tung-Han Hsieh <thhsieh (AT) linux (DOT) org.tw % Yuan-Chung Cheng <platin (AT) ms31 (DOT) hinet.net % Distribution and use is free, even for comercial purpose. % % This charmap is converted from: % ftp://ftp.unicode.org/Public/MAPPING...DOWS/CP950.TXT % ... There "my" characters are in. That's a Microsoft's definition, not a standard. I think there should be a reason why the Unicode org. does not use it. |
|
Don't you agree that it is strange that I can (for EUC_TW) copy "to" file without error but I can not copy "from" file without error? I'm not quite sure what you are saying. Are you complaining that (for example) 0xe7a281 in UTF-8 does not convert to EUC_TW? |
|
BTW, what do you think about below? FYI, CNS 11643-1993 is the standard character set and EUC_TW is the one of the encodings. That means your problem below will disappear. |
|
WARNING: copy: line 2, LocalToUtf: could not convert (0x8ea3cfd0) EUC_TW to UTF-8. Ignored WARNING: copy: line 3, LocalToUtf: could not convert (0x8ea3c4ce) EUC_TW to UTF-8. Ignored WARNING: copy: line 4, LocalToUtf: could not convert (0x8ea3bdfe) EUC_TW to UTF-8. Ignored Hum. These seem to be CNS 11643-1993, plane 3. Currently PostgreSQL supports only: CNS 11643-1993, plane 0 CNS 11643-1993, plane 1 CNS 11643-1993, plane 2 CNS 11643-1993, plane 15 Would you like to have support for rest of CNS 11643-1993 planes: CNS 11643-1993, plane 3 CNS 11643-1993, plane 4 CNS 11643-1993, plane 5 CNS 11643-1993, plane 6 CNS 11643-1993, plane 7 support for upcoming 7.4? -- Tatsuo Ishii |
![]() |
| Thread Tools | |
| Display Modes | |
| |