This is the mail archive of the
cygwin-developers
mailing list for the Cygwin project.
Re: default charset for imlicit locale specificatio
I implemented that locally. However...
On Jan 20 11:07, Corinna Vinschen wrote:
> 874 ANSI/Thai -> CP874 (== ISO-IR-166 used on Linux)
> 932 SJIS -> SJIS
This should probably better be
932 SJIS -> EUCJP
> 936 GB2312 -> GBK
> 949 ANSI/Korean -> EUCKR
> 950 Big-5 -> Big-5
> 1250 ANSI/Central European -> ISO-8859-2
> 1251 ANSI/Cyrillic -> ISO-8859-5
> 1252 ANSI/Latin 1 -> ISO-8859-1
> 1253 ANSI/Greek -> ISO-8859-7
> 1254 ANSI/Turkish -> ISO-8859-9
> 1255 ANSI/Hebrew -> ISO-8859-8
> 1256 ANSI/Arabic -> ISO-8859-6
> 1257 ANSI/Baltic -> ISO-8859-4
> 1258 ANSI/Vietnamese -> UTF-8
> 65001 UTF-8 -> UTF-8
>
> Is that a valid transition?
>
> What's missing is a transition to ISO-8859-15 for languages with the
> EUR currency letter. I assume that's by adding the @euro modifier?
I also noticed that on Linux two-letter settings like "de" or "ja" do not
change the charset from ASCII to something else. Only full-qualified
language_TERRITORY settings do that:
de: ANSI_X3.4-1968
de_DE: ISO-8859-1
de_DE@euro ISO-8859-15
ja: ANSI_X3.4-1968
ja_JP: EUC-JP
Is that something we should do as well? It's very simple to implement.
Corinna
--
Corinna Vinschen Please, send mails regarding Cygwin to
Cygwin Project Co-Leader cygwin AT cygwin DOT com
Red Hat