This is the mail archive of the cygwin-developers mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: default charset for imlicit locale specificatio


I implemented that locally.  However...

On Jan 20 11:07, Corinna Vinschen wrote:
>   874	ANSI/Thai		-> CP874 (== ISO-IR-166 used on Linux)
>   932	SJIS			-> SJIS

This should probably better be

    932 SJIS			-> EUCJP

>   936	GB2312			-> GBK
>   949	ANSI/Korean		-> EUCKR
>   950	Big-5			-> Big-5
>  1250	ANSI/Central European	-> ISO-8859-2
>  1251	ANSI/Cyrillic		-> ISO-8859-5
>  1252	ANSI/Latin 1		-> ISO-8859-1
>  1253	ANSI/Greek		-> ISO-8859-7
>  1254	ANSI/Turkish		-> ISO-8859-9
>  1255	ANSI/Hebrew		-> ISO-8859-8
>  1256	ANSI/Arabic		-> ISO-8859-6
>  1257	ANSI/Baltic		-> ISO-8859-4
>  1258	ANSI/Vietnamese		-> UTF-8
> 65001	UTF-8			-> UTF-8
> 
> Is that a valid transition?
> 
> What's missing is a transition to ISO-8859-15 for languages with the
> EUR currency letter.  I assume that's by adding the @euro modifier?

I also noticed that on Linux two-letter settings like "de" or "ja" do not
change the charset from ASCII to something else.  Only full-qualified
language_TERRITORY settings do that:

  de:		ANSI_X3.4-1968
  de_DE:	ISO-8859-1
  de_DE@euro	ISO-8859-15

  ja:		ANSI_X3.4-1968
  ja_JP:	EUC-JP

Is that something we should do as well?  It's very simple to implement.


Corinna

-- 
Corinna Vinschen                  Please, send mails regarding Cygwin to
Cygwin Project Co-Leader          cygwin AT cygwin DOT com
Red Hat


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]