This is the mail archive of the cygwin mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Non-trivial Cyrillic question


Alexander Shopov <ash@contact.bg> writes:

> However when I pipe it to less
>
> cat myfile | less
>
> it gets broken
> I get things like: <CF><D0><C5><CF><C8><D1><CA><C0>

You should tell less what characters are supposed to be printable
using LESSCHARSET or LESSCHARDEF environment variable (man less for
details).

> ls *
>
> it is displayed like
> ????.txt

ls uses current locale to determine which characters are
printable. AFAIK cygwin isn't locale-aware (I can be mistaken); the
possible workaround is to use --show-control-chars switch to print all
characters literally.

BTW Is cp1251 the only 8-bit encoding used by Bulgarian locale in MS
Windows ? In Russia two encoding are used - cp866 as OEM (used, e.g.,
in console), cp1251 as ANSI.

-- 
 							With regards, Roman.


--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]