This is the mail archive of the cygwin@cygwin.com mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Unicode files


At 21:18 2003-01-12, Clancy Malcolm wrote:
>Can cygwin programs like grep process a unicode file?
>
>I have a Windows 2000 backup log file which seems to be a unicode file.
>When I cat the file under cygwin it displays with spaces between every
>second character: e.g.
>
>ÿ_B a c k u p   S t a t u s
> O p e r a t i o n :   B a c k u p
> A c t i v e   b a c k u p   d e s t i n a t i o n :   4 m m   D D S
> M e d i a   n a m e :   " T S C 4 - 2 0 0 3 - 0 1 - 1 3 - 2 - 3 5 p "
>
>When I cat in under linux it displays as a normal text file.  I have a
>script which grep's for particular patterns and emails matching lines, but
>it won't work with these log files.
>
>Any suggestions greatly appreciated.

For the ntbtlog.txt file I found this worked as a conversion tool for me.

	tr -d '\0\r' < InputFile > OutputFile

But that is the only unicode file I've had to deal with.

-- 
Jon H. LaBadie                  jcyg@jgcomp.com
 JG Computing
 4455 Province Line Road        (609) 252-0159
 Princeton, NJ  08540-4322      (609) 683-7220 (fax)

--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Bug reporting:         http://cygwin.com/bugs.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]