This is the mail archive of the cygwin mailing list for the Cygwin project.


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: pdftk and apropos - general questions


On 2009-03-04, Mike Marchywka wrote:

> > Mike Marchywka wrote:
> >> I've had a persistent problem getting apropos to work
> >> as it never finds anything appropriate. Is there
> >> something I need to do to make this work?
> >>
> > After each setup session, you need to run, /usr/sbin/makewhatis -u.
> 
> 
> Thanks but I did get that far after earlier hints and you list
> below is about what I ended up with too. One problem
> I ran into was trying to extract sensical text from the 
> IRS instructions.

I have that problem with the printed versions.

> I used the pdftotext utility IIRC from 
> 
> http://www.foolabs.com/xpdf/download.html
> 
> and it didn't seem to be able to separate multi-column text
> automatically ( with sed and awk I got what I needed but what
> a mess).

Did you use the -layout option to pdftotext?  It makes a huge
difference on the documents I've converted, but they've all been
single column.

Regards,
Gary



--
Unsubscribe info:      http://cygwin.com/ml/#unsubscribe-simple
Problem reports:       http://cygwin.com/problems.html
Documentation:         http://cygwin.com/docs.html
FAQ:                   http://cygwin.com/faq/


Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]