[ITP] pstotext-1.9 - Extract text from PostScript and PDF files

Jari Aalto+mail.linux letters@hotpop.com
Fri Mar 5 01:04:00 GMT 2004


pstotext extracts text (in the ISO 8859-1 character set) from a PostScript
or PDF (Portable Document Format) file. Thus, pstotext is similar to the
ps2ascii program that comes with ghostscript. The output of pstotext is
however better than that of ps2ascii, because pstotext deals better with
punctuation and ligatures.

http://packages.debian.org/unstable/text/pstotext

a)

wget --non-verbose  \
  http://tierra.dyndns.org:81/cygwin/pstotext/pstotext-1.9-1-src.tar.bz2 \
  http://tierra.dyndns.org:81/cygwin/pstotext/pstotext-1.9-1.tar.bz2 \
  http://tierra.dyndns.org:81/cygwin/pstotext/setup.hint

b) or use

mkdir pstotext ; cd pstotext
wget -q -O - http://tierra.dyndns.org:81/cygwin/pstotext/get.sh | sh


Jari

-- 
http://tiny-tools.sourceforge.net/
Swatch @time   http://www.mir.com.my/iTime/itime.htm
               http://www.ryanthiessen.com/swatch/resources.htm
Use Licenses!  http://www.linuxjournal.com/article.php?sid=6225
Which Licence? http://www.linuxjournal.com/article.php?sid=4825
OSI Licences   http://www.opensource.org/licenses/



More information about the Cygwin-apps mailing list