pstotext – Extract ASCII from PostScript and PDF
A Unix program that extracts ASCII text from PostScript and PDF (Acrobat) files. pstotext uses Ghostscript, but does a more careful job with kerned characters and nonstandard font encodings than Ghostscript's ps2ascii utility.
Pstotext is no longer held on CTAN; documentation and downloads are available from its home page.
|License||Free license not otherwise listed, or more than one free license applies|
convert one format of file to another
Maybe you are interested in the following packages as well.
- catdoc: Text extractor for word files
- rtf2tex: Convert RTF to TeX
- hyperlatex: A restricted LaTeX system that also produces HTML
- pdbf-toolkit: A Toolkit for Creating Janiform Data Documents