public inbox for speakup@linux-speakup.org
 help / color / mirror / Atom feed
From: "Michael Whapples" <mwhapples@aim.com>
To: "Speakup is a screen review system for Linux." <speakup@braille.uwo.ca>
Subject: Re: Anyone able to OCR a PDF file?
Date: Tue, 3 Jan 2012 17:38:12 -0000	[thread overview]
Message-ID: <66B84B1D49004B7D8DCE9C34AF6CB22E@layla> (raw)
In-Reply-To: <20120103164040.GA12039@sonata.rednote.net>

I have personally used cuneiform for linux mostly. I cannot remmeber if it 
can natively manage PDF files (possibly, certainly it can do more than 
TIFF), however you could use a conversion tool (memory seems to say 
pdf2tiff).

Michael Whapples

-----Original Message----- 
From: Janina Sajka
Sent: Tuesday, January 03, 2012 4:40 PM
To: speakup@braille.uwo.ca
Subject: Anyone able to OCR a PDF file?

Has anyone figured out how to get one of the Linux OCR engines (like
tesseract) to accept a graphical file (other than .tiff) as input? In
particular I'm going to be swamped with graphical PDF files this year.
Printing these just to scan them seems both wasteful and inefficient.

I know people do this on other OS's. Has anyone suggestions on how to do
this in Linux?

All suggestions greatly appreciated.

Janina

-- 

Janina Sajka, Phone: +1.443.300.2200
sip:janina@asterisk.rednote.net

Chair, Open Accessibility janina@a11y.org
Linux Foundation http://a11y.org

Chair, Protocols & Formats
Web Accessibility Initiative http://www.w3.org/wai/pf
World Wide Web Consortium (W3C)



  parent reply	other threads:[~ UTC|newest]

Thread overview: 7+ messages / expand[flat|nested]  mbox.gz  Atom feed  top
 Janina Sajka
 ` Samuel Thibault
   ` Janina Sajka
     ` Willem van der Walt
 ` Michael Whapples [this message]
 pj
 ` Jason White

Reply instructions:

You may reply publicly to this message via plain-text email
using any one of the following methods:

* Save the following mbox file, import it into your mail client,
  and reply-to-all from there: mbox

  Avoid top-posting and favor interleaved quoting:
  https://en.wikipedia.org/wiki/Posting_style#Interleaved_style

* Reply using the --to, --cc, and --in-reply-to
  switches of git-send-email(1):

  git send-email \
    --in-reply-to=66B84B1D49004B7D8DCE9C34AF6CB22E@layla \
    --to=mwhapples@aim.com \
    --cc=speakup@braille.uwo.ca \
    /path/to/YOUR_REPLY

  https://kernel.org/pub/software/scm/git/docs/git-send-email.html

* If your mail client supports setting the In-Reply-To header
  via mailto: links, try the mailto: link
Be sure your reply has a Subject: header at the top and a blank line before the message body.
This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).