From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from int-mx1.corp.redhat.com (int-mx1.corp.redhat.com [172.16.44.254]) by listman.redhat.com (Postfix) with ESMTP id CF8023ED7B for ; Mon, 10 Dec 2001 14:29:01 -0500 (EST) Received: from mail.redhat.com (mail.redhat.com [199.183.24.239]) by int-mx1.corp.redhat.com (8.11.6/8.11.6) with SMTP id fBAJT1p30418 for ; Mon, 10 Dec 2001 14:29:01 -0500 Received: (from mail@localhost) by mail.redhat.com (8.11.0/8.8.7) id fBAJT1114119 for blinux-list@listman.redhat.com; Mon, 10 Dec 2001 14:29:01 -0500 Received: from dc.cis.okstate.edu (dc.cis.okstate.edu [139.78.100.219]) by mail.redhat.com (8.11.0/8.8.7) with ESMTP id fBAJT1w14115 for ; Mon, 10 Dec 2001 14:29:01 -0500 Received: from dc.cis.okstate.edu (localhost.cis.okstate.edu [127.0.0.1]) by dc.cis.okstate.edu (8.11.3/8.11.3) with ESMTP id fBAJT0Y75172 for ; Mon, 10 Dec 2001 13:29:00 -0600 (CST) (envelope-from martin@dc.cis.okstate.edu) Message-Id: <200112101929.fBAJT0Y75172@dc.cis.okstate.edu> To: blinux-list@redhat.com Subject: Re: OCR In Linux Date: Mon, 10 Dec 2001 13:29:00 -0600 From: "Martin G. McCormick" X-Loop: blinux-list@redhat.com Sender: blinux-list-admin@redhat.com Errors-To: blinux-list-admin@redhat.com X-BeenThere: blinux-list@redhat.com X-Mailman-Version: 2.0.1 Precedence: bulk Reply-To: blinux-list@redhat.com List-Help: List-Post: List-Subscribe: , List-Id: Linux for blind general discussion List-Unsubscribe: , List-Archive: Since this topic has come up, I have a few questions. Do any of the free OCR packages run in command line mode? I don't mind that they may not automatically turn the image for you to take care of upside down pages, etc, but how does the quality of the conversion compare with something like OsCar or Reading Edge? I have found those products to do a pretty fair job on standard printed pages or typewritten material. If I could duplicate that functionality in Linux, I would be happy as the OsCar system I use here at work is approaching 9 years old and the HP scanner will probably crash and burn one of these days. It is my understanding that you do the OCR under UNIX by scanning a Jpeg image with the scanner and then feed that image in to the OCR program where you get a text file or standard output if text can be decoded. If the output indicates differently when tables or columns are involved, one should be able to write filters in C or perl or even awk to read them more easily. Martin McCormick WB5AGZ Stillwater, OK OSU Center for Computing and Information Services Network Operations Group