From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from int-mx08.intmail.prod.int.phx2.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.21]) by lists01.pubmisc.prod.ext.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CDYCKj007592 for ; Thu, 12 Aug 2010 09:34:12 -0400 Received: from mx1.redhat.com (ext-mx04.extmail.prod.ext.phx2.redhat.com [10.5.110.8]) by int-mx08.intmail.prod.int.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CDY7Em005444 for ; Thu, 12 Aug 2010 09:34:07 -0400 Received: from ewr.djernes.net (ewr.djernes.net [97.107.141.189]) by mx1.redhat.com (8.13.8/8.13.8) with ESMTP id o7CDXwrG029444 for ; Thu, 12 Aug 2010 09:33:58 -0400 Received: from data.home (bzq-79-181-184-85.red.bezeqint.net [79.181.184.85]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by ewr.djernes.net (Postfix) with ESMTPSA id 1723E12C072 for ; Thu, 12 Aug 2010 13:33:56 +0000 (UTC) X-DKIM: Sendmail DKIM Filter v2.8.2 ewr.djernes.net 1723E12C072 DKIM-Signature: v=1; a=rsa-sha256; c=simple/simple; d=quitelikely.com; s=2010; t=1281620037; bh=QPaMAmVWZxZcJsn9sMwEikAlY84Wb645ySe85pYjftc=; h=Date:From:To:Subject:In-Reply-To:Message-ID:References: MIME-Version:Content-Type; b=FQ9Ncc7w7LyZ0GKZ9y4KW/KgbG0wZkUXEHFWAEzgCiMF1b53bRF88+KHHGW9eP6EQ 44AzZQj5AFYl0qDTdHBJ8OKtoHL7FYeLnop9vF7D4AR1Z9Al5ga1gRR8iTILhF6FmW OAG2X1J6w1R/HJVgwuEG5hZrjUOTTelEO+3AtGCs= Date: Thu, 12 Aug 2010 16:33:53 +0300 (IDT) From: Geoff Shang X-X-Sender: geoff@data.home To: Linux for blind general discussion Subject: Re: Extracting ASCII text from a PDF Document In-Reply-To: <201008121226.o7CCQuBf017566@dc.cis.okstate.edu> Message-ID: References: <201008121226.o7CCQuBf017566@dc.cis.okstate.edu> User-Agent: Alpine 1.10 (DEB 962 2008-03-14) MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-RedHat-Spam-Score: -0.1 (DKIM_SIGNED,DKIM_VALID,DKIM_VALID_AU) X-Scanned-By: MIMEDefang 2.67 on 10.5.11.21 X-Scanned-By: MIMEDefang 2.67 on 10.5.110.8 X-loop: blinux-list@redhat.com X-BeenThere: blinux-list@redhat.com X-Mailman-Version: 2.1.12 Precedence: junk Reply-To: Linux for blind general discussion List-Id: Linux for blind general discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Aug 2010 13:34:13 -0000 On Thu, 12 Aug 2010, Martin McCormick wrote: > I have a PDF document that does have embedded ASCII text in it. Iuse pdftotext from the poppler-utils package. Note that this expects an output filename rather than sending to stdout. Geoff.