From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from int-mx08.intmail.prod.int.phx2.redhat.com (int-mx08.intmail.prod.int.phx2.redhat.com [10.5.11.21]) by lists01.pubmisc.prod.ext.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CFY7D9021136 for ; Thu, 12 Aug 2010 11:34:07 -0400 Received: from mx1.redhat.com (ext-mx05.extmail.prod.ext.phx2.redhat.com [10.5.110.9]) by int-mx08.intmail.prod.int.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CFY26n009397 for ; Thu, 12 Aug 2010 11:34:02 -0400 Received: from x.it.okstate.edu (x.it.okstate.edu [139.78.2.13]) by mx1.redhat.com (8.13.8/8.13.8) with ESMTP id o7CFXnQq016403 for ; Thu, 12 Aug 2010 11:33:50 -0400 Received: from x.it.okstate.edu (localhost.cis.okstate.edu [127.0.0.1]) by x.it.okstate.edu (8.14.3/8.14.3) with ESMTP id o7CFXnCV079176 for ; Thu, 12 Aug 2010 10:33:49 -0500 (CDT) (envelope-from martin@x.it.okstate.edu) Message-Id: <201008121533.o7CFXnCV079176@x.it.okstate.edu> To: Linux for blind general discussion Subject: Re: Extracting ASCII text from a PDF Document MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-ID: <79174.1281627229.1@x.it.okstate.edu> Date: Thu, 12 Aug 2010 10:33:49 -0500 From: Martin McCormick X-RedHat-Spam-Score: -0.01 (T_RP_MATCHES_RCVD) X-Scanned-By: MIMEDefang 2.67 on 10.5.11.21 X-Scanned-By: MIMEDefang 2.67 on 10.5.110.9 X-loop: blinux-list@redhat.com X-BeenThere: blinux-list@redhat.com X-Mailman-Version: 2.1.12 Precedence: junk Reply-To: Linux for blind general discussion List-Id: Linux for blind general discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Aug 2010 15:34:07 -0000 Kirk Reiser writes: > pdftotext is a different program, mine with the -v argument returns: > > pdftotext version 3.02 > Copyright 1996-2007 Glyph & Cog, LLC > > > It also outputs to a file with the basename but containing a .txt > extension. I believe it is part of the xpdf utilities. Thank you very much. I do have pdftotext and I probably need to upgrade it as mine is 3.00 but it read the document just fine. I got confused and thought pstotext was what I needed as the man page says it will convert a postscript or pdf document to ASCII text. Anyway, it looks like the problem is solved by calling the right application. Martin McCormick