From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from int-mx04.intmail.prod.int.phx2.redhat.com (int-mx04.intmail.prod.int.phx2.redhat.com [10.5.11.17]) by lists01.pubmisc.prod.ext.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CEI3PT012836 for ; Thu, 12 Aug 2010 10:18:03 -0400 Received: from mx1.redhat.com (ext-mx01.extmail.prod.ext.phx2.redhat.com [10.5.110.5]) by int-mx04.intmail.prod.int.phx2.redhat.com (8.13.8/8.13.8) with ESMTP id o7CEHw97022338 for ; Thu, 12 Aug 2010 10:17:58 -0400 Received: from server1.shellworld.net (shellworld.net [69.60.117.94]) by mx1.redhat.com (8.13.8/8.13.8) with ESMTP id o7CEHk5F026683 for ; Thu, 12 Aug 2010 10:17:46 -0400 Received: by server1.shellworld.net (Postfix, from userid 1028) id A767B22A5D; Thu, 12 Aug 2010 09:17:45 -0500 (CDT) Received: from localhost (localhost [127.0.0.1]) by server1.shellworld.net (Postfix) with ESMTP id A0F5B22A57 for ; Thu, 12 Aug 2010 07:17:45 -0700 (PDT) Date: Thu, 12 Aug 2010 07:17:45 -0700 From: Hart Larry To: Linux for blind general discussion Subject: Re: Extracting ASCII text from a PDF Document In-Reply-To: Message-ID: References: <201008121249.o7CCncJI077980@x.it.okstate.edu> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed X-RedHat-Spam-Score: 0 () X-Scanned-By: MIMEDefang 2.67 on 10.5.11.17 X-Scanned-By: MIMEDefang 2.67 on 10.5.110.5 X-loop: blinux-list@redhat.com X-BeenThere: blinux-list@redhat.com X-Mailman-Version: 2.1.12 Precedence: junk Reply-To: Linux for blind general discussion List-Id: Linux for blind general discussion List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 12 Aug 2010 14:18:04 -0000 Well, certainly a majority of pdf files never read well, seemingly better results with pdftohtml, but if I have that wrong try pdf2html, since I am not at home, I cannot check. Anyway the results are somewhat smoother. Hart