From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from smtp01.mrf.mail.rcn.net ([207.172.4.60]) by speech.braille.uwo.ca with esmtp (Exim 3.35 #1 (Debian)) id 18uatR-0007dG-00 for ; Sun, 16 Mar 2003 11:21:49 -0500 Received: from 208-59-175-194.c3-0.slvr-ubr2.lnh-slvr.md.cable.rcn.com ([208.59.175.194] helo=computer.ACB.org) by smtp01.mrf.mail.rcn.net with esmtp (Exim 3.35 #4) id 18uatO-0002TX-00 for speakup@braille.uwo.ca; Sun, 16 Mar 2003 11:21:46 -0500 Message-Id: <5.1.0.14.2.20030316112013.025057c0@198.144.194.210> X-Sender: ccrawford@198.144.194.210 X-Mailer: QUALCOMM Windows Eudora Version 5.1 Date: Sun, 16 Mar 2003 11:22:05 -0500 To: speakup@braille.uwo.ca From: Charles Crawford Subject: Re: word documents In-Reply-To: <15988.10894.462403.836384@localhost.localdomain> References: Mime-Version: 1.0 Content-Type: text/plain; charset="us-ascii"; format=flowed Sender: speakup-admin@braille.uwo.ca Errors-To: speakup-admin@braille.uwo.ca X-BeenThere: speakup@braille.uwo.ca X-Mailman-Version: 2.0.11 Precedence: bulk Reply-To: speakup@braille.uwo.ca List-Help: List-Post: List-Subscribe: , List-Id: Speakup is a screen review system for Linux. List-Unsubscribe: , List-Archive: Hmmm. I never thought about the proprietary issue with MS-Word. I wonder if we should not be talking with Microsoft to get at least the formatting info available? Oh yeah, didn't Gates give the chinese open source Windows? Hmmm. -- charlie. At 02:41 AM 03/16/2003 -0500, you wrote: >There are Word document viewers for Linux console. The one I use is >called wv. Another is called antiword. No doubt, there are more. >Because Word is a proprietary format, and the specification is not >available, the authors of programs such as wv have had to >reverse-engineer a bit. Because of this, certain things in the Word >document may not decode as well as we'd like. Nonetheless, I use wv >and get reasonable results when converting from Word to html. The >resulting html source is quite bloated, but, it's there. > >For pdf conversion, there's pdftotext. This is part of the xpdf >package, and may already be on your system. Surprise, it was already >on my stock installation of RH 7.2. the one thing I don't like about >pdftotext-s rendering, is that hyperlinks get lost. To preserve the >navigability of pdf documents, I visit , and submit >the url of a pdf document (assuming I've found it on the web) to the >form. What comes back is a nice html rendering (links and all). > > >Hope this helps, > > >-Dave > > >_______________________________________________ >Speakup mailing list >Speakup@braille.uwo.ca >http://speech.braille.uwo.ca/mailman/listinfo/speakup