From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from qsmtp3.america.net ([209.17.206.58]) by speech.braille.uwo.ca with esmtp (Exim 3.35 #1 (Debian)) id 18ucRL-0008J5-00 for ; Sun, 16 Mar 2003 13:00:56 -0500 Received: from dhcp1-69.stmarys.gmpexpress.net ([63.147.51.69]) by qsmtp3.america.net with esmtp (Exim 4.10) id 18ucJR-000764-00 for speakup@braille.uwo.ca; Sun, 16 Mar 2003 12:52:45 -0500 Date: Sun, 16 Mar 2003 13:00:44 -0500 (EST) From: Jude DaShiell To: Subject: Re: word documents In-Reply-To: <5.1.0.14.2.20030316112013.025057c0@198.144.194.210> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: speakup-admin@braille.uwo.ca Errors-To: speakup-admin@braille.uwo.ca X-BeenThere: speakup@braille.uwo.ca X-Mailman-Version: 2.0.11 Precedence: bulk Reply-To: speakup@braille.uwo.ca List-Help: List-Post: List-Subscribe: , List-Id: Speakup is a screen review system for Linux. List-Unsubscribe: , List-Archive: Actually, the msword format is one of the iso standards formats. It's a long time since I've had any dealings with Microsoft so I forgot the number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford wrote: > Date: Sun, 16 Mar 2003 11:22:05 -0500 > From: Charles Crawford > Reply-To: speakup@braille.uwo.ca > To: speakup@braille.uwo.ca > Subject: Re: word documents > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > if we should not be talking with Microsoft to get at least the formatting > info available? Oh yeah, didn't Gates give the > chinese open source Windows? Hmmm. > > -- charlie. > At 02:41 AM 03/16/2003 -0500, you wrote: > >There are Word document viewers for Linux console. The one I use is > >called wv. Another is called antiword. No doubt, there are more. > >Because Word is a proprietary format, and the specification is not > >available, the authors of programs such as wv have had to > >reverse-engineer a bit. Because of this, certain things in the Word > >document may not decode as well as we'd like. Nonetheless, I use wv > >and get reasonable results when converting from Word to html. The > >resulting html source is quite bloated, but, it's there. > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > >package, and may already be on your system. Surprise, it was already > >on my stock installation of RH 7.2. the one thing I don't like about > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > >navigability of pdf documents, I visit , and submit > >the url of a pdf document (assuming I've found it on the web) to the > >form. What comes back is a nice html rendering (links and all). > > > > > >Hope this helps, > > > > > >-Dave > > > > > >_______________________________________________ > >Speakup mailing list > >Speakup@braille.uwo.ca > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > -- Jude