From mboxrd@z Thu Jan 1 00:00:00 1970 Received: from 208-59-175-194.c3-0.slvr-ubr2.lnh-slvr.md.cable.rcn.com ([208.59.175.194] helo=localhost.localdomain) by speech.braille.uwo.ca with esmtp (Exim 3.35 #1 (Debian)) id 18usy0-0004zZ-00 for ; Mon, 17 Mar 2003 06:39:44 -0500 Received: from localhost.localdomain (localhost.localdomain [127.0.0.1]) by localhost.localdomain (8.12.5/8.12.5) with ESMTP id h2HBThuL005237 for ; Mon, 17 Mar 2003 06:29:43 -0500 Received: from localhost (ccrawford@localhost) by localhost.localdomain (8.12.5/8.12.5/Submit) with ESMTP id h2HBTh4d005233 for ; Mon, 17 Mar 2003 06:29:43 -0500 X-Authentication-Warning: localhost.localdomain: ccrawford owned process doing -bs Date: Mon, 17 Mar 2003 06:29:43 -0500 (EST) From: ccrawford@acb.org X-X-Sender: ccrawford@localhost.localdomain To: speakup@braille.uwo.ca Subject: Re: word documents In-Reply-To: Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: speakup-admin@braille.uwo.ca Errors-To: speakup-admin@braille.uwo.ca X-BeenThere: speakup@braille.uwo.ca X-Mailman-Version: 2.0.11 Precedence: bulk Reply-To: speakup@braille.uwo.ca List-Help: List-Post: List-Subscribe: , List-Id: Speakup is a screen review system for Linux. List-Unsubscribe: , List-Archive: If that is the case, then why is it so difficult to read afile? Not sure. -- Charlie. On Sun, 16 Mar 2003, Jude DaShiell wrote: > Actually, the msword format is one of the iso standards formats. It's a > long time since I've had any dealings with Microsoft so I forgot the > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > wrote: > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > From: Charles Crawford > > Reply-To: speakup@braille.uwo.ca > > To: speakup@braille.uwo.ca > > Subject: Re: word documents > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > if we should not be talking with Microsoft to get at least the formatting > > info available? Oh yeah, didn't Gates give the > > chinese open source Windows? Hmmm. > > > > -- charlie. > > At 02:41 AM 03/16/2003 -0500, you wrote: > > >There are Word document viewers for Linux console. The one I use is > > >called wv. Another is called antiword. No doubt, there are more. > > >Because Word is a proprietary format, and the specification is not > > >available, the authors of programs such as wv have had to > > >reverse-engineer a bit. Because of this, certain things in the Word > > >document may not decode as well as we'd like. Nonetheless, I use wv > > >and get reasonable results when converting from Word to html. The > > >resulting html source is quite bloated, but, it's there. > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > >package, and may already be on your system. Surprise, it was already > > >on my stock installation of RH 7.2. the one thing I don't like about > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > >navigability of pdf documents, I visit , and submit > > >the url of a pdf document (assuming I've found it on the web) to the > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > >Hope this helps, > > > > > > > > >-Dave > > > > > > > > >_______________________________________________ > > >Speakup mailing list > > >Speakup@braille.uwo.ca > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > >