* word documents
@ Joseph C. Lininger
` Aaron Howell
` (5 more replies)
0 siblings, 6 replies; 27+ messages in thread
From: Joseph C. Lininger @ UTC (permalink / raw)
To: Speakup List
Hello all,
A couple of questions. Is there any way to read a word document in Linux?
Yes I know it's a propriotary format, but that's what my prof in class
is using. Second, is there a way to read a PDF?
--
Joseph C. Lininger
jbahm@pcdesk.net
^ permalink raw reply [flat|nested] 27+ messages in thread* Re: word documents word documents Joseph C. Lininger @ ` Aaron Howell ` Dave Hunt ` (4 subsequent siblings) 5 siblings, 0 replies; 27+ messages in thread From: Aaron Howell @ UTC (permalink / raw) To: speakup There are a couple of ways to achieve this. word2x or catdoc will both convert word documents to text with varying degrees of accuracy. You can email pdf documents to pdf2txt@sun.trace.wisc.edu as an attachment and you'll get them back in plain text, or you can install xpdf which comes with a pdftotext utility. The advantage of the xpdf route is that its a very simple patch to make it ignore pdf encryption so if you're ok with building a source package, you can have yourself a pdf converter that doesn't worry about silly conversions restrictions like acrobat reader does. Regards Aaron On Sun, Mar 16, 2003 at 12:06:05AM -0700, Joseph C. Lininger wrote: > Hello all, > A couple of questions. Is there any way to read a word document in Linux? > Yes I know it's a propriotary format, but that's what my prof in class > is using. Second, is there a way to read a PDF? > > -- > Joseph C. Lininger > jbahm@pcdesk.net > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup ^ permalink raw reply [flat|nested] 27+ messages in thread
* word documents word documents Joseph C. Lininger ` Aaron Howell @ ` Dave Hunt ` Igor Gueths ` Charles Crawford ` Jude DaShiell ` (3 subsequent siblings) 5 siblings, 2 replies; 27+ messages in thread From: Dave Hunt @ UTC (permalink / raw) To: speakup There are Word document viewers for Linux console. The one I use is called wv. Another is called antiword. No doubt, there are more. Because Word is a proprietary format, and the specification is not available, the authors of programs such as wv have had to reverse-engineer a bit. Because of this, certain things in the Word document may not decode as well as we'd like. Nonetheless, I use wv and get reasonable results when converting from Word to html. The resulting html source is quite bloated, but, it's there. For pdf conversion, there's pdftotext. This is part of the xpdf package, and may already be on your system. Surprise, it was already on my stock installation of RH 7.2. the one thing I don't like about pdftotext-s rendering, is that hyperlinks get lost. To preserve the navigability of pdf documents, I visit <access.adobe.com>, and submit the url of a pdf document (assuming I've found it on the web) to the form. What comes back is a nice html rendering (links and all). Hope this helps, -Dave ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Dave Hunt @ ` Igor Gueths ` Dave Hunt ` Charles Crawford 1 sibling, 1 reply; 27+ messages in thread From: Igor Gueths @ UTC (permalink / raw) To: speakup Hi Dave. I have a not so good result with pdftotext. Basically I converted a fairly large manual written in pdf format to text, the result was a textfile that didn't have the line rap set properly. The result was having about half a line of text, and the other half of the line had been lost. Needless to say the conversion was not accurate. have you been able to get around this problem and if so how? May you code in the power of the source, may the kernel, libraries, and utilities be with you, throughout all distributions until the end of the epoch. On Sun, 16 Mar 2003, Dave Hunt wrote: > There are Word document viewers for Linux console. The one I use is > called wv. Another is called antiword. No doubt, there are more. > Because Word is a proprietary format, and the specification is not > available, the authors of programs such as wv have had to > reverse-engineer a bit. Because of this, certain things in the Word > document may not decode as well as we'd like. Nonetheless, I use wv > and get reasonable results when converting from Word to html. The > resulting html source is quite bloated, but, it's there. > > For pdf conversion, there's pdftotext. This is part of the xpdf > package, and may already be on your system. Surprise, it was already > on my stock installation of RH 7.2. the one thing I don't like about > pdftotext-s rendering, is that hyperlinks get lost. To preserve the > navigability of pdf documents, I visit <access.adobe.com>, and submit > the url of a pdf document (assuming I've found it on the web) to the > form. What comes back is a nice html rendering (links and all). > > > Hope this helps, > > > -Dave > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Igor Gueths @ ` Dave Hunt 0 siblings, 0 replies; 27+ messages in thread From: Dave Hunt @ UTC (permalink / raw) To: speakup In general, I don't use pdftotext. I suggested it as an option. I much prefer to submit my pdfs to access.adobe.com. If the pdf is on a web site. I just read the resulting html in a browser. Otherwise, I have the text mailed back. My attempts to use pdftotext have, thus far, been on small, simple files, and the resulting text has been usable. I code in the power of The Source, maintaining all kernel, dependent libraries, etc... -Dave ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Dave Hunt ` Igor Gueths @ ` Charles Crawford ` Jude DaShiell ` Danny Crone 1 sibling, 2 replies; 27+ messages in thread From: Charles Crawford @ UTC (permalink / raw) To: speakup Hmmm. I never thought about the proprietary issue with MS-Word. I wonder if we should not be talking with Microsoft to get at least the formatting info available? Oh yeah, didn't Gates give the chinese open source Windows? Hmmm. -- charlie. At 02:41 AM 03/16/2003 -0500, you wrote: >There are Word document viewers for Linux console. The one I use is >called wv. Another is called antiword. No doubt, there are more. >Because Word is a proprietary format, and the specification is not >available, the authors of programs such as wv have had to >reverse-engineer a bit. Because of this, certain things in the Word >document may not decode as well as we'd like. Nonetheless, I use wv >and get reasonable results when converting from Word to html. The >resulting html source is quite bloated, but, it's there. > >For pdf conversion, there's pdftotext. This is part of the xpdf >package, and may already be on your system. Surprise, it was already >on my stock installation of RH 7.2. the one thing I don't like about >pdftotext-s rendering, is that hyperlinks get lost. To preserve the >navigability of pdf documents, I visit <access.adobe.com>, and submit >the url of a pdf document (assuming I've found it on the web) to the >form. What comes back is a nice html rendering (links and all). > > >Hope this helps, > > >-Dave > > >_______________________________________________ >Speakup mailing list >Speakup@braille.uwo.ca >http://speech.braille.uwo.ca/mailman/listinfo/speakup ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Charles Crawford @ ` Jude DaShiell ` Janina Sajka ` (2 more replies) ` Danny Crone 1 sibling, 3 replies; 27+ messages in thread From: Jude DaShiell @ UTC (permalink / raw) To: speakup Actually, the msword format is one of the iso standards formats. It's a long time since I've had any dealings with Microsoft so I forgot the number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford wrote: > Date: Sun, 16 Mar 2003 11:22:05 -0500 > From: Charles Crawford <CCrawford@ACB.org> > Reply-To: speakup@braille.uwo.ca > To: speakup@braille.uwo.ca > Subject: Re: word documents > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > if we should not be talking with Microsoft to get at least the formatting > info available? Oh yeah, didn't Gates give the > chinese open source Windows? Hmmm. > > -- charlie. > At 02:41 AM 03/16/2003 -0500, you wrote: > >There are Word document viewers for Linux console. The one I use is > >called wv. Another is called antiword. No doubt, there are more. > >Because Word is a proprietary format, and the specification is not > >available, the authors of programs such as wv have had to > >reverse-engineer a bit. Because of this, certain things in the Word > >document may not decode as well as we'd like. Nonetheless, I use wv > >and get reasonable results when converting from Word to html. The > >resulting html source is quite bloated, but, it's there. > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > >package, and may already be on your system. Surprise, it was already > >on my stock installation of RH 7.2. the one thing I don't like about > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > >navigability of pdf documents, I visit <access.adobe.com>, and submit > >the url of a pdf document (assuming I've found it on the web) to the > >form. What comes back is a nice html rendering (links and all). > > > > > >Hope this helps, > > > > > >-Dave > > > > > >_______________________________________________ > >Speakup mailing list > >Speakup@braille.uwo.ca > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > -- Jude <dashielljt(at)gmpexpress-dot-net> ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Jude DaShiell @ ` Janina Sajka ` Kenny Hitt ` ccrawford 2 siblings, 0 replies; 27+ messages in thread From: Janina Sajka @ UTC (permalink / raw) To: speakup Jude DaShiell writes: > > Actually, the msword format is one of the iso standards formats. OH? That's news to me. Care to document this assertion? ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Jude DaShiell ` Janina Sajka @ ` Kenny Hitt ` ccrawford ` ccrawford 2 siblings, 1 reply; 27+ messages in thread From: Kenny Hitt @ UTC (permalink / raw) To: speakup That's the first time I've heard of this. Where's your proof? Is this just another example of you spouting BS? Kenny On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote: > Actually, the msword format is one of the iso standards formats. It's a > long time since I've had any dealings with Microsoft so I forgot the > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > wrote: > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > From: Charles Crawford <CCrawford@ACB.org> > > Reply-To: speakup@braille.uwo.ca > > To: speakup@braille.uwo.ca > > Subject: Re: word documents > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > if we should not be talking with Microsoft to get at least the formatting > > info available? Oh yeah, didn't Gates give the > > chinese open source Windows? Hmmm. > > > > -- charlie. > > At 02:41 AM 03/16/2003 -0500, you wrote: > > >There are Word document viewers for Linux console. The one I use is > > >called wv. Another is called antiword. No doubt, there are more. > > >Because Word is a proprietary format, and the specification is not > > >available, the authors of programs such as wv have had to > > >reverse-engineer a bit. Because of this, certain things in the Word > > >document may not decode as well as we'd like. Nonetheless, I use wv > > >and get reasonable results when converting from Word to html. The > > >resulting html source is quite bloated, but, it's there. > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > >package, and may already be on your system. Surprise, it was already > > >on my stock installation of RH 7.2. the one thing I don't like about > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > >the url of a pdf document (assuming I've found it on the web) to the > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > >Hope this helps, > > > > > > > > >-Dave > > > > > > > > >_______________________________________________ > > >Speakup mailing list > > >Speakup@braille.uwo.ca > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > -- > Jude <dashielljt(at)gmpexpress-dot-net> > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Kenny Hitt @ ` ccrawford ` Kenny Hitt 0 siblings, 1 reply; 27+ messages in thread From: ccrawford @ UTC (permalink / raw) To: speakup Now let's not get over the top with stuf like"sputing bs." -- charlie. On Sun, 16 Mar 2003, Kenny Hitt wrote: > That's the first time I've heard of this. Where's your proof? > Is this just another example of you spouting BS? > > Kenny > > > On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote: > > Actually, the msword format is one of the iso standards formats. It's a > > long time since I've had any dealings with Microsoft so I forgot the > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > > wrote: > > > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > > From: Charles Crawford <CCrawford@ACB.org> > > > Reply-To: speakup@braille.uwo.ca > > > To: speakup@braille.uwo.ca > > > Subject: Re: word documents > > > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > > if we should not be talking with Microsoft to get at least the formatting > > > info available? Oh yeah, didn't Gates give the > > > chinese open source Windows? Hmmm. > > > > > > -- charlie. > > > At 02:41 AM 03/16/2003 -0500, you wrote: > > > >There are Word document viewers for Linux console. The one I use is > > > >called wv. Another is called antiword. No doubt, there are more. > > > >Because Word is a proprietary format, and the specification is not > > > >available, the authors of programs such as wv have had to > > > >reverse-engineer a bit. Because of this, certain things in the Word > > > >document may not decode as well as we'd like. Nonetheless, I use wv > > > >and get reasonable results when converting from Word to html. The > > > >resulting html source is quite bloated, but, it's there. > > > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > > >package, and may already be on your system. Surprise, it was already > > > >on my stock installation of RH 7.2. the one thing I don't like about > > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > > >the url of a pdf document (assuming I've found it on the web) to the > > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > > > > >Hope this helps, > > > > > > > > > > > >-Dave > > > > > > > > > > > >_______________________________________________ > > > >Speakup mailing list > > > >Speakup@braille.uwo.ca > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > _______________________________________________ > > > Speakup mailing list > > > Speakup@braille.uwo.ca > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > -- > > Jude <dashielljt(at)gmpexpress-dot-net> > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` ccrawford @ ` Kenny Hitt ` Erik Heil ` William F. Acker WB2FLW +1-303-722-7209 0 siblings, 2 replies; 27+ messages in thread From: Kenny Hitt @ UTC (permalink / raw) To: speakup Hi. Jude DaShiell regularly posts wrong information. People have asked him to get his facts straight before posting and he still doesn't. Experienced Linux users know he's full of it, but new users won't. Linux is confusing enough for a new user without someone giving them wrong information. Kenny On Mon, Mar 17, 2003 at 07:31:01AM -0500, ccrawford@acb.org wrote: > Now let's not get over the top with stuf like"sputing bs." > > -- charlie. > On Sun, 16 Mar > 2003, Kenny Hitt wrote: > > > That's the first time I've heard of this. Where's your proof? > > Is this just another example of you spouting BS? > > > > Kenny > > > > > > On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote: > > > Actually, the msword format is one of the iso standards formats. It's a > > > long time since I've had any dealings with Microsoft so I forgot the > > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > > > wrote: > > > > > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > > > From: Charles Crawford <CCrawford@ACB.org> > > > > Reply-To: speakup@braille.uwo.ca > > > > To: speakup@braille.uwo.ca > > > > Subject: Re: word documents > > > > > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > > > if we should not be talking with Microsoft to get at least the formatting > > > > info available? Oh yeah, didn't Gates give the > > > > chinese open source Windows? Hmmm. > > > > > > > > -- charlie. > > > > At 02:41 AM 03/16/2003 -0500, you wrote: > > > > >There are Word document viewers for Linux console. The one I use is > > > > >called wv. Another is called antiword. No doubt, there are more. > > > > >Because Word is a proprietary format, and the specification is not > > > > >available, the authors of programs such as wv have had to > > > > >reverse-engineer a bit. Because of this, certain things in the Word > > > > >document may not decode as well as we'd like. Nonetheless, I use wv > > > > >and get reasonable results when converting from Word to html. The > > > > >resulting html source is quite bloated, but, it's there. > > > > > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > > > >package, and may already be on your system. Surprise, it was already > > > > >on my stock installation of RH 7.2. the one thing I don't like about > > > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > > > >the url of a pdf document (assuming I've found it on the web) to the > > > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > > > > > > > >Hope this helps, > > > > > > > > > > > > > > >-Dave > > > > > > > > > > > > > > >_______________________________________________ > > > > >Speakup mailing list > > > > >Speakup@braille.uwo.ca > > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > > > _______________________________________________ > > > > Speakup mailing list > > > > Speakup@braille.uwo.ca > > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > -- > > > Jude <dashielljt(at)gmpexpress-dot-net> > > > > > > > > > _______________________________________________ > > > Speakup mailing list > > > Speakup@braille.uwo.ca > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Kenny Hitt @ ` Erik Heil ` William F. Acker WB2FLW +1-303-722-7209 1 sibling, 0 replies; 27+ messages in thread From: Erik Heil @ UTC (permalink / raw) To: speakup -----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 Kenny. MSWord an ISO format? This simply is not true. - ----- Original Message ----- From: "Kenny Hitt" <kennyhitt@knology.net> To: <speakup@braille.uwo.ca> Sent: Monday, March 17, 2003 11:06 AM Subject: Re: word documents > Hi. Jude DaShiell regularly posts wrong information. People have asked > him to get his facts straight before posting and he still doesn't. > Experienced Linux users know he's full of it, but new users won't. > Linux is confusing enough for a new user without someone giving them > wrong information. > > Kenny > > > On Mon, Mar 17, 2003 at 07:31:01AM -0500, ccrawford@acb.org wrote: > > Now let's not get over the top with stuf like"sputing bs." > > > > -- charlie. > > On Sun, 16 Mar > > 2003, Kenny Hitt wrote: > > > > > That's the first time I've heard of this. Where's your proof? > > > Is this just another example of you spouting BS? > > > > > > Kenny > > > > > > > > > On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote: > > > > Actually, the msword format is one of the iso standards formats. It's a > > > > long time since I've had any dealings with Microsoft so I forgot the > > > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > > > > wrote: > > > > > > > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > > > > From: Charles Crawford <CCrawford@ACB.org> > > > > > Reply-To: speakup@braille.uwo.ca > > > > > To: speakup@braille.uwo.ca > > > > > Subject: Re: word documents > > > > > > > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > > > > if we should not be talking with Microsoft to get at least the formatting > > > > > info available? Oh yeah, didn't Gates give the > > > > > chinese open source Windows? Hmmm. > > > > > > > > > > -- charlie. > > > > > At 02:41 AM 03/16/2003 -0500, you wrote: > > > > > >There are Word document viewers for Linux console. The one I use is > > > > > >called wv. Another is called antiword. No doubt, there are more. > > > > > >Because Word is a proprietary format, and the specification is not > > > > > >available, the authors of programs such as wv have had to > > > > > >reverse-engineer a bit. Because of this, certain things in the Word > > > > > >document may not decode as well as we'd like. Nonetheless, I use wv > > > > > >and get reasonable results when converting from Word to html. The > > > > > >resulting html source is quite bloated, but, it's there. > > > > > > > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > > > > >package, and may already be on your system. Surprise, it was already > > > > > >on my stock installation of RH 7.2. the one thing I don't like about > > > > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > > > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > > > > >the url of a pdf document (assuming I've found it on the web) to the > > > > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > > > > > > > > > > >Hope this helps, > > > > > > > > > > > > > > > > > >-Dave > > > > > > > > > > > > > > > > > >_______________________________________________ > > > > > >Speakup mailing list > > > > > >Speakup@braille.uwo.ca > > > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > > > > > > _______________________________________________ > > > > > Speakup mailing list > > > > > Speakup@braille.uwo.ca > > > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > > > > -- > > > > Jude <dashielljt(at)gmpexpress-dot-net> > > > > > > > > > > > > _______________________________________________ > > > > Speakup mailing list > > > > Speakup@braille.uwo.ca > > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > _______________________________________________ > > > Speakup mailing list > > > Speakup@braille.uwo.ca > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > -----BEGIN PGP SIGNATURE----- Version: PGP 8.0 iQA/AwUBPnX0NM6y+CHlb+ZeEQLdKACgtJnTnFz09zMQp3abkyYVe60RkDMAoIpZ yuJ0XSNBUqNP1Lzc+W3s73fA =NLcV -----END PGP SIGNATURE----- ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Kenny Hitt ` Erik Heil @ ` William F. Acker WB2FLW +1-303-722-7209 1 sibling, 0 replies; 27+ messages in thread From: William F. Acker WB2FLW +1-303-722-7209 @ UTC (permalink / raw) To: speakup Thank you! Thank you!! Thank you!!! The sad Truth is that Jude isn't the only example, albeit an egregious one, and this list isn't the only place where such crap is found. I guess some folks have to post whether they have something worthwhile to say or not. -- Bill in Denver On Mon, 17 Mar 2003, Kenny Hitt wrote: > Hi. Jude DaShiell regularly posts wrong information. People have asked > him to get his facts straight before posting and he still doesn't. > Experienced Linux users know he's full of it, but new users won't. > Linux is confusing enough for a new user without someone giving them > wrong information. > > Kenny ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Jude DaShiell ` Janina Sajka ` Kenny Hitt @ ` ccrawford ` Joseph C. Lininger 2 siblings, 1 reply; 27+ messages in thread From: ccrawford @ UTC (permalink / raw) To: speakup If that is the case, then why is it so difficult to read afile? Not sure. -- Charlie. On Sun, 16 Mar 2003, Jude DaShiell wrote: > Actually, the msword format is one of the iso standards formats. It's a > long time since I've had any dealings with Microsoft so I forgot the > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > wrote: > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > From: Charles Crawford <CCrawford@ACB.org> > > Reply-To: speakup@braille.uwo.ca > > To: speakup@braille.uwo.ca > > Subject: Re: word documents > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > if we should not be talking with Microsoft to get at least the formatting > > info available? Oh yeah, didn't Gates give the > > chinese open source Windows? Hmmm. > > > > -- charlie. > > At 02:41 AM 03/16/2003 -0500, you wrote: > > >There are Word document viewers for Linux console. The one I use is > > >called wv. Another is called antiword. No doubt, there are more. > > >Because Word is a proprietary format, and the specification is not > > >available, the authors of programs such as wv have had to > > >reverse-engineer a bit. Because of this, certain things in the Word > > >document may not decode as well as we'd like. Nonetheless, I use wv > > >and get reasonable results when converting from Word to html. The > > >resulting html source is quite bloated, but, it's there. > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > >package, and may already be on your system. Surprise, it was already > > >on my stock installation of RH 7.2. the one thing I don't like about > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > >the url of a pdf document (assuming I've found it on the web) to the > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > >Hope this helps, > > > > > > > > >-Dave > > > > > > > > >_______________________________________________ > > >Speakup mailing list > > >Speakup@braille.uwo.ca > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > _______________________________________________ > > Speakup mailing list > > Speakup@braille.uwo.ca > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` ccrawford @ ` Joseph C. Lininger 0 siblings, 0 replies; 27+ messages in thread From: Joseph C. Lininger @ UTC (permalink / raw) To: Speakup List The reason it is so hard to read the document is that even though there is in fact an ISO standard for the .doc format, Microsoft Word does not correctly implement it. This means that any program that conforms to the standard may not be able to read .doc files generated by word. -- Joseph C. Lininger jbahm@pcdesk.net On Mon, 17 Mar 2003 ccrawford@acb.org wrote: > If that is the case, then why is it so difficult to read afile? Not > sure. > > -- Charlie. > On Sun, 16 Mar 2003, Jude DaShiell wrote: > > > Actually, the msword format is one of the iso standards formats. It's a > > long time since I've had any dealings with Microsoft so I forgot the > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford > > wrote: > > > > > Date: Sun, 16 Mar 2003 11:22:05 -0500 > > > From: Charles Crawford <CCrawford@ACB.org> > > > Reply-To: speakup@braille.uwo.ca > > > To: speakup@braille.uwo.ca > > > Subject: Re: word documents > > > > > > Hmmm. I never thought about the proprietary issue with MS-Word. I wonder > > > if we should not be talking with Microsoft to get at least the formatting > > > info available? Oh yeah, didn't Gates give the > > > chinese open source Windows? Hmmm. > > > > > > -- charlie. > > > At 02:41 AM 03/16/2003 -0500, you wrote: > > > >There are Word document viewers for Linux console. The one I use is > > > >called wv. Another is called antiword. No doubt, there are more. > > > >Because Word is a proprietary format, and the specification is not > > > >available, the authors of programs such as wv have had to > > > >reverse-engineer a bit. Because of this, certain things in the Word > > > >document may not decode as well as we'd like. Nonetheless, I use wv > > > >and get reasonable results when converting from Word to html. The > > > >resulting html source is quite bloated, but, it's there. > > > > > > > >For pdf conversion, there's pdftotext. This is part of the xpdf > > > >package, and may already be on your system. Surprise, it was already > > > >on my stock installation of RH 7.2. the one thing I don't like about > > > >pdftotext-s rendering, is that hyperlinks get lost. To preserve the > > > >navigability of pdf documents, I visit <access.adobe.com>, and submit > > > >the url of a pdf document (assuming I've found it on the web) to the > > > >form. What comes back is a nice html rendering (links and all). > > > > > > > > > > > >Hope this helps, > > > > > > > > > > > >-Dave > > > > > > > > > > > >_______________________________________________ > > > >Speakup mailing list > > > >Speakup@braille.uwo.ca > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > _______________________________________________ > > > Speakup mailing list > > > Speakup@braille.uwo.ca > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup > > > > > > > > > > _______________________________________________ > Speakup mailing list > Speakup@braille.uwo.ca > http://speech.braille.uwo.ca/mailman/listinfo/speakup > ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Charles Crawford ` Jude DaShiell @ ` Danny Crone ` Ann Parsons 1 sibling, 1 reply; 27+ messages in thread From: Danny Crone @ UTC (permalink / raw) To: speakup What about Star Office? Is there a form of Star Office we could use? At 11:22 3/16/03 -0500, you wrote: >Hmmm. I never thought about the proprietary issue with MS-Word. I >wonder if we should not be talking with Microsoft to get at least the >formatting info available? Oh yeah, didn't Gates give the >chinese open source Windows? Hmmm. > >-- charlie. >At 02:41 AM 03/16/2003 -0500, you wrote: >>There are Word document viewers for Linux console. The one I use is >>called wv. Another is called antiword. No doubt, there are more. >>Because Word is a proprietary format, and the specification is not >>available, the authors of programs such as wv have had to >>reverse-engineer a bit. Because of this, certain things in the Word >>document may not decode as well as we'd like. Nonetheless, I use wv >>and get reasonable results when converting from Word to html. The >>resulting html source is quite bloated, but, it's there. >> >>For pdf conversion, there's pdftotext. This is part of the xpdf >>package, and may already be on your system. Surprise, it was already >>on my stock installation of RH 7.2. the one thing I don't like about >>pdftotext-s rendering, is that hyperlinks get lost. To preserve the >>navigability of pdf documents, I visit <access.adobe.com>, and submit >>the url of a pdf document (assuming I've found it on the web) to the >>form. What comes back is a nice html rendering (links and all). >> >> >>Hope this helps, >> >> >>-Dave >> >> >>_______________________________________________ >>Speakup mailing list >>Speakup@braille.uwo.ca >>http://speech.braille.uwo.ca/mailman/listinfo/speakup > > >_______________________________________________ >Speakup mailing list >Speakup@braille.uwo.ca >http://speech.braille.uwo.ca/mailman/listinfo/speakup ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Danny Crone @ ` Ann Parsons 0 siblings, 0 replies; 27+ messages in thread From: Ann Parsons @ UTC (permalink / raw) To: speakup Hi all, No star office yet. Be patient. Ann P. -- Ann K. Parsons email: akp@eznet.net ICQ Number: 33006854 WEB SITE: http://home.eznet.net/~akp "All that is gold does not glitter. Not all those who wander are lost." JRRT ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents word documents Joseph C. Lininger ` Aaron Howell ` Dave Hunt @ ` Jude DaShiell ` Ann Parsons ` (2 subsequent siblings) 5 siblings, 0 replies; 27+ messages in thread From: Jude DaShiell @ UTC (permalink / raw) To: Speakup List If you can compile it, there's antiword 0.33 for linux to handle the Microsoft problem. To handle the adobe problem, there's ps2ascii.On Sun, 16 Mar 2003, Joseph C. Lininger wrote: > Date: Sun, 16 Mar 2003 00:06:05 -0700 (Mountain Standard Time) > From: Joseph C. Lininger <jbahm@pcdesk.net> > Reply-To: speakup@braille.uwo.ca > To: Speakup List <speakup@braille.uwo.ca> > Subject: word documents > > Hello all, > A couple of questions. Is there any way to read a word document in Linux? > Yes I know it's a propriotary format, but that's what my prof in class > is using. Second, is there a way to read a PDF? > > -- Jude <dashielljt(at)gmpexpress-dot-net> ^ permalink raw reply [flat|nested] 27+ messages in thread
* word documents word documents Joseph C. Lininger ` (2 preceding siblings ...) ` Jude DaShiell @ ` Ann Parsons ` Charles Crawford [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et> 5 siblings, 0 replies; 27+ messages in thread From: Ann Parsons @ UTC (permalink / raw) To: speakup Hi all, Dunnow about PDF, but you can read Word docs with antiword. It's a debian package, suspect that it's a red hat one too. Ann P. -- Ann K. Parsons email: akp@eznet.net ICQ Number: 33006854 WEB SITE: http://home.eznet.net/~akp "All that is gold does not glitter. Not all those who wander are lost." JRRT ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents word documents Joseph C. Lininger ` (3 preceding siblings ...) ` Ann Parsons @ ` Charles Crawford ` smart profs, was: " Gregory Nowak ` Ann Parsons [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et> 5 siblings, 2 replies; 27+ messages in thread From: Charles Crawford @ UTC (permalink / raw) To: speakup You can tell the "Prof" in class to convert the ms-word files to text. I would assume someone with the brains to become a professor has the minimum amount of smarts to convert a file. Janina knows about reading PDF files. I believe we should be telling purveyors of PDF to include TXT or HTML files of the same materials. maybe they will get the message? Well, maybe not. Guess I have to convert the message to PDF first? Grin. -- charlie. At 12:06 AM 03/16/2003 -0700, you wrote: >Hello all, >A couple of questions. Is there any way to read a word document in Linux? >Yes I know it's a propriotary format, but that's what my prof in class >is using. Second, is there a way to read a PDF? > >-- >Joseph C. Lininger >jbahm@pcdesk.net > > >_______________________________________________ >Speakup mailing list >Speakup@braille.uwo.ca >http://speech.braille.uwo.ca/mailman/listinfo/speakup ^ permalink raw reply [flat|nested] 27+ messages in thread
* smart profs, was: Re: word documents ` Charles Crawford @ ` Gregory Nowak ` Ann Parsons 1 sibling, 0 replies; 27+ messages in thread From: Gregory Nowak @ UTC (permalink / raw) To: speakup On Sun, Mar 16, 2003 at 11:18:54AM -0500, Charles Crawford wrote: > You can tell the "Prof" in class to convert the ms-word files to text. I > would assume someone with the brains to become a professor has the minimum > amount of smarts to convert a file. Interesting that you bring that up. We have to tell our computer science 201 professor how to bring up his lecture notes in pdf on the screen, and how to use a lot of basic things in windblows. The guy either is doing a very excellent job of pretending he is dumb, or he really doesn't know how to use windblows. Now that makes me ask the question, how did this guy become a computer science professor? Also, some of what he's teaching, I've done before in high school. But, listening to the guy I have no clue what he's talking about when he's lecturing, even when I know that I've learned this stuff before, and understand it. This isn't just me, since I've talked to a couple of other class mates who have been through some of this stuff before, and they don't get anything that he's lecturing on either. If this is how those of us who know this stuff feel, I wonder what the majority of students who have taken none of this stuff before feel? Sorry about the off topic rant, but Charley's comment above really begged me to write something about the topic. Ok, I'll stop wasting band width. Greg ^ permalink raw reply [flat|nested] 27+ messages in thread
* Re: word documents ` Charles Crawford ` smart profs, was: " Gregory Nowak @ ` Ann Parsons 1 sibling, 0 replies; 27+ messages in thread From: Ann Parsons @ UTC (permalink / raw) To: speakup Hi all, Charlie, you're expecting people to have common sense. My friend commented the other day that he didn't know why it was called "common sense" because in all his life, he hadn't been able to find much of it at all. Ann P. -- Ann K. Parsons email: akp@eznet.net ICQ Number: 33006854 WEB SITE: http://home.eznet.net/~akp "All that is gold does not glitter. Not all those who wander are lost." JRRT ^ permalink raw reply [flat|nested] 27+ messages in thread
[parent not found: <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>]
* Re: word documents [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et> @ ` Charles Crawford 0 siblings, 0 replies; 27+ messages in thread From: Charles Crawford @ UTC (permalink / raw) To: speakup Now June, Now there ya go giving good suggestions again. Now I can't wine on about the problem of Word. Oh well, guess I will just have to go back to how we don't have turkish tafffy anymore. Smile. -- charlie Crawford. At 09:12 AM 03/16/2003 -0500, you wrote: >If you can compile it, there's antiword 0.33 for linux to handle the >Microsoft problem. To handle the adobe problem, there's ps2ascii.On Sun, >16 Mar 2003, Joseph C. Lininger wrote: > > > Date: Sun, 16 Mar 2003 00:06:05 -0700 (Mountain Standard Time) > > From: Joseph C. Lininger <jbahm@pcdesk.net> > > Reply-To: speakup@braille.uwo.ca > > To: Speakup List <speakup@braille.uwo.ca> > > Subject: word documents > > > > Hello all, > > A couple of questions. Is there any way to read a word document in Linux? > > Yes I know it's a propriotary format, but that's what my prof in class > > is using. Second, is there a way to read a PDF? > > > > > >-- >Jude <dashielljt(at)gmpexpress-dot-net> > > >_______________________________________________ >Speakup mailing list >Speakup@braille.uwo.ca >http://speech.braille.uwo.ca/mailman/listinfo/speakup ^ permalink raw reply [flat|nested] 27+ messages in thread
* re: word documents @ Jude DaShiell ` Chuck Hallenbeck 0 siblings, 1 reply; 27+ messages in thread From: Jude DaShiell @ UTC (permalink / raw) To: speakup Before I lost that email message from George Allen at Microsoft I remember forwarding it along to Dr. Chuck Hallenbech since he at the time was trying to pull that format into his sss software. If he no longer has that message then it can't be documented quickly. -- Jude <dashielljt(at)gmpexpress-dot-net> ^ permalink raw reply [flat|nested] 27+ messages in thread
* re: word documents Jude DaShiell @ ` Chuck Hallenbeck 0 siblings, 0 replies; 27+ messages in thread From: Chuck Hallenbeck @ UTC (permalink / raw) To: speakup > Before I lost that email message from George Allen at Microsoft I remember > forwarding it along to Dr. Chuck Hallenbech since he at the time was > trying to pull that format into his sss software. If he no longer has > that message then it can't be documented quickly. I do not recall getting such a document and we never attempted to support MS-Word documents in any of our browsers. Sorry, I cannot help on this one, and in fact my reaction was the same as Janina's concerning the MS-Word format being described in one of the ISO standards. Sounds like a pipe dream to me. > -- The Moon is Waxing Gibbous (97% of Full) So visit me sometime at http://www.mhonline.net/~chuckh or you might reach me at chuckh on the jabber network. ^ permalink raw reply [flat|nested] 27+ messages in thread
[parent not found: <20030316235502.7926.4073.Mailman@speech.braille.uwo.ca>]
* Re: word documents [not found] <20030316235502.7926.4073.Mailman@speech.braille.uwo.ca> @ ` Thomas Ward 0 siblings, 0 replies; 27+ messages in thread From: Thomas Ward @ UTC (permalink / raw) To: speakup Hi, here is what I use. For reading and converting MS Word documents I I use a program called antiword. There is an rpm package for Mandrake Linux which I have also installed on Red Hat as well. It is my understanding Debian has a binary version of antiword as well. The other option I use is Star Office 6.0 with gnopernicus cvs version, but at this point the Star Office accessability isn't vary good.Neither is it vary easy to get gnome 2.2 and gnopernicus going. As for pdf files I installed xpdf package, and use pdftotext to convert pdf files to text files. ^ permalink raw reply [flat|nested] 27+ messages in thread
[parent not found: <20030317062559.19403.13142.Mailman@speech.braille.uwo.ca>]
* Re: word documents [not found] <20030317062559.19403.13142.Mailman@speech.braille.uwo.ca> @ ` Thomas Ward 0 siblings, 0 replies; 27+ messages in thread From: Thomas Ward @ UTC (permalink / raw) To: speakup Hi, Dannie. Well, in theory we can use Star Office 6.0 or Open Office 1.0 for Linux, but Star Office accessability is still in it's infancy. For you and everyone else who is interested in X-windows accessability and Star Office 6 accessability I'll outline what I know, what current state is, and and where this is going. First, you'll need an updated distribution of Linux that will have the Gnome 2.2 desktop, or update your Linux system, libraries, etc to gnome 2.2. I know for a fact that Red Hat 8.1 and Mandrake 9.1 will be shipping sometime this spring with Gnome 2.2. After obtaining Gnome 2.2 you will need to check out gnome common, gnome-speech libraries, and gnopernicus from cvs, and build them. This often the most frustrating step. If you are using Star office 6 you will need the Java virtual machine in jre 1.4 or better, install it, and then install the Java accessability bridge for Linux. At last you will need to install Star Office 6. Fortunately, I did not have to buy it because it ships with Mandrake Linux Delux already. After all this accessability wasn't all that fantastic, or anything I could write home about. However, future versions may improve as development continues. I assume since Sun is pushing the development of gnopernicus, and are also the creators of Star office accessability will improve over time, and when they have more time to deal on specific applications and not have to worry about the core which is still under development. I will also inform everyone here that this information is a couple of months old. I haven't been able to get the vary latest gnopernicus compiled yet, and I have not updated the java access, etc. I'm waiting for Mandrake 9.1 to be released before I attempt to upgrade gnopernicus, the java access, etc. ^ permalink raw reply [flat|nested] 27+ messages in thread
end of thread, other threads:[~ UTC | newest]
Thread overview: 27+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
word documents Joseph C. Lininger
` Aaron Howell
` Dave Hunt
` Igor Gueths
` Dave Hunt
` Charles Crawford
` Jude DaShiell
` Janina Sajka
` Kenny Hitt
` ccrawford
` Kenny Hitt
` Erik Heil
` William F. Acker WB2FLW +1-303-722-7209
` ccrawford
` Joseph C. Lininger
` Danny Crone
` Ann Parsons
` Jude DaShiell
` Ann Parsons
` Charles Crawford
` smart profs, was: " Gregory Nowak
` Ann Parsons
[not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>
` Charles Crawford
Jude DaShiell
` Chuck Hallenbeck
[not found] <20030316235502.7926.4073.Mailman@speech.braille.uwo.ca>
` Thomas Ward
[not found] <20030317062559.19403.13142.Mailman@speech.braille.uwo.ca>
` Thomas Ward
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox; as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).