public inbox for speakup@linux-speakup.org
 help / color / mirror / Atom feed
* Re: word documents
       [not found] <20030316235502.7926.4073.Mailman@speech.braille.uwo.ca>
@  ` Thomas Ward
  0 siblings, 0 replies; 26+ messages in thread
From: Thomas Ward @  UTC (permalink / raw)
  To: speakup

Hi, here is what I use. For reading and converting MS Word documents I 
I use a program called antiword. There is an rpm package for Mandrake 
Linux which I have also installed on Red Hat as well. It is my 
understanding Debian has a binary version of antiword as well.
The other option I use is Star Office 6.0 with gnopernicus cvs version, 
but at this point the Star Office accessability isn't vary good.Neither is 
it vary easy to get gnome 2.2 and gnopernicus going.
As for pdf files I installed xpdf package, and use pdftotext to convert 
pdf files to text files.





^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
             ` Kenny Hitt
               ` Erik Heil
@              ` William F. Acker WB2FLW +1-303-722-7209
  1 sibling, 0 replies; 26+ messages in thread
From: William F. Acker WB2FLW +1-303-722-7209 @  UTC (permalink / raw)
  To: speakup

Thank you!  Thank you!! Thank you!!!

     The sad Truth is that Jude isn't the only example, albeit an 
egregious one, and this list isn't the only place where such crap is 
found.  I guess some folks have to post whether they have something 
worthwhile to say or not.
-- 
          Bill in Denver


On Mon, 17 Mar 2003, Kenny Hitt wrote:

> Hi.  Jude DaShiell regularly posts wrong information.  People have asked
> him to get his facts straight before posting and he still doesn't.
> Experienced Linux users know he's full of it, but new users won't.
> Linux is confusing enough for a new user without someone giving them
> wrong information.
> 
>           Kenny


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
             ` Kenny Hitt
@              ` Erik Heil
               ` William F. Acker WB2FLW +1-303-722-7209
  1 sibling, 0 replies; 26+ messages in thread
From: Erik Heil @  UTC (permalink / raw)
  To: speakup

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

 Kenny. MSWord an ISO format?  This simply is not true.
- ----- Original Message -----
From: "Kenny Hitt" <kennyhitt@knology.net>
To: <speakup@braille.uwo.ca>
Sent: Monday, March 17, 2003 11:06 AM
Subject: Re: word documents


> Hi.  Jude DaShiell regularly posts wrong information.  People have asked
> him to get his facts straight before posting and he still doesn't.
> Experienced Linux users know he's full of it, but new users won't.
> Linux is confusing enough for a new user without someone giving them
> wrong information.
>
>           Kenny
>
>
> On Mon, Mar 17, 2003 at 07:31:01AM -0500, ccrawford@acb.org wrote:
> > Now let's not get over the top with stuf like"sputing bs."
> >
> > -- charlie.
> > On Sun, 16 Mar
> > 2003, Kenny Hitt wrote:
> >
> > > That's the first time I've heard of this.  Where's your proof?
> > > Is this just another example of you spouting BS?
> > >
> > >           Kenny
> > >
> > >
> > > On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote:
> > > > Actually, the msword format is one of the iso standards formats.
It's a
> > > > long time since I've had any dealings with Microsoft so I forgot the
> > > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles
Crawford
> > > > wrote:
> > > >
> > > > > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > > > > From: Charles Crawford <CCrawford@ACB.org>
> > > > > Reply-To: speakup@braille.uwo.ca
> > > > > To: speakup@braille.uwo.ca
> > > > > Subject: Re: word documents
> > > > >
> > > > > Hmmm.   I never thought about the proprietary issue with MS-Word.
I wonder
> > > > > if we should not be talking with Microsoft to get at least the
formatting
> > > > > info available?  Oh yeah, didn't Gates give the
> > > > > chinese open source Windows?  Hmmm.
> > > > >
> > > > > -- charlie.
> > > > > At 02:41 AM 03/16/2003 -0500, you wrote:
> > > > > >There are Word document viewers for Linux console.  The one I use
is
> > > > > >called wv.  Another is called antiword.  No doubt, there are
more.
> > > > > >Because Word is a proprietary format, and the specification is
not
> > > > > >available, the authors of programs such as wv have had to
> > > > > >reverse-engineer a bit.  Because of this, certain things in the
Word
> > > > > >document may not decode as well as we'd like.  Nonetheless, I use
wv
> > > > > >and get reasonable results when converting from Word to html.
The
> > > > > >resulting html source is quite bloated, but, it's there.
> > > > > >
> > > > > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > > > > >package, and may already be on your system.  Surprise, it was
already
> > > > > >on my stock installation of RH 7.2.  the one thing I don't like
about
> > > > > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve
the
> > > > > >navigability of pdf documents, I visit <access.adobe.com>, and
submit
> > > > > >the url of a pdf document (assuming I've found it on the web) to
the
> > > > > >form.  What comes back is a nice html rendering (links and all).
> > > > > >
> > > > > >
> > > > > >Hope this helps,
> > > > > >
> > > > > >
> > > > > >-Dave
> > > > > >
> > > > > >
> > > > > >_______________________________________________
> > > > > >Speakup mailing list
> > > > > >Speakup@braille.uwo.ca
> > > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > > >
> > > > >
> > > > > _______________________________________________
> > > > > Speakup mailing list
> > > > > Speakup@braille.uwo.ca
> > > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > > >
> > > >
> > > > --
> > > > Jude <dashielljt(at)gmpexpress-dot-net>
> > > >
> > > >
> > > > _______________________________________________
> > > > Speakup mailing list
> > > > Speakup@braille.uwo.ca
> > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > >
> > >
> > > _______________________________________________
> > > Speakup mailing list
> > > Speakup@braille.uwo.ca
> > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > >
> >
> >
> > _______________________________________________
> > Speakup mailing list
> > Speakup@braille.uwo.ca
> > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> >
>
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
>
-----BEGIN PGP SIGNATURE-----
Version: PGP 8.0

iQA/AwUBPnX0NM6y+CHlb+ZeEQLdKACgtJnTnFz09zMQp3abkyYVe60RkDMAoIpZ
yuJ0XSNBUqNP1Lzc+W3s73fA
=NLcV
-----END PGP SIGNATURE-----




^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
           ` ccrawford
@            ` Kenny Hitt
               ` Erik Heil
               ` William F. Acker WB2FLW +1-303-722-7209
  0 siblings, 2 replies; 26+ messages in thread
From: Kenny Hitt @  UTC (permalink / raw)
  To: speakup

Hi.  Jude DaShiell regularly posts wrong information.  People have asked
him to get his facts straight before posting and he still doesn't.
Experienced Linux users know he's full of it, but new users won't.
Linux is confusing enough for a new user without someone giving them
wrong information.

          Kenny


On Mon, Mar 17, 2003 at 07:31:01AM -0500, ccrawford@acb.org wrote:
> Now let's not get over the top with stuf like"sputing bs."
> 
> -- charlie.
> On Sun, 16 Mar 
> 2003, Kenny Hitt wrote:
> 
> > That's the first time I've heard of this.  Where's your proof?
> > Is this just another example of you spouting BS?
> > 
> >           Kenny
> > 
> > 
> > On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote:
> > > Actually, the msword format is one of the iso standards formats.  It's a
> > > long time since I've had any dealings with Microsoft so I forgot the
> > > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
> > > wrote:
> > > 
> > > > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > > > From: Charles Crawford <CCrawford@ACB.org>
> > > > Reply-To: speakup@braille.uwo.ca
> > > > To: speakup@braille.uwo.ca
> > > > Subject: Re: word documents
> > > >
> > > > Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> > > > if we should not be talking with Microsoft to get at least the formatting
> > > > info available?  Oh yeah, didn't Gates give the
> > > > chinese open source Windows?  Hmmm.
> > > >
> > > > -- charlie.
> > > > At 02:41 AM 03/16/2003 -0500, you wrote:
> > > > >There are Word document viewers for Linux console.  The one I use is
> > > > >called wv.  Another is called antiword.  No doubt, there are more.
> > > > >Because Word is a proprietary format, and the specification is not
> > > > >available, the authors of programs such as wv have had to
> > > > >reverse-engineer a bit.  Because of this, certain things in the Word
> > > > >document may not decode as well as we'd like.  Nonetheless, I use wv
> > > > >and get reasonable results when converting from Word to html.  The
> > > > >resulting html source is quite bloated, but, it's there.
> > > > >
> > > > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > > > >package, and may already be on your system.  Surprise, it was already
> > > > >on my stock installation of RH 7.2.  the one thing I don't like about
> > > > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> > > > >navigability of pdf documents, I visit <access.adobe.com>, and submit
> > > > >the url of a pdf document (assuming I've found it on the web) to the
> > > > >form.  What comes back is a nice html rendering (links and all).
> > > > >
> > > > >
> > > > >Hope this helps,
> > > > >
> > > > >
> > > > >-Dave
> > > > >
> > > > >
> > > > >_______________________________________________
> > > > >Speakup mailing list
> > > > >Speakup@braille.uwo.ca
> > > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > >
> > > >
> > > > _______________________________________________
> > > > Speakup mailing list
> > > > Speakup@braille.uwo.ca
> > > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > >
> > > 
> > > -- 
> > > Jude <dashielljt(at)gmpexpress-dot-net>
> > > 
> > > 
> > > _______________________________________________
> > > Speakup mailing list
> > > Speakup@braille.uwo.ca
> > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > > 
> > 
> > _______________________________________________
> > Speakup mailing list
> > Speakup@braille.uwo.ca
> > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > 
> 
> 
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
> 


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       ` Danny Crone
@        ` Ann Parsons
  0 siblings, 0 replies; 26+ messages in thread
From: Ann Parsons @  UTC (permalink / raw)
  To: speakup

Hi all,

No star office yet.  Be patient.

Ann P.

-- 
			Ann K. Parsons  
email:  akp@eznet.net 			ICQ Number:  33006854
WEB SITE:  http://home.eznet.net/~akp
"All that is gold does not glitter.  Not all those who wander are lost."  JRRT



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
         ` ccrawford
@          ` Joseph C. Lininger
  0 siblings, 0 replies; 26+ messages in thread
From: Joseph C. Lininger @  UTC (permalink / raw)
  To: Speakup List

The reason it is so hard to read the document is that even though there is
in fact an ISO standard for the .doc format, Microsoft Word does not
correctly implement it. This means that any program that conforms to the
standard may not be able to read .doc files generated by word.

-- 
Joseph C. Lininger
jbahm@pcdesk.net

On Mon, 17 Mar 2003 ccrawford@acb.org wrote:

> If that is the case, then why is it so difficult to read afile?  Not
> sure.
>
> -- Charlie.
> On Sun, 16 Mar 2003, Jude DaShiell wrote:
>
> > Actually, the msword format is one of the iso standards formats.  It's a
> > long time since I've had any dealings with Microsoft so I forgot the
> > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
> > wrote:
> >
> > > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > > From: Charles Crawford <CCrawford@ACB.org>
> > > Reply-To: speakup@braille.uwo.ca
> > > To: speakup@braille.uwo.ca
> > > Subject: Re: word documents
> > >
> > > Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> > > if we should not be talking with Microsoft to get at least the formatting
> > > info available?  Oh yeah, didn't Gates give the
> > > chinese open source Windows?  Hmmm.
> > >
> > > -- charlie.
> > > At 02:41 AM 03/16/2003 -0500, you wrote:
> > > >There are Word document viewers for Linux console.  The one I use is
> > > >called wv.  Another is called antiword.  No doubt, there are more.
> > > >Because Word is a proprietary format, and the specification is not
> > > >available, the authors of programs such as wv have had to
> > > >reverse-engineer a bit.  Because of this, certain things in the Word
> > > >document may not decode as well as we'd like.  Nonetheless, I use wv
> > > >and get reasonable results when converting from Word to html.  The
> > > >resulting html source is quite bloated, but, it's there.
> > > >
> > > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > > >package, and may already be on your system.  Surprise, it was already
> > > >on my stock installation of RH 7.2.  the one thing I don't like about
> > > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> > > >navigability of pdf documents, I visit <access.adobe.com>, and submit
> > > >the url of a pdf document (assuming I've found it on the web) to the
> > > >form.  What comes back is a nice html rendering (links and all).
> > > >
> > > >
> > > >Hope this helps,
> > > >
> > > >
> > > >-Dave
> > > >
> > > >
> > > >_______________________________________________
> > > >Speakup mailing list
> > > >Speakup@braille.uwo.ca
> > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > >
> > >
> > > _______________________________________________
> > > Speakup mailing list
> > > Speakup@braille.uwo.ca
> > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > >
> >
> >
>
>
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
>



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
         ` Kenny Hitt
@          ` ccrawford
             ` Kenny Hitt
  0 siblings, 1 reply; 26+ messages in thread
From: ccrawford @  UTC (permalink / raw)
  To: speakup

Now let's not get over the top with stuf like"sputing bs."

-- charlie.
On Sun, 16 Mar 
2003, Kenny Hitt wrote:

> That's the first time I've heard of this.  Where's your proof?
> Is this just another example of you spouting BS?
> 
>           Kenny
> 
> 
> On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote:
> > Actually, the msword format is one of the iso standards formats.  It's a
> > long time since I've had any dealings with Microsoft so I forgot the
> > number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
> > wrote:
> > 
> > > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > > From: Charles Crawford <CCrawford@ACB.org>
> > > Reply-To: speakup@braille.uwo.ca
> > > To: speakup@braille.uwo.ca
> > > Subject: Re: word documents
> > >
> > > Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> > > if we should not be talking with Microsoft to get at least the formatting
> > > info available?  Oh yeah, didn't Gates give the
> > > chinese open source Windows?  Hmmm.
> > >
> > > -- charlie.
> > > At 02:41 AM 03/16/2003 -0500, you wrote:
> > > >There are Word document viewers for Linux console.  The one I use is
> > > >called wv.  Another is called antiword.  No doubt, there are more.
> > > >Because Word is a proprietary format, and the specification is not
> > > >available, the authors of programs such as wv have had to
> > > >reverse-engineer a bit.  Because of this, certain things in the Word
> > > >document may not decode as well as we'd like.  Nonetheless, I use wv
> > > >and get reasonable results when converting from Word to html.  The
> > > >resulting html source is quite bloated, but, it's there.
> > > >
> > > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > > >package, and may already be on your system.  Surprise, it was already
> > > >on my stock installation of RH 7.2.  the one thing I don't like about
> > > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> > > >navigability of pdf documents, I visit <access.adobe.com>, and submit
> > > >the url of a pdf document (assuming I've found it on the web) to the
> > > >form.  What comes back is a nice html rendering (links and all).
> > > >
> > > >
> > > >Hope this helps,
> > > >
> > > >
> > > >-Dave
> > > >
> > > >
> > > >_______________________________________________
> > > >Speakup mailing list
> > > >Speakup@braille.uwo.ca
> > > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > >
> > >
> > > _______________________________________________
> > > Speakup mailing list
> > > Speakup@braille.uwo.ca
> > > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > >
> > 
> > -- 
> > Jude <dashielljt(at)gmpexpress-dot-net>
> > 
> > 
> > _______________________________________________
> > Speakup mailing list
> > Speakup@braille.uwo.ca
> > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> > 
> 
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
> 



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       ` Jude DaShiell
         ` Janina Sajka
         ` Kenny Hitt
@        ` ccrawford
           ` Joseph C. Lininger
  2 siblings, 1 reply; 26+ messages in thread
From: ccrawford @  UTC (permalink / raw)
  To: speakup

If that is the case, then why is it so difficult to read afile?  Not 
sure.

-- Charlie.
On Sun, 16 Mar 2003, Jude DaShiell wrote:

> Actually, the msword format is one of the iso standards formats.  It's a
> long time since I've had any dealings with Microsoft so I forgot the
> number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
> wrote:
> 
> > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > From: Charles Crawford <CCrawford@ACB.org>
> > Reply-To: speakup@braille.uwo.ca
> > To: speakup@braille.uwo.ca
> > Subject: Re: word documents
> >
> > Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> > if we should not be talking with Microsoft to get at least the formatting
> > info available?  Oh yeah, didn't Gates give the
> > chinese open source Windows?  Hmmm.
> >
> > -- charlie.
> > At 02:41 AM 03/16/2003 -0500, you wrote:
> > >There are Word document viewers for Linux console.  The one I use is
> > >called wv.  Another is called antiword.  No doubt, there are more.
> > >Because Word is a proprietary format, and the specification is not
> > >available, the authors of programs such as wv have had to
> > >reverse-engineer a bit.  Because of this, certain things in the Word
> > >document may not decode as well as we'd like.  Nonetheless, I use wv
> > >and get reasonable results when converting from Word to html.  The
> > >resulting html source is quite bloated, but, it's there.
> > >
> > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > >package, and may already be on your system.  Surprise, it was already
> > >on my stock installation of RH 7.2.  the one thing I don't like about
> > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> > >navigability of pdf documents, I visit <access.adobe.com>, and submit
> > >the url of a pdf document (assuming I've found it on the web) to the
> > >form.  What comes back is a nice html rendering (links and all).
> > >
> > >
> > >Hope this helps,
> > >
> > >
> > >-Dave
> > >
> > >
> > >_______________________________________________
> > >Speakup mailing list
> > >Speakup@braille.uwo.ca
> > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> >
> >
> > _______________________________________________
> > Speakup mailing list
> > Speakup@braille.uwo.ca
> > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> >
> 
> 



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       [not found] <20030317062559.19403.13142.Mailman@speech.braille.uwo.ca>
@  ` Thomas Ward
  0 siblings, 0 replies; 26+ messages in thread
From: Thomas Ward @  UTC (permalink / raw)
  To: speakup

Hi, Dannie. Well, in theory we can use Star Office 6.0  or Open Office 
1.0 for Linux, but Star 
Office accessability is still in it's infancy. 
For you and everyone else who is interested in X-windows accessability and 
Star Office 6 accessability I'll outline what I know, what current state 
is, and and where this is going.
First, you'll need an updated distribution of Linux that will have the 
Gnome 2.2 desktop, or update your Linux system, libraries, etc to gnome 
2.2. I know for a fact that Red Hat 8.1 and Mandrake 9.1 will be shipping 
sometime this spring with Gnome 2.2.
After obtaining Gnome 2.2 you will need to check out gnome common, 
gnome-speech libraries, and gnopernicus from cvs, and build them. This 
often the most frustrating step.
If you are using Star office 6 you will need the Java virtual machine in 
jre 1.4 or better, install it, and then install the Java accessability 
bridge for Linux.
At last you will need to install Star Office 6. Fortunately, I did not 
have to buy it because it ships with Mandrake Linux Delux already.
After all this accessability wasn't all that fantastic, or anything I 
could write home about. However, future versions may improve as 
development continues. I assume since Sun is pushing the development of 
gnopernicus, and are also the creators of Star office accessability will 
improve over time, and when they have more time to deal on specific 
applications and not have to worry about the core which is still under 
development.
I will also inform everyone here that this information is a couple of 
months old. I haven't been able to get the vary latest gnopernicus 
compiled yet, and I have not updated the java access, etc.
I'm waiting for Mandrake 9.1 to be released before I attempt to upgrade 
gnopernicus, the java access, etc.




^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       ` Jude DaShiell
         ` Janina Sajka
@        ` Kenny Hitt
           ` ccrawford
         ` ccrawford
  2 siblings, 1 reply; 26+ messages in thread
From: Kenny Hitt @  UTC (permalink / raw)
  To: speakup

That's the first time I've heard of this.  Where's your proof?
Is this just another example of you spouting BS?

          Kenny


On Sun, Mar 16, 2003 at 01:00:44PM -0500, Jude DaShiell wrote:
> Actually, the msword format is one of the iso standards formats.  It's a
> long time since I've had any dealings with Microsoft so I forgot the
> number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
> wrote:
> 
> > Date: Sun, 16 Mar 2003 11:22:05 -0500
> > From: Charles Crawford <CCrawford@ACB.org>
> > Reply-To: speakup@braille.uwo.ca
> > To: speakup@braille.uwo.ca
> > Subject: Re: word documents
> >
> > Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> > if we should not be talking with Microsoft to get at least the formatting
> > info available?  Oh yeah, didn't Gates give the
> > chinese open source Windows?  Hmmm.
> >
> > -- charlie.
> > At 02:41 AM 03/16/2003 -0500, you wrote:
> > >There are Word document viewers for Linux console.  The one I use is
> > >called wv.  Another is called antiword.  No doubt, there are more.
> > >Because Word is a proprietary format, and the specification is not
> > >available, the authors of programs such as wv have had to
> > >reverse-engineer a bit.  Because of this, certain things in the Word
> > >document may not decode as well as we'd like.  Nonetheless, I use wv
> > >and get reasonable results when converting from Word to html.  The
> > >resulting html source is quite bloated, but, it's there.
> > >
> > >For pdf conversion, there's pdftotext.  This is part of the xpdf
> > >package, and may already be on your system.  Surprise, it was already
> > >on my stock installation of RH 7.2.  the one thing I don't like about
> > >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> > >navigability of pdf documents, I visit <access.adobe.com>, and submit
> > >the url of a pdf document (assuming I've found it on the web) to the
> > >form.  What comes back is a nice html rendering (links and all).
> > >
> > >
> > >Hope this helps,
> > >
> > >
> > >-Dave
> > >
> > >
> > >_______________________________________________
> > >Speakup mailing list
> > >Speakup@braille.uwo.ca
> > >http://speech.braille.uwo.ca/mailman/listinfo/speakup
> >
> >
> > _______________________________________________
> > Speakup mailing list
> > Speakup@braille.uwo.ca
> > http://speech.braille.uwo.ca/mailman/listinfo/speakup
> >
> 
> -- 
> Jude <dashielljt(at)gmpexpress-dot-net>
> 
> 
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
> 


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
     ` Charles Crawford
       ` Jude DaShiell
@      ` Danny Crone
         ` Ann Parsons
  1 sibling, 1 reply; 26+ messages in thread
From: Danny Crone @  UTC (permalink / raw)
  To: speakup

What about Star Office?  Is there a form of Star Office we could use?

At 11:22 3/16/03 -0500, you wrote:
>Hmmm.   I never thought about the proprietary issue with MS-Word.  I 
>wonder if we should not be talking with Microsoft to get at least the 
>formatting info available?  Oh yeah, didn't Gates give the
>chinese open source Windows?  Hmmm.
>
>-- charlie.
>At 02:41 AM 03/16/2003 -0500, you wrote:
>>There are Word document viewers for Linux console.  The one I use is
>>called wv.  Another is called antiword.  No doubt, there are more.
>>Because Word is a proprietary format, and the specification is not
>>available, the authors of programs such as wv have had to
>>reverse-engineer a bit.  Because of this, certain things in the Word
>>document may not decode as well as we'd like.  Nonetheless, I use wv
>>and get reasonable results when converting from Word to html.  The
>>resulting html source is quite bloated, but, it's there.
>>
>>For pdf conversion, there's pdftotext.  This is part of the xpdf
>>package, and may already be on your system.  Surprise, it was already
>>on my stock installation of RH 7.2.  the one thing I don't like about
>>pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
>>navigability of pdf documents, I visit <access.adobe.com>, and submit
>>the url of a pdf document (assuming I've found it on the web) to the
>>form.  What comes back is a nice html rendering (links and all).
>>
>>
>>Hope this helps,
>>
>>
>>-Dave
>>
>>
>>_______________________________________________
>>Speakup mailing list
>>Speakup@braille.uwo.ca
>>http://speech.braille.uwo.ca/mailman/listinfo/speakup
>
>
>_______________________________________________
>Speakup mailing list
>Speakup@braille.uwo.ca
>http://speech.braille.uwo.ca/mailman/listinfo/speakup




^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   ` Charles Crawford
@    ` Ann Parsons
  0 siblings, 0 replies; 26+ messages in thread
From: Ann Parsons @  UTC (permalink / raw)
  To: speakup

Hi all,

Charlie, you're expecting people to have common sense.  My friend
commented the other day that he didn't know why it was called "common
sense" because in all his life, he hadn't been able to find much of it
at all. 

Ann P.

-- 
			Ann K. Parsons  
email:  akp@eznet.net 			ICQ Number:  33006854
WEB SITE:  http://home.eznet.net/~akp
"All that is gold does not glitter.  Not all those who wander are lost."  JRRT



^ permalink raw reply	[flat|nested] 26+ messages in thread

* re: word documents
   Jude DaShiell
@  ` Chuck Hallenbeck
  0 siblings, 0 replies; 26+ messages in thread
From: Chuck Hallenbeck @  UTC (permalink / raw)
  To: speakup

> Before I lost that email message from George Allen at Microsoft I remember
> forwarding it along to Dr. Chuck Hallenbech since he at the time was
> trying to pull that format into his sss software.  If he no longer has
> that message then it can't be documented quickly.

I do not recall getting such a document and we never attempted to
support MS-Word documents in any of our browsers. Sorry, I cannot
help on this one, and in fact my reaction was the same as
Janina's concerning the MS-Word format being described in one of
the ISO standards. Sounds like a pipe dream to me.



>

-- 
The Moon is Waxing Gibbous (97% of Full)
So visit me sometime at http://www.mhonline.net/~chuckh
or you might reach me at chuckh on the jabber network.



^ permalink raw reply	[flat|nested] 26+ messages in thread

* re: word documents
@  Jude DaShiell
   ` Chuck Hallenbeck
  0 siblings, 1 reply; 26+ messages in thread
From: Jude DaShiell @  UTC (permalink / raw)
  To: speakup

Before I lost that email message from George Allen at Microsoft I remember
forwarding it along to Dr. Chuck Hallenbech since he at the time was
trying to pull that format into his sss software.  If he no longer has
that message then it can't be documented quickly.

-- 
Jude <dashielljt(at)gmpexpress-dot-net>


^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
     ` Igor Gueths
@      ` Dave Hunt
  0 siblings, 0 replies; 26+ messages in thread
From: Dave Hunt @  UTC (permalink / raw)
  To: speakup

In general, I don't use pdftotext.  I suggested it as an option.  I
much prefer to submit my pdfs to access.adobe.com.  If the pdf is on a
web site.  I just read the resulting html in a browser.  Otherwise, I
have the text mailed back.  My attempts to use pdftotext have, thus
far, been on small, simple files, and the resulting text has been
usable.  


I code in the power of The Source, maintaining all kernel, dependent
libraries, etc...  


-Dave



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       ` Jude DaShiell
@        ` Janina Sajka
         ` Kenny Hitt
         ` ccrawford
  2 siblings, 0 replies; 26+ messages in thread
From: Janina Sajka @  UTC (permalink / raw)
  To: speakup

Jude DaShiell writes:
> 
> Actually, the msword format is one of the iso standards formats. 


OH? That's news to me. Care to document this assertion?



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
     ` Charles Crawford
@      ` Jude DaShiell
         ` Janina Sajka
                         ` (2 more replies)
       ` Danny Crone
  1 sibling, 3 replies; 26+ messages in thread
From: Jude DaShiell @  UTC (permalink / raw)
  To: speakup

Actually, the msword format is one of the iso standards formats.  It's a
long time since I've had any dealings with Microsoft so I forgot the
number of the standard they sent me.On Sun, 16 Mar 2003, Charles Crawford
wrote:

> Date: Sun, 16 Mar 2003 11:22:05 -0500
> From: Charles Crawford <CCrawford@ACB.org>
> Reply-To: speakup@braille.uwo.ca
> To: speakup@braille.uwo.ca
> Subject: Re: word documents
>
> Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder
> if we should not be talking with Microsoft to get at least the formatting
> info available?  Oh yeah, didn't Gates give the
> chinese open source Windows?  Hmmm.
>
> -- charlie.
> At 02:41 AM 03/16/2003 -0500, you wrote:
> >There are Word document viewers for Linux console.  The one I use is
> >called wv.  Another is called antiword.  No doubt, there are more.
> >Because Word is a proprietary format, and the specification is not
> >available, the authors of programs such as wv have had to
> >reverse-engineer a bit.  Because of this, certain things in the Word
> >document may not decode as well as we'd like.  Nonetheless, I use wv
> >and get reasonable results when converting from Word to html.  The
> >resulting html source is quite bloated, but, it's there.
> >
> >For pdf conversion, there's pdftotext.  This is part of the xpdf
> >package, and may already be on your system.  Surprise, it was already
> >on my stock installation of RH 7.2.  the one thing I don't like about
> >pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> >navigability of pdf documents, I visit <access.adobe.com>, and submit
> >the url of a pdf document (assuming I've found it on the web) to the
> >form.  What comes back is a nice html rendering (links and all).
> >
> >
> >Hope this helps,
> >
> >
> >-Dave
> >
> >
> >_______________________________________________
> >Speakup mailing list
> >Speakup@braille.uwo.ca
> >http://speech.braille.uwo.ca/mailman/listinfo/speakup
>
>
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
>

-- 
Jude <dashielljt(at)gmpexpress-dot-net>



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
       [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>
@    ` Charles Crawford
  0 siblings, 0 replies; 26+ messages in thread
From: Charles Crawford @  UTC (permalink / raw)
  To: speakup

Now June,

         Now there ya go giving good suggestions again.  Now I can't wine 
on about the problem of Word.  Oh well, guess I will just have to go back 
to how we don't have turkish tafffy anymore.  Smile.

-- charlie Crawford.
At 09:12 AM 03/16/2003 -0500, you wrote:
>If you can compile it, there's antiword 0.33 for linux to handle the
>Microsoft problem.  To handle the adobe problem, there's ps2ascii.On Sun,
>16 Mar 2003, Joseph C. Lininger wrote:
>
> > Date: Sun, 16 Mar 2003 00:06:05 -0700 (Mountain Standard Time)
> > From: Joseph C. Lininger <jbahm@pcdesk.net>
> > Reply-To: speakup@braille.uwo.ca
> > To: Speakup List <speakup@braille.uwo.ca>
> > Subject: word documents
> >
> > Hello all,
> > A couple of questions. Is there any way to read a word document in Linux?
> > Yes I know it's a propriotary format, but that's what my prof in class
> > is using. Second, is there a way to read a PDF?
> >
> >
>
>--
>Jude <dashielljt(at)gmpexpress-dot-net>
>
>
>_______________________________________________
>Speakup mailing list
>Speakup@braille.uwo.ca
>http://speech.braille.uwo.ca/mailman/listinfo/speakup



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   ` Dave Hunt
     ` Igor Gueths
@    ` Charles Crawford
       ` Jude DaShiell
       ` Danny Crone
  1 sibling, 2 replies; 26+ messages in thread
From: Charles Crawford @  UTC (permalink / raw)
  To: speakup

Hmmm.   I never thought about the proprietary issue with MS-Word.  I wonder 
if we should not be talking with Microsoft to get at least the formatting 
info available?  Oh yeah, didn't Gates give the
chinese open source Windows?  Hmmm.

-- charlie.
At 02:41 AM 03/16/2003 -0500, you wrote:
>There are Word document viewers for Linux console.  The one I use is
>called wv.  Another is called antiword.  No doubt, there are more.
>Because Word is a proprietary format, and the specification is not
>available, the authors of programs such as wv have had to
>reverse-engineer a bit.  Because of this, certain things in the Word
>document may not decode as well as we'd like.  Nonetheless, I use wv
>and get reasonable results when converting from Word to html.  The
>resulting html source is quite bloated, but, it's there.
>
>For pdf conversion, there's pdftotext.  This is part of the xpdf
>package, and may already be on your system.  Surprise, it was already
>on my stock installation of RH 7.2.  the one thing I don't like about
>pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
>navigability of pdf documents, I visit <access.adobe.com>, and submit
>the url of a pdf document (assuming I've found it on the web) to the
>form.  What comes back is a nice html rendering (links and all).
>
>
>Hope this helps,
>
>
>-Dave
>
>
>_______________________________________________
>Speakup mailing list
>Speakup@braille.uwo.ca
>http://speech.braille.uwo.ca/mailman/listinfo/speakup



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   Joseph C. Lininger
                   ` (3 preceding siblings ...)
   ` Ann Parsons
@  ` Charles Crawford
     ` Ann Parsons
       [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>
  5 siblings, 1 reply; 26+ messages in thread
From: Charles Crawford @  UTC (permalink / raw)
  To: speakup

You can tell the "Prof" in class to convert the ms-word files to text.  I 
would assume someone with the brains to become a professor has the minimum 
amount of smarts to convert a file.

         Janina knows about reading PDF files.  I believe we should be 
telling purveyors of PDF to include TXT or HTML files of the same 
materials.  maybe they will get the message?  Well, maybe not.  Guess I 
have to convert the message to PDF first?  Grin.

-- charlie.

At 12:06 AM 03/16/2003 -0700, you wrote:
>Hello all,
>A couple of questions. Is there any way to read a word document in Linux?
>Yes I know it's a propriotary format, but that's what my prof in class
>is using. Second, is there a way to read a PDF?
>
>--
>Joseph C. Lininger
>jbahm@pcdesk.net
>
>
>_______________________________________________
>Speakup mailing list
>Speakup@braille.uwo.ca
>http://speech.braille.uwo.ca/mailman/listinfo/speakup



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   ` Dave Hunt
@    ` Igor Gueths
       ` Dave Hunt
     ` Charles Crawford
  1 sibling, 1 reply; 26+ messages in thread
From: Igor Gueths @  UTC (permalink / raw)
  To: speakup

Hi Dave. I have a not so good result with pdftotext. Basically I converted
a fairly large manual written in pdf format to text, the result was a
textfile that didn't have the line rap set properly. The result was having
about half a line of text, and the other half of the line had been lost.
Needless to say the conversion was not accurate. have you been able to get
around this problem and if so how?

May you code in the power of the source,
may the kernel, libraries, and utilities be with you,
throughout all distributions until the end of the epoch.

On Sun, 16 Mar 2003, Dave Hunt wrote:

> There are Word document viewers for Linux console.  The one I use is
> called wv.  Another is called antiword.  No doubt, there are more.
> Because Word is a proprietary format, and the specification is not
> available, the authors of programs such as wv have had to
> reverse-engineer a bit.  Because of this, certain things in the Word
> document may not decode as well as we'd like.  Nonetheless, I use wv
> and get reasonable results when converting from Word to html.  The
> resulting html source is quite bloated, but, it's there.
>
> For pdf conversion, there's pdftotext.  This is part of the xpdf
> package, and may already be on your system.  Surprise, it was already
> on my stock installation of RH 7.2.  the one thing I don't like about
> pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
> navigability of pdf documents, I visit <access.adobe.com>, and submit
> the url of a pdf document (assuming I've found it on the web) to the
> form.  What comes back is a nice html rendering (links and all).
>
>
> Hope this helps,
>
>
> -Dave
>
>
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup
>



^ permalink raw reply	[flat|nested] 26+ messages in thread

* word documents
   Joseph C. Lininger
                   ` (2 preceding siblings ...)
   ` Jude DaShiell
@  ` Ann Parsons
   ` Charles Crawford
       [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>
  5 siblings, 0 replies; 26+ messages in thread
From: Ann Parsons @  UTC (permalink / raw)
  To: speakup

Hi all,

Dunnow about PDF, but you can read Word docs with antiword.  It's a
debian package, suspect that it's a red hat one too.

Ann P.

-- 
			Ann K. Parsons  
email:  akp@eznet.net 			ICQ Number:  33006854
WEB SITE:  http://home.eznet.net/~akp
"All that is gold does not glitter.  Not all those who wander are lost."  JRRT



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   Joseph C. Lininger
   ` Aaron Howell
   ` Dave Hunt
@  ` Jude DaShiell
   ` Ann Parsons
                   ` (2 subsequent siblings)
  5 siblings, 0 replies; 26+ messages in thread
From: Jude DaShiell @  UTC (permalink / raw)
  To: Speakup List

If you can compile it, there's antiword 0.33 for linux to handle the
Microsoft problem.  To handle the adobe problem, there's ps2ascii.On Sun,
16 Mar 2003, Joseph C. Lininger wrote:

> Date: Sun, 16 Mar 2003 00:06:05 -0700 (Mountain Standard Time)
> From: Joseph C. Lininger <jbahm@pcdesk.net>
> Reply-To: speakup@braille.uwo.ca
> To: Speakup List <speakup@braille.uwo.ca>
> Subject: word documents
>
> Hello all,
> A couple of questions. Is there any way to read a word document in Linux?
> Yes I know it's a propriotary format, but that's what my prof in class
> is using. Second, is there a way to read a PDF?
>
>

-- 
Jude <dashielljt(at)gmpexpress-dot-net>



^ permalink raw reply	[flat|nested] 26+ messages in thread

* word documents
   Joseph C. Lininger
   ` Aaron Howell
@  ` Dave Hunt
     ` Igor Gueths
     ` Charles Crawford
   ` Jude DaShiell
                   ` (3 subsequent siblings)
  5 siblings, 2 replies; 26+ messages in thread
From: Dave Hunt @  UTC (permalink / raw)
  To: speakup

There are Word document viewers for Linux console.  The one I use is
called wv.  Another is called antiword.  No doubt, there are more.
Because Word is a proprietary format, and the specification is not
available, the authors of programs such as wv have had to
reverse-engineer a bit.  Because of this, certain things in the Word
document may not decode as well as we'd like.  Nonetheless, I use wv
and get reasonable results when converting from Word to html.  The
resulting html source is quite bloated, but, it's there.

For pdf conversion, there's pdftotext.  This is part of the xpdf
package, and may already be on your system.  Surprise, it was already
on my stock installation of RH 7.2.  the one thing I don't like about
pdftotext-s rendering, is that hyperlinks get lost.  To preserve the
navigability of pdf documents, I visit <access.adobe.com>, and submit
the url of a pdf document (assuming I've found it on the web) to the
form.  What comes back is a nice html rendering (links and all).  


Hope this helps,


-Dave



^ permalink raw reply	[flat|nested] 26+ messages in thread

* Re: word documents
   Joseph C. Lininger
@  ` Aaron Howell
   ` Dave Hunt
                   ` (4 subsequent siblings)
  5 siblings, 0 replies; 26+ messages in thread
From: Aaron Howell @  UTC (permalink / raw)
  To: speakup

There are a couple of ways to achieve this.
word2x or catdoc will both convert word documents to text with varying degrees of accuracy.
You can email pdf documents to pdf2txt@sun.trace.wisc.edu as an attachment and you'll get them back in plain text,
or you can install xpdf which comes with a pdftotext utility.
The advantage of the xpdf route is that its a very simple patch to make it ignore pdf encryption
so if you're ok with building a source package, you can have yourself a pdf converter that doesn't worry about silly  conversions restrictions
like acrobat reader does.
Regards
Aaron
On Sun, Mar 16, 2003 at 12:06:05AM -0700, Joseph C. Lininger wrote:
> Hello all,
> A couple of questions. Is there any way to read a word document in Linux?
> Yes I know it's a propriotary format, but that's what my prof in class
> is using. Second, is there a way to read a PDF?
> 
> -- 
> Joseph C. Lininger
> jbahm@pcdesk.net
> 
> 
> _______________________________________________
> Speakup mailing list
> Speakup@braille.uwo.ca
> http://speech.braille.uwo.ca/mailman/listinfo/speakup


^ permalink raw reply	[flat|nested] 26+ messages in thread

* word documents
@  Joseph C. Lininger
   ` Aaron Howell
                   ` (5 more replies)
  0 siblings, 6 replies; 26+ messages in thread
From: Joseph C. Lininger @  UTC (permalink / raw)
  To: Speakup List

Hello all,
A couple of questions. Is there any way to read a word document in Linux?
Yes I know it's a propriotary format, but that's what my prof in class
is using. Second, is there a way to read a PDF?

-- 
Joseph C. Lininger
jbahm@pcdesk.net



^ permalink raw reply	[flat|nested] 26+ messages in thread

end of thread, other threads:[~ UTC | newest]

Thread overview: 26+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
     [not found] <20030316235502.7926.4073.Mailman@speech.braille.uwo.ca>
 ` word documents Thomas Ward
     [not found] <20030317062559.19403.13142.Mailman@speech.braille.uwo.ca>
 ` Thomas Ward
 Jude DaShiell
 ` Chuck Hallenbeck
  -- strict thread matches above, loose matches on Subject: below --
 Joseph C. Lininger
 ` Aaron Howell
 ` Dave Hunt
   ` Igor Gueths
     ` Dave Hunt
   ` Charles Crawford
     ` Jude DaShiell
       ` Janina Sajka
       ` Kenny Hitt
         ` ccrawford
           ` Kenny Hitt
             ` Erik Heil
             ` William F. Acker WB2FLW +1-303-722-7209
       ` ccrawford
         ` Joseph C. Lininger
     ` Danny Crone
       ` Ann Parsons
 ` Jude DaShiell
 ` Ann Parsons
 ` Charles Crawford
   ` Ann Parsons
     [not found] ` <Pine.LNX.4.33.0303160909310.714-100000@athame.gmpexpress.n et>
   ` Charles Crawford

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).