public inbox for blinux-list@redhat.com
 help / color / mirror / Atom feed
* Concerning BLinux project (fwd)
@  Hans Zoebelein
   ` Concerning BLinux project T.Pospisek's MailLists
   ` Concerning BLinux project (fwd) Jude Dashiell
  0 siblings, 2 replies; 15+ messages in thread
From: Hans Zoebelein @  UTC (permalink / raw)
  To: blinux-list

Hi Blinuxers,

Aaron is looking for a software project in the context of Blinux. 
If we help him out, he might help us out. 
Wha about these CX interface libs? 

Enjoy!
Hans 


---------- Forwarded message ----------
Date: Sat, 5 Dec 1998 08:28:34 -0500 (EST)
From: Corbin <ac1164@messiah.edu>
To: zocki@goldfish.cube.net
Cc: aaron corbin <ac1164@mailman.messiah.edu>
Subject: Concerning BLinux project

Hi, I'm a senior computer science major at Messiah College.  I'm in the
stages of trying to find a senior project.  If I could write something
that would contribute to the BLinux project, that would be great.  But I
don't really know what to write (I just found the website two days ago),
and not being blind, I don't even know what would be really
useful.  Is there anything you could think of that *needs* to be written,
and (sadly) is novel enough that I could use it as a senior project?  Are
you the right person to be asking this?  Should I ask someone else?

Thanks for your time,
Aaron Corbin.    


^ permalink raw reply	[flat|nested] 15+ messages in thread
* Re: OCR software (was Re: Concerning BLinux project (fwd))
@  Lloyd G. Rasmussen
   ` Jude Dashiell
  0 siblings, 1 reply; 15+ messages in thread
From: Lloyd G. Rasmussen @  UTC (permalink / raw)
  To: blinux-list

What you ask for is not likely to be available until artificial 
intelligence goes forward much further.  You are asking a computer 
program which knows the *presentation* of a document to correctly 
infer the *structure* of that document, or at least attempt to do so.  

I recently bought Omnipage 9 for Win95 from Caere Corporation.  Among 
all its export formats, it includes an HTML export format.  From what 
I've seen so far, the objective is to make a GUI web browser display 
the page, with fonts, italics, centering, intact.  The HTML is  a 
series of <p> and <br> with Font, I, Align attributes.  No structure.  
It even claims to conform to the HTML 3.0 DTD, and tells you that the 
generator is Adobe Word for Word.  I know that HTML is not SGML.  But 
I'm not too hopeful that when OCR programs begin exporting XML, that 
they will do much better than this. 

I know that Duxbury attempts to create styles in a file which it has 
imported from ASCII, but this is usually just a beginning toward 
correctly marking up a document.  I agree that you're aiming for the 
right objective, but I don't know how we're going to get there.

On Mon, 7 Dec 1998 09:28:03 +1100 (AEDT), 
Jason White   <jasonw@ariel.ucs.unimelb.EDU.AU> wrote:

>On the subject of freely available OCR software, currently under
>development, see http://www.socr.org/
>
>What is most needed as output is not straightforward ASCII text, but
>rather a document which has been marked up in SGML, XML or a related
>language, that preserves its structure and maintains the distinctions
>necessary for the generation of high quality braille and audio output.
>
>
>---
>Send your message for blinux-list to blinux-list@redhat.com
>Blinux software archive at ftp://leb.net/pub/blinux
>Blinux web page at http://leb.net/blinux
>To unsubscribe send mail to blinux-list-request@redhat.com
>with subject line: unsubscribe
>

-- Lloyd Rasmussen
Senior Staff Engineer, Engineering Section
National Library Service for the  Blind and Physically Handicapped
Library of Congress          202-707-0535
(work)       lras@loc.gov    http://www.loc.gov/nls/
(home) lras@sprynet.com http://home.sprynet.com/sprynet/lras/      


^ permalink raw reply	[flat|nested] 15+ messages in thread

end of thread, other threads:[~ UTC | newest]

Thread overview: 15+ messages (download: mbox.gz / follow: Atom feed)
-- links below jump to the message on this page --
 Concerning BLinux project (fwd) Hans Zoebelein
 ` Concerning BLinux project T.Pospisek's MailLists
 ` Concerning BLinux project (fwd) Jude Dashiell
   ` OCR software (was Re: Concerning BLinux project (fwd)) Jason White
     ` Jude Dashiell
       ` Jack Berdeaux
       ` Ron Marriage
     ` Jack Berdeaux
       ` Jude Dashiell
   ` Concerning BLinux project (fwd) Jason White
     ` Jason White
       ` aaa
         ` Dave Mielke
 OCR software (was Re: Concerning BLinux project (fwd)) Lloyd G. Rasmussen
 ` Jude Dashiell

This is a public inbox, see mirroring instructions
for how to clone and mirror all data and code used for this inbox;
as well as URLs for read-only IMAP folder(s) and NNTP newsgroup(s).