From mboxrd@z Thu Jan 1 00:00:00 1970 Return-Path: Received: from smtp1.wa.amnet.net.au (smtp1.wa.amnet.net.au [203.161.124.50]) by speech.braille.uwo.ca (Postfix) with ESMTP id E02B7109CF for ; Sun, 13 Jul 2008 10:19:40 -0400 (EDT) Received: from localhost (localhost.localdomain [127.0.0.1]) by smtp1.wa.amnet.net.au (Postfix) with ESMTP id A1C567D55D for ; Sun, 13 Jul 2008 22:19:38 +0800 (WST) X-Virus-Scanned: amavisd-new at smtp1.wa.amnet.net.au Received: from smtp1.wa.amnet.net.au ([127.0.0.1]) by localhost (smtp1.wa.amnet.net.au [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ge6pwiRyppnG for ; Sun, 13 Jul 2008 22:19:37 +0800 (WST) Received: from gotss1.gotss.net (203.161.101.89.static.amnet.net.au [203.161.101.89]) (using TLSv1 with cipher AES256-SHA (256/256 bits)) (No client certificate requested) by smtp1.wa.amnet.net.au (Postfix) with ESMTP id EE3DA7D55C for ; Sun, 13 Jul 2008 22:19:36 +0800 (WST) Received: from bouncy.gotss.net ([192.168.24.37] helo=bouncy) by gotss1.gotss.net with smtp (Exim 4.69) (envelope-from ) id 1KI2QH-00006t-7D for speakup@braille.uwo.ca; Sun, 13 Jul 2008 22:19:33 +0800 Message-ID: <000b01c8e4f3$78af0120$2518a8c0@bouncy> From: "Kerry Hoath" To: "Speakup is a screen review system for Linux." References: <4879E36B.6010408@brailcom.org> Subject: Re: status of speakup support for espeak Date: Sun, 13 Jul 2008 22:19:33 +0800 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2900.3138 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198 X-BeenThere: speakup@braille.uwo.ca X-Mailman-Version: 2.1.10 Precedence: list Reply-To: "Speakup is a screen review system for Linux." List-Id: "Speakup is a screen review system for Linux." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 13 Jul 2008 14:19:41 -0000 Original text removed. Firstly let us sort out what people mean by quality. Sounds better verses sounds nicer? Nicer by whose definition? I personally find the festival voices bloody awful and the internation unpleasant. At high speeds they become unintelligible and they take a lot of processer time to synthesize speech. This is only my personal preference however. I don't want my computer to sound like a person; after all it is my computer synthesizing speech, not my wife reading to me ;-) I don't want my computer sounding like the star trek computer; as the star trek computer takes all day to say what it means. Human speech is hard to understand at speeds >400 words per minute; synthesized speech such as that found in espeak seems to work far better at these speeds. I have the same complaint regarding apple's voices; they sound natural but are barely understandable at high speeds and perform sluggishly. I am a long time user of hardware speech; accent, artic transport, doubletalk etc. I find software speech performs sluggishly in comparison especially a system like festival that seems loaded down with so much extra functionality. Certainly, festival is a flexible and configurable system; but I have no desire to learn scheme to read my mail, and the disk space footprint for festival is quite large. The higher quality the voices; the more disk space used and the more data needs to go to the soundcard. Just because I have a 2ghz processer does not mean I want to use a lion's share of it to synthesize speech. I tend to find the lag time between an application sending speech to the synthesizer setup and the speech beeing synthesized annoying on most systems, Jaws, Windoweyes and hardware speech responding as fast as i'd like. I find espeak responds quickly, and the speech is tolerable to listen to once you get used to it. Initially the default british english has far too much top end for my ears to handle, and it is so very loud. As someone who has used the echo gp and old school speech; I am perhaps more tollerant than most regarding quality. I'd much rather use espeak on the mac rather than the slow built-in voices such as fred and alex. Cepstral sounds nice; but still takes an inordinate amount of time to synthesize. we're working at getting hardware speech on the mac, and think that it will greatly increase the responsiveness of voiceover. Our doubletalk box will have open specs and will work on USB and serial, making it an alternative for modern hardware speech. Regards, Kerry.