From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <kerry@gotss.net>
Received: from smtp1.wa.amnet.net.au (smtp1.wa.amnet.net.au [203.161.124.50])
	by speech.braille.uwo.ca (Postfix) with ESMTP id E02B7109CF
	for <speakup@braille.uwo.ca>; Sun, 13 Jul 2008 10:19:40 -0400 (EDT)
Received: from localhost (localhost.localdomain [127.0.0.1])
	by smtp1.wa.amnet.net.au (Postfix) with ESMTP id A1C567D55D
	for <speakup@braille.uwo.ca>; Sun, 13 Jul 2008 22:19:38 +0800 (WST)
X-Virus-Scanned: amavisd-new at smtp1.wa.amnet.net.au
Received: from smtp1.wa.amnet.net.au ([127.0.0.1])
	by localhost (smtp1.wa.amnet.net.au [127.0.0.1]) (amavisd-new,
	port 10024) with ESMTP id ge6pwiRyppnG for <speakup@braille.uwo.ca>;
	Sun, 13 Jul 2008 22:19:37 +0800 (WST)
Received: from gotss1.gotss.net (203.161.101.89.static.amnet.net.au
	[203.161.101.89]) (using TLSv1 with cipher AES256-SHA (256/256 bits))
	(No client certificate requested)
	by smtp1.wa.amnet.net.au (Postfix) with ESMTP id EE3DA7D55C
	for <speakup@braille.uwo.ca>; Sun, 13 Jul 2008 22:19:36 +0800 (WST)
Received: from bouncy.gotss.net ([192.168.24.37] helo=bouncy)
	by gotss1.gotss.net with smtp (Exim 4.69)
	(envelope-from <kerry@gotss.net>) id 1KI2QH-00006t-7D
	for speakup@braille.uwo.ca; Sun, 13 Jul 2008 22:19:33 +0800
Message-ID: <000b01c8e4f3$78af0120$2518a8c0@bouncy>
From: "Kerry Hoath" <kerry@gotss.net>
To: "Speakup is a screen review system for Linux." <speakup@braille.uwo.ca>
References: <4879E36B.6010408@brailcom.org>
Subject: Re: status of speakup support for espeak
Date: Sun, 13 Jul 2008 22:19:33 +0800
MIME-Version: 1.0
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 6.00.2900.3138
X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2900.3198
X-BeenThere: speakup@braille.uwo.ca
X-Mailman-Version: 2.1.10
Precedence: list
Reply-To: "Speakup is a screen review system for Linux."
	<speakup@braille.uwo.ca>
List-Id: "Speakup is a screen review system for Linux."
	<speakup.braille.uwo.ca>
List-Unsubscribe: <http://speech.braille.uwo.ca/mailman/options/speakup>,
	<mailto:speakup-request@braille.uwo.ca?subject=unsubscribe>
List-Archive: <http://speech.braille.uwo.ca/pipermail/speakup>
List-Post: <mailto:speakup@braille.uwo.ca>
List-Help: <mailto:speakup-request@braille.uwo.ca?subject=help>
List-Subscribe: <http://speech.braille.uwo.ca/mailman/listinfo/speakup>,
	<mailto:speakup-request@braille.uwo.ca?subject=subscribe>
X-List-Received-Date: Sun, 13 Jul 2008 14:19:41 -0000

Original text removed.
Firstly let us sort out what people mean by quality.

Sounds better verses sounds nicer? Nicer by whose definition?
I personally find the festival voices bloody awful and the internation 
unpleasant.
At high speeds they become unintelligible and they take a lot of processer 
time to synthesize speech.
This is only my personal preference however. I don't want my computer to 
sound like a person;
after all it is my computer synthesizing speech, not my wife reading to me 
;-)
I don't want my computer sounding like the star trek computer;
as the star trek computer takes all day to say what it means.
Human speech is hard to understand at speeds >400 words per minute;
synthesized speech such as that found in espeak seems to work far better at 
these speeds.
I have the same complaint regarding apple's voices;
they sound natural but are barely understandable at high speeds and perform
sluggishly.
I am a long time user of hardware speech; accent, artic transport, 
doubletalk etc.
I find software speech performs sluggishly in comparison especially a system
like festival that seems loaded down with so much extra functionality.
Certainly, festival is a flexible and configurable system;
but I have no desire to learn scheme to read my mail,
and the disk space footprint for festival is quite large. The higher quality 
the voices; the more disk space used and the more data needs to go to the 
soundcard.


Just because I have a 2ghz processer does not mean I want to use a lion's 
share of it to synthesize speech.
I tend to find the lag time between an application sending speech to the 
synthesizer setup and the
speech beeing synthesized annoying on most systems,
Jaws, Windoweyes and hardware speech responding as fast as i'd like.

I find espeak responds quickly, and the speech is tolerable to listen to 
once you get used to it.
Initially the default british english has far too much top end for my ears 
to handle, and it is so very loud.

As someone who has used the echo gp and old school speech;
I am perhaps more tollerant than most regarding quality.
I'd much rather use espeak on the mac rather than the slow built-in voices 
such as fred and alex.
Cepstral sounds nice; but still takes an inordinate amount of time to 
synthesize.
we're working at getting hardware speech on the mac, and think that it
will greatly increase the responsiveness of voiceover.
Our doubletalk box will have open specs and will work on USB and serial, 
making
it an alternative for modern hardware speech.
Regards, Kerry.