speechd-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Comments on the Text to Speech "algorithm"


From: Christopher Brannon
Subject: Comments on the Text to Speech "algorithm"
Date: Sun, 14 Nov 2010 18:26:24 +0000

Gilles Casse <gcasse at oralux.org> writes:

> - 78000 pre-recorded words,
> - 5GB of disk space, wav files, 44100Hz,
> - Each word is a separate wav file having the same name as the word in
>   the text.

What happens if those WAV files are encoded as 32 kbit/s OGG/Vorbis or
MP3 files?  The disk space requirement would shrink to approximately 120
megabytes.  The most-frequently-used words could be cached in memory.
Let's suppose a cache of 10000 words, for a memory footprint of 12
megabytes.  That's acceptable on modern hardware, and it should be
fairly responsive.

I still don't see the appeal of this technique, but to each his own.

-- Chris
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: not available
URL: 
<http://lists.freebsoft.org/pipermail/speechd/attachments/20101114/78857c49/attachment.pgp>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]