[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Comments on the Text to Speech "algorithm"
From: |
Christopher Brannon |
Subject: |
Comments on the Text to Speech "algorithm" |
Date: |
Sun, 14 Nov 2010 18:26:24 +0000 |
Gilles Casse <gcasse at oralux.org> writes:
> - 78000 pre-recorded words,
> - 5GB of disk space, wav files, 44100Hz,
> - Each word is a separate wav file having the same name as the word in
> the text.
What happens if those WAV files are encoded as 32 kbit/s OGG/Vorbis or
MP3 files? The disk space requirement would shrink to approximately 120
megabytes. The most-frequently-used words could be cached in memory.
Let's suppose a cache of 10000 words, for a memory footprint of 12
megabytes. That's acceptable on modern hardware, and it should be
fairly responsive.
I still don't see the appeal of this technique, but to each his own.
-- Chris
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: not available
URL:
<http://lists.freebsoft.org/pipermail/speechd/attachments/20101114/78857c49/attachment.pgp>