Comments on the Text to Speech "algorithm"

speechd-discuss

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Comments on the Text to Speech "algorithm"

From:	Christopher Brannon
Subject:	Comments on the Text to Speech "algorithm"
Date:	Sun, 14 Nov 2010 18:26:24 +0000

Gilles Casse <gcasse at oralux.org> writes:

> - 78000 pre-recorded words,
> - 5GB of disk space, wav files, 44100Hz,
> - Each word is a separate wav file having the same name as the word in
>   the text.

What happens if those WAV files are encoded as 32 kbit/s OGG/Vorbis or
MP3 files?  The disk space requirement would shrink to approximately 120
megabytes.  The most-frequently-used words could be cached in memory.
Let's suppose a cache of 10000 words, for a memory footprint of 12
megabytes.  That's acceptable on modern hardware, and it should be
fairly responsive.

I still don't see the appeal of this technique, but to each his own.

-- Chris
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 197 bytes
Desc: not available
URL: 
<http://lists.freebsoft.org/pipermail/speechd/attachments/20101114/78857c49/attachment.pgp>

[Prev in Thread]

Current Thread

[Next in Thread]

Comments on the Text to Speech "algorithm", Gilles Casse, 2010/11/14
- Comments on the Text to Speech "algorithm", Christopher Brannon <=
  - Comments on the Text to Speech "algorithm", William Hubbs, 2010/11/14
  - Comments on the Text to Speech "algorithm", Bohdan R . Rau, 2010/11/14
  - Comments on the Text to Speech "algorithm", Gilles Casse, 2010/11/15
- Comments on the Text to Speech "algorithm", Marc Coevoet, 2010/11/15

Prev by Date: Comments on the Text to Speech "algorithm"
Next by Date: Comments on the Text to Speech "algorithm"
Previous by thread: Comments on the Text to Speech "algorithm"
Next by thread: Comments on the Text to Speech "algorithm"
Index(es):
- Date
- Thread