TTS algorithms (Re: Comments on the Text to Speech "algorithm")

speechd-discuss

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

TTS algorithms (Re: Comments on the Text to Speech "algorithm")

From:	marc
Subject:	TTS algorithms (Re: Comments on the Text to Speech "algorithm")
Date:	Sun, 28 Feb 2010 11:30:56 +0100

Klaus Knopper wrote:
> Hi,
> 
> Maybe I'm missing something, but as far as I understood the question,
> marc just asked about a common procedure called "unit selection" which
> is the algorithm of many text-to-speech synthesizers.
> 

Well, I know, in TTS they use some kind of FFT (Fourier Transform) etc 
that mathematically construct a soundwave.

My guess is that mp3/ogg/wav/whatever would sound better.

> Back to unit selection: Because of time-critical issues, selection and
> processing of real recordings requires a lot of IO throughput, so you
> will need a very fast harddisk (maybe raid) or database, possibly cached
> in RAM, or just accept the output data to be generated "offline" with
> playback a few seconds or even minutes after the original text was sent,
> output being in form of a WAV, Ogg or also the aforementioned MP3 if you
> don't mind using a patented format with its problematic legal issues.
> 

Well, ever heard of a tree? If a word starts with:
all words with a in a/
all words with b in b/
...
all words with aa in a/a/
all words with ac in a/c/
..
all words in aaa in /a/a/a
..
all words in prononciation in /p/r/o/n/u/n/c/i/a/t/i/o/n/.words

I suppose you have a fast access here, and after a while, you have a lot 
of these sound files in memory.

> "Mary" from the DFKI uses unit selection, it is open source and written
> in JAVA, but there is no plugin for speechd yet. It may be possible to
> use it as commandline-based external program for speechd still.
> 
> http://mary.dfki.de/

I will take a look at it.

My problem is that my own language (Dutch) does not sound very good in 
synthesised speech ..

And I know some GPSses use ogg/mp3 words (eg the Mappy.fr uses Ogg).

In Belgium, blind/handicapped people get 90% subsidies for ability 
technologies (speech etc), but would an European funding be an idea?
http://cordis.lu

Marc
-- 
What's on Shortwave guide: choose an hour, go!
http://shortwave.tk
700+ Radio Stations on SW http://swstations.tk
300+ languages on SW http://radiolanguages.tk

[Prev in Thread]

Current Thread

[Next in Thread]

Comments on the Text to Speech "algorithm", marc, 2010/02/27
- Comments on the Text to Speech "algorithm", Kenny Hitt, 2010/02/27
  - Comments on the Text to Speech "algorithm", A, 2010/02/27
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), Klaus Knopper, 2010/02/28
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), A, 2010/02/28
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), marc <=
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), Jonathan Duddington, 2010/02/28
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), Klaus Knopper, 2010/02/28
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), marc, 2010/02/28
    - TTS algorithms (Re: Comments on the Text to Speech "algorithm"), Bill Cox, 2010/02/28
    - Comments on the Text to Speech "algorithm", Bill Cox, 2010/02/28
- Comments on the Text to Speech "algorithm", Tim Cross, 2010/02/28
  - Comments on the Text to Speech "algorithm", Chris Brannon, 2010/02/28

Prev by Date: Comments on the Text to Speech "algorithm"
Next by Date: TTS algorithms (Re: Comments on the Text to Speech "algorithm")
Previous by thread: TTS algorithms (Re: Comments on the Text to Speech "algorithm")
Next by thread: TTS algorithms (Re: Comments on the Text to Speech "algorithm")
Index(es):
- Date
- Thread