speechd-discuss
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Sound icons handling


From: Trevor Saunders
Subject: Sound icons handling
Date: Wed, 1 Sep 2010 05:26:09 -0400

Hi,

On Wed, Sep 01, 2010 at 11:04:43AM +0200, Hynek Hanke wrote:
> On 1.9.2010 10:16, Halim Sahin wrote:
> >On Wed, Sep 01, 2010 at 01:09:08AM -0400, Trevor Saunders wrote:
> >>that makes no sense, we are asking the module to play a wave file.
> 
> Hello all,
> 
> we used to do sound icons directly in Speech Dispatcher, but
> decided against it. Let me explain.
> 
> We are not asking the synthesizer to play a wave file, we are
> asking the TTS to include a sound among the speech it produces.

I was thinking only about the sound icon command to modules here, which afaik is
synchronous so can't be executing while sound is being synthesized, so
all that is happening is that the file that is that sound icon is
played.  Is this the wrong understanding of that command?

Trev

> This sound can be embedded in the speech mid-sentence (see SSML
> AUDIO element). There can also be a pre-synthesized text in the
> form of audio.
> 
> In other cases, such as a the SSIP key command, the TTS might
> choose either playing synthesize the key name or play
> a sound icon, or many times combine those (play a sound
> for 'shitf', synthesize the letter).
> 
> So the TTS, if it's good, must handle sound icons already.
> Handling the special case where the whole message is composed only
> of the sound icon itself in Speech Dispatcher would be a duplication
> of functionality.
> 
> It is also principially not a task for Speech Dispatcher to
> fabricate any sound data. We can of course speak of emulations
> for those synthesizers who can't support it.
> 
> >>Which synth can play soundicons except espeak?
> 
> Festival. It's also a requirement of both TTS API and SSML.
> 
> This is unrelated to the issue of resampling, because you can
> (and will) get different sample rates from the TTS even when
> switching voices, which can also happen mid-sentence.
> 
> Best regards,
> Hynek
> 
> 
> _______________________________________________
> Speechd mailing list
> Speechd at lists.freebsoft.org
> http://lists.freebsoft.org/mailman/listinfo/speechd
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 836 bytes
Desc: Digital signature
URL: 
<http://lists.freebsoft.org/pipermail/speechd/attachments/20100901/97d98a34/attachment.pgp>


reply via email to

[Prev in Thread] Current Thread [Next in Thread]