Re: [gnuspeech-contact] Early GNUSpeech observations

gnuspeech-contact

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [gnuspeech-contact] Early GNUSpeech observations

From:	Jason White
Subject:	Re: [gnuspeech-contact] Early GNUSpeech observations
Date:	Fri, 10 Apr 2009 12:37:56 +1000
User-agent:	Mutt/1.5.18 (2008-05-17)

David Hill <address@hidden> wrote:

> This sound was put in there deliberately by Steve Nygard to make sure it 
> was clearly understood that the system was not dealing with parts of the 
> input, because (as you guess) the parser (which does all kinds of things 
> including dictionary derivatives, arranging numbers and dates to be 
> spoken in the way people speak them, and so on) is by no means completely 
> ported.  It is probably the very next job because it makes a big 
> different to the overall quality of the spoken output.

Thank you for the explanation, which is much appreciated. Now I understand why
plurals, some past tenses, and certain names are omitted from the
output. I now observe that many of the function declarations in
parser_module.m have been commented out, indicating that porting is still in
progress.

> Also, the dictionary should be expanded -- a project that got put on  
> hold when the NeXT & NeXT software disappeared.  All sorts of proper  
> names/nouns need to be added, including city and country names, people's 
> names, and so on.  It has been more important recently to get the basic 
> software up on GNU/Linux and the Mac.

Indeed so. Bringing the dictionary into accord with the phonetics of the
synthesizer would improve the spoken output markedly, I think, as exemplified
by those pronounced "r" and "l" sounds that need to be addressed.

> Again, the intonation rules, based on the M.A.K. Halliday's intonation 
> scheme for British English, were being refined.  Craig [Taube-] Schock 
> wrote his thesis on the topic under my supervision ("Intonation for 
> Computer Speech Output" -- University of Calgary Dept. of Computer 
> Science 1993) and received the Governor General's Gold Medal for it, but 
> the method had already been greatly improved when we released the new 
> articulatory synthesis software in 1994-5.

In the current, partially ported, version the intonation tends to rise sharply
until punctuation is encountered. I have noticed, however, that the parsing
code inserts markers into the phonetic string which is forwarded to the rest
of the synthesizer for processing, and I surmise that these affect, among
other parameters, the intonation. Thus I will listen again when the parser has
been ported and refined.

With thanks and regards,

Jason.

[Prev in Thread]

Current Thread

[Next in Thread]

[gnuspeech-contact] Early GNUSpeech observations, Jason White, 2009/04/08
- Re: [gnuspeech-contact] Early GNUSpeech observations, David Hill, 2009/04/08
  - Re: [gnuspeech-contact] Early GNUSpeech observations, Jason White <=
    - Re: [gnuspeech-contact] Early GNUSpeech observations, David Hill, 2009/04/21
    - Re: [gnuspeech-contact] Early GNUSpeech observations, Marcelo Yassunori Matuda, 2009/04/21

Prev by Date: [gnuspeech-contact] Re: gnuspeech
Next by Date: [gnuspeech-contact] Monet Update
Previous by thread: Re: [gnuspeech-contact] Early GNUSpeech observations
Next by thread: Re: [gnuspeech-contact] Early GNUSpeech observations
Index(es):
- Date
- Thread