|
From: | Marcelo Y. Matuda |
Subject: | Re: [gnuspeech-contact] Quickstart for the latest Gnuspeech? |
Date: | Sun, 1 Nov 2015 21:14:33 -0200 |
User-agent: | Mozilla/5.0 (X11; Linux x86_64; rv:38.0) Gecko/20100101 Thunderbird/38.3.0 |
Hi, On 11/01/2015 03:45 PM, Advrk Aplmrkt wrote:
Thanks for the links, and I agree a proper man page or quickstart guide would be super useful for end users! (and not just speech synthesis researchers) I checked out the YouTube videos, and I confess it was hard for me to understand what Gnuspeech was saying... Is there a reason why it doesn't sound nearly as natural as, say, Siri yet???
Siri uses a method called Unit Selection (AFAIK), which joins segments of recorded speech. That is why the quality can be so good.
Gnuspeech uses articulatory synthesis, which uses a mathematical model of the human vocal tract to synthesize the speech from scratch. It is very difficult to adjust the many parameters. Also GnuspeechSA is a C++ port of the original TTS_Server (for NeXTSTEP), developed a long time ago. It doesn't yet incorporate the research done in all these years. Hopefully articulatory synthesis will reach the quality of unit selection, but there is much work to do.
Regards, Marcelo
[Prev in Thread] | Current Thread | [Next in Thread] |