gnuspeech-contact
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [gnuspeech-contact] Re: Organising gnuspeech source(s) (David Hill)


From: Dalmazio Brisinda
Subject: Re: [gnuspeech-contact] Re: Organising gnuspeech source(s) (David Hill)
Date: Sun, 16 Nov 2008 16:10:30 -0600

Hello Marcelo,

[...]

I have written/modified the makefiles (in the attachment), and the sequence is:

1) Compile/install the Frameworks/Tube files as a library (libGnuSpeechTube).
2) Compile/install the Frameworks/GnuSpeech files as a library (libGnuSpeech).
3) Compile/install Applications/Monet.
4) Compile/install Applications/PreMo.

This is a manual process.

In the attachment, the file INSTALL.gnustep contains the installation
procedure, and the file modifications.txt the adjustments that I have
had to do.

Cool. Thanks for this. I'll make these changes and commit to the repository shortly, save for one. Also I'll wait a little to commit the gorm changes until we can verify nib 3.x compatibility (see below). Here's the one change which may be problematic:

- In Applications/Monet/MSynthesisController.m:
(R1/R2 were always zero before)

@@ -424,7 +424,7 @@
     setDriftGenerator([driftDeviationField floatValue], 500, [driftCutoffField floatValue]);
     //setDriftGenerator(0.5, 250, 0.5);

-    [eventList setRadiusMultiply:[radiusMultiplyField doubleValue]];
+    //[eventList setRadiusMultiply:[radiusMultiplyField doubleValue]];

I found that making this change disables the effects of the Radius Multiply slider/field in the Intonation Parameters window when the intonation check box is checked. Currently it does affect r1/r2. Can you confirm this on Linux/Gnustep? It's odd that it should give different results on Linux/Gnustep vs. OS X.




I can synthesize a speech, but with no intonation. And the speech is
very slow. It seems that the tempo is not being set. And the output
file is always in AU format, even when I select WAV.

Yes, actually, there is no intonation in the Mac version either by default (flat intonation). I don't know what's required to make this work at present, not until I can put aside some time to look more closely under the hood. It may well be a series of settings. David might know more about this. David?

I'll send you some speech rendered into the 3 supported formats in a following email, AU, AIFF, WAV as produced by Monet. My media info app tells me that they are AU, AIFF, and WAV files. Can you read and play these on your system? Could you try generating the audio and comparing to these files? The speech rendered is the "I know you believe you understand what you think I said..." sentence.




In PreMo, the dictionary is not working. You are using ndbm or gdbm in MacOSX?

I haven't made any changes to the functionality, and it looks like it uses dbm by default. When I run the application it always prints out "Using DBM dictionary."

    if ([[NSUserDefaults standardUserDefaults] boolForKey:@"ShouldUseDBMFile"]) {
        NSLog(@"Using DBM dictionary.");
        [self _createDBMFileIfNecessary];
        dictionary = [[GSDBMPronunciationDictionary mainDictionary] retain];
    } else {
        NSLog(@"Using simple dictionary.");
        dictionary = [[GSSimplePronunciationDictionary mainDictionary] retain];
        [dictionary loadDictionaryIfNecessary];
    }


The key line in the GSDBMPronunciationDictionary class (located in the GnuSpeech framework source under the TextProcessing directory) is:

    newDB = dbm_open([aFilename UTF8String], O_RDWR | O_CREAT, 0660);

Also, the GSSimplePronunciationDictionary class loads "2.0eMainDictionary.dict" located in the GnuSpeech framework source under the TextProcessing directory. 




Yes, they are. When I imported everything, all the nibs were in 2.x format,
but they wouldn't load in Xcode/Interface Builder. So I had to massage them
a bit (add template info.nib files which allowed them to open in Interface
Builder). But anyway, whatever is required for cross-platform compatibility
is easy to change, whether xib, 3.x nib, or 2.x nib. I should probably use
nib 3.x format if xib is not supported in gorm. Assuming gorm can handle nib
3.x.

You could send me an example of nib 3.x file. Then I would test it
using Gorm. If we can't convert between nib 3.x and gorm the work will
be doubled.


I've attached a gzip'd tarball of the English.lproj directory for the PreMo application which contains both a nib 3.x and a xib 3.x version of the interface.

Attachment: English.lproj.tar.gz
Description: GNU Zip compressed data


Best,
dalmazio




reply via email to

[Prev in Thread] Current Thread [Next in Thread]