VisemeSchedulerSphinx4

Overview

VisemeSchedulerSphinx4 is a tool that generates lip-sync using the Sphinx-4 library (http://cmusphinx.sourceforge.net/sphinx4).

Quick facts:

Location: /tools/VisemeSchedulerSphinx4
Language: C#
Distribution: source
Platform(s): Windows

Users

Creating bmls for audio files

These instructions explain how to get a Smartbody .bml lip-sync timing file from a .wav file with a .txt transcript.

This tool will do the following for each .wav file in the specified folder

~~run the .wav through Sox to get the audio value at each sample rate~~
~~Map the value to a viseme weight~~
~~Process the .txt transcript of the audio and do an even distribution of the word break data based on the length of the audio file~~
~~Write out the .bml file~~

Usage: ~~VisemeSchedulerSox <folder containing sounds> [<viseme to use>] [<scale factor>] [<sampleLength in ms>]~~

where

~~- folder is a relative or absolute path to the folder containing the .wav and .txt files~~
~~- (default 'Ao') viseme to use is the single viseme it will use to map the audio to~~
~~- (default 6) scale factor is a multiply factor to the viseme weight data. If your results are too low or too high, you can adjust this value.~~
~~- (default 100) sample length is how much to split the audio file up for analyzing.~~

~~Run the VisemeSchedulerSox tool:~~

~~e.g. VisemeSchedulerSox.exe ..\..\data\sounds~~

This will create a set of smartbody .bml files to use with your audio files.

FAQ

See the Main FAQ. Please use the Google Groups emailing list for unlisted questions.