Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Overview

VisemeSchedulerSphinx4 is a tool that generates lip-sync using the Sphinx-4 library (http://cmusphinx.sourceforge.net/sphinx4).

Quick facts:

  • Location: /tools/VisemeSchedulerSphinx4
  • Language: C#
  • Distribution: source
  • Platform(s): Windows

Users

Creating bmls for audio files

These instructions explain how to get a Smartbody .bml lip-sync timing file from a .wav file with a .txt transcript.

This tool will do the following for each .wav file in the specified folder

  •  run the .wav through Sox to get the audio value at each sample rate
  •  Map the value to a viseme weight
  •  Process the .txt transcript of the audio and do an even distribution of the word break data based on the length of the audio file
  • Write out the .bml file


Usage: VisemeSchedulerSox <folder containing sounds> [<viseme to use>] [<scale factor>] [<sampleLength in ms>]

where

- folder is a relative or absolute path to the folder containing the .wav and .txt files
- (default 'Ao') viseme to use is the single viseme it will use to map the audio to
- (default 6) scale factor is a multiply factor to the viseme weight data. If your results are too low or too high, you can adjust this value.
- (default 100) sample length is how much to split the audio file up for analyzing.

Run the VisemeSchedulerSox tool:

e.g. VisemeSchedulerSox.exe ..\..\data\sounds

This will create a set of smartbody .bml files to use with your audio files.

FAQ

See the Main FAQ. Please use the Google Groups emailing list for unlisted questions.

 

  • No labels