VisemeSchedulerSphinx4
Overview
VisemeSchedulerSphinx4 is a tool that generates lip-sync using the Sphinx-4 library (http://cmusphinx.sourceforge.net/sphinx4).
Quick facts:
- Location: /tools/VisemeSchedulerSphinx4
- Language: Java
- Distribution: source
- Platform(s): Windows
Users
Creating bmls for audio files
These instructions explain how to get a Smartbody .bml lip-sync timing file from a .wav file with a .txt transcript.
This tool will do the following for each .wav file in the specified folder
- Run the .wav file through ForceAlignment (based on sphinx4 library) with the correct transcription (.txt file with the same name present in the same folder) and get a phoneme schedule with word break information (which is loaded in memory).
- Use the phoneme schedule, the word break data and the phoneme-to-viseme mapping to create a .bml file.
- Note that the viseme curve currently generated is trapezoidal (ramp-up and ramp-down for 10% of the beginning and end respectively).
Usage: process.bat <absolute path for folder which contains the .wav / .txt files>
where process.bat is located in /tools/VisemeSchedulerSphinx4
This will create a set of smartbody .bml files to use with your audio files in the folder specified.
FAQ
See the Main FAQ. Please use the Google Groups emailing list for unlisted questions.