Components

Overview

The Toolkit is a distributed system of many components working in sync with each other. These components can be categorized into modules, tools, libraries and third party software.

Modules are run-time components essential to the functioning of the Toolkit, including perception, speech recognition, natural language processing, nonverbal behavior, animation, rendering, etc.

Tools are used to assist in configuring, running and modifying the Toolkit.

Libraries are the API's and frameworks that are used by the modules and tools. e.g. messaging API, threading etc.

Third party software are components developed outside of ICT which are essential to provide a proper framework for the Toolkit to run on.

Modules

Tools

Libraries

Third Party Software Requirements

This section lists all the 3rd party software needed to run the toolkit. This software is bundled with the toolkit.

Users

Java Runtime 6u26 (http://java.sun.com/products/archive/j2se/6u10/index.html)
Visual Studio 2008 Redistributable (http://www.microsoft.com/downloads/details.aspx?familyid=A5C84275-3B97-4AB7-A40D-3802B2AF5FC2&displaylang=en)
.NET Framework 3.5 SP1 Redistributable (http://www.microsoft.com/downloads/details.aspx?FamilyID=AB99342F-5D1A-413D-8319-81DA479AB0D7&displaylang=en)
DirectX Redistributable (August 2009) (http://www.microsoft.com/downloads/details.aspx?FamilyId=2DA43D38-DB71-4C1B-BC6A-9B6652CD92A3&displaylang=en)
ActiveMQ Server 5.4.2.0 (http://activemq.apache.org) (no windows installer is available directly through Apache, but is included with the appropriate Toolkit installers)

Developers

All user requirements (above)
Java JDK 6u26 (http://java.sun.com/products/archive/j2se/6u10/index.html)
Microsoft Visual Studio 2010 (optional, but preferred)

Third Party Alternatives

Some of the toolkit components are mere proof-of-concepts rather than state of the art solutions. This section lists alternatives to some of these components.

Speech Recognition

Conversational speech recognizers often need to be trained for use with specific domains. Untrained recognizers may perform poorly. Some recognizer options are:

Sonic, to which the AcquireSpeech client can connect, link
The toolkit has been integrated with PocketSphinx, but as of yet not optimized for performance.
Dragon, not integrated, link

If you are interested in the USC Signal Analysis and Interpretation Laboratory providing you with custom speech models for Sonic, please /wiki/spaces/VHTK/pages/14583665.

Text To Speech

The toolkit uses MS SAPI as a TTS interface, so that any SAPI compatible voice can be used with the toolkit. Using a new voice only requires downloading and installing it and changing the character's voice ID. See the FAQ for details.

Some free SAPI voices can be found here:

http://www.bytecool.com/voices.htm

ATT Natural Voice is also compatible with SAPI, but requires you to buy voices. link

Unfortunately, SAPI is not very well maintained by Microsoft; Windows XP by default contains a very outdated voice, Sam, and the installation of new voices on XP 64, Vista, and Windows 7 is either not working or requires registry hacks.

The following commercial Text To Speech engines offer higher quality voices, but will require purchasing or otherwise licensing a voice directly from the developer:

Cereproc's Cerevoice, (free research licenses are available to academic institutions for research purposes)
Cepstral

Software interfacing from the toolkit to these engines is available in the SmartBody SourceForge repository.

The toolkit is also integrated with the Festival text to speech engine. Currently the toolkit comes packaged with free diphone voices.

Festival offers other voices with better quality but might require licensing.

Renderer

SmartBody has been integrated with the following renderers:

/wiki/spaces/AG/pages/11984082 available as part of the toolkit.
Ogre, available as part of the toolkit
Gamebryo
Unreal Tournament 2004, requires license
Panda 3D, available in SmartBody SourceForge repository

Since SmartBody is open source, as a developer you can integrate it with other renderers.