Overview
The Google and AT&T ASR is a java application which acts as a speech server which uses either Google's speech recognition engine or AT&T's speech recognition engine based on what you select. Both Google and AT&T servers, share the same code (it's a jar that implements the sonic protocol and both use the same jar).
They are both web services and so depend on having an Internet connection.
Both services do not provide partial interpretations in real time, just the final interpretation. To use them one needs acquirespeech. The audio is accumulated while the push-to-talk is pressed and sent it over to the server when the button is released. The results are then received and communicated back to acquirespeech.
Quick facts:
- Location: core/GoogleASR
- Language: Java
- Distribution: source
Users
Using the AT&T ASR
The AT&T service requires a subscription and allows for custom language models to be trained. Once you subscribe, you will receive a username and password to access your account.
to create a language model prepare a file with one utterance you want to be recognized for each line of the file then select:
Manage grammar files
then select upload and select the text file (it needs to have the .train extension)
then compile it
then you can rename it to the my_lm id or any other id you want to use)
Using the Google ASR
The Google ASR doesn't require anything and cannot be customized (but it's very good in its current settings). the major limitation of the Google ASR is that it processes audios only about 10 seconds long.
Command line parameters
-h display help
-l <arg> The language model ID to be used in the AT&T ASR
-m <arg> Maximum audio length (seconds) before sending request to service
-p <arg> Port to listen for sonic activity
-t <arg> Type of the asr to run (Google or AT&T), use G for Google and A for AT&T
-u <arg> The UID to use the AT&T MashUp site
usage: [-p TCP port ][-m seconds ] [-h] [-t G|A (G is default)] [-u AT&T_MashUp_UID ] [-l Language model ID for AT&T ]
Known Issues
- List of common known issues, like why something isn't working, why it's implemented in a certain way, limitations, etc. If there are major Jira tickets, link to those as well.
FAQ
See the Main FAQ. Please use the Google Groups emailing list for unlisted questions.