This reference is based on W3C standards Voice eXtensible Markup Language (VoiceXML) version 2.0, Speech Synthesis Markup Language (SSML) Version 1.0, and
Speech Recognition Grammar Specification Version 1.0 (SRGS). Descriptions were extracted from these specifications.

Standards' excerpts:
Voice Extensible Markup Language. VoiceXML is designed for creating audio dialogs that feature synthesized speech, digitized audio, recognition of spoken and DTMF key input, recording of spoken input, telephony, and mixed initiative conversations. Its major goal is to bring the advantages of Web-based development and content delivery to interactive voice response applications.

SSML is part of a larger set of markup specifications for voice browsers developed through the open processes of the W3C. It is designed to provide a rich, XML-based markup language for assisting the generation of synthetic speech in Web and other applications. The essential role of the markup language is to give authors of synthesizable content a standard way to control aspects of speech output such as pronunciation, volume, pitch, rate, etc. across different synthesis-capable platforms.

[SRGS] defines syntax for representing grammars for use in speech recognition so that developers can specify the words and patterns of words to be listened for by a speech recognizer. The syntax of the grammar format is presented in two forms, an Augmented BNF Form and an XML Form.

The usage of the W3C standards is covered by W3C DOCUMENT LICENSE.

Zvon contains other VoiceXML related materials.

Prepared by: Miloslav Nic (Mila)

