Code: MMIA214
Course: Prof.Dr.N.Tandareanu
Cycle 2 Year II;
Sem. 1: Course: 28h, Lab: 28h
Credits: 8
Profil: computer science
Type: optional
Objectives:
- Assimilation of the concepts assigned to the process of voice recognition and
voice synthesis
- Familiarization with the features concerning the implementation in Java of the
applications by voice
I. Apache Ant.
- Generalities about this product
- Instaling Apache Ant
- Projects, properties, tags
- Build files. Example.
II. Interfaces by voice
- Applications by voice
- The use of voice applications
- Designing voice applications
- Voice technology
- Speech synthesis
III. Speech synthesis by Java Speech API
- What is JSAPI?
- Speech engine, properties
- The states of a speech engine
- Locating, Selecting and Creating Engines
- Speech Events
- The synthesiser as an engine
- Speech Synthesis: javax.speech.synthesis
- Send a text to be spoken
IV. Voice recognition
- Generalities
- Architecture of Sphinx
- FrontEnd module
- Linguist module
- Recognizers. Selecting a recognizer.
V Java Speech Grammar Format
-
Introduction
-
Definitions
2.1 Grammar Names and Package Names
2.2 Rulenames
2.3 Tokens
2.4 Comments
-
Grammar Header
3.1 Self-Identifying Header
3.2 Grammar Name Declaration
3.3 Import
-
Grammar Body
4.1 Rule Definitions
4.2 Rule Expansions
4.3 Composition
4.4 Grouping
4.5 Unary Operators
4.6 Tags
4.7 Precedence
4.8 Recursion
4.9 Uses of NULL and VOID
-
Examples
Bibliography:
-
Java Speech Grammar Format Specification - JSGF documentation
-
The CMU-Cambridge Statistical Language Modeling Toolkit v2 ,
http://svr-www.eng.cam.ac.uk/~prc14/toolkit_documentation.html
- Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf, Joe Woelfel -
Sphinx-4: A Flexible Open Source Framework for Speech Recognition, SMLI TR2004-0811 c
2004 SUN MICROSYSTEMS INC.
|