Facultatea de Matematica-Informatica

Speech Synthesis and Speech Recognition

Master in Computer Science
Specialization: Methods and Models in Artificial Intelligence
SYLLABUS

Academic year 2011- 2012

Code: MMIA214
Course: Prof.Dr.N.Tandareanu
Cycle 2 Year II;
Sem. 1: Course: 28h, Lab: 28h
Credits: 8
Profil: computer science
Type: optional
Objectives:

Assimilation of the concepts assigned to the process of voice recognition and voice synthesis
Familiarization with the features concerning the implementation in Java of the applications by voice

I. Apache Ant.

Generalities about this product
Instaling Apache Ant
Projects, properties, tags
Build files. Example.

II. Interfaces by voice

Applications by voice
The use of voice applications
Designing voice applications
Voice technology
Speech synthesis

III. Speech synthesis by Java Speech API

What is JSAPI?
Speech engine, properties
The states of a speech engine
Locating, Selecting and Creating Engines
Speech Events
The synthesiser as an engine
Speech Synthesis: javax.speech.synthesis
Send a text to be spoken

IV. Voice recognition

Generalities
Architecture of Sphinx
FrontEnd module
Linguist module
Recognizers. Selecting a recognizer.

V Java Speech Grammar Format

Introduction
Definitions
2.1 Grammar Names and Package Names
2.2 Rulenames
2.3 Tokens
2.4 Comments
Grammar Header
3.1 Self-Identifying Header
3.2 Grammar Name Declaration
3.3 Import
Grammar Body
4.1 Rule Definitions
4.2 Rule Expansions
4.3 Composition
4.4 Grouping
4.5 Unary Operators
4.6 Tags
4.7 Precedence
4.8 Recursion
4.9 Uses of NULL and VOID
Examples

Bibliography:

Java Speech Grammar Format Specification - JSGF documentation
The CMU-Cambridge Statistical Language Modeling Toolkit v2 ,
http://svr-www.eng.cam.ac.uk/~prc14/toolkit_documentation.html
Willie Walker, Paul Lamere, Philip Kwok, Bhiksha Raj, Rita Singh, Evandro Gouvea, Peter Wolf, Joe Woelfel - Sphinx-4: A Flexible Open Source Framework for Speech Recognition, SMLI TR2004-0811 c 2004 SUN MICROSYSTEMS INC.

Practical works

Documentation

Speech synthesis (Course Notes)

Speech recognition (Course Notes)

Apache Ant

FreeTTS

Sphinx 4-1.0

JSAPI.html

Speech Synthesis and Speech Recognition Master in Computer Science Specialization: Methods and Models in Artificial Intelligence SYLLABUS

Academic year 2011- 2012

Last update: Sept. 2010

Speech Synthesis and Speech Recognition

Master in Computer Science
Specialization: Methods and Models in Artificial Intelligence
SYLLABUS