Voicebuilding for Text-to-speech Synthesis (WS 2014/15)

Description

This seminar will allow the participants to build their own voices for TTS synthesis. This includes designing and recording (in a professional studio!) small speech databases and processing them with open-source tools. The course will cover both unit selection and HMM-based synthesis voices.

We will be using MaryTTS; furthermore, familiarity with Linux and shell interaction are an advantage.

Prerequisites

To be eligible for this course, participants must have successfully taken the lecture Text-to-Speech Synthesis. This means passing the final exam.

Participants must subscribe to the course mailing list, which will serve as the primary means of communicating and organizing the schedule.

Schedule

Week 1 (23.02. – 27.02.2015)

Monday Tuesday Wednesday Thursday Friday
9:30–
12:00
Group 1
Studio
Group 3
Studio
Group 5
Studio
13:00–
16:00
Intro
(slides)
Group 2
Studio
Group 4
Studio
Group 6
Studio

The Intro takes place in the Seminar Room in the ground floor of Building C7.2 (and may run a bit later, until 17:00). The Studio is located in Room 1.01 of Building C7.4.

Week 2 (02.03. – 06.03.2015)

Monday Tuesday Wednesday Thursday Friday
10:00–
12:00
Hacking Hacking Hacking Hacking Hacking
13:00–
16:00
Hacking Hacking Hacking Hacking Hacking

All “Hacking” sessions take place in the Seminar Room, Building C7.2.