Voicebuilding for Text-to-speech Synthesis (WS 2014/15)

Description

This seminar will allow the participants to build their own voices for TTS synthesis. This includes designing and recording (in a professional studio!) small speech databases and processing them with open-source tools. The course will cover both unit selection and HMM-based synthesis voices.

We will be using MaryTTS; furthermore, familiarity with Linux and shell interaction are an advantage.

Prerequisites

To be eligible for this course, participants must have successfully taken the lecture Text-to-Speech Synthesis. This means passing the final exam.

Participants must subscribe to the course mailing list, which will serve as the primary means of communicating and organizing the schedule.

Schedule

Week 1 (23.02. – 27.02.2015)

	Tuesday	Wednesday	Thursday	Friday
9:30– 12:00		Group 1 Studio	Group 3 Studio	Group 5 Studio

13:00– 16:00	Intro (slides)	Group 2 Studio	Group 4 Studio	Group 6 Studio

The Intro takes place in the Seminar Room in the ground floor of Building C7.2 (and may run a bit later, until 17:00). The Studio is located in Room 1.01 of Building C7.4.

Week 2 (02.03. – 06.03.2015)

	Monday	Tuesday	Wednesday	Thursday	Friday
10:00– 12:00	Hacking	Hacking	Hacking	Hacking	Hacking

13:00– 16:00	Hacking	Hacking	Hacking	Hacking	Hacking

All “Hacking” sessions take place in the Seminar Room, Building C7.2.