Voicebuilding for Text-to-speech Synthesis (WS 2017/18)

Description

This seminar will allow the participants to build their own voices for TTS synthesis. This includes designing and recording (in a professional studio!) small speech databases and processing them with open-source tools.

We will be using MaryTTS; furthermore, familiarity with command-line shell interaction are mandatory. Bringing a laptop running MacOS X or Linux is an advantage.

Prerequisites

To be eligible for this course, participants must have successfully taken the lecture Text-to-Speech Synthesis. This means passing the final exam.

Participants must subscribe to the course mailing list, which will serve as the primary means of communication.

Slides

Schedule

This seminar will run as a block for two weeks in February/March 2018 (26.02.–09.03.2018) and require full-time participation during that period. Each day will have a morning session, with group work in the afternoons.

The course will take place at the Computational Linguistics Department, building C7.3 (room 1.12), with the exception of the recordings, which will be made in the studio of building C7.4 (room 1.01).

Week 1 (26.02.–02.03.)

Time Monday Tuesday Wednesday Thursday Friday
10–13 Lecture Recording / Processing Recording / Processing Recording / Processing Recording / Processing
14–17 Lecture Recording / Processing Recording / Processing Recording / Processing Recording / Processing

Week 2 (05.03.–09.03.)

Time Monday Tuesday Wednesday Thursday Friday
10–13 Lecture Lecture Lecture Lecture Lecture
14–17 Group work Group work Group work Group work Group work