Voicebuilding for Text-to-speech Synthesis (WS 2017/18)
Description
This seminar will allow the participants to build their own voices for TTS synthesis. This includes designing and recording (in a professional studio!) small speech databases and processing them with open-source tools.
We will be using MaryTTS; furthermore, familiarity with command-line shell interaction are mandatory. Bringing a laptop running MacOS X or Linux is an advantage.
Prerequisites
To be eligible for this course, participants must have successfully taken the lecture Text-to-Speech Synthesis. This means passing the final exam.
Participants must subscribe to the course mailing list, which will serve as the primary means of communication.
Slides
Schedule
This seminar will run as a block for two weeks in February/March 2018 (26.02.–09.03.2018) and require full-time participation during that period. Each day will have a morning session, with group work in the afternoons.
The course will take place at the Computational Linguistics Department, building C7.3 (room 1.12), with the exception of the recordings, which will be made in the studio of building C7.4 (room 1.01).
Week 1 (26.02.–02.03.)
Time | Monday | Tuesday | Wednesday | Thursday | Friday |
---|---|---|---|---|---|
10–13 | Lecture | Recording / Processing | Recording / Processing | Recording / Processing | Recording / Processing |
14–17 | Lecture | Recording / Processing | Recording / Processing | Recording / Processing | Recording / Processing |
Week 2 (05.03.–09.03.)
Time | Monday | Tuesday | Wednesday | Thursday | Friday |
---|---|---|---|---|---|
10–13 | Lecture | Lecture | Lecture | Lecture | Lecture |
14–17 | Group work | Group work | Group work | Group work | Group work |