In our continuing effort to better understand different speech
synthesis techniques on the same data, we have
devised a challenge that will help us better compare research
techniques in building corpus-based speech synthesizers.
The Blizzard Challenge 2006 is to take the provided ATR 5000 utterance
database and build a synthetic voice. Unknown
sentences from an independent source will be generated and each
participant will synthesize them with their system. The speech will
then be put on the web for evaluation. The results were presented at
a satellite workshop on
Interspeech 2006 -- ICSLP in Pittsburgh, PA.
This year we had 14 entries from all round the world, 2 from North
America, 7 from Asia and 5 from Europe. This year the database made
available was substantially larger though we also offered two
databases sizes: the full 5000 utterance databases, and a subset
consisting of 1200 (ARCTIC) utterances.
Blizzard Challenge 2006 Workshop
Held in the Allegheny I, Westin Conference Hotel, Pittsburgh PA, Saturday September 16th 2006.
- 10:00-10:30 Coffee
- 10:30-10:45 "Blizzard Challenge Welcome and Introduction" Alan W Black and Keiichi Tokuda.
"The Cerevoice Blizzard Entry 2006: A prototype Database Unit Selection Engine",
Matthew P. Aylett, Christopher J. Pidcock, Mark E. Fraser, Cereproc, Edinburgh.
"Building Probabilistic Corpus-based Speech Synthesis Systems from the Blizzard
Challenge 2006 Speech Databases", Sinsuke Sakai,
Academic Center for Computing and Media Studies,
"The Nitech-NAIST HMM-based speech synthesis system
for the Blizzard Challenge 2006",
Heiga Zen, Tomoki Toda , Keiichi Tokuda, Nagoya Institute of Technology and
Nara Institute of Science and Technology.
"The Blizzard Challenge 2006 CMU Entry
introducing hybrid trajectory-selection synthesis",
John Kominek, Alan W Black, Language Technologies Institute,
Carnegie Mellon University.
"The Jess Blizzard Challenge 2006 Entry",
Peter Cahill and Julie Carson-Berndsen, University College Dublin, Ireland.
"USTC System for Blizzard Challenge 2006
an Improved HMM-based Speech Synthesis Method",
Zhen-Hua Ling, Yi-Jian Wu, Yu-Ping Wang, Long Qin, Ren-Hua Wang,
University of Science and Technology of China.
"Developing a Test Bed of English Text-to-Speech System XIMERA
for the Blizzard Challenge 2006",
Tomoki Toda, Hisashi Kawai, Toshio Hirai, Jinfu Ni, Nobuyuki Nishizawa,
Junichi Yamagishi, Minoru Tsuzaki, Keiichi Tokuda, and Satoshi Nakamura,
- 12:30-14:00 Lunch, Orchard Resturant, Westin Hotel
"Multisyn Voice for the Blizzard Challenge 2006",
Robert Clark, Korin Richmond, Volker Strom and Simon King,
CSTR, University of Edinburgh.
Open Source Unit Selection as the Basis for Research on Expressive Synthesis",
Marc Schroeder, Anna Hunecke, Sacha Krstulovic
"A Study on How Human Annotations Benefit the TTS Voice",
Min Chu, Yining Chen, Yong Zhao, Yusheng Li and Frank Soong,
Microsoft Research Asia, Beijing, China.
"IVO Blizzard 2006 Entry",
Lukasz Osowski and Michal Kaszczuk, IVO Software, Poland.
"Multi-tier Non-uniform Unit Selection for Corpus-based Speech Synthesis",
Jin-Hui Yang, Zhi-Wei Zhao, Yuan Jiang, Guo-Ping Hu, Xiao-Ru Wu,
iFLYTEK Research, Hefei, China. (presented by Zhen-Hua Ling USTC)
"The IBM Submission to the 2006 Blizzard Text-to-Speech Challenge"
Ellen Eide, Raul Fernandez, Ron Hoory, Wael Hamza, Zvi Kons, Michael Picheny, Ariel Sagi, Slava Shechtman, Zhi Wei Shuang. IBM T.J. Watson Research Center,
IBM Haifa Research Center, and IBM China Research Lab (Two Systems IBM-Watson and IBM-Haifa).
- 15:45-16:00 Coffee
"Blizzard Challenge 2006: Results"
Christina L. Bennett and Alan W Black.
"Discussion and Future" open discussion of challenge and next years challanges
- 19:00- Dinner (for those interested)