Blog about speech technologies - recognition, synthesis, identification. Mostly it's about scientific part of it, the core design of the engines, the new methods, machine learning and about about technical part like architecture of the recognizer and design decisions behind it.

Magic Words of Interspeech 2011

Interspeech 2011 is coming. It going to be an amazing event I suppose. If you are interested what is going on there, let's figure that out.

To keep things simple we will use Unix command line tools. Sometimes text processing could be fun even with simple commands. Text is still most conventint form of the information presentation, way better than HTML or databases. Of course there is lack for more advanced things like stopword filtering or named entity recognition. Let's hope one day Unix command line will have them.

1. Download full printable programs of Interspeech 2010 and Interspeech 2011 with wget, dump them to text with lynx and cleanup punctuation with sed.

2. Dump word counts with SRILM tool ngram-count and cut 1000 most frequent words on list for 2011 with head and sort. Leave all words in 2010 list.

3. Figure out which of the words in 2011 list are new and do not appear in 2010 list with sort and uniq.

Suprisingly there will be only 2 new words. They are: i-vector and crowdsourcing.

1 коммент.:

Post a Comment

Blog Archive

About Me

My Photo
Moscow, Russia
Nowdays I mostly work on open source projects in speech recognition and synthesis like Festival, CMU Sphinx and Voxforge. I also support the Russian parts of those projecs, providing the leading product in ASR and TTS in Russian. In the past I used to participate in GNOME, work on embedded Linux devices and on software development technologies related to automatic software verification and modelling. If you have any questions feel free to contact me by mail nshmyrev at nexiwave dot com or find me in jabber/irc.