Archive for Voice Recognition

Chrome Voice recognition an insight !

Posted in Google, Uncategorized, Web 2.0 with tags , , , , , , , , , on June 18, 2011 by javaforweb

As I promised myself to dig more about Chrome’s Voice recognition, I have started it already after having a cup of tea and Honey Nut Cherios. By the way I found them very tasty.
Coming back to Chrome, first thing I found is their code trunk :

Here is the link to speech recognition

Mugging it more I came to know they are using FLAC – Free Lossless Audio Codec, here is the link
Checking more about FLAC I came to know it compress audio by 50% but the good thing is it does not lose a single bit during the process.

If you want to test Chrome Voice Recognition, checkout this out

Then I scratched my itching and started to search for a site where I can find some phonetic sounds that I used on HTML5Rocks.

Here is the Link

I used Phonetics in American English, Spanish and German

I tried few words like ball, taboo, rub , goose, ago, dear and much more .. here are the results

ball – lol
taboo – taboo
rub – various attempts give krupp, growth,
goose – who is
ago – google and piccola (God what is this?)
dear – dear

Results are not impressive !!! Damn………….. !

My sincere advice to them, please please please do a sincere alpha testing… get people on board from various countries and see if they can extend support for
various phonetics.. BEFORE that see what improvements are required in the original APIs.

Chrome’s Voice recognition – could it be messy?

Posted in Google, Uncategorized, Web 2.0 with tags , , , , , , , , , on June 17, 2011 by javaforweb

Now that Google Chrome is out with the voice recognition support and I am curious about many things e.g. like how they are handling accents around the world. The biggest challenge(they must be have accessed it already) could be Indian, Chinese, Spanish, German english and accent.

Google Chrome Voice Recognition Support

I tested various words with my Beats earphone/microphone and testing it over Lenovo T410 with genuine Audio driver (I am not sure if they care about all this). But the results came are not good. Few word that Chrome detected clearly on the first go were “Hello” & “John”.

One thing I am sure about is auto search suggestions below original search given by it are absolutely vague .. like below “Hello” it suggests “I will” Below “Dance” it gives “Jazz” and below that “Dancing”.
One thing is really freaky… please check the Image

I would like to know about the APIs and algorithms they are using for Voice Recognition. Weekend is coming and I will be after it. That gives me ultimate pleasure wor