Babel-Fish earbuds
Hey everyone,
Today I've come up with an interesting topic which was mostly shown in sci-fi movies but now it has turned into reality. Imagine if you're in a foreign country and you don't know the language of the people and want to communicate but you don't understand a thing they are saying. Well you're now in good hands there is a tech which does live translation of what people are saying to the language of your preference and THIS IS DONE IN REAL TIME. How cool is that? Two companies Google and Baidu were working on this tech and they succeeded too.
Now what is this earbuds and how it works?
If two people knowing different languages wants to communicate, the person who speaks wears the earbud and the listener holds his/her phone and as the person speaks the app on the phone translates it and plays it out loud. Now the person with the phone responds and this response is again translated and played outloud in the earbuds. Working is quite simple but the data matching and algorithm works going on inside is pretty tiring and complicated.
Google already has google translate which translates as we speak and they have upgraded it and it can scan and translate in real time. Now they have come one step forward with real time conversation translator.
They have had this feature before but it wasn't quite efficient as the background noises confused the AI making the conversion process quite difficult so they designed the earbuds in such a way that it goes deep into your ear canal and the microphone is activated only when you press the button on it.
Google built earbuds known as pixel buds could translate currently about 40 languages in real time or at least fast enough to hold a conversion.
Baidu's earbuds are no lesser to google they are putting a really tough fight and it is said to be better in this compared to pixel buds. The baidu's AI is trained with 2 million pairs of Chinese and English sentences and they seem pretty accurate and precise when translating e.g in Chinese the verb "meets" is at the end of the sentence but when translating to English it is the third word. Currently they translate only two languages but soon it will be language neutral.
Now lets get technical on how the earbuds does what it does
First Input conditioning the earbuds picks up background noise, interference, effectively recording a mixture of user's voice and other sounds. Then denoising which is removing background sounds other than the user's with a voice activity detector which can also be activated by a switch.
Then Language Identification which is done by a machine learning algorithm which matches the speech with the languages in the database and all of this must be done within a couple of seconds or less.
Automatic speech recognition(ASR) then uses a acoustic model to convert the recorded speech to phonemes(distinguish one word from other in a particular language). By using grammar and a pronunciation dictionary the speech is converted into text.
Next Natural language processing this is the hard part. Here the meaning of the speech is decoded and then encoded into another language with all the complexities that makes the second language hard to learn for us.
Last is speech synthesis which is the opposite of ASR and here the converted text is converted back to speech in the second language.
Fact Flash:
It is has been announced that in few years these earbuds can translate and interpret almost all known languages with atmost certainty with a possibility of zero error and time lag. You can get this pixel ear buds online for 159$.
YOU CAN ALSO SUGGEST ANY TOPIC YOU NEED OR GIVE SOME SUGGESTIONS IT WILL BE UPDATED WITHIN 48 HOURS
.....................Keep Calm and Love TECH...................
2 Comments
Bài viết rất hữu ích, cảm ơn tác giả đã chia sẻ
ReplyDeleteMoreover, with so much research backing mobile phone Accessories & Digital Products and their Health related Product, and tons of comparison reviews available online, the dependency on sales people in physical stores has reduced. duttajitechnical.com
ReplyDelete