Nearly Instant Voice Recognition, Machine Translation, and Speech Synthesis This answer is then converted into written words, translated, and then sent to the other person as a voice signal (text display is only available on smartphones). That person can then answer questions, for example, by speaking in his or her own language. Two to three seconds later, the translated text is transferred into sound signals that the other person hears as speech in his or her own language. They are then translated and displayed in the language of the person on the other end of the call. The spoken words are first displayed on the LCD screen on the speaker's handset. To begin using the service, users press the "begin translation" button on the touch screen and then speak. The words a speaker says are displayed as text, translated, and then transmitted to the listener in his or her own spoken language. Translation accuracy is constantly improving, and the new translation service should be available to general cell phone users early in 2012. They were provided with smartphones and cell phones (FOMA phones) and given access to both Japanese and English language services. A campaign to recruit users to try out the new system began in November 2011. It translates the user's words into the language spoken by the person on the other end of the call almost instantaneously. In May 2011, NTT DOCOMO announced a Phone Translation Service for smartphones and certain cell phones. In order to perform processing at a speed equivalent to the super computers of the 1990s, the world had to wait for "cloud computing," which allows users to use all their devices-computers, cell phones, or other things-via a connection to a server on the Internet. The world moved one step closer to making the dream of "voice translation devices" practical as we entered the 21st century. Integrating Cloud Computing with Cell Phone TechnologyĪ user can view his or her spoken words and their translation on a tablet (left) or smartphone (right). Both of these efforts resulted in functional technologies in the early 1980s, but it has taken a much longer time to produce a prototype of a "voice translation device." This is because, until lately, the processing speed of a super computer was necessary to realize nearly instantaneous and highly accurate voice recognition. At the same time, a national project headed by Kyoto University was at work on Japanese-to-English and English-to-Japanese machine translation technology. In Japan, NTT and other telecommunications companies as well as university research centers worked to develop a Japanese speech synthesis technology. Work on these technologies has been steadily progressing around the world. In order for computers to translate the spoken word, three technologies are required: a voice recognition function that allows them to convert spoken words into text, a machine translation function that translates the text into another language, and a speech synthesis function that converts the text back into audible speech and sends this sound signal to a speaker.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |