A voice language translator, suitable for implementation in hand-held size, is disclosed. The voice language translator includes: a key pad (20); a display system (17); a language cartridge(s) (45); a voice recognition module (49); a voice synthesizer (47); a speaker (39); a microphone (41); and a programmed CPU (43). Prior to use as a translator, the voice language translator is trained to the voice of a user. During training, a series of words and phrases to be spoken by the user are displayed, or spoken, in the language of the user. As the user speaks the words and phrases, the voice recognition circuit produces a digitally coded voice pattern that uniquely identifies the way in which the user spoke the words and phrases. The voice patterns produced by the voice recognition circuit are analyzed and stored, preferably in the cartridge. Thereafter, during translation, when the user speaks a sentence, the voice pattern produced by the voice recognition circuit is compared with the... |
Citations|
| US4507750 | May 13, 1982 | Mar 26, 1985 | Texas Instruments Incorporated | Electronic apparatus from a host language |
Referenced by|
| US5293584 | May 21, 1992 | Mar 8, 1994 | International Business Machines Corporation | Speech recognition system for natural language translation | | US5317508 | Oct 26, 1992 | May 31, 1994 | Matsushita Electric Industrial Co., Ltd. | Image editing apparatus | | US5377303 | Dec 9, 1993 | Dec 27, 1994 | Articulate Systems, Inc. | Controlled computer interface | | US5475798 | Jan 6, 1992 | Dec 12, 1995 | Handlos, L.L.C. | Speech-to-text translator | | US5524169 | Dec 30, 1993 | Jun 4, 1996 | International Business Machines Incorporated | Method and system for location-specific speech recognition | | US5526259 | Apr 22, 1994 | Jun 11, 1996 | Hitachi, Ltd. | Method and apparatus for inputting text | | US5561736 | Jun 4, 1993 | Oct 1, 1996 | International Business Machines Corporation | Three dimensional speech synthesis | | US5615301 | Sep 28, 1994 | Mar 25, 1997 | | Automated language translation system | | US5724526 | Dec 15, 1995 | Mar 3, 1998 | Sharp Kabushiki Kaisha | Electronic interpreting machine | | US5758023 | Sep 21, 1995 | May 26, 1998 | | Multi-language speech recognition system | | US5765132 | Oct 26, 1995 | Jun 9, 1998 | Dragon Systems, Inc. | Building speech models for new words in a multi-word utterance | | US5794189 | Nov 13, 1995 | Aug 11, 1998 | Dragon Systems, Inc. | Continuous speech recognition | | US5794204 | Sep 29, 1995 | Aug 11, 1998 | Seiko Epson Corporation | Interactive speech recognition combining speaker-independent and speaker-specific word recognition, and having a response-creation capability | | US5799279 | Nov 13, 1995 | Aug 25, 1998 | Dragon Systems, Inc. | Continuous speech recognition of text and commands | | US5802251 | Sep 5, 1995 | Sep 1, 1998 | International Business Machines Corporation | Method and system for reducing perplexity in speech recognition via caller identification | | US5831518 | Feb 7, 1997 | Nov 3, 1998 | Sony Corporation | Sound producing method and sound producing apparatus | | US5842168 | Aug 20, 1996 | Nov 24, 1998 | Seiko Epson Corporation | Cartridge-based, interactive speech recognition device with response-creation capability | | US5889473 | Mar 17, 1997 | Mar 30, 1999 | Sony Corporation Sony Electronics, Inc. | Tourist information pager | | US5938593 | Nov 6, 1997 | Aug 17, 1999 | Microline Technologies, Inc. | Skin analyzer with speech capability | | US5946658 | Oct 2, 1998 | Aug 31, 1999 | Seiko Epson Corporation | Cartridge-based, interactive speech recognition method with a response creation capability | | US5956668 | Jul 18, 1997 | Sep 21, 1999 | AT&T Corp. | Method and apparatus for speech translation with unrecognized segments | | US5960393 | Jun 12, 1997 | Sep 28, 1999 | Lucent Technologies Inc. | User selectable multiple threshold criteria for voice recognition | | US5963892 | Jun 26, 1996 | Oct 5, 1999 | Sony Corporation | Translation apparatus and method for facilitating speech input operation and obtaining correct translation thereof | | US5983182 | Jan 2, 1996 | Nov 9, 1999 | | Apparatus and method for producing audible labels in multiple languages | | US6070139 | Aug 20, 1996 | May 30, 2000 | Seiko Epson Corporation | Bifurcated speaker specific and non-speaker specific speech recognition method and apparatus | | US6085162 | Oct 18, 1996 | Jul 4, 2000 | Gedanken Corporation | Translation system and method in which words are translated by a specialized dictionary and then a general dictionary | | US6088671 | Jun 17, 1998 | Jul 11, 2000 | Dragon Systems | Continuous speech recognition of text and commands | | US6104845 | May 28, 1999 | Aug 15, 2000 | Wizcom Technologies Ltd. | Hand-held scanner with rotary position detector | | US6148105 | Apr 22, 1999 | Nov 14, 2000 | Hitachi, Ltd. | Character recognizing and translating system and voice recognizing and translating system | | US6157727 | May 22, 1998 | Dec 5, 2000 | Siemens Audiologische Technik GmbH | Communication system including a hearing aid and a language translation system | | US6163768 | Jun 15, 1998 | Dec 19, 2000 | Dragon Systems, Inc. | Non-interactive enrollment in speech recognition | | US6167377 | Mar 28, 1997 | Dec 26, 2000 | Dragon Systems, Inc. | Speech recognition language models | | US6182154 | Apr 7, 1997 | Jan 30, 2001 | International Business Machines Corporation | Universal object request broker encapsulater | | US6212498 | Mar 28, 1997 | Apr 3, 2001 | Dragon Systems, Inc. | Enrollment in speech recognition | | US6223150 | Jan 29, 1999 | Apr 24, 2001 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for parsing in a spoken language translation system | | US6243669 | Jan 29, 1999 | Jun 5, 2001 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for providing syntactic analysis and data structure for translation knowledge in example-based language translation | | US6266642 | Jan 29, 1999 | Jul 24, 2001 | Sony Corporation Sony Electronics, Inc. | Method and portable apparatus for performing spoken language translation | | US6278968 | Jan 29, 1999 | Aug 21, 2001 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for adaptive speech recognition hypothesis construction and selection in a spoken language translation system | | US6282507 | Jan 29, 1999 | Aug 28, 2001 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for interactive source language expression recognition and alternative hypothesis presentation and selection | | US6356865 | Jan 29, 1999 | Mar 12, 2002 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for performing spoken language translation | | US6374224 | Mar 10, 1999 | Apr 16, 2002 | Sony Corporation Sony Electronics, Inc. | Method and apparatus for style control in natural language generation | | US6377925 | Jul 7, 2000 | Apr 23, 2002 | Interactive Solutions, Inc. | Electronic translator for assisting communications | | US6424943 | Jul 24, 2000 | Jul 23, 2002 | Scansoft, Inc. | Non-interactive enrollment in speech recognition | | US6438524 | Nov 23, 1999 | Aug 20, 2002 | Qualcomm, Incorporated | Method and apparatus for a voice controlled foreign language translation device | | US6442524 | Jan 29, 1999 | Aug 27, 2002 | Sony Corporation Sony Electronics Inc. | Analyzing inflectional morphology in a spoken language translation system | | US6718308 | Jul 7, 2000 | Apr 6, 2004 | | Media presentation system controlled by voice to text commands | | US6789093 | Mar 20, 2001 | Sep 7, 2004 | Hitachi, Ltd. | Method and apparatus for language translation using registered databases | | US7080002 | Mar 26, 1998 | Jul 18, 2006 | Samsung Electronics Co., Ltd. | Bi-lingual system and method for automatically converting one language into another language | | US7113182 | Jul 17, 2003 | Sep 26, 2006 | Seiko Epson Corporation | System and method for sharing general purpose data lines between a display panel and non-display devices | | US7245903 | Jan 27, 2005 | Jul 17, 2007 | | Wireless telephone system with programming and information service | | US7286993 | Jul 23, 2004 | Oct 23, 2007 | Product Discovery, Inc. | Holographic speech translation system and method | | US7369998 | Aug 14, 2003 | May 6, 2008 | Voxtec International, Inc. | Context based language translation devices and methods | | US7467085 | Jul 27, 2004 | Dec 16, 2008 | Hitachi, Ltd. | Method and apparatus for language translation using registered databases | | US7548849 | Apr 29, 2005 | Jun 16, 2009 | Research In Motion Limited | Method for generating text that meets specified characteristics in a handheld electronic device and a handheld electronic device incorporating the same | | US7593842 | Dec 10, 2003 | Sep 22, 2009 | | Device and method for translating language | | US7912696 | Aug 31, 1999 | Mar 22, 2011 | Sony Corporation | Natural language processing apparatus and natural language processing method | | US8015016 | Oct 25, 2007 | Sep 6, 2011 | Electronics and Telecommunications Research Institute | Automatic translation method and system based on corresponding sentence pattern | | US8032384 | Mar 14, 2008 | Oct 4, 2011 | | Hand held language translation and learning device | | USD364390 | Jul 27, 1994 | Nov 21, 1995 | | Hand held translator | | USD385276 | Jan 26, 1996 | Oct 21, 1997 | | Portable language translating machine | | USD391561 | Apr 3, 1997 | Mar 3, 1998 | | Combined scanner and translator | | USD415154 | Oct 31, 1997 | Oct 12, 1999 | | Language translator | | USD425518 | Aug 12, 1999 | May 23, 2000 | | Portable electronic translator | | USD501459 | Feb 19, 2004 | Feb 1, 2005 | Marine Acoustics, Inc. | Hand-held device | | USH2098 | Feb 22, -6 | | The United States of America as represented by the Secretary of the Navy | Multilingual communications device |
Claims1. A speech translator for translating words spoken by a user in a first language into spoken words in a second language, said speech translator comprising: - language storing means for storing, in digitally coded form, voice patterns of words and phrases in first and second languages, at least some of said digitally coded voice patterns being stored in banks of related words and phrases;
- word recognition means for receiving audible words spoken by a user and creating corresponding voice patterns in digitally coded form;
- word producing means for receiving voice patterns in digitally coded form and creating corresponding audible words; and
- programmable control means connected to said language storage means, said word recognition means and said word producing means for translating words spoken by a user in said first language into spoken words in said second language by controlling the operation of said language storage means, said word recognition means and said word producing means, said programmable control means including a training mode of operation and a translate mode of operation, said training mode of operation training said speech translator to understand words spoken by a user in said first language by: (i) instructing a user to speak a series of words in said first language stored in digitally coded form in said language storage means; and (ii) storing the digitally coded voice patterns produced by said word recognition means in response to said user speaking said series of words in said first language as a series of trained voice patterns, said translate mode of operation translating words spoken by said user in said first language into said second language by: (i) comparing the digitally coded voice patterns, produced by said word recognition means when said user speaks words in said first language, with said stored series of trained voice patterns (ii) using the results of said comparison to locate digitally coded voice patterns of corresponding words in said second language stored in said language storing means; and (iii) applying said digitally coded voice patterns of said corresponding words to said word producing means, said programmable control means only accessing selected ones of said banks of related words and phrases stored in said language storing means in a logical sequence when comparing the digitally coded voice patterns produced by said word recognition means when said user speaks words in said first language with said stored series of trained voice patterns.
2. A speech translator as claimed in claim 1, wherein said voice speech translator includes a display means and wherein said programmable control means causes said display means to display said series of words in said first language when said programmable control means is in said training mode of operation. 3. A speech translator as claimed in claim 2, wherein said training mode of operation includes a TRAIN ALL words option during which a user is instructed to speak in seriatum the series of words in said first language stored in digitally coded form in said language storage means as they are displayed, and a TRAIN SELECTED words option during which a user can select which of said series of words in said first language stored in digitally coded form in said language storage means to speak. 4. A speech translator as claimed in claim 3, wherein said training mode of operation tests the way in which a user speaks a word in said first language by asking the user to repeat the word in said first language and analyzing the digitally coded voice pattern produced by said word recognition means in response to said user repeating said word in said first language to determine if the user has respoken the word in the same way. 5. A speech translator as claimed in claim 4, wherein said display means displays instructions to a user to speak a displayed word or words as the word or words are displayed when said programmable control means is in said training mode of operation. 6. A speech translator as claimed in claim 5, wherein said control means includes a talk key that enables said word recognition means to receive audible words spoken by a user and create corresponding digitally coded voice patterns when said talk key is depressed and wherein said speech translator instructs a user to depress said talk key as well as speak a word or words in said first language when said programmable control means is in said training mode of operation. 7. A speech translator as claimed in claim 6, wherein said control means includes cursor keys and wherein said cursor keys are used to scroll through words displayed by said display means when said programmable control means is in said TRAIN SELECTED words option of said training mode of operation. 8. A speech translator as claimed in claim 1, wherein said programmable controller logically combines the digitally coded voice patterns of words spoken by a user into a sentence and analyzes the sentence to determine if it is a sentence suitable for translation when said programmable control means is in said translate mode of operation. 9. A speech translator as claimed in claim 8, wherein said analysis requires that said sentence lie in a predetermined sequence of banks accessed by said programmable control means when comparing the digitally coded voice patterns produced by said word recognition means when said user speaks words in said first language with said stored series of trained voice patterns. 10. A speech translator as claimed in claim 8 or 9, wherein said analysis requires that said combined sentence terminate with a specific word that is unrelated to the content of the sentence. 11. A speech translator claimed in claim 8 or 9, wherein said digitally coded voice patterns of said spoken words are used to locate digitally coded voice patterns of corresponding words in said second language stored in digitally coded form in said language storing means and apply said digitally coded voice patterns of said corresponding words to said word producing means immediately after said sentence is determined to be suitable for translation. 12. A speech translator as claimed in claim 8 or 9, wherein said sentence is provided to said user in said first language after said sentence is determined to be suitable for translation prior to said digitally coded voice patterns of said spoken words being used to locate digitally coded voice patterns of corresponding words in said second language stored in digitally coded form in said language storing means and apply said digitally coded voice patterns of said corresponding words to said word producing means. 13. A speech translator as claimed in claim 12, wherein said sentence is provided to said user by being displayed on said display means. 14. A speech translator as claimed in claim 12, wherein said sentence is provided to said user by being uttered by said word producing means. 15. A speech translator as claimed in claim 9, wherein said programmable controller displays the words in the banks when the analysis of the words spoken by a user do not find a match. 16. A speech translator as claimed in claim 15, wherein words spoken by a user are analyzed twice and the words in a bank are displayed only if no match is found after both analyses have been completed. 17. A speech translator as claimed in claim 9, wherein said speech translator includes a display means and wherein said user can control during translation the display of words stored in said banks. 18. A speech translator as claimed in claim 8 or 9, wherein said speech translator includes a display means and wherein said programmable control means causes said display said series of words in said first language when said programmable control means is in said training mode of operation. 19. A speech translator as claimed in claims 2, 3, 4, 5, 6, 7, 8, or 9, wherein: (a) said speech translator includes a hand-sized housing; (b) said display means, word recognition means, word producing means and said programmable control means are all mounted in said hand-sized housing; and (c) said language storing means includes at least two cartridges, said cartridges being removably mounted in said hand held-housing. 20. A speech translator as claimed in claim 19, wherein said TRAIN ALL words option instructs a user to speak all of the series of words in said first language stored in digitally coded form in said language storage means as they are displayed. 21. A speech translator as claimed in claim 20, wherein said training mode of operation also includes a TEST option during which a user speaks in said first language words to be tested and the digitally coded voice pattern produced by said word recognition means in response to said user speaking is analyzed to determine if the words spoken by the user are part of a legitimate code string that includes a digitally coded voice pattern stored in said language storage means. 22. A speech translator as claimed in claim 21, wherein said displays instructions to a user to speak a displayed word or words as the word or words are displayed when said programmable control means is in said training mode of operation. 23. A speech translator as claimed in claim 22, wherein said control means includes a talk key that enables said word recognition means to receive audible words spoken by a user and create corresponding digitally coded voice patterns when said talk key is depressed and wherein said speech translator instructs a user to depress said talk key as well as speak a word or words in said first language when said programmable control means is in said training mode of operation. 24. A speech translator as claimed in claim 23, wherein said control means includes cursor keys and wherein said cursor keys are used to scroll through words displayed by said display means when said programmable control means is in said TRAIN SELECTED words option of said training mode of operation. |