|Publication number||USH2098 H1|
|Application number||US 08/200,049|
|Publication date||Mar 2, 2004|
|Filing date||Feb 22, 1994|
|Priority date||Feb 22, 1994|
|Also published as||US20030036911|
|Publication number||08200049, 200049, US H2098 H1, US H2098H1, US-H1-H2098, USH2098 H1, USH2098H1|
|Inventors||Lee M. E. Morin|
|Original Assignee||The United States Of America As Represented By The Secretary Of The Navy|
|Export Citation||BiBTeX, EndNote, RefMan|
|Patent Citations (16), Non-Patent Citations (4), Referenced by (65), Classifications (4), Legal Events (1)|
|External Links: USPTO, USPTO Assignment, Espacenet|
The invention described herein may be manufactured, used, and/or licensed by or for the United States Government for governmental purposes without the payment of any royalties thereon.
Field of the Invention
This invention provides a method and apparatus for providing interpretation into a chosen one of a plurality of languages for a structured interview, especially the type of interview done by a medical professional (hereafter called the physician, the operator, or the user) with a patient-who does not share a common language, without the necessity of a human interpreter, and without the necessity of the person being interviewed (hereafter called the patient or the respondent) being able to read or write in any language. The terms translation and interpretation are used interchangeably herein.
Medical history taking, physical examination, diagnostic procedures, and treatment all involve verbal communication to some degree. With rapid world-wide travel now being common, patients are often presented to physicians for care who do not have a common language with the physicians. While it is in this context that the inventor approached the problem, the invention could also be used between confessor and penitent, waiter and customer, hotel desk clerk and international customers, or in other situations where multiple unknown languages must be dealt with.
The use of a human interpreter is a good solution to the physician/patient interview, but it has drawbacks. An interpreter may not be available. It may not even be initially clear what language the patient speaks. Interpreters often interfere with the interview process. They may inject their usually poor medical judgment into the interview, or they may be embarrassed by or embarrass the patient with probing personal questions. If the translator is a relative of the patient, embarrassment or outright fabrication of answers may result.
Description of the Prior Art
In the prior art, phrase books have been used, and a large set of these for many different languages have been compiled by the United States Department of Defense. These have their drawbacks. Where they are written for the physician to attempt to pronounce a transliteration into a language that the physician is not familiar with, they frequently result in lack of understanding. Pointing to a written phrase in a phrase book requires that the patient be literate, and it is often slow.
In U.S. Pat. No. 4,428,733 to Kumat-Misir, a series of question and answer sheets are provided in two languages, with answers given in one language being generally understandable by reference to sheets in the second language. This would be slow, would require a literate patient, and would not allow the physician to choose the next question based upon the response to the previous question.
There have been efforts, such as represented in U.S. Pat. No. 4,984,117 to Rondel et al, to provide a number of phrases and sentences in a single foreign language, with provision for the user to attempt in his own language to select one or more of those phrases, and if his selection is recognized as possible, to play out a recorded foreign language version of what the user selected. In Rondel et al, this selection is made by training the device to recognize the user's voice as a means for making the selection in his own language. This device can operate in only one foreign language unless restructured, and provides no means for questioning a respondent to determine what foreign language would be suitable for an interview. It is also structured to operate only with user voices that it recognizes, making it time consuming at best for a new user to begin using the translator on short notice.
The invention provides a translating machine to enable an operator who is fluent in one language to interview a respondent using a predetermined list of available sentences, which may include questions. This assumes that the respondent speaks any one of a plurality of available languages other than the language in which the operator is fluent, and also assumes that the respondent need not be literate in any language. Translations into each of the available languages of each of the available sentences are stored in advance in a digital form which is convertible into an audio waveform. The available language to be used with a particular respondent is chosen. The user selects individual desired sentences from an alphanumerically stored list which is visually presented to the user. Then, as selected by the user, a translation of the chosen sentences are played out in an audio form to the respondent.
These translations into individual foreign languages were obtained and stored in advance from speakers who were fluent in the individual languages. One of the available sentences is visually presented to the speaker for translation and his spoken translation is recorded. It is then played back for the speaker's approval, and if approved is accepted for long-term storage. If not approved, the speaker is given additional opportunities for recording his spoken translation until he is satisfied.
When the device is to be used to interview a potential respondent, if the language spoken by that respondent is uncertain, the user plays samples of seemingly probable languages to the respondent to determine which language the respondent chooses. The user then can limit future translations to a given respondent to a language which the given respondent has chosen from the samples. In general, digital audio sentences sufficient to conduct a medical interview in a large number of languages, approximating 25 or 30, can be stored on one CD-ROM disk of the size currently in wide use.
FIG. 1 is a schematic block diagram of a translating machine in accordance with the present invention
FIG. 2 is a schematic block diagram of a machine for recording a series of translations into a given foreign language.
FIG. 3 is a schematic block diagram of an element for use with the device of FIG. 1 for selecting which of a plurality of foreign languages a given respondent is familiar with.
FIG. 4 is a schematic block diagram indicating that a plurality of foreign languages can be stored on and played back from a single CD-ROM.
When a physician wishes to interview a patient, as in an initial examination, there is a standard list of questions, almost a script, that covers most of what has to be asked. Lists of these phrases have long been available in Department of Defense phrase books referred to above. Other than “yes” or “no” answers in a foreign language, the physician will generally have difficulty understanding responses in the foreign language and must depend upon pointing, holding up a proper number of fingers for the answer, and other non-verbal responses.
Referring to FIG. 1, which is a schematic block diagram of a translating machine in accordance with the present invention, a storage unit 2 stores an alphabetical list of available phrases in the operator/user's language, and it is possible to move about the available list through the use of a manual selector 4 which can choose among the various available phrases. The phrases available to choose from are displayed to the operator on a visual display of available phrases 6.
The precise method of manually selecting from the available phrases can be chosen from several. It is possible to do a word search by typing in a word such as “appendicitis” and have all available phrases using that word appear on the visual display in order to allow selection of a desired phrase. It is possible to choose, with a mouse or otherwise, from the available phrases being displayed on the visual display in order to select the desired phrase. It is possible to have a script containing a plurality of questions to be asked in sequence (or skipped) as desired for a particular procedure or interview, and to go down that script in order to select the desired phrase.
For the purposes of FIG. 1, it is assumed that, by this time, the foreign language to be used has been selected by operator, using a foreign language selector 8. This can also be operated from a keyboard or with a mouse. Selector 8 operates a logical switch 10, which chooses whether to take the stored spoken foreign language from a storage 12 for a first spoken foreign language, or a storage 14 for a second spoken foreign language.
The choice from the available phrases by the operator from selector 4 goes to a selector 16 for corresponding foreign language phrases. This selector, in connection with logical switch 10, chooses a recorded spoken phrase in the chosen foreign language (the first spoken foreign language with the switch as illustrated) and passes that recorded phrase to an audio playout device 18, where it is played out to be listened to by the respondent/patient.
Referring to FIG. 2, which is a schematic block diagram of a machine for recording translations of a series of phrases into a given foreign language, a storage unit 2 is provided for alphanumeric storage of available phrases in the operator's language. The phrases to be translated are presented to the person/speaker who will speak and record the translations on a visual display 6. This speaker is, of course, necessarily knowledgeable in the foreign language to be recorded, unlike the physician/user who is to be the ultimate user of the machine.
When a phrase is presented for translation on display 6, the speaker speaks the translation into microphone 30, from which it is taken and temporarily stored in a temporary storage unit 32 for equivalent spoken foreign language phrases. The recorded phrase is then played back on an audio playout device 34 for the approval of the speaker. The speaker indicates whether or not he approves the translation as played back on manual approval indicator 36. If he does not approve, a re-record control 38 causes the system to accept a new recording of the phrase from the speaker until he gets one he approves. If he does approve of the translation, a transfer control unit 40 causes the temporarily stored phrase from storage unit 32 to be transferred to long-term storage unit 42 for storage as an approved equivalent spoken foreign language phrase.
Referring to FIG. 3, which is a schematic block diagram of an element for use with the device of FIG. 1 for selecting which of a plurality of recorded foreign languages a given respondent/patient is familiar with, foreign language selector 8 is shown in more detail in FIG. 3. When a respondent/patient is first presented for interview, if it is not clear what language the respondent understands, manual control 50 is operated to cause a selector 52 to make an initial selection of samples from a plurality of foreign languages. If, for example, a Navy ship picks up a person of oriental appearance from a raft in the ocean off southeast Asia, the operator might choose a series of languages such as Vietnamese, Laotian, Thai, Burmese, etc., to use in the first attempt to find the language of the respondent. In each language in sequence, selector 52 might ask, in that language, “Do you understand this language? If so, say yes.” These questions would be played out to the respondent from the audio playout device 18 of FIG. 1. When a satisfactory language was arrived at, manual control 50 could be used to operate limiter 54 to limit future translations to the one selected foreign language which had been found satisfactory. While switch 10 is shown as a logical switch connected to sources for two foreign languages, many more foreign languages could be connected. When the foreign languages are stored on CD-ROM, as indicated in FIG. 4, phrases and sentences sufficient to conduct a medical interview in up to twenty-five or thirty different foreign languages can be stored on one CD-ROM disk 60, and, of course, a plurality of such disks can be used interchangeably.
It is perfectly possible to construct a special-purpose device containing all of the digital logic to carry out the functions of this invention. However, from the point of economy and ease of operation, the preferred embodiment of the invention uses a personal computer to carry out the function. The system used by the inventor is configured as follows:
An Austin 433VLI Winstation 486 computer with 20 megabytes RAM, two Maxtor hard disk drives respectively holding 130 megabytes and 220 megabytes, a CD drive and soundboard provided by Soundblaster Pro multimedia kit,,a Colorado Mountain Jumbo tape backup unit, an SVGA monitor, a Diamond Stealth video board with 1 megabyte of RAM, DOS version 5.0, Windows version 3.1, Norton Desktop version 2.0, WavaWav (Wave after Wave) version 1.5 (a shareware utility allowing sequential audio playback without using Windows) which is available from Ben Salido, 660 West Oak St., Hurst, Tex. 76053-5526, WAVE EDITOR version 1.03 (a shareware utility allowing wave editing, which displays waveform, allowing blocking of the part of a waveform to be retained, thereby reducing required memory, and also allowing amplitude adjustment) available from Keith W. Boone, 114 Broward St., Tallahasse, Fla. 32301, Sony SRS 27 speakers, ACE CAT 5-inch tablet for mouse, and Microsoft Visual Basic version 3.0.
Many variations on this configuration would be possible, but this is the configuration used by the inventor, which is known to be operable. The inventor uses computer programs in Visual Basic, operated under Windows, to run the system. Although these programs are made a part of the file of this application as originally filed, they are not considered to be essential to the invention per se. It is within the skill of those skilled in the art to write such programs as needed, and the programs themselves are not intended for printing with a patent resulting from this application.
When the foreign-language speaker is recording the initial translations, the newly recorded material is originally recorded in RAM, then after approval by speaker is transferred to a hard disk. When the complete set of phrases for a given language are successfully recorded, they are “harvested” from the hard disk and combined with sets of phrases from other languages for permanent recording on a CD-ROM disk. Eventually as many different CD-ROM disks as are needed can be used.
It may be advisable to record all the sample questions needed to find the language spoken by the respondent on one disk for all available languages, to reduce the need from frequent switching of disks as the language is located. It is also possible, when operating in an environment where perhaps five or fewer foreign languages will cover all of the potential respondents, to download those languages from a CD-ROM disk to a hard disk of perhaps 80 megabyte capacity, to avoid necessity of carrying a CD-ROM drive in a portable computer.
It is also desirable to provide the ability to keep a medical history by recording and later printing out a record of the questions asked and the physician's contemporaneous recording of the patient's responses to those question. The system also allows recording a series of phrases as used with one patient, then subsequently editing the phrases in the physician's language to derive a suitable set of phrases for use with later similar patients in any available language. This edited version can include comments which were later added by the editing physician to assist later users. Editing can be done by using the Windows integrated utility Notepad, or by using other word processors, or by using the program which has been written in Visual Basic.
|Cited Patent||Filing date||Publication date||Applicant||Title|
|US4428733||Jul 13, 1981||Jan 31, 1984||Kumar Misir Victor||Information gathering system|
|US4493050 *||Jul 24, 1981||Jan 8, 1985||Sharp Kabushiki Kaisha||Electronic translator having removable voice data memory connectable to any one of terminals|
|US4593356 *||Jul 16, 1981||Jun 3, 1986||Sharp Kabushiki Kaisha||Electronic translator for specifying a sentence with at least one key word|
|US4613944 *||Aug 25, 1981||Sep 23, 1986||Sharp Kabushiki Kaisha||Electronic translator having removable data memory and controller both connectable to any one of terminals|
|US4843589||Sep 17, 1982||Jun 27, 1989||Sharp Kabushiki Kaisha||Word storage device for use in language interpreter|
|US4882681 *||Sep 2, 1987||Nov 21, 1989||Brotz Gregory R||Remote language translating device|
|US4984177||Feb 1, 1989||Jan 8, 1991||Advanced Products And Technologies, Inc.||Voice language translator|
|US5010495 *||Feb 2, 1989||Apr 23, 1991||American Language Academy||Interactive language learning system|
|US5056145||Jan 22, 1990||Oct 8, 1991||Kabushiki Kaisha Toshiba||Digital sound data storing device|
|US5063534 *||Oct 12, 1989||Nov 5, 1991||Canon Kabushiki Kaisha||Electronic translator capable of producing a sentence by using an entered word as a key word|
|US5065317 *||May 24, 1990||Nov 12, 1991||Sony Corporation||Language laboratory systems|
|US5091876||Dec 18, 1989||Feb 25, 1992||Kabushiki Kaisha Toshiba||Machine translation system|
|US5341291 *||Mar 8, 1993||Aug 23, 1994||Arch Development Corporation||Portable medical interactive test selector having plug-in replaceable memory|
|US5375164 *||Aug 12, 1992||Dec 20, 1994||At&T Corp.||Multiple language capability in an interactive system|
|US5384701 *||Jun 7, 1991||Jan 24, 1995||British Telecommunications Public Limited Company||Language translation system|
|US5523946 *||May 5, 1995||Jun 4, 1996||Xerox Corporation||Compact encoding of multi-lingual translation dictionaries|
|1||*||Cowart, R., "Mastering Windows 3.1" pp. 516-518 Sybex Inc. 1993.*|
|2||Operator's Guide-Morin Multimedia Medical Translator Release 2.0 (1993) (by the inventor).|
|3||Operator's Guide—Morin Multimedia Medical Translator Release 2.0 (1993) (by the inventor).|
|4||*||Wurst, Brooke E. "PC Interpreter topple the tower of babble. (Evaluation)", Nov. 1992, Computer Shopper, v12, n11, p950(2).*|
|Citing Patent||Filing date||Publication date||Applicant||Title|
|US7702624||Apr 19, 2005||Apr 20, 2010||Exbiblio, B.V.||Processing techniques for visual capture data from a rendered document|
|US7706611||Aug 23, 2005||Apr 27, 2010||Exbiblio B.V.||Method and system for character recognition|
|US7707039||Dec 3, 2004||Apr 27, 2010||Exbiblio B.V.||Automatic modification of web pages|
|US7742953||Apr 1, 2005||Jun 22, 2010||Exbiblio B.V.||Adding information or functionality to a rendered document via association with an electronic counterpart|
|US7812860||Sep 27, 2005||Oct 12, 2010||Exbiblio B.V.||Handheld device for capturing text from both a document printed on paper and a document displayed on a dynamic display device|
|US7818215||May 17, 2005||Oct 19, 2010||Exbiblio, B.V.||Processing techniques for text capture from a rendered document|
|US7831912||Apr 1, 2005||Nov 9, 2010||Exbiblio B. V.||Publishing techniques for adding value to a rendered document|
|US7949538 *||Mar 14, 2007||May 24, 2011||A-Life Medical, Inc.||Automated interpretation of clinical encounters with cultural cues|
|US7990556||Feb 28, 2006||Aug 2, 2011||Google Inc.||Association of a portable scanner with input/output and storage devices|
|US8005720||Aug 18, 2005||Aug 23, 2011||Google Inc.||Applying scanned information to identify content|
|US8019648||Apr 1, 2005||Sep 13, 2011||Google Inc.||Search engines and systems with handheld document data capture devices|
|US8081849||Feb 6, 2007||Dec 20, 2011||Google Inc.||Portable scanning and memory device|
|US8179563||Sep 29, 2010||May 15, 2012||Google Inc.||Portable scanning device|
|US8214387||Apr 1, 2005||Jul 3, 2012||Google Inc.||Document enhancement system and method|
|US8244222||May 2, 2005||Aug 14, 2012||Stephen William Anthony Sanders||Professional translation and interpretation facilitator system and method|
|US8261094||Aug 19, 2010||Sep 4, 2012||Google Inc.||Secure data gathering from rendered documents|
|US8346620||Sep 28, 2010||Jan 1, 2013||Google Inc.||Automatic modification of web pages|
|US8418055||Feb 18, 2010||Apr 9, 2013||Google Inc.||Identifying a document by performing spectral analysis on the contents of the document|
|US8423370||Apr 19, 2011||Apr 16, 2013||A-Life Medical, Inc.||Automated interpretation of clinical encounters with cultural cues|
|US8442331||Aug 18, 2009||May 14, 2013||Google Inc.||Capturing text from rendered documents using supplemental information|
|US8447066||Mar 12, 2010||May 21, 2013||Google Inc.||Performing actions based on capturing information from rendered documents, such as documents under copyright|
|US8489624||Jan 29, 2010||Jul 16, 2013||Google, Inc.||Processing techniques for text capture from a rendered document|
|US8505090||Feb 20, 2012||Aug 6, 2013||Google Inc.||Archive of text captures from rendered documents|
|US8515816||Apr 1, 2005||Aug 20, 2013||Google Inc.||Aggregate analysis of text captures performed by multiple users from rendered documents|
|US8600196||Jul 6, 2010||Dec 3, 2013||Google Inc.||Optical scanners, such as hand-held optical scanners|
|US8620083||Oct 5, 2011||Dec 31, 2013||Google Inc.||Method and system for character recognition|
|US8638363||Feb 18, 2010||Jan 28, 2014||Google Inc.||Automatically capturing information, such as capturing information using a document-aware device|
|US8639829 *||May 19, 2010||Jan 28, 2014||Ebay Inc.||System and method to facilitate translation of communications between entities over a network|
|US8655668||Mar 15, 2013||Feb 18, 2014||A-Life Medical, Llc||Automated interpretation and/or translation of clinical encounters with cultural cues|
|US8682823||Apr 13, 2007||Mar 25, 2014||A-Life Medical, Llc||Multi-magnitudinal vectors with resolution based on source vector features|
|US8713418||Apr 12, 2005||Apr 29, 2014||Google Inc.||Adding value to a rendered document|
|US8731954||Mar 27, 2007||May 20, 2014||A-Life Medical, Llc||Auditing the coding and abstracting of documents|
|US8799099||Sep 13, 2012||Aug 5, 2014||Google Inc.||Processing techniques for text capture from a rendered document|
|US8874504||Mar 22, 2010||Oct 28, 2014||Google Inc.||Processing techniques for visual capture data from a rendered document|
|US8914395||Jan 3, 2013||Dec 16, 2014||Uptodate, Inc.||Database query translation system|
|US8953886||Aug 8, 2013||Feb 10, 2015||Google Inc.||Method and system for character recognition|
|US8990235||Mar 12, 2010||Mar 24, 2015||Google Inc.||Automatically providing content associated with captured information, such as information captured in real-time|
|US9008447||Apr 1, 2005||Apr 14, 2015||Google Inc.||Method and system for character recognition|
|US9030699||Aug 13, 2013||May 12, 2015||Google Inc.||Association of a portable scanner with input/output and storage devices|
|US9063924||Jan 28, 2011||Jun 23, 2015||A-Life Medical, Llc||Mere-parsing with boundary and semantic driven scoping|
|US9075779||Apr 22, 2013||Jul 7, 2015||Google Inc.||Performing actions based on capturing information from rendered documents, such as documents under copyright|
|US9081799||Dec 6, 2010||Jul 14, 2015||Google Inc.||Using gestalt information to identify locations in printed information|
|US9116890||Jun 11, 2014||Aug 25, 2015||Google Inc.||Triggering actions in response to optically or acoustically capturing keywords from a rendered document|
|US9143638||Apr 29, 2013||Sep 22, 2015||Google Inc.||Data capture from rendered documents using handheld device|
|US9268852||Sep 13, 2012||Feb 23, 2016||Google Inc.||Search engines and systems with handheld document data capture devices|
|US9275051||Nov 7, 2012||Mar 1, 2016||Google Inc.||Automatic modification of web pages|
|US9323784||Dec 9, 2010||Apr 26, 2016||Google Inc.||Image search using text-based elements within the contents of images|
|US9411793||Jul 13, 2011||Aug 9, 2016||Motionpoint Corporation||Dynamic language translation of web site content|
|US9418062 *||Jan 21, 2009||Aug 16, 2016||Geacom, Inc.||Method and system for situational language interpretation|
|US9465782||Jul 13, 2011||Oct 11, 2016||Motionpoint Corporation||Dynamic language translation of web site content|
|US9514134||Jul 15, 2015||Dec 6, 2016||Google Inc.||Triggering actions in response to optically or acoustically capturing keywords from a rendered document|
|US9633013||Mar 22, 2016||Apr 25, 2017||Google Inc.||Triggering actions in response to optically or acoustically capturing keywords from a rendered document|
|US20020111791 *||Feb 15, 2001||Aug 15, 2002||Sony Corporation And Sony Electronics Inc.||Method and apparatus for communicating with people who speak a foreign language|
|US20030065504 *||Oct 2, 2001||Apr 3, 2003||Jessica Kraemer||Instant verbal translator|
|US20030146926 *||Jan 22, 2003||Aug 7, 2003||Wesley Valdes||Communication system|
|US20030200088 *||Apr 18, 2002||Oct 23, 2003||Intecs International, Inc.||Electronic bookmark dictionary|
|US20040172236 *||Feb 27, 2003||Sep 2, 2004||Fraser Grant E.||Multi-language communication system|
|US20070226211 *||Mar 27, 2007||Sep 27, 2007||Heinze Daniel T||Auditing the Coding and Abstracting of Documents|
|US20080208596 *||Mar 14, 2007||Aug 28, 2008||A-Life Medical, Inc.||Automated interpretation of clinical encounters with cultural cues|
|US20080256329 *||Apr 13, 2007||Oct 16, 2008||Heinze Daniel T||Multi-Magnitudinal Vectors with Resolution Based on Source Vector Features|
|US20090070140 *||Aug 4, 2008||Mar 12, 2009||A-Life Medical, Inc.||Visualizing the Documentation and Coding of Surgical Procedures|
|US20100228536 *||May 19, 2010||Sep 9, 2010||Steve Grove||System and method to facilitate translation of communications between entities over a network|
|US20110196665 *||Apr 19, 2011||Aug 11, 2011||Heinze Daniel T||Automated Interpretation of Clinical Encounters with Cultural Cues|
|US20110246174 *||Jan 21, 2009||Oct 6, 2011||Geacom, Inc.||Method and system for situational language interpretation|
|US20120017146 *||Jul 13, 2011||Jan 19, 2012||Enrique Travieso||Dynamic language translation of web site content|
|Mar 10, 1994||AS||Assignment|
Owner name: UNITED STATES OF AMERICA, THE, AS REPRESENTED BY T
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MORIN, LEE M. E.;REEL/FRAME:006878/0620
Effective date: 19940217