|Publication number||US5995590 A|
|Application number||US 09/035,493|
|Publication date||Nov 30, 1999|
|Filing date||Mar 5, 1998|
|Priority date||Mar 5, 1998|
|Publication number||035493, 09035493, US 5995590 A, US 5995590A, US-A-5995590, US5995590 A, US5995590A|
|Inventors||Peter Thomas Brunet, Abraham P. Ittycheriah, Chandrasekhar Narayanaswami, Michael Alan Picheny, Bhuvana Ramabhadran|
|Original Assignee||International Business Machines Corporation|
|Export Citation||BiBTeX, EndNote, RefMan|
|Patent Citations (13), Referenced by (64), Classifications (10), Legal Events (5)|
|External Links: USPTO, USPTO Assignment, Espacenet|
1. Field of the Invention
The present invention relates to telephone communication systems, and more particularly to integrated messaging and voice free telephony.
2. Background Art
In telephone communication there are times at which people desire to talk normally into the telephone mouthpiece but are inhibited to do so because they. will disturb other people who are nearby.
With presently available technology, when a telephone call needs to be made, the speaker has to either disturb others or leave the room and handle the call in a more private setting.
The present invention provides a method and apparatus that allows people to carry on unobtrusive phone conversations in business or other settings where it is either not possible or impolite to talk. Some examples would be in business meetings, large conference rooms, game stadiums, airplanes and the like. The present invention is particularly significant with the use of cellular telephones which may be operated in any setting.
The present invention also allows a hearing impaired or deaf person to communicate using the telephone. The apparatus can be used either in isolation or as a service to this community. The apparatus allows both calls to be made as well as received. Currently, this community uses a relay service to communicate with others. This type of service is done with human intervention and is slow and rather expensive. Our apparatus allows the hearing impaired person more control over the call and reduces his long term cost of using relay services.
Some examples of presently available background technology include "text to speech" (TTS) systems that convert text to sound such as IBM's "Eloquent TTS", IBM's "Virtual Voices", and Bell Laboratories "Text to Speech Synthesis". This technology is presently used to read text passages aloud. For example, using this technology, a person can listen to his e-mail while he is driving. The text to speech conversion can be done in real time in software with presently available computer CPUs such as the 166 MHz Pentium processors.
Another technology referred to as "speech recognition" is also presently available. IBM's "ViaVoice" is an example of this technology. Speech recognition systems technology convert spoken words into text.
An object of the present invention is to provide a telephone communication system that permits flexible communication.
Another object of the present invention is to provide an asymmetric telephone system wherein the user can communicate without disturbing others who are nearby.
A further object of the present invention is to provide a telephone system that can be used by the hearing impaired without involving a third party human transcriber.
Other and further features, advantages and benefits of the invention will become apparent in the following description taken in conjunction with the following drawings. It is to be understood that the foregoing general description and the following detailed description are exemplary and explanatory but are not to be restrictive of the invention. The accompanying drawings which are incorporated in and constitute a part of this invention and, together with the description, serve to explain the principles of the invention in general terms. Like numerals refer to like parts throughout the disclosure.
FIG. 1 is a schematic block diagram of an asymmetric telephone system according to the principles of the present invention.
FIGS. 2-13 are illustrations of icons representing commonly used speech phrases.
FIG. 14 is a schematic illustration of an integrated messaging system according to the principles of the present invention.
The system of the present invention utilizes the fact that keyboards are very silent and non-intrusive. Referring to FIG. 1 an embodiment of an asymmetric form of communication is shown. In the system of FIG. 1, a telephone, user one will listen in the same manner as with a regular telephone. However, he will not speak into the telephone microphone. User one instead employs unit 10 including keyboard 12 to enter the text corresponding to what he wants to say. The text is converted into a synthesized speech using TTS apparatus 14 and a voice output 16 is sent to the microphone 18 of the phone apparatus 20. Telephone apparatus 20 transmits the synthesized voice signal over a standard telephone line 30 to unit 24 including a conventional telephone speaker 28 and telephone microphone 26.
User two, the party using the telephone at the other end, listens to a synthesized voice, but user one listens to the actual voice of user two with telephone speaker 22, unless user two is also using a system similar to that of FIG. 1.
Unit 10 also includes a speech recognizer means 13 and a display means 15 connected to the incoming speech signals from telephone microphone 26 employed by user two.
Handwritten text may also be used in the system of FIG. 1 by employing a computer with a character recognition program as a means of input in unit 10. In such a case handwriting is converted into synthesized sound and inputted into the telephone microphone.
Thus, in the system of FIG. 1, a signal cable from the keyboard 12 or a computer carries synthesized sound signals to the phone. The text to speech conversion may be done by the user's portable computer as the TTS apparatus 14 for example. The text to speech conversion may be done by any other suitable device as well. For example, the text may be transmitted and the text to speech conversion may be done by locating apparatus 14 at the other end of the line. Other system variations are also possible. For example, the internet may be used for the phone connection between instead of a regular phone line using available internet telephony.
The text to speech converter 14 of the present invention is somewhat different from present TTS systems. The TTS system of the present invention analyzes each word and converts it to sound after a possible spelling correction and sends the voice signal to the telephone microphone as soon as it detects a word boundary instead of waiting for complete sentences or paragraphs. Words that are miss-typed at keyboard 12 and then corrected by user one are converted to speech and sent again with an indication of the correction to user two. Retransmission begins from the word that was corrected.
In the TTS text to speech apparatus 14, the voice synthesis is tailored to the user's voice characteristics as closely as possible. This means that intonation, pitch, breathings, etc., can be altered as desired. In addition, security code words may be used to authenticate to user two that it is indeed the correct user one who is making the call and not an impersonator.
The present invention may be useful even when people are not using wireless phones. For example, a group of people in a shared office may use the same idea. In another example, the invention may be employed by doctors in an operating room that requires a low noise level. Passengers in an airplane is yet another example.
The present invention is also useful to the hearing impaired. The hearing impaired person can either receive an incoming call or make an outgoing call by keying in the phone numbers to be called. When the system connects the other party to the call, the system announces that the call is using a special device. The system then asks the other party to the call to say some sample sentences in order to enable the speech recognition system to adapt to the other party. This is not necessary if a speaker independent type of speech recognition system is being used. The system then allows the caller to speak a sentence. The sentence is transcribed by the speech recognition engine and the hearing impaired person may then choose to listen to the audio as many times as required, or may speak a sentence to the audio card in his device. The system records his speech and sends it to the caller over the telephone. Alternatively, the hearing impaired person may choose to type his response into the GUI interface and the text-to-speech system converts it to audio and then transmits it to the caller over the telephone. The device may also work as an answering machine, in the event the hearing impaired person does not take the call. In this case, a transcript of the external caller's message is saved for later retrieval by the hearing impaired person.
The present invention provides an advantage over other forms of communication such as e-mail and pagers because it provides immediate and interactive response.
To simplify the task of typing or writing with apparatus 12, the system of the present invention may provide several preselected words or phrases for the user to employ. FIGS. 2-13 illustrate this concept. An example would be a salutation such as "Good morning, this is John Smith calling from New York". Parts of the phrase may be generated from available information. Such words as morning, afternoon, etc, can be derived from the present time at the user's location. The user's location may be derived from the latitude, longitude and area code of his location. In addition these phrases may be tailored to the recipient of the call. For example, the user's administrative assistant may receive one prolog whereas the user's spouse may receive another prolog. As shown in FIGS. 2-13, icons are used to indicate commonly used phrases. For example, FIG. 2 can be used for the phrase "make travel arrangements for me". FIG. 3 is used for "we were right on target." Likewise, FIG. 4 means "it is dinner time." FIG. 5 means "I will call you when I get back to the office." FIG. 6 means "it is time to celebrate." FIG. 7 means "how hot is it there." FIG. 8 means "I have an idea!" FIG. 10 means "read me my calendar." FIG. 11 means "it is raining here." FIG. 12 means "lot more work to do" and FIG. 13 means "good work". These icons are fully programmable and reconfigurable. This facilitates a short hand notation and reduces the amount of typing necessary. Previously composed text passages can also be played by marking them as preselected phrases. These techniques allow for more interactivity during the call and can also reduce the duration of the call.
Integrated messaging systems according to the present invention may be embodied by using the technologies previously described. One embodiment of an integrated messaging system is shown in FIG. 14. In the system of FIG. 14, speech recognition technology may be used to convert voice into text that is delivered in a message that can later be either viewed as text or read out as voice mail by the user, who may be a hearing impaired, mute or deaf person. Such systems may also convert the whole conversation to text and make a text transcript of the duplex conversation. If the receiver of a telephone call is not present to receive the call, the call can be directed to either voice mail or e-mail with the aid of the voice to text converter.
More particularly, the system of FIG. 14 includes a text or handwriting converter means 14A having inputs from a keyboard or stylus 12A and pager sender 32A. A wired or wireless connection 44 is provided between converter means 14A and a voice to text converter means 46B that also has inputs from a voice input device 34B and a phone sender 36B. Inputs from the keyboard or stylus 12A and the pager sender 32A from a first user A are connected to text to speech converter means 14A and transmitted via connection 44 and voice to text converter means 46B and pager receiver, e-mail device 38B to a second user B. The voice to text converter 42B accepts audio message inputs from second user B via voice input 34B (such as a microphone) and phone sender 36B and transmits through text to speech converter 14A to first user A via audio receiver 40A and phone receiver 42A.
Likewise, inputs from the keyboard or stylus 12B and pager sender 32B from second user B are connected via text to speech conversion means 14B and wired or wireless connection 44, through which to text conversion means 46 to pager receiver E-mail device 38A. The voice to text conversion means 46A also accepts audio messages from voice input 34A (i.e., microphone) and phone sender 36A and transmits them via connection 44 through text to speech converter means 14B to second user B via audio receiver 40B and phone receiver 42B.
One skilled in the art will appreciate two components such as the keyboard, the pager (E-mail) sender, the audio receiver (sound card), phone sender and receive and voice input are all available as elements of and may be embodied in a personal computer.
While the invention has been described in connection with a preferred embodiment, it is not intended to limit the scope of the invention to the particular form set form, but, on the contrary, it is intended to cover such alternatives, modifications, and equivalence as may be included within the spirit and scope of the invention as defined in the appended claims.
|Cited Patent||Filing date||Publication date||Applicant||Title|
|US4918723 *||Oct 7, 1988||Apr 17, 1990||Jerry R. Iggulden||Keyboard to facsimile machine transmission system|
|US5253285 *||May 22, 1992||Oct 12, 1993||Alheim Curtis C||Automated interactive telephone communication system for TDD users|
|US5581593 *||Jan 5, 1995||Dec 3, 1996||Ultratec, Inc.||Combination telephone and alphanumeric entry device|
|US5809425 *||Jun 7, 1995||Sep 15, 1998||Ultratec, Inc.||Gateway for low cost alphanumeric paging entry system|
|DE3620938A1 *||Jun 23, 1986||Jan 28, 1988||Siemens Ag||Method for user guidance and transmission of writing or drawings using a digital telephone station|
|JPH036163A *||Title not available|
|JPH066436A *||Title not available|
|JPH0463044A *||Title not available|
|JPH0465960A *||Title not available|
|JPH03283852A *||Title not available|
|JPS58131857A *||Title not available|
|JPS58134568A *||Title not available|
|JPS63242054A *||Title not available|
|Citing Patent||Filing date||Publication date||Applicant||Title|
|US6198916 *||May 5, 1999||Mar 6, 2001||Lucent Technologies Inc.||Call screening in wireless systems|
|US6836668 *||Sep 1, 2000||Dec 28, 2004||Nec Corporation||Portable communication apparatus with voice/character conversion direction switch|
|US6879669 *||Sep 30, 2003||Apr 12, 2005||International Business Machines Corporation||Method and apparatus for enabling a privacy feature in a communications network|
|US6944591 *||Jul 27, 2000||Sep 13, 2005||International Business Machines Corporation||Audio support system for controlling an e-mail system in a remote computer|
|US7003082 *||Aug 5, 2003||Feb 21, 2006||Ultratec, Inc.||System for text assisted telephony|
|US7054819 *||Feb 11, 2000||May 30, 2006||Microsoft Corporation||Voice print access to computer resources|
|US7170977 *||Mar 31, 2004||Jan 30, 2007||Fairleigh Dickinson University||Telephone interface for a handicapped individual|
|US7277855 *||Feb 26, 2001||Oct 2, 2007||At&T Corp.||Personalized text-to-speech services|
|US7286649 *||Sep 8, 2000||Oct 23, 2007||Fuji Xerox Co., Ltd.||Telecommunications infrastructure for generating conversation utterances to a remote listener in response to a quiet selection|
|US7305068||Feb 25, 2005||Dec 4, 2007||Hewlett-Packard Development Company, L.P.||Telephone communication with silent response feature|
|US7346919 *||Aug 13, 2001||Mar 18, 2008||Integra5 Communications, Inc.||Television messaging vocal response generation|
|US7489767 *||Oct 29, 2002||Feb 10, 2009||Nec Corporation||Terminal device and communication control method|
|US7555104||Feb 24, 2006||Jun 30, 2009||Ultratec, Inc.||Relay for personal interpreter|
|US7649877||Sep 20, 2007||Jan 19, 2010||Braintexter, Inc||Mobile device for sending text messages|
|US7660398||Feb 18, 2005||Feb 9, 2010||Ultratec, Inc.||Captioned telephone service|
|US7689248||Sep 27, 2005||Mar 30, 2010||Nokia Corporation||Listening assistance function in phone terminals|
|US7702792 *||Jan 8, 2004||Apr 20, 2010||Cisco Technology, Inc.||Method and system for managing communication sessions between a text-based and a voice-based client|
|US7706511||Sep 20, 2007||Apr 27, 2010||Braintexter, Inc.||System and method for sending text messages converted into speech through an internet connection|
|US7865365||Aug 5, 2004||Jan 4, 2011||Nuance Communications, Inc.||Personalized voice playback for screen reader|
|US7881441||Mar 28, 2006||Feb 1, 2011||Ultratec, Inc.||Device independent text captioned telephone service|
|US8156005||Sep 9, 2008||Apr 10, 2012||Braintexter, Inc.||Systems and methods of contextual advertising|
|US8213578||Dec 13, 2007||Jul 3, 2012||Ultratec, Inc.||System for text assisted telephony|
|US8224647 *||Oct 3, 2005||Jul 17, 2012||Nuance Communications, Inc.||Text-to-speech user's voice cooperative server for instant messaging clients|
|US8229091||Apr 9, 2009||Jul 24, 2012||At&T Intellectual Property I, L.P.||Interactive voice response to short message service text messenger|
|US8280434||Feb 27, 2009||Oct 2, 2012||Research In Motion Limited||Mobile wireless communications device for hearing and/or speech impaired user|
|US8321202 *||Mar 26, 2012||Nov 27, 2012||Hoshiko Llc||Method and system for candidate matching|
|US8359234||Jan 24, 2008||Jan 22, 2013||Braintexter, Inc.||System to generate and set up an advertising campaign based on the insertion of advertising messages within an exchange of messages, and method to operate said system|
|US8416925||Apr 14, 2008||Apr 9, 2013||Ultratec, Inc.||Device independent text captioned telephone service|
|US8423412||Mar 9, 2012||Apr 16, 2013||Braintexter, Inc.||Systems and methods of contextual advertising|
|US8428952||Jun 12, 2012||Apr 23, 2013||Nuance Communications, Inc.||Text-to-speech user's voice cooperative server for instant messaging clients|
|US8463611||Oct 13, 2004||Jun 11, 2013||Hewlett-Packard Development Company, L.P.||Method and system for improving the fidelity of a dialog system|
|US8515024||Jan 13, 2010||Aug 20, 2013||Ultratec, Inc.||Captioned telephone service|
|US8892446||Dec 21, 2012||Nov 18, 2014||Apple Inc.||Service orchestration for intelligent automated assistant|
|US8903716||Dec 21, 2012||Dec 2, 2014||Apple Inc.||Personalized vocabulary for digital assistant|
|US8908838||Jun 9, 2014||Dec 9, 2014||Ultratec, Inc.||System for text assisted telephony|
|US8909545||Dec 10, 2012||Dec 9, 2014||Braintexter, Inc.||System to generate and set up an advertising campaign based on the insertion of advertising messages within an exchange of messages, and method to operate said system|
|US8917821||Mar 8, 2013||Dec 23, 2014||Ultratec, Inc.||Device independent text captioned telephone service|
|US8917822||Jun 9, 2014||Dec 23, 2014||Ultratec, Inc.||System for text assisted telephony|
|US8918322 *||Jun 20, 2007||Dec 23, 2014||At&T Intellectual Property Ii, L.P.||Personalized text-to-speech services|
|US8930191||Mar 4, 2013||Jan 6, 2015||Apple Inc.||Paraphrasing of user requests and results by automated digital assistant|
|US8942986||Dec 21, 2012||Jan 27, 2015||Apple Inc.||Determining user intent based on ontologies of domains|
|US9026445||Mar 20, 2013||May 5, 2015||Nuance Communications, Inc.||Text-to-speech user's voice cooperative server for instant messaging clients|
|US9117447||Dec 21, 2012||Aug 25, 2015||Apple Inc.||Using event alert text as input to an automated assistant|
|US9131045||Jun 1, 2012||Sep 8, 2015||Ultratec, Inc.||System for text assisted telephony|
|US20050025290 *||Mar 31, 2004||Feb 3, 2005||Eamon Doherty||Telephone interface for a handicapped individual|
|US20050048992 *||Apr 17, 2004||Mar 3, 2005||Alcatel||Multimode voice/screen simultaneous communication device|
|US20050049879 *||Aug 28, 2003||Mar 3, 2005||Alcatel||Communication device capable of interworking between voice communications and text communications|
|US20050069106 *||Sep 30, 2003||Mar 31, 2005||International Business Machines Corporation||Method and apparatus for enabling a privacy feature in a communications network|
|US20050115947 *||Nov 10, 2004||Jun 2, 2005||Dainippon Screen Mfg. Co., Ltd.||Thermal processing apparatus and thermal processing method|
|US20050141680 *||Feb 25, 2005||Jun 30, 2005||Hewlett-Packard Development Company, L.P.||Telephone communication with silent response feature|
|US20050198096 *||Jan 8, 2004||Sep 8, 2005||Cisco Technology, Inc.:||Method and system for managing communication sessions between a text-based and a voice-based client|
|US20060026001 *||Sep 29, 2005||Feb 2, 2006||Communication Service For The Deaf, Inc.||Enhanced communications services for the deaf and hard of hearing cross-reference to related applications|
|US20060080102 *||Oct 13, 2004||Apr 13, 2006||Sumit Roy||Method and system for improving the fidelity of a dialog system|
|US20060140354 *||Feb 24, 2006||Jun 29, 2006||Engelke Robert M||Relay for personal interpreter|
|US20070078656 *||Oct 3, 2005||Apr 5, 2007||Niemeyer Terry W||Server-provided user's voice for instant messaging clients|
|US20110205849 *||Aug 25, 2011||Sony Corporation, A Japanese Corporation||Digital calendar device and methods|
|US20120185251 *||Jul 19, 2012||Hoshiko Llc||Method and system for candidate matching|
|CN102074150A *||Jan 12, 2011||May 25, 2011||无锡工艺职业技术学院||Sentence and speech conversion device for the deaf to communicate with outside world|
|EP1469691A2 *||Feb 13, 2004||Oct 20, 2004||Evolium S.A.S.||Mobile phone system|
|EP1874010A1 *||Feb 12, 2007||Jan 2, 2008||Samsung Electronics Co., Ltd.||Video communication apparatus using VoIP and method of operating the same|
|EP2224427A1||Feb 27, 2009||Sep 1, 2010||Research In Motion Limited||Mobile wireless communications device for hearing and/or speech impaired user|
|WO2001069895A1 *||Mar 12, 2001||Sep 20, 2001||Giuseppe Baldino||Telephonic device for deaf-mutes|
|WO2003019924A1 *||Aug 22, 2002||Mar 6, 2003||Ultratec Inc||System for text assisted telephony|
|WO2006044253A1 *||Oct 7, 2005||Apr 27, 2006||Hewlett Packard Development Co||Method and system for improving the fidelity of a dialog system|
|U.S. Classification||379/52, 379/93.24, 379/93.18, 340/4.13, 340/4.14|
|International Classification||G10L13/00, H04M1/247|
|Cooperative Classification||G10L13/00, H04M1/2475|
|Jun 29, 1998||AS||Assignment|
Owner name: IBM CORPORATION, NEW YORK
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BRUNET, PETER T.;ITTYCHERIAH, ABRAHAM P.;NARAYANASWANI, CHANDRASEKHAR;AND OTHERS;REEL/FRAME:009290/0064;SIGNING DATES FROM 19980305 TO 19980326
|Dec 19, 2002||FPAY||Fee payment|
Year of fee payment: 4
|Jun 18, 2007||REMI||Maintenance fee reminder mailed|
|Nov 30, 2007||LAPS||Lapse for failure to pay maintenance fees|
|Jan 22, 2008||FP||Expired due to failure to pay maintenance fee|
Effective date: 20071130