US3855418A - Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment - Google Patents

Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment Download PDF

Info

Publication number
US3855418A
US3855418A US00311392A US31139272A US3855418A US 3855418 A US3855418 A US 3855418A US 00311392 A US00311392 A US 00311392A US 31139272 A US31139272 A US 31139272A US 3855418 A US3855418 A US 3855418A
Authority
US
United States
Prior art keywords
speech
emotional stress
electrical signal
amplitude
emphasizing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US00311392A
Inventor
F Fuller
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to US00311392A priority Critical patent/US3855418A/en
Application granted granted Critical
Publication of US3855418A publication Critical patent/US3855418A/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/26Recognition of special voice characteristics, e.g. for use in lie detectors; Recognition of animal voices

Definitions

  • the modulation is UNITED STATES PATENTS emphasized by rectification, smoothing, and time and 3.268.661 8/1966 COUllel 179/1 SA amplitude discrimination of the Speech Wave form and 3.346.694 l0/l967 Brady 179/] SA is compared to a selected voltage level to produce 21 3,387,090 6/1968 Bridges 179/1 SA series of uniform pulses, the number of which is indic- 3,549.806 12/1970 Wood l79/l SA ative of the magnitude of vibratto content.
  • the present invention relates generally to voice signal analysis systems and more specifically to a method and apparatus for detecting emotional stress within a voice pattern. The presence of an emotional state will be used to determine the truthfulness of a response to questions asked by a skilled interrogator.
  • Speech is the acoustic energy response of: (a) the voluntary motions of the vocal cords and the vocal tract, which consists of the throat, the nose, the mouth, the tongue, the lips, and the pharynx, and (b) the resonances of the various openings and cavities of the human head.
  • the primary source of speech energy is excess air under pressure, contained in the lungs. This air pressureis allowed to flow out of the mouth and nose, under muscular control which produces modulation. This flow is controlled or modulated by the human speaker in a variety of ways.
  • the major source of modulation is the vibration of the vocal cords. This vibration produces the major component of the voiced speech sounds, such as those required when pronouncing the vowel sounds in a normal manner. These voiced sounds, formed by the buzzing action of the vocal cords, contrast to the voiceless sounds such as the letter s or the letter f produced by the nose, tongue and lips. This action of voicing is known as phonation.
  • the basic buzz or pitch frequency which establishes phonation are different for men and women.
  • the basic pitch pulses of phonation contain many harmonies and overtones of the fundamental rate, in both men and women.
  • the vocal cords are capable of a variety of shapes and motions. During the process of simple breathing, they are involuntarily held open and during phonation, they are brought together. As air is expelled from the lungs, at the onset of phonation, the cords vibrate back and forth, alternately closing and opening. Current physiological authorities hold that the muscular tension and theefiective mass of the cords is varied by learned muscular action. These changes strongly influence the oscillating or vibrating system.
  • phonation is established by or governed by two different structures in the pharynx; the vocal cord muscles and a mucous membrane called the conus elasticus. These two structures are acoustically coupled together at a mutual edge, within the pharynx, and cooperate to produce two different modes of vibration.
  • a pitch cycle begins with a subglottal closure of the conus elasticus.
  • This membrane is forced upward toward the coupled edge of the vocal cord muscle in a wave-like fashion, by air pressure being expelled from the lungs.
  • a small puff of air explosively occurs, giving rise to the open phase of vocal cord motion.
  • the subglottal closure is pulled shut by a suction which results from the aspiration of air through the glottis.
  • the vocal cord muscles Shortly after this, the vocal cord muscles also close.
  • the two masses tend to vibrate in opposite phase. The result in a relatively long closed time alternated with short sharp air pulses which may produce numerous overtones and harmonics.
  • the balance of respiratory tract and the nasal and cranial cavities give rise to a variety of resonances, known as Formants in the physiology of speech.
  • Formants in the physiology of speech.
  • the lowest frequency formant can be approximately identitied with the pharyngeal cavity, resonating as a closed pipe.
  • the second formant arises-in the mouth cavity.
  • the third formant is often considered related to the second resonance of the pharyngeal cavity.
  • the modes of the higher order formants are too complex to be very simply identified.
  • the frequency of the various formants vary greatly with the production of the various voiced sounds.
  • amplitude and frequency variations in the fundamental voiced pitch energy appears to be an acoustic correlate of emotional content, transmitted through speech.
  • Other parameters thought to be related to the emotional transmission of information include: Phonetic Content, Gross Changes in Fundamen-' tal Frequency, Relative Energy Levels in Various Frequency Bands, and the Speech Envelope Amplitude. These parameters all contribute to the conveyance of emotion or a stressful condition existing in the speaker.
  • Speech analysis and the equipment for accomplishing the same has been developed for a variety of loosely related purposes.
  • One of the primary concerns is the transmission of speech with a high order of intelligibility and presence over a very reduced bandwidth.
  • the applicability of this particular art becomes obvious in civil and military communications.
  • Other fields in which speech analysis equipment is used are the voice operated printing or recording device, such as a typewriter, and systems, equipment and devices that command and control the spoken word or phrase. While these activities are interesting and valuable in themselves, they do not relate to the detection of emotional content of a speech wave nor'to its use to determine the veracity of the speaker.
  • the amplitude variations of the basic phonation may be assessed and quantified by measurement of the amount of rapid aperiodic amplitude modulation on the speech signal envelope of a spoken word.
  • This rapid variation of the amplitude of the speech signal envelope is called Vibratto for the purposes of this invention.
  • This invention discloses a means whereby the measure of vibratto in the speech envelope of a person under interrogation may be meaningfully quantified in real time, so that a Truth/Lie decision can be made.
  • Research into the vibratto component of the speech wave has conclusively demonstrated that the amount of vibratto correlates well with stress or emotional involvement which leads to the Truth/ Lie decision.
  • Frequency fluctuation in the basic pitch frequency could be quantified with the aid of a frequency discriminator, for example.
  • variability of time between successive pitch pulses could be obtained by conventional zero crossing analysis.
  • the present vibratto quantification method and apparatus provides means for identifying and selecting speech signal envelope amplitude excursions or modulations in excess of a selected value and means for displaying, counting and recording the number of these amplitude excursions.
  • This speech signal is rectified and the envelope is smoothed.
  • the envelope is time and amplitude discriminated by a differentiator and a DC base line restorer to emphasize the amplitude excursions or vibratto modulation.
  • the resulting pulses are applied to a level detector and then processed into a pulse counter which drives the display to indicate the amount of modulation or vibratto in the speakers speech.
  • a simple oscillographic recorder readout may be used so that the over-threshold envelope amplitude excursions could be visually counted and recorded in such a manner as to allow comparison between successive responses during interrogation.
  • Other comparison or threshold selection techniques can be employed too.
  • the output pulses of such circuits may be counted in a digital manner or in a simple integrating analog diode circuit.
  • the specific embodiment of the invention includes a comparator circuit with a variable voltage level to be selected after observation by a trained operator. The value selected determines the level beyond which the stressed phonation pulses may be considered statistically significant in making the Truth/Lie decision.
  • the group of pulses is digitally counted for each proper utterance of the person being interrogated.
  • a digital display is employed to indicate the number of pulses that the exceeded the selected threshold of the comparator circuit. This digital measure of a proper utterance is available to the interrogator so that he (or she) can intelligently quantify the veracity of the answers to the selected questions during the interrogation process. Statistical data has revealed that this technique allows the Truth/Lie decision to be made with a high degree of accuracy.
  • An additional object of this invention is to detect this emotional or stressful condition while the person who is speaking is under direct and skillful interrogation.
  • a further object of this invention is to provide means whereby a valid Truth/Lie decision can be rendered by direct observations of the data readout of a voice or speech analysis system.
  • a still further object of this invention is to detect the emotional or stressful condition by analysis of the rapid amplitude modulation of the fundamental phonation of the speaker using an electronic signal analysis system.
  • FIG. 1 is an oscillograph of a male voice responding with the word yes in the English language, in answer to a direct question at a bandwidth of SkHz.
  • FIG. 2 is an oscillograph of a male voice responding with the word no in the English language, in answer to a direct question at a bandwidth of 5kHz.
  • FIGS. 3a and 3b are typical graphs of a portion of a yes response with and without emotional stress, respectively.
  • FIGS. 4a and 4b are typical graphs of a portion of a no response with and without emotional stress, respectively.
  • FIG. 5 is a block diagram of the vibratto signal processing circuit.
  • FIG. 6 is a detailed block schematic of the vibratto pulse level selecting and counting circuit with display.
  • FIG. 1 shows an oscillograph of a male voice responding with the word yes in the English language in answer to a direct question at a bandwidth of SkHz.
  • the wave form contains two distinct envelopes, the first being for the ye sound and the second being for the harsh 5 sound. Since the first envelope of the yes signal wave form is a mellower sound being produced primarily by the vocal cords and conus elasticus, this envelope will be processed to detect emotional stress content or modulations.
  • the male voice responding with the word no in the English language at a bandwidth of SkI-Iz is shown in FIG. 2. This response has a single envelope which will be analyzed by the present device to detect the presence of rapid modulation of the phonation constituent of the speech signal.
  • FIG. 3a is a drawn replica of a portion of the response yes, delivered under emotional stress.
  • the rapid modulation or vibratto pulses can be seen extending above and below the normal envelope. These additional excursions occur as the result of non-symmetric action between the vocal cords and the conus elasticus.
  • the basic reptition period of this male voice is about 8.3 milliseconds.
  • FIG. 3b is a drawn replica of a portion of a male voice responding yes delivered under conditions of no emotional stress.
  • the smooth regular features of the pitch pulses can be easily seen.
  • FIG. 4a is a drawn replica of a portion of the same male voice responding no under a condition of emotional stress.
  • the vibratto modulations appear as distortions near the axis of averages and as excessively high peaks in the position direction. This non-regularity is the result of interaction in the pharynges between the vocal cords and the conus elasticus leading to explosive" type of formant excitation.
  • FIG. 4b is a drawn replica of a portion of the same male voice answering no to a non-stressful question. The smoothness and regularity of the response can be readily seen.
  • the present invention will emphasize the rapid modulations amplitude in excess of a selected level within the envelope in order to distinguish them. After this emphasis, the signal will be analyzed by comparison with a selected voltage level above which the pulses will be counted. It is the registration of the number of pulses which will indicate the presence of emotional stress in the speech of the individual under interrogation.
  • FIG. 5 A blockdiagram of the vlbratto signal processing circuit is shown in FIG. 5 as having an acoustical transducer 2 at its input.
  • the acoustical transducer 2 is a microphone type of device which converts the acoustical utterance of the speaker into alternating current energy.
  • a tape recorder 4 may be used as a source of electrical signal energy instead of direct transduction by means of a microphone. In either case, the microphone used to record the information into the tape recorder (or as an input directly into the system) should have the property to transfer the acoustical utterances into electric al'energy with a minimum of frequency and amplitude distortion.
  • Electrical signals representing the speech wave are amplified in operationalarnplifier which provides linear amplification and isolation of the input from the remainder of the system.
  • the amplified speech signal is then rectified in a unipolar process 14 to provide an electrical signal having only one polarity.
  • the rectified signal is again amplified and isolated from the remainder of the circuit by operational amplifier 24.
  • Electrical signals representing the speech envelope of a single polarity is then smoothed in filter 28,32 by integration.
  • the smoothing filter 28,32 removes the high frequency energy of the phonation and extracts a signal which is representative of the envelope of the speech wave.
  • the smoothed signal is again amplified and isolated from the remainder of the circuit by operational amplifier 38.
  • the smoothed envelope is then difierentiated in time and compared with the envelope amplitude in its level (to be determined by resistor 48).
  • the interrogator determines the statistical weight to be given to various amplitude levels of modulation.
  • Voltage comparator 50 produces a series of uniform pulses indicative of the number of pulses of which it has received which are greater than the voltage level set by resistor 48. These pulses are counted in pulse counter 52 and displayed in a numerical indicator 54.
  • the present invention provides a rapidly observable indication to a trained interrogator of the truthfulness of the subjects response by mere observation of the numerical indicator 54.
  • the interrogator would initially ask the subject a series of questions for which he knows the answers and which applicant would not lie. These questions would include are you wearing a specific color shirt?" and the response would be yes or no.
  • the interrogator would adjust the voltage level of voltage comparator 50 so that the number appearing on the numerical indicator would be minimal or approximately under 10. It should be noted that the count of a number in response to a yes is different from the count of a number in response to a no.
  • the interrogator may proceed to ask questions for which he is not sure of the answers. Upon monitoring various responses, the interrogator may determine which questions the applicant answered with various degrees of emotional stress. By comparing the number in the numerical indicator 54 with the number determined to be truthful responses, the interrogator can determine when emotional stress is present which would correspond to when the applicant is lying. The number on the numerical indicator for an untruth ,or presence of an emotional stress will normally exceed twice the value that would be recorded for truthful responses to a yes or no.”
  • FIG..6 A more detailed schematic of the present invention is shown in FIG..6.
  • the electrical signal from either thetransducer 2 or the tape recorder 4 enters the system at input port 5.
  • An operational amplifier 10 with its gain and performance determing resistors 6, 8 and 12 is used to provide isolation and linear amplification of the input signal.
  • This isolated and amplified signal is conducted to a unipolar processor or diode 14 where one polarity of the signal is allowed to pass into the following circuitry.
  • a diode connected in the opposite polarity 16 could be used equally as well.
  • a full wave rectification or bridge rectification circuit (not shown) could be used as well with a small additional complication of the circuit.
  • the electrical energy out of the diode, at the input of the following circuitry is therefore primarily and predominantly of one polarity.
  • the DC energy return resistance 20 prevents a residual charge from building up on the input of the following circuit.
  • Operational amplifier 24 with its gain and performance determining resistors 18, 22 and 26, is used to isolate the diode circuit from the follow-on circuitry.
  • the follow-on circuitry consists of a smoothing filter in the form of an R/C integrator having a variable resistor 28 and a fixed capacitor 32. It can be seen by those versed in the art that a variety of different active and passive smoothing filters could be used to remove the high frequency energy of the phonation and to extract a signal which is representative of the envelope of the speech wave.
  • the R/C integrator which is used in the present embodiment, functions quite well and is simple to employ. The time constant is variable to afford adjustment for voices of various fundamental frequencies.
  • the R/C integrator is followed by a further operational amplifier 38 with its gain and performance determining resistors 30, 34 and 36. This operational amplifier isolates the processing of the R/C integrator 28 and 32 from the subsequent circuit.
  • the isolation amplifier 38 Following the isolation amplifier 38 is a special time and amplitude discriminator having a differentiator circuit involving the variable capacitor 42 and the fixed resistor 40. These two components perform the time differentiation function.
  • the potentiometer 41 provides a measure of the undifferentiated signal envelope which is used to null out residual envelope energy. This component, connected as it is, performs the envelope amplitude discrimination function.
  • An operational amplifier 44 with its gain and performance determining resistances 43,45 and 46 accepts the time derivative signal and the amplitude discrimination signal and provides effective base line restoration for most typical types of phonation.
  • Base line restoration can be accomplished in a variety of ways, for example, clamping and DC restoration. Irrespective of the circuit used, the output of the amplifier 44 is a series of varying amplitude pulses that comprise the variable modulation of the phonation which represents the vibratto to be quantified. This circuitry emphasizes the modulation with respect to the normally present phonation.
  • the present invention provides an electronic system to provide digital results.
  • the preferred embodiment of the invention provides a comparator 50 by which the level of significant output pulses may be adjusted by a knowledgeable operator of the equipment.
  • Potentiometer 48 is the control means for this level adjustment. This control is shown to function either a positive or a negative voltage level. When the polarity of the diodes 14 or 16 are selected, the comparator voltage level will become of the polarity that will select either excess positive or excess negative peaks.
  • the potentiometer 48 may be set at volts, at which time the circuit becomes a conventional zerocrossing detection device. It has been found that the statistical significance of the Truth/Lie decision process will improve if a level away from the baseline is selected for the functioning of the comparator 50.
  • the comparator may be a simple diode circuit or it may be a Schmitt trigger circuit with suitable voltage supplies, passive and active components. However, for simplicity and economy, a differential voltage comparator such as the Motorola MCl7lO has been used for the circuit function. When the differential comparator is used, the output of amplifier 44 is brought into the comparator 50 on the signal input lead 49 while the voltage that the input pulses are being differentially compared to is brought into the comparator at lead 51. The output of the comparator is a series of pulses of constant amplitude that are related to the vibratto component in stressed and unstressed phonation.
  • pulses may be counted in a variety of ways. They could be simply recorded on a chart recorder and manually and visually counted. They could also be put into an integrating diode counter and the resultant DC voltage at the outut of the counter would be directly proportional to the number of pulses of interest. A digital counter could obviously be used as well.
  • the number of pulses at the output of the comparator 50 are fed into a digital counter that counts and registers in decimal digits, the exact number of pulses at its input.
  • the digital counter 52 takes the input pulses in at terminal 53 and registers the count at digital indicator 54.
  • the present device and method provides a readily identifiable numerical indication which will provide an interrogator with instantaneous indication of the veracity or the presence of emotional stress in the subjects responses.
  • the invention by electrical analysis of the phonation speech envelope and emphasization of modulation produces a uniform pulse train which can be monitored to provide the data needed to detect the emotional stress.
  • a method to detect emotional stress in the speech of an individual comprising:
  • a method as in claim 2 including the step of rectifying said electrical signal before smoothing and wherein said smoothing comprises integrating said rectified electrical signal.
  • a device for measuring the emotional stress produced variations in a speech sound comprising:
  • a device as in claim 4 wherein said emphasizing means includes an integrating means and a differentiating means connected in series.
  • a device as in claim 5 wherein said emphasizing means includes a rectifying means connected to the input of said integrating means.
  • a device as in claim 6 including amplifiers connected between said converting means and said rectifying means, between said rectifying means and said integrating means, and between said integrating means and said differentiating means.
  • said differentiating means produces a series of varying amplitude pulses and said detecting means includes a voltage comprising means for producing a series of uniform amplitude pulses for each varying amplitude pulse above a predetermined level.
  • said indicating means includes a counting means for counting the uniform amplitude pulses, whereby the number of uniform amplitude pulses indicates the degree of emotional stress produced variations present.
  • a device for determining emotional stress by speech wave analysis comprising:
  • a device as in claim 10 wherein said emphasizing means comprises a differentiator means and a baseline restoration means for producing a varying amplitude pulse train.
  • said detecting means comprises a voltage comparator means for producing a uniform amplitude pulse for each varying amplitude pulse above said predetermined level.

Abstract

A method and apparatus for indicating emotional stress in speech by detecting the presence of vibratto or rapid modulation of the phonation constituent within the speech signal envelope. The modulation is emphasized by rectification, smoothing, and time and amplitude discrimination of the speech wave form and is compared to a selected voltage level to produce a series of uniform pulses, the number of which is indicative of the magnitude of vibratto content.

Description

United States Patent Fuller Dec. 17, 1974 OTHER PUBLICATIONS Philip Lieberman, Perturbations in Vocal Pitch, J.A.S.A Vol. 33. 5/l96l, p. 597-603. Philip Lieberman, Some Acoustic Correlates of Word lnvemofl Fred Fuller, 4450 Park St, Stress in American English, J.A.S.A. Vol. 32, April Chevy Chase, Md. 20014 19 0 45 454 [22] Filed: Dec. 1, 1972 Primary Examiner-David L. Stewart [21] Appl' 311,392 Attorney, Agent, or FirmFidelman. Wolffe, Leitner & Hiney [52] US. Cl 179/1 SA, 179/1 SP [51] Int. Cl. G10] 1/04 [57] ABSTRACT [58] Field 0fS earch...1 79 /1 SA, 1 SB, 1 VS,15.55 R, A method and apparatus for indicating emotional 179N555 1 SP; 128/206; 35/21 stress in speech by detecting the presence of vibratto or rapid modulation of the phonation constituent [56] References C'ted within the speech signal envelope. The modulation is UNITED STATES PATENTS emphasized by rectification, smoothing, and time and 3.268.661 8/1966 COUllel 179/1 SA amplitude discrimination of the Speech Wave form and 3.346.694 l0/l967 Brady 179/] SA is compared to a selected voltage level to produce 21 3,387,090 6/1968 Bridges 179/1 SA series of uniform pulses, the number of which is indic- 3,549.806 12/1970 Wood l79/l SA ative of the magnitude of vibratto content. 3,592.96) .7/l97l Yoshino l79/l SA 3,688,126 8/1972 Klein 179/1 SA 13 Clams, 8 Drawlng Flgures TIME AND UNIPOLAR SMOOTHING AMPUTUDE PROCESSOR FILTER D|SCR|M|NATOR 2 24 28,32 38 4Q 4|, 42, 4 J
54 52') 50) NUMERICAL PULSE VOLTAGE INDICATOR COUNTER COMPARATOR PATENIE nan 1 71914 sum xnr 3 H6. 2 I "N v AXIS OF AVERAGES .v v I' II I l v "I l 'I v ll II I ll ll V '1 l 1 l I II I |l l l AXIS OF AVERAGES PATENTEDBEBI 119M 3,855,418
'SHEETEUF3 MAAA AAA A AA AA A AA ILEIESRAEDES} W WW W W w w FIG. 4a
AMPLIFIER AMPLIFIER AMPL'F'ER TIME AND UNIPOLAR SMOOTHING AMPUTUDE PROCESSOR FILTER DISCRIMINATOR 2 lO 24 542 7 NUMERICAL PULSE VOLTAGE INDICATOR COUNTER COMPARATOR w mlw v F/aa METHOD AND APPARATUS FOR PHONATION ANALYSIS LEADING TO VALID TRUTH/LIE DECISIONS BY VIBRATTO COMPONENT ASSESSMENT BACKGROUND OF THE INVENTION The present invention relates generally to voice signal analysis systems and more specifically to a method and apparatus for detecting emotional stress within a voice pattern. The presence of an emotional state will be used to determine the truthfulness of a response to questions asked by a skilled interrogator.
DESCRIPTION OF THE PRIOR ART It has long been known that the voice may be, and often is, used to convey the emotions of the speaker. The emotional state of the speaker produces readily observable variation in the measureable parameters of the voice.
Speech is the acoustic energy response of: (a) the voluntary motions of the vocal cords and the vocal tract, which consists of the throat, the nose, the mouth, the tongue, the lips, and the pharynx, and (b) the resonances of the various openings and cavities of the human head. The primary source of speech energy is excess air under pressure, contained in the lungs. This air pressureis allowed to flow out of the mouth and nose, under muscular control which produces modulation. This flow is controlled or modulated by the human speaker in a variety of ways.
The major source of modulation is the vibration of the vocal cords. This vibration produces the major component of the voiced speech sounds, such as those required when pronouncing the vowel sounds in a normal manner. These voiced sounds, formed by the buzzing action of the vocal cords, contrast to the voiceless sounds such as the letter s or the letter f produced by the nose, tongue and lips. This action of voicing is known as phonation.
The basic buzz or pitch frequency, which establishes phonation are different for men and women. The vocal cords of a typical adult male vibrate or buzz at a frequency of about l20I-Iz, whereas for women, this basic rate is approximately an octave higher, near 250 Hz. The basic pitch pulses of phonation contain many harmonies and overtones of the fundamental rate, in both men and women.
The vocal cords are capable of a variety of shapes and motions. During the process of simple breathing, they are involuntarily held open and during phonation, they are brought together. As air is expelled from the lungs, at the onset of phonation, the cords vibrate back and forth, alternately closing and opening. Current physiological authorities hold that the muscular tension and theefiective mass of the cords is varied by learned muscular action. These changes strongly influence the oscillating or vibrating system.
Certain physiologists consider that phonation is established by or governed by two different structures in the pharynx; the vocal cord muscles and a mucous membrane called the conus elasticus. These two structures are acoustically coupled together at a mutual edge, within the pharynx, and cooperate to produce two different modes of vibration.
In one mode, which seems to be an emotionally stable or non-stressful timbre of voice, the conus elasticus and the vocal cord muscle vibrate as a unit, in synchronism. Phonation in this mode sounds soft or mellow" and few overtones are present.
In the second mode, a pitch cycle begins with a subglottal closure of the conus elasticus. This membrane is forced upward toward the coupled edge of the vocal cord muscle in a wave-like fashion, by air pressure being expelled from the lungs. When the closure reaches the coupled edge, a small puff of air explosively occurs, giving rise to the open phase of vocal cord motion. After the explosive puff of air has been released, the subglottal closure is pulled shut by a suction which results from the aspiration of air through the glottis. Shortly after this, the vocal cord muscles also close. Thus, in this mode, the two masses tend to vibrate in opposite phase. The result in a relatively long closed time alternated with short sharp air pulses which may produce numerous overtones and harmonics.
The balance of respiratory tract and the nasal and cranial cavities give rise to a variety of resonances, known as Formants in the physiology of speech. The lowest frequency formant can be approximately identitied with the pharyngeal cavity, resonating as a closed pipe. The second formant arises-in the mouth cavity. The third formant is often considered related to the second resonance of the pharyngeal cavity. The modes of the higher order formants are too complex to be very simply identified. The frequency of the various formants vary greatly with the production of the various voiced sounds.
Certain investigators and researchers in the field have determined that amplitude and frequency variations in the fundamental voiced pitch energy (which is often termed the fine structure) appears to be an acoustic correlate of emotional content, transmitted through speech. Other parameters thought to be related to the emotional transmission of information include: Phonetic Content, Gross Changes in Fundamen-' tal Frequency, Relative Energy Levels in Various Frequency Bands, and the Speech Envelope Amplitude. These parameters all contribute to the conveyance of emotion or a stressful condition existing in the speaker.
Speech analysis and the equipment for accomplishing the same has been developed for a variety of loosely related purposes. One of the primary concerns is the transmission of speech with a high order of intelligibility and presence over a very reduced bandwidth. The applicability of this particular art becomes obvious in civil and military communications. Other fields in which speech analysis equipment is used are the voice operated printing or recording device, such as a typewriter, and systems, equipment and devices that command and control the spoken word or phrase. While these activities are interesting and valuable in themselves, they do not relate to the detection of emotional content of a speech wave nor'to its use to determine the veracity of the speaker.
According to the present invention, the amplitude variations of the basic phonation may be assessed and quantified by measurement of the amount of rapid aperiodic amplitude modulation on the speech signal envelope of a spoken word. This rapid variation of the amplitude of the speech signal envelope is called Vibratto for the purposes of this invention.
This invention discloses a means whereby the measure of vibratto in the speech envelope of a person under interrogation may be meaningfully quantified in real time, so that a Truth/Lie decision can be made. Research into the vibratto component of the speech wave has conclusively demonstrated that the amount of vibratto correlates well with stress or emotional involvement which leads to the Truth/ Lie decision.
There are many ways to detect and measure the amount of vibratto in the phonation of an emotionally involved person under interrogation. Frequency fluctuation in the basic pitch frequency could be quantified with the aid of a frequency discriminator, for example. In addition, variability of time between successive pitch pulses could be obtained by conventional zero crossing analysis.
SUMMARY OF THE INVENTION The present vibratto quantification method and apparatus provides means for identifying and selecting speech signal envelope amplitude excursions or modulations in excess of a selected value and means for displaying, counting and recording the number of these amplitude excursions. This speech signal is rectified and the envelope is smoothed. The envelope is time and amplitude discriminated by a differentiator and a DC base line restorer to emphasize the amplitude excursions or vibratto modulation. The resulting pulses are applied to a level detector and then processed into a pulse counter which drives the display to indicate the amount of modulation or vibratto in the speakers speech.
A simple oscillographic recorder readout may be used so that the over-threshold envelope amplitude excursions could be visually counted and recorded in such a manner as to allow comparison between successive responses during interrogation. Other comparison or threshold selection techniques can be employed too. The output pulses of such circuits may be counted in a digital manner or in a simple integrating analog diode circuit. The specific embodiment of the invention includes a comparator circuit with a variable voltage level to be selected after observation by a trained operator. The value selected determines the level beyond which the stressed phonation pulses may be considered statistically significant in making the Truth/Lie decision. The group of pulses is digitally counted for each proper utterance of the person being interrogated. A digital display is employed to indicate the number of pulses that the exceeded the selected threshold of the comparator circuit. This digital measure of a proper utterance is available to the interrogator so that he (or she) can intelligently quantify the veracity of the answers to the selected questions during the interrogation process. Statistical data has revealed that this technique allows the Truth/Lie decision to be made with a high degree of accuracy.
OBJECTS OF THE INVENTION It is an object of the present invention to provide a means for detecting a stressful or emotional condition in a human being who is speaking.
An additional object of this invention is to detect this emotional or stressful condition while the person who is speaking is under direct and skillful interrogation.
A further object of this invention is to provide means whereby a valid Truth/Lie decision can be rendered by direct observations of the data readout of a voice or speech analysis system.
A still further object of this invention is to detect the emotional or stressful condition by analysis of the rapid amplitude modulation of the fundamental phonation of the speaker using an electronic signal analysis system.
Other objects, advantages and novel features of the present invention will become apparent from the following detailed description of the invention when considered in conjunction with the accompanying drawings in which:
BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is an oscillograph of a male voice responding with the word yes in the English language, in answer to a direct question at a bandwidth of SkHz.
FIG. 2 is an oscillograph of a male voice responding with the word no in the English language, in answer to a direct question at a bandwidth of 5kHz.
FIGS. 3a and 3b are typical graphs of a portion of a yes response with and without emotional stress, respectively.
FIGS. 4a and 4b are typical graphs of a portion of a no response with and without emotional stress, respectively.
FIG. 5 is a block diagram of the vibratto signal processing circuit.
FIG. 6 is a detailed block schematic of the vibratto pulse level selecting and counting circuit with display.
DESCRIPTION OF PREFERRED EMBODIMENTS FIG. 1 shows an oscillograph of a male voice responding with the word yes in the English language in answer to a direct question at a bandwidth of SkHz. The wave form contains two distinct envelopes, the first being for the ye sound and the second being for the harsh 5 sound. Since the first envelope of the yes signal wave form is a mellower sound being produced primarily by the vocal cords and conus elasticus, this envelope will be processed to detect emotional stress content or modulations. The male voice responding with the word no in the English language at a bandwidth of SkI-Iz is shown in FIG. 2. This response has a single envelope which will be analyzed by the present device to detect the presence of rapid modulation of the phonation constituent of the speech signal.
FIG. 3a is a drawn replica of a portion of the response yes, delivered under emotional stress. The rapid modulation or vibratto pulses can be seen extending above and below the normal envelope. These additional excursions occur as the result of non-symmetric action between the vocal cords and the conus elasticus. The basic reptition period of this male voice is about 8.3 milliseconds.
FIG. 3b is a drawn replica of a portion of a male voice responding yes delivered under conditions of no emotional stress. The smooth regular features of the pitch pulses can be easily seen.
FIG. 4a is a drawn replica of a portion of the same male voice responding no under a condition of emotional stress. The vibratto modulations appear as distortions near the axis of averages and as excessively high peaks in the position direction. This non-regularity is the result of interaction in the pharynges between the vocal cords and the conus elasticus leading to explosive" type of formant excitation.
FIG. 4b is a drawn replica of a portion of the same male voice answering no to a non-stressful question. The smoothness and regularity of the response can be readily seen.
Thus, it is an object of the present invention to isolate the rapid modulation of the phonation constituent of the speech signal envelope in order to detect the presence of emotional stress in the speaker.
The present invention will emphasize the rapid modulations amplitude in excess of a selected level within the envelope in order to distinguish them. After this emphasis, the signal will be analyzed by comparison with a selected voltage level above which the pulses will be counted. It is the registration of the number of pulses which will indicate the presence of emotional stress in the speech of the individual under interrogation.
Experimentation with the present invention has shown that the difference in count between a nonemotional response and an emotional response is readily evident. Though the count varies for degree of emotional stress and for various individuals, the number of pulses counted in the emotional state is usually greater thantwice the number of pulses counted where no emotional stress is present. It is this type of comparison that will present the interrogator with an instantaneous and readily observable deviation which can be correlated with the questions asked to determine the Truth/Lie of the subject.
A blockdiagram of the vlbratto signal processing circuit is shown in FIG. 5 as having an acoustical transducer 2 at its input. The acoustical transducer 2 is a microphone type of device which converts the acoustical utterance of the speaker into alternating current energy. As shown in FIG. 6, a tape recorder 4 may be used as a source of electrical signal energy instead of direct transduction by means of a microphone. In either case, the microphone used to record the information into the tape recorder (or as an input directly into the system) should have the property to transfer the acoustical utterances into electric al'energy with a minimum of frequency and amplitude distortion.
Electrical signals representing the speech wave are amplified in operationalarnplifier which provides linear amplification and isolation of the input from the remainder of the system. The amplified speech signal is then rectified in a unipolar process 14 to provide an electrical signal having only one polarity. The rectified signal is again amplified and isolated from the remainder of the circuit by operational amplifier 24. Electrical signals representing the speech envelope of a single polarity is then smoothed in filter 28,32 by integration. The smoothing filter 28,32 removes the high frequency energy of the phonation and extracts a signal which is representative of the envelope of the speech wave. The smoothed signal is again amplified and isolated from the remainder of the circuit by operational amplifier 38. The smoothed envelope is then difierentiated in time and compared with the envelope amplitude in its level (to be determined by resistor 48). By proper selection of the variable voltage level by the interrogator, analysis of the speech wave can be adapted to the specificperson being interrogated. By varying the voltage level for comparator 50, the interrogator determines the statistical weight to be given to various amplitude levels of modulation. Voltage comparator 50 produces a series of uniform pulses indicative of the number of pulses of which it has received which are greater than the voltage level set by resistor 48. These pulses are counted in pulse counter 52 and displayed in a numerical indicator 54.
With the brief description of the block diagram of the present invention, it is obvious that the present invention provides a rapidly observable indication to a trained interrogator of the truthfulness of the subjects response by mere observation of the numerical indicator 54. The interrogator would initially ask the subject a series of questions for which he knows the answers and which applicant would not lie. These questions would include are you wearing a specific color shirt?" and the response would be yes or no. After observing the number of numerical. indicator 54, the interrogator would adjust the voltage level of voltage comparator 50 so that the number appearing on the numerical indicator would be minimal or approximately under 10. It should be noted that the count of a number in response to a yes is different from the count of a number in response to a no. Once the initial adjustment of the system has been accomplished, the interrogator may proceed to ask questions for which he is not sure of the answers. Upon monitoring various responses, the interrogator may determine which questions the applicant answered with various degrees of emotional stress. By comparing the number in the numerical indicator 54 with the number determined to be truthful responses, the interrogator can determine when emotional stress is present which would correspond to when the applicant is lying. The number on the numerical indicator for an untruth ,or presence of an emotional stress will normally exceed twice the value that would be recorded for truthful responses to a yes or no."
A more detailed schematic of the present invention is shown in FIG..6. As described in reference to FIG. 5, the electrical signal from either thetransducer 2 or the tape recorder 4 enters the system at input port 5. An operational amplifier 10 with its gain and performance determing resistors 6, 8 and 12 is used to provide isolation and linear amplification of the input signal.
This isolated and amplified signal, at the output of the operational amplifier 10, is conducted to a unipolar processor or diode 14 where one polarity of the signal is allowed to pass into the following circuitry. A diode connected in the opposite polarity 16 could be used equally as well. A full wave rectification or bridge rectification circuit (not shown) could be used as well with a small additional complication of the circuit. The electrical energy out of the diode, at the input of the following circuitry is therefore primarily and predominantly of one polarity. The DC energy return resistance 20 prevents a residual charge from building up on the input of the following circuit.
Operational amplifier 24 with its gain and performance determining resistors 18, 22 and 26, is used to isolate the diode circuit from the follow-on circuitry. The follow-on circuitry consists of a smoothing filter in the form of an R/C integrator having a variable resistor 28 and a fixed capacitor 32. It can be seen by those versed in the art that a variety of different active and passive smoothing filters could be used to remove the high frequency energy of the phonation and to extract a signal which is representative of the envelope of the speech wave. The R/C integrator, which is used in the present embodiment, functions quite well and is simple to employ. The time constant is variable to afford adjustment for voices of various fundamental frequencies.
The R/C integrator is followed by a further operational amplifier 38 with its gain and performance determining resistors 30, 34 and 36. This operational amplifier isolates the processing of the R/ C integrator 28 and 32 from the subsequent circuit.
Following the isolation amplifier 38 is a special time and amplitude discriminator having a differentiator circuit involving the variable capacitor 42 and the fixed resistor 40. These two components perform the time differentiation function. The potentiometer 41 provides a measure of the undifferentiated signal envelope which is used to null out residual envelope energy. This component, connected as it is, performs the envelope amplitude discrimination function. An operational amplifier 44 with its gain and performance determining resistances 43,45 and 46 accepts the time derivative signal and the amplitude discrimination signal and provides effective base line restoration for most typical types of phonation.
Base line restoration can be accomplished in a variety of ways, for example, clamping and DC restoration. Irrespective of the circuit used, the output of the amplifier 44 is a series of varying amplitude pulses that comprise the variable modulation of the phonation which represents the vibratto to be quantified. This circuitry emphasizes the modulation with respect to the normally present phonation.
Statistical analysis of the series of output pulses at the output of amplifier 44 employing manual means, has been used to derive the validity of Truth/Lie decision assessment of the vibratto quantification. The present invention provides an electronic system to provide digital results.
The preferred embodiment of the invention provides a comparator 50 by which the level of significant output pulses may be adjusted by a knowledgeable operator of the equipment. Potentiometer 48 is the control means for this level adjustment. This control is shown to function either a positive or a negative voltage level. When the polarity of the diodes 14 or 16 are selected, the comparator voltage level will become of the polarity that will select either excess positive or excess negative peaks. The potentiometer 48 may be set at volts, at which time the circuit becomes a conventional zerocrossing detection device. It has been found that the statistical significance of the Truth/Lie decision process will improve if a level away from the baseline is selected for the functioning of the comparator 50. The comparator may be a simple diode circuit or it may be a Schmitt trigger circuit with suitable voltage supplies, passive and active components. However, for simplicity and economy, a differential voltage comparator such as the Motorola MCl7lO has been used for the circuit function. When the differential comparator is used, the output of amplifier 44 is brought into the comparator 50 on the signal input lead 49 while the voltage that the input pulses are being differentially compared to is brought into the comparator at lead 51. The output of the comparator is a series of pulses of constant amplitude that are related to the vibratto component in stressed and unstressed phonation.
These pulses may be counted in a variety of ways. They could be simply recorded on a chart recorder and manually and visually counted. They could also be put into an integrating diode counter and the resultant DC voltage at the outut of the counter would be directly proportional to the number of pulses of interest. A digital counter could obviously be used as well. In the chosen embodiment of the invention, the number of pulses at the output of the comparator 50 are fed into a digital counter that counts and registers in decimal digits, the exact number of pulses at its input. The digital counter 52 takes the input pulses in at terminal 53 and registers the count at digital indicator 54.
Thus the present device and method provides a readily identifiable numerical indication which will provide an interrogator with instantaneous indication of the veracity or the presence of emotional stress in the subjects responses. The invention by electrical analysis of the phonation speech envelope and emphasization of modulation produces a uniform pulse train which can be monitored to provide the data needed to detect the emotional stress. Although the invention has been described and illustrated in detail, it is to be clearly understood that the same is by way of illustration and example only and is not to be taken by way of limitation, the spirit and scope of the invention being limited only by the tenns of the appended claims.
What is claimed:
1. A method to detect emotional stress in the speech of an individual comprising:
converting said speech to an electrical signal;
smoothing said electrical signal to produce an envelope; isolating any rapid aperiodic amplitude modulations present on said smoothed envelope;
counting the number of said rapid aperiodic modulations; and
indicating the count per utterance which is indicative of emotional stress.
2. A method as in claim 1 wherein said isolating includes:
differentiating said smoothed electrical signal; and
comparing said differentiated signal with a selected voltage level to produce a pulse for each differentiated signal above said selected voltage.
3. A method as in claim 2 including the step of rectifying said electrical signal before smoothing and wherein said smoothing comprises integrating said rectified electrical signal.
4. A device for measuring the emotional stress produced variations in a speech sound comprising:
means for converting speech sounds into electrical signals;
means connected to said converting means for emphasizing an emotional stress produced variation segment of said electrical signals, by time and amplitude discrimination, by integration followed by differentiation and baseline restoration;
means connected to said emphasizing means for detecting said emotional stress produced variation segment; and
means connected to said detecting means for indicating the degree of emotional stress produced variations detected.
5. A device as in claim 4 wherein said emphasizing means includes an integrating means and a differentiating means connected in series.
6. A device as in claim 5 wherein said emphasizing means includes a rectifying means connected to the input of said integrating means.
7. A device as in claim 6 including amplifiers connected between said converting means and said rectifying means, between said rectifying means and said integrating means, and between said integrating means and said differentiating means.
8. A device as in claim 5 wherein said differentiating means produces a series of varying amplitude pulses and said detecting means includes a voltage comprising means for producing a series of uniform amplitude pulses for each varying amplitude pulse above a predetermined level.
9. A device as in claim 8 wherein said indicating means includes a counting means for counting the uniform amplitude pulses, whereby the number of uniform amplitude pulses indicates the degree of emotional stress produced variations present.
10. A device for determining emotional stress by speech wave analysis comprising:
means for producing an electrical signal representative of said speech wave;
means connected to said producing means for amplifying and shaping said electrical signal to form an electrical signal envelope; 7
means connected to said amplifying and shaping means for emphasizing rapid aperiodic amplitude modulation on said electrical signal envelope;
means connected to said emphasizing means for detecting amplitudes of said emphasized rapid aperiodic amplitude modulation above a predetermined level; and
means connected to said detecting means for indicating the number of detected modulations whereby emotional stress is determined by the value indicated.
11. A device as in claim 10 wherein said emphasizing means comprises a differentiator means and a baseline restoration means for producing a varying amplitude pulse train.
12. A device as in claim 11 wherein said detecting means comprises a voltage comparator means for producing a uniform amplitude pulse for each varying amplitude pulse above said predetermined level.
13. A device as in claim 12 wherein said shaping means includes a rectifying means and an integrating means connected in series.

Claims (13)

1. A method to detect emotional stress in the speech of an individual comprising: converting said speech to an electrical signal; smoothing said electrical signal to produce an envelope; isolating any rapid aperiodic amplitude modulations present on said smoothed envelope; counting the number of said rapid aperiodic modulations; and indicating the count per utterance which is indicative of emotional stress.
2. A method as in claim 1 wherein said isolating includes: differentiating said smoothed electrical signal; and comparing said differentiated signal with a selected voltage level to produce a pulse for each differentiated signal above said selected voltage.
3. A method as in claim 2 including the step of rectifying said electrical signal before smoothing and wherein said smoothing comprises integrating said rectified electrical signal.
4. A device for measuring the emotional stress produced variations in a speech sound comprising: means for converting speech sounds into electrical signals; means connected to said converting means for emphasizing an emotional stress produced variation segment of said electrical signals, by time and amplitude discrimination, by integration followed by differentiation and baseline restoration; means connected to said emphasizing means for detecting said emotional stress produced variation segment; and means connected to said detecting means for indicating the degree of emotional stress produced variations detected.
5. A device as in claim 4 wherein said emphasizing means includes an integrating means and a differentiating means connected in series.
6. A device as in claim 5 wherein said emphasizing means includes a rectifying means connected to the input of said integrating means.
7. A device as in claim 6 including amplifiers connected between said converting means and said rectifying means, between said rectifying means and said integrating means, and between said integrating means and said differentiating means.
8. A device as in claim 5 wherein said differentiating means produces a series of varying amplitude pulses and said detecting means includes a voltage comprising means for producing a series of uniform amplitude pulses for each varying amplitude pulse above a predetermined level.
9. A device as in claim 8 wherein said indicating means includes a counting means for counting the uniform amplitude pulses, whereby the number of uniform amplitude pulses indicates the degree of emotional stress produced variations present.
10. A device for determining emotional stress by speech wave analysis comprising: means for producing an electrical signal representative of said speech wave; means connected to said producing means for amplifying and shaping said electrical signal to form an electrical signal envelope; means connected to said amplifying and shaping means for emphasizing rapid aperiodic amplitude modulation on said electrical signal envelope; means connected to said emphasizing means for detecting amplitudes of said emphasized rapid aperiodic amplitude modulation above a predetermined level; and means connected to said detecting means for indicating the number of detected modulations whereby emotional stress is determined by the value indicated.
11. A device as in claim 10 wherein said emphasizing means comprises a differentiator means and a baseline restoration means for producing a varying amplitude pulse train.
12. A device as in claim 11 wherein said detecting means comprises a voltage comparator means for producing a uniform amplitude pulse for each varying amplitude pulse above said predetermined level.
13. A device as in claim 12 wherein said shaping means includes a rectifying means and an integrating means connected in series.
US00311392A 1972-12-01 1972-12-01 Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment Expired - Lifetime US3855418A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US00311392A US3855418A (en) 1972-12-01 1972-12-01 Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US00311392A US3855418A (en) 1972-12-01 1972-12-01 Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment

Publications (1)

Publication Number Publication Date
US3855418A true US3855418A (en) 1974-12-17

Family

ID=23206680

Family Applications (1)

Application Number Title Priority Date Filing Date
US00311392A Expired - Lifetime US3855418A (en) 1972-12-01 1972-12-01 Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment

Country Status (1)

Country Link
US (1) US3855418A (en)

Cited By (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4276445A (en) * 1979-09-07 1981-06-30 Kay Elemetrics Corp. Speech analysis apparatus
US4292469A (en) * 1979-06-13 1981-09-29 Scott Instruments Company Voice pitch detector and display
US4444199A (en) * 1981-07-21 1984-04-24 William A. Shafer Method and apparatus for monitoring physiological characteristics of a subject
US4675904A (en) * 1983-08-11 1987-06-23 Compusonics, Inc. Method for detecting suicidal predisposition
US5148483A (en) * 1983-08-11 1992-09-15 Silverman Stephen E Method for detecting suicidal predisposition
EP0572531A1 (en) * 1991-02-22 1993-12-08 Seaway Technologies, Inc. Acoustic method and apparatus for identifying human sonic sources
WO1995020216A1 (en) * 1994-01-21 1995-07-27 Wizsoft Inc. Method and apparatus for indicating the emotional state of a person
US5884260A (en) * 1993-04-22 1999-03-16 Leonhard; Frank Uldall Method and system for detecting and generating transient conditions in auditory signals
WO1999031653A1 (en) * 1997-12-16 1999-06-24 Carmel, Avi Apparatus and methods for detecting emotions
US5976081A (en) * 1983-08-11 1999-11-02 Silverman; Stephen E. Method for detecting suicidal predisposition
US6006188A (en) * 1997-03-19 1999-12-21 Dendrite, Inc. Speech signal processing for determining psychological or physiological characteristics using a knowledge base
US20020077825A1 (en) * 2000-08-22 2002-06-20 Silverman Stephen E. Methods and apparatus for evaluating near-term suicidal risk using vocal parameters
US20030182116A1 (en) * 2002-03-25 2003-09-25 Nunally Patrick O?Apos;Neal Audio psychlogical stress indicator alteration method and apparatus
US6719707B1 (en) 2001-06-15 2004-04-13 Nathan Montgomery Apparatus and method for performing musical perception sound analysis on a system
US6724887B1 (en) 2000-01-24 2004-04-20 Verint Systems, Inc. Method and system for analyzing customer communications with a contact center
AU2004200002B2 (en) * 1997-12-16 2006-04-13 Amir Liberman Apparatus and methods for detecting emotions
US7139699B2 (en) 2000-10-06 2006-11-21 Silverman Stephen E Method for analysis of vocal jitter for near-term suicidal risk assessment
US7165033B1 (en) 1999-04-12 2007-01-16 Amir Liberman Apparatus and methods for detecting emotions in the human voice
USRE40634E1 (en) 1996-09-26 2009-02-10 Verint Americas Voice interaction analysis module
US7511606B2 (en) 2005-05-18 2009-03-31 Lojack Operating Company Lp Vehicle locating unit with input voltage protection
US20100060461A1 (en) * 2008-09-08 2010-03-11 Sprague Phillip R Psychophysiological Touch Screen Stress Analyzer
US20100070283A1 (en) * 2007-10-01 2010-03-18 Yumiko Kato Voice emphasizing device and voice emphasizing method
US20100090834A1 (en) * 2008-10-13 2010-04-15 Sandisk Il Ltd. Wearable device for adaptively recording signals
US7869586B2 (en) 2007-03-30 2011-01-11 Eloyalty Corporation Method and system for aggregating and analyzing data relating to a plurality of interactions between a customer and a contact center and generating business process analytics
US7995717B2 (en) 2005-05-18 2011-08-09 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US8023639B2 (en) 2007-03-30 2011-09-20 Mattersight Corporation Method and system determining the complexity of a telephonic communication received by a contact center
US8094790B2 (en) 2005-05-18 2012-01-10 Mattersight Corporation Method and software for training a customer service representative by analysis of a telephonic interaction between a customer and a contact center
US8094803B2 (en) 2005-05-18 2012-01-10 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US8718262B2 (en) 2007-03-30 2014-05-06 Mattersight Corporation Method and system for automatically routing a telephonic communication base on analytic attributes associated with prior telephonic communication
US9083801B2 (en) 2013-03-14 2015-07-14 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US10419611B2 (en) 2007-09-28 2019-09-17 Mattersight Corporation System and methods for determining trends in electronic communications

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3268661A (en) * 1962-04-09 1966-08-23 Melpar Inc System for determining consonant formant loci
US3346694A (en) * 1965-06-02 1967-10-10 Bell Telephone Labor Inc Speech level measuring apparatus
US3387090A (en) * 1964-09-11 1968-06-04 Tracor Method and apparatus for displaying speech
US3549806A (en) * 1967-05-05 1970-12-22 Gen Electric Fundamental pitch frequency signal extraction system for complex signals
US3592969A (en) * 1968-07-24 1971-07-13 Matsushita Electric Ind Co Ltd Speech analyzing apparatus
US3688126A (en) * 1971-01-29 1972-08-29 Paul R Klein Sound-operated, yes-no responsive switch

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3268661A (en) * 1962-04-09 1966-08-23 Melpar Inc System for determining consonant formant loci
US3387090A (en) * 1964-09-11 1968-06-04 Tracor Method and apparatus for displaying speech
US3346694A (en) * 1965-06-02 1967-10-10 Bell Telephone Labor Inc Speech level measuring apparatus
US3549806A (en) * 1967-05-05 1970-12-22 Gen Electric Fundamental pitch frequency signal extraction system for complex signals
US3592969A (en) * 1968-07-24 1971-07-13 Matsushita Electric Ind Co Ltd Speech analyzing apparatus
US3688126A (en) * 1971-01-29 1972-08-29 Paul R Klein Sound-operated, yes-no responsive switch

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
Philip Lieberman, Perturbations in Vocal Pitch, J.A.S.A Vol. 33, 5/1961, p. 597 603. *
Philip Lieberman, Some Acoustic Correlates of Word Stress in American English, J.A.S.A. Vol. 32, April 1960, p. 451 454. *

Cited By (70)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4292469A (en) * 1979-06-13 1981-09-29 Scott Instruments Company Voice pitch detector and display
US4276445A (en) * 1979-09-07 1981-06-30 Kay Elemetrics Corp. Speech analysis apparatus
US4444199A (en) * 1981-07-21 1984-04-24 William A. Shafer Method and apparatus for monitoring physiological characteristics of a subject
US5976081A (en) * 1983-08-11 1999-11-02 Silverman; Stephen E. Method for detecting suicidal predisposition
US4675904A (en) * 1983-08-11 1987-06-23 Compusonics, Inc. Method for detecting suicidal predisposition
US5148483A (en) * 1983-08-11 1992-09-15 Silverman Stephen E Method for detecting suicidal predisposition
US6591238B1 (en) * 1983-08-11 2003-07-08 Stephen E. Silverman Method for detecting suicidal predisposition
EP0572531A1 (en) * 1991-02-22 1993-12-08 Seaway Technologies, Inc. Acoustic method and apparatus for identifying human sonic sources
EP0572531A4 (en) * 1991-02-22 1995-03-22 Seaway Technologies Inc Acoustic method and apparatus for identifying human sonic sources.
US5884260A (en) * 1993-04-22 1999-03-16 Leonhard; Frank Uldall Method and system for detecting and generating transient conditions in auditory signals
WO1995020216A1 (en) * 1994-01-21 1995-07-27 Wizsoft Inc. Method and apparatus for indicating the emotional state of a person
USRE43255E1 (en) 1996-09-26 2012-03-20 Verint Americas, Inc. Machine learning based upon feedback from contact center analysis
USRE43386E1 (en) 1996-09-26 2012-05-15 Verint Americas, Inc. Communication management system for network-based telephones
USRE43324E1 (en) 1996-09-26 2012-04-24 Verint Americas, Inc. VOIP voice interaction monitor
USRE43183E1 (en) * 1996-09-26 2012-02-14 Cerint Americas, Inc. Signal monitoring apparatus analyzing voice communication content
USRE40634E1 (en) 1996-09-26 2009-02-10 Verint Americas Voice interaction analysis module
USRE41608E1 (en) 1996-09-26 2010-08-31 Verint Americas Inc. System and method to acquire audio data packets for recording and analysis
USRE41534E1 (en) 1996-09-26 2010-08-17 Verint Americas Inc. Utilizing spare processing capacity to analyze a call center interaction
US6006188A (en) * 1997-03-19 1999-12-21 Dendrite, Inc. Speech signal processing for determining psychological or physiological characteristics using a knowledge base
US6638217B1 (en) 1997-12-16 2003-10-28 Amir Liberman Apparatus and methods for detecting emotions
AU770410B2 (en) * 1997-12-16 2004-02-19 Amir Liberman Apparatus and methods for detecting emotions
WO1999031653A1 (en) * 1997-12-16 1999-06-24 Carmel, Avi Apparatus and methods for detecting emotions
AU2004200002B2 (en) * 1997-12-16 2006-04-13 Amir Liberman Apparatus and methods for detecting emotions
US7165033B1 (en) 1999-04-12 2007-01-16 Amir Liberman Apparatus and methods for detecting emotions in the human voice
US6724887B1 (en) 2000-01-24 2004-04-20 Verint Systems, Inc. Method and system for analyzing customer communications with a contact center
US20020077825A1 (en) * 2000-08-22 2002-06-20 Silverman Stephen E. Methods and apparatus for evaluating near-term suicidal risk using vocal parameters
US7062443B2 (en) 2000-08-22 2006-06-13 Silverman Stephen E Methods and apparatus for evaluating near-term suicidal risk using vocal parameters
US7139699B2 (en) 2000-10-06 2006-11-21 Silverman Stephen E Method for analysis of vocal jitter for near-term suicidal risk assessment
US7565285B2 (en) 2000-10-06 2009-07-21 Marilyn K. Silverman Detecting near-term suicidal risk utilizing vocal jitter
US6719707B1 (en) 2001-06-15 2004-04-13 Nathan Montgomery Apparatus and method for performing musical perception sound analysis on a system
US7191134B2 (en) * 2002-03-25 2007-03-13 Nunally Patrick O'neal Audio psychological stress indicator alteration method and apparatus
US20030182116A1 (en) * 2002-03-25 2003-09-25 Nunally Patrick O?Apos;Neal Audio psychlogical stress indicator alteration method and apparatus
US10129402B1 (en) 2005-05-18 2018-11-13 Mattersight Corporation Customer satisfaction analysis of caller interaction event data system and methods
US9692894B2 (en) 2005-05-18 2017-06-27 Mattersight Corporation Customer satisfaction system and method based on behavioral assessment data
US10104233B2 (en) 2005-05-18 2018-10-16 Mattersight Corporation Coaching portal and methods based on behavioral assessment data
US9571650B2 (en) 2005-05-18 2017-02-14 Mattersight Corporation Method and system for generating a responsive communication based on behavioral assessment data
US8094790B2 (en) 2005-05-18 2012-01-10 Mattersight Corporation Method and software for training a customer service representative by analysis of a telephonic interaction between a customer and a contact center
US8094803B2 (en) 2005-05-18 2012-01-10 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US8781102B2 (en) 2005-05-18 2014-07-15 Mattersight Corporation Method and system for analyzing a communication by applying a behavioral model thereto
US7995717B2 (en) 2005-05-18 2011-08-09 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US10021248B2 (en) 2005-05-18 2018-07-10 Mattersight Corporation Method and system for analyzing caller interaction event data
US7511606B2 (en) 2005-05-18 2009-03-31 Lojack Operating Company Lp Vehicle locating unit with input voltage protection
US9432511B2 (en) 2005-05-18 2016-08-30 Mattersight Corporation Method and system of searching for communications for playback or analysis
US9357071B2 (en) 2005-05-18 2016-05-31 Mattersight Corporation Method and system for analyzing a communication by applying a behavioral model thereto
US9225841B2 (en) 2005-05-18 2015-12-29 Mattersight Corporation Method and system for selecting and navigating to call examples for playback or analysis
US8594285B2 (en) 2005-05-18 2013-11-26 Mattersight Corporation Method and system for analyzing separated voice data of a telephonic communication between a customer and a contact center by applying a psychological behavioral model thereto
US8023639B2 (en) 2007-03-30 2011-09-20 Mattersight Corporation Method and system determining the complexity of a telephonic communication received by a contact center
US7869586B2 (en) 2007-03-30 2011-01-11 Eloyalty Corporation Method and system for aggregating and analyzing data relating to a plurality of interactions between a customer and a contact center and generating business process analytics
US8891754B2 (en) 2007-03-30 2014-11-18 Mattersight Corporation Method and system for automatically routing a telephonic communication
US8983054B2 (en) 2007-03-30 2015-03-17 Mattersight Corporation Method and system for automatically routing a telephonic communication
US9124701B2 (en) 2007-03-30 2015-09-01 Mattersight Corporation Method and system for automatically routing a telephonic communication
US8718262B2 (en) 2007-03-30 2014-05-06 Mattersight Corporation Method and system for automatically routing a telephonic communication base on analytic attributes associated with prior telephonic communication
US9270826B2 (en) 2007-03-30 2016-02-23 Mattersight Corporation System for automatically routing a communication
US10129394B2 (en) 2007-03-30 2018-11-13 Mattersight Corporation Telephonic communication routing system based on customer satisfaction
US9699307B2 (en) 2007-03-30 2017-07-04 Mattersight Corporation Method and system for automatically routing a telephonic communication
US10601994B2 (en) 2007-09-28 2020-03-24 Mattersight Corporation Methods and systems for determining and displaying business relevance of telephonic communications between customers and a contact center
US10419611B2 (en) 2007-09-28 2019-09-17 Mattersight Corporation System and methods for determining trends in electronic communications
US8311831B2 (en) * 2007-10-01 2012-11-13 Panasonic Corporation Voice emphasizing device and voice emphasizing method
US20100070283A1 (en) * 2007-10-01 2010-03-18 Yumiko Kato Voice emphasizing device and voice emphasizing method
US8264364B2 (en) 2008-09-08 2012-09-11 Phillip Roger Sprague Psychophysiological touch screen stress analyzer
US20100060461A1 (en) * 2008-09-08 2010-03-11 Sprague Phillip R Psychophysiological Touch Screen Stress Analyzer
US8031075B2 (en) 2008-10-13 2011-10-04 Sandisk Il Ltd. Wearable device for adaptively recording signals
US8258964B2 (en) 2008-10-13 2012-09-04 Sandisk Il Ltd. Method and apparatus to adaptively record data
US20100090834A1 (en) * 2008-10-13 2010-04-15 Sandisk Il Ltd. Wearable device for adaptively recording signals
US9191510B2 (en) 2013-03-14 2015-11-17 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US9942400B2 (en) 2013-03-14 2018-04-10 Mattersight Corporation System and methods for analyzing multichannel communications including voice data
US9667788B2 (en) 2013-03-14 2017-05-30 Mattersight Corporation Responsive communication system for analyzed multichannel electronic communication
US10194029B2 (en) 2013-03-14 2019-01-29 Mattersight Corporation System and methods for analyzing online forum language
US9407768B2 (en) 2013-03-14 2016-08-02 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data
US9083801B2 (en) 2013-03-14 2015-07-14 Mattersight Corporation Methods and system for analyzing multichannel electronic communication data

Similar Documents

Publication Publication Date Title
US3855418A (en) Method and apparatus for phonation analysis leading to valid truth/lie decisions by vibratto component assessment
US3855416A (en) Method and apparatus for phonation analysis leading to valid truth/lie decisions by fundamental speech-energy weighted vibratto component assessment
US3971034A (en) Physiological response analysis method and apparatus
US6697457B2 (en) Voice messaging system that organizes voice messages based on detected emotion
US6427137B2 (en) System, method and article of manufacture for a voice analysis system that detects nervousness for preventing fraud
Kozhevnikov et al. Speech: Articulation and perception
US6480826B2 (en) System and method for a telephonic emotion detection that provides operator feedback
US3855417A (en) Method and apparatus for phonation analysis lending to valid truth/lie decisions by spectral energy region comparison
EP1222448B1 (en) System, method, and article of manufacture for detecting emotion in voice signals by utilizing statistics for voice signal parameters
Lieberman Some acoustic measures of the fundamental periodicity of normal and pathologic larynges
Ladefoged et al. Loudness, sound pressure, and subglottal pressure in speech
US4862503A (en) Voice parameter extractor using oral airflow
Horii An accelerometric approach to nasality measurement: a preliminary report
US4817155A (en) Method and apparatus for speech analysis
Ramig et al. Acoustic analysis of voice in amyotrophic lateral sclerosis: A longitudinal case study
US4335276A (en) Apparatus for non-invasive measurement and display nasalization in human speech
Pickett et al. Communication of speech sounds by a tactual vocoder
US7191134B2 (en) Audio psychological stress indicator alteration method and apparatus
Sundberg Phonatory vibrations in singers: A critical review
US3245403A (en) System for acoustic detection of pathologic larynges
US3925616A (en) Apparatus for determining the glottal waveform
Alpert Feedback effects of audition and vocal effort on intensity of voice
Hamlet Vocal compensation: An ultrasonic study of vocal fold vibration in normal and nasal vowels
US3387090A (en) Method and apparatus for displaying speech
Badin et al. A model of frication noise source based on data from fricative consonants in vowel context