WO2000077537A1 - Procede et appareil de determination d'une source sonore - Google Patents
Procede et appareil de determination d'une source sonore Download PDFInfo
- Publication number
- WO2000077537A1 WO2000077537A1 PCT/JP2000/003695 JP0003695W WO0077537A1 WO 2000077537 A1 WO2000077537 A1 WO 2000077537A1 JP 0003695 W JP0003695 W JP 0003695W WO 0077537 A1 WO0077537 A1 WO 0077537A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- sound
- sound source
- information
- position information
- processing means
- Prior art date
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/15—Conference systems
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S5/00—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
- G01S5/18—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
Definitions
- the present invention relates to a sound source identification apparatus and method for individually identifying each sound source based on image information and acoustic information from a plurality of sound sources.
- a method has been proposed in which sound sources are identified based on acoustic information from each sound collecting microphone by using the same number of sound collecting microphones as the number of sound sources. This is to identify the sound intensity and the position of the sound source, but the frequency information is spread along the azimuth axis, and it is difficult to identify a good sound source. Furthermore, such a method can increase the recognition rate of the sound sources, but the costs are high because each sound source is independent and the number of microphones required is the same as the number of sound sources.
- a first object of the present invention is to use acoustic information and image information to identify the position of a sound source object with higher accuracy. It is an object of the present invention to provide a sound source identification device capable of separating each sound from a mixed sound with high accuracy using position information. Further, as a second object of the present invention, the sound information and the image information are used to identify the position of the sound source object with higher accuracy, and the position information is used to obtain each sound with high accuracy from the mixed sound. It is an object of the present invention to provide an identification method which enables to separate S. Disclosure of the invention
- a sound collecting means including two sound collecting microphones arranged at a predetermined interval with respect to a plurality of sound sources; Both of the imaging means for continuous imaging and the sensing means for detecting an object, and / or both, and a sound source from either or both of the image captured by the imaging means and the directional information of the object detected by the sensing means.
- Image processing means for selecting position information on an object to be obtained; sound processing means for identifying the position of a sound source based on the sound information collected by the sound collecting means and the position information selected by the image processing means; And a control means for controlling the sound collecting means, the imaging means, the sensing means, the image processing means, and the sound processing means.
- the sound processing means includes a direction filter for extracting only sound information at a specific time.
- the sound processing means preferably has a function of selecting approximate position information of the sound source.
- the sensing means preferably detects based on magnetism or infrared light of an object that can be a sound source.
- the object that can be a sound source is provided with a magnetic device.
- the sound source identification device of the present invention when identifying the position of the sound source based on the acoustic information obtained from the sound collecting microphone, the image information captured by the imaging means and the direction information obtained by the sensing means are used.
- the direction of the sound source is narrowed down by referring to the positional information based on the above.
- an object that can be a sound source is specified using the moving image and the direction information of the object, and the sound source separation can be reliably performed using the position information and the acoustic information.
- the sound source identification method of the present invention preferably includes a fifth step of roughly selecting the position information of the sound source based only on the sound information collected in the first step.
- the third step based on the approximate position information selected in the fifth step, the direction of the sound source is narrowed down in advance to select the position information on the object that can be the sound source.
- the fifth step roughly selects a direction of the sound source based on a phase difference and an intensity difference of acoustic information obtained by the two sound collecting microphones.
- the sound source identification method of the present invention is preferably selected based on one or both of the position information color and the shape of the object that can be the sound source in the third step.
- the sound source identification method of the present invention is preferably arranged such that the fourth step selects a preset directional filter based on the position information selected in the third step.
- the sound information from each sound source is extracted, and the position of each sound source is identified.
- the fourth step or the fifth step is based on the acoustic information obtained in the first step, and arbitrarily divided signals in each frequency band as a reference. Select the position of the sound source.
- positional information relating to an object that can be the sound source it may also be configured to select a motion of an object as a reference Les, 0
- the direction can be detected based on magnetism or infrared rays.
- sound information is obtained for a plurality of sound sources by a sound collecting means including two sound collecting microphones, and these sound sources are imaged by an image pickup means. Obtain image information. Furthermore, the direction of the sound source is detected based on magnetism and infrared rays, and direction detection information is obtained.
- the sound processing unit identifies the position of the sound source based on the sound information, for example, based on the phase difference and the intensity difference of each sound information acquired by the sound collecting microphone, the image information obtained by the imaging unit Based on one or both of the direction detection information and the direction detection information, the direction of the sound source is narrowed down by referring to the position information on the object that can be the sound source selected by the image processing means based on, for example, its color, shape, movement, etc.
- the position of the sound source is identified based on the band signal, for example, the harmonic structure. Therefore, it is not necessary to process sound information in all directions for identifying the position of the sound source, and it is possible to more accurately identify the position of the sound source, to reduce the amount of processing information, and to shorten the processing time. .
- two or more sound collecting microphones of the sound collecting means can identify the positions of three or more sound sources, so that it is possible to accurately identify the positions of the sound sources with a simple configuration. .
- the method includes a fifth step of roughly selecting the position information of the sound source based only on the sound information collected in the first step, and the third step is selected by the fifth step. If the direction of the sound source is narrowed down in advance and the position information on the object that can be the sound source is selected based on the approximated position information obtained, the object that can be the sound source based on the image information in the third stage is selected. Since the amount of information to be processed in the selection of the location information on the object is reduced, the processing can be performed easily.
- the fourth step extracts the acoustic information from each sound source by selecting a preset directional filter based on the position information selected in the third step, and identifies the position of each sound source In this case, since a direction filter for extracting acoustic information from a sound source in a certain direction is set in advance, processing for identifying the position of the sound source can be performed smoothly.
- FIG. 1 is a schematic diagram showing a configuration of a first embodiment of a sound source identification device according to the present invention.
- FIG. 2 is a schematic diagram showing an example of an image screen by an imaging means in the sound source identification device of FIG.
- FIG. 3A and 3B are explanatory diagrams for an image screen in the sound source identification device of FIG. 1, wherein FIG. 3A shows schematic directions AO, BO, and CO by sound processing means, and FIG. 1 and C1, and (C) shows position information A3, B3, and C3 of an object that can be a sound source by the image processing means, respectively.
- FIG. 4 is an explanatory diagram showing a distance difference between two sound collecting microphones of the sound collecting means and the sound source in the sound source identification device of FIG.
- FIG. 5 is a graph showing the effect of the directional filter in the sound processing means in the sound source identification device of FIG.
- FIG. 6 is a graph showing extraction of two pieces of sound information from the same sound source by the sound processing means in the sound source identification device of FIG.
- FIG. 7 is an explanatory diagram showing the extraction of sound information from each sound source by the direction filter in the sound processing means in the sound source identification device of FIG.
- FIG. 8 is a flowchart showing an operation method in the sound source identification device of FIG.
- FIG. 9 is a diagram showing a part of a continuous imaging screen by the imaging means in the sound source identification device of FIG.
- FIG. 10 is a graph showing positional information of an object that can be a sound source on various references by the image processing means in the sound source identification device of FIG. BEST MODE FOR CARRYING OUT THE INVENTION
- FIG. 1 shows an embodiment of the sound source identification device of the present invention.
- the sound source identification device 10 includes a sound collection unit 11, an imaging unit 12, an image processing unit 13, a sound processing unit 14, and a control unit 15.
- the sound collecting means 11 captures sounds from a plurality of sound sources (for example, three speakers) with two sound collecting microphones 11a and 11b arranged at a predetermined interval D (see FIG. 1). Processing.
- the arrangement of these sound-collecting microphones is a force that can be determined as appropriate. In the example shown in FIG.
- the imaging means 12 is composed of, for example, a CCD (solid-state imaging device) camera. As shown in FIG. 2, an image including the plurality of sound sources (three speakers A, B, and C) is continuously output. It is for imaging.
- CCD solid-state imaging device
- the image processing means 13 is for selecting position information on an object that can be a sound source based on an image taken by the imaging means 12, for example, a color, shape, or motion in the image. In addition, movement includes vibration.
- the image processing means 13 performs three operations on the image captured by the imaging means 12 based on the color (for example, the color of human skin) and the height.
- the frames A1, B1, and C1 are set for the speakers A, B, and C, respectively, and the center position A of these frames A1, B1, and C1 is set as shown in FIG.
- the horizontal coordinates A 3, B 3, and C 3 of 2, 2, 2, and 2 are used as positional information about the object that can be a sound source. Select.
- the term “object that can be a sound source” is used because it is not always possible to determine whether or not the sound source is based on image recognition alone.
- the image processing means 13 preferably includes, in order to simplify the image processing, the general directions AO, BO of the sound sources selected by the sound processing means 14 as described later before the above-described image processing.
- CO see Fig. 3 (A)
- the above image processing is performed in a state where it is narrowed down to the general directions AO, BO, C0, that is, within the range of these general directions A0, BO, CO.
- position information A3, B3, C3 relating to an object that can be a sound source is selected.
- the sound processing means 14 is a sound source based on the sound information collected by the microphone of the sound collection means 11, for example, based on the sound information and the position information A 3, B 3, C 3 selected by the image processing means 13. It identifies the position.
- the identification of the position of the sound source is performed based on the phase difference and the intensity difference between the sound information of the left and right sound collecting microphones 11a and 11 with respect to the sound information.
- the sound processing means 14 performs the above-described processing over the entire angle range of ⁇ 90 degrees ⁇ ⁇ ⁇ + 90 degrees.
- the processing may be performed at regular intervals, for example, at 0, for example, at intervals of 5 degrees.
- the sound processing means 14 first selects the approximate directions AO, B O, and CO of the sound source based on the left and right sound information from the sound collection means 11. This is the same as the conventional sound source identification, and has an accuracy of about ⁇ 10 degrees.
- the sound processing means 14 outputs the general directions A 0, B O, and C O to the image processing means 13.
- the sound processing means 14 refers to the position information A 3, B 3, C 3 inputted from the image processing means 13, and narrows down the range of the position information A 3, B 3, C 3. In the prone position, that is, near the position information A3, B3, and C3, the position of the sound source is identified again based on the acoustic information.
- the sound processing unit 14 identifies the position of the sound source by selecting an appropriate so-called directional filter for each of the sound sources A, B, and C.
- the directional filter is created as shown in FIG. 5 to extract only the acoustic information at the specific time t0, and is used as a reference table for the direction of the sound source in the auxiliary storage means in the control means 15 (not shown).
- the sound processing means 14 selects an appropriate direction file based on the position information A3, B3, C3 from the image processing means 13 and reads it from the auxiliary storage means. .
- the time t 2 (t 2 t 1 + ⁇ t) after the delay time ⁇ t due to the phase difference with respect to the right acoustic information at a certain time t 1 in by retrieving the audio information of the left, to obtain the acoustic information collected emitted by the respective sound collecting microphone 1 1 a, 1 1 b simultaneously from the sound source.
- a t Can be negative.
- the sound processing means 14 selects a direction filter, and as shown in FIG. Can be obtained.
- the sound processing means 14 uses the position information A 3, B 3, and C 3 to narrow down the direction of the sound source to some extent, so that the entire angle range of 0 ( ⁇ 90 degrees ⁇ 0 ⁇ + 90) It is not necessary to perform the processing for (degree), and it is sufficient to perform the processing for the position information A3, B3, and C3 within a predetermined angle range.
- the control means 15 is composed of, for example, a computer or the like.
- the control means 15 controls the sound collection means 11, the imaging means 12, the image processing means 13 and the sound processing means 14, as described above.
- the preset direction field is stored in an auxiliary storage means (not shown).
- the sound source identification device 10 according to the embodiment of the present invention is configured as described above, and operates as described below according to the flowchart shown in FIG.
- step ST1 the control means 15 controls the sound collecting means 11 so that the sound collecting microphones 11a, 11 of the sound collecting means 11 generate the sound sources A, At the same time that the sounds from B and C are collected, in step ST2, the control means 15 controls the imaging means 12 to continuously capture the image of the sound source.
- step ST3 the control means 15 controls the sound processing means 14 so that the sound processing means 14 can determine the phase difference between the same two sounds of the same sound source obtained by the sound collecting means 11
- the approximate directions AO, BO, and CO of the sound source are selected based on the acoustic information of the intensity difference.
- all harmonic structures with a phase difference are examined and sound source separation is performed. Note that the harmonic structure was used as a reference as an example of the signal in each frequency band that was arbitrarily divided.
- step ST4 the control means 15 controls the image processing means 13 so that the image processing means 13 and the imaging means 12 can output the sound processing means 14 based on the imaging screen.
- position information A3, B3, C3 (see Fig. 3 (C)) about the object that can be a sound source depending on the color, shape, etc. of the image is selected.
- step ST5 the control means 15 controls the sound processing means 14.
- the sound processing means 14 generates the sound sources A, B, C Identify the location.
- the sound processing means 14 selects a directional filter, and extracts only sound information including a specific time delay of the same sound from the same sound source. At this time, the acoustic information of the erroneous other harmonic structure is not processed, so that the error is reduced and the sound source separation rate is increased.
- the sound source identification device 10 not only the sound processing unit 14, the sound information from the force collecting unit 11, but also the image captured by the imaging unit 12.
- the position of the sound source is identified by referring to the position information A 3, B 3, and C 3 of the object that can be the sound source by the image processing means 13, so that only the sound information from the conventional sound collecting means 11 is used.
- the sound source identification device 10 can more accurately identify the position of the sound source.
- the sound source information obtained by roughly separating the sound source in advance can reliably identify the sound source even if the sound source is close to the sound source.
- FIG. 9 shows the seventh, 51st, 78th, and 158th frames of the continuously captured images.
- each speaker is located near 0 degrees, ⁇ 30 degrees, 0 degrees, and +20 degrees.
- the image processing means 13 performs the image processing based on only the color and selects the position of the object which can be a sound source, as shown in the graph of FIG.
- the misidentification decreases as shown in the graph of FIG. 10 (C).
- FIG. 10 (E) As shown in the graph, it is clear that compared to the accurate face position shown in FIG. 10 (A), that is, it is possible to select a rather accurate sound source position information.
- the image processing means 13 determines the center positions A 2, B 2, and C 2 of the frames A 1, B 1, and C 1 of the object that can be a sound source based on the captured continuous images.
- the horizontal coordinates A3, B3, and C3 are used as the position information regarding the object that can be the sound source, the horizontal and vertical coordinates may be used as the position information regarding the object that can be the sound source.
- the image processing means 13 selects position information of an object that can be a sound source on the basis of a color, a shape (for example, height), etc., based on a captured continuous image. ing.
- the image processing means 13 performs the image processing with reference to the general directions AO, BO, and C0 from the sound processing means 14, but is not limited thereto. Only the image information from 12 may be used to select the positional information of the object that can be a sound source.
- an active badge made of magnetic equipment may be attached to each sound source, and the direction in which magnetism is emitted may be selected using a magnetic detection device as a sensing means. This may be fed back to the sound processing means, and the sound processing means may create a directional filter using the direction obtained from the magnetic detection device to separate the sound source.
- the sound source is, for example, a human, it emits heat rays, so the direction of the sound source may be detected by an infrared sensor.
- the direction of the sound source is determined based on the image information and the direction detection information with reference to positional information on an object that can be a sound source. Since there is no need to process sound information in all directions for sound source identification, more accurate sound source identification can be performed, and the amount of processing information can be reduced and processing time can be reduced. Can be. It Therefore, according to the present invention, there is provided an extremely excellent sound control apparatus and method capable of identifying a plurality of sound sources with high accuracy using two microphones.
- the sound source identification device and the identification method of the present invention identify the position of the sound source object with higher accuracy based on the acoustic information and the image information, and use the position information to separate each sound from the mixed sound. It is extremely useful as a sound source identification device and a method for identifying the same that can be separated with high accuracy.
Description
Claims
Priority Applications (3)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/926,673 US7035418B1 (en) | 1999-06-11 | 2000-06-07 | Method and apparatus for determining sound source |
DE60036216T DE60036216T2 (de) | 1999-06-11 | 2000-06-07 | Verfahren und gerät zur bestimmung einer tonquelle |
EP00935570A EP1205762B1 (en) | 1999-06-11 | 2000-06-07 | Method and apparatus for determining sound source |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP11/165182 | 1999-06-11 | ||
JP16518299A JP3195920B2 (ja) | 1999-06-11 | 1999-06-11 | 音源同定・分離装置及びその方法 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2000077537A1 true WO2000077537A1 (fr) | 2000-12-21 |
Family
ID=15807412
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/JP2000/003695 WO2000077537A1 (fr) | 1999-06-11 | 2000-06-07 | Procede et appareil de determination d'une source sonore |
Country Status (6)
Country | Link |
---|---|
US (1) | US7035418B1 (ja) |
EP (1) | EP1205762B1 (ja) |
JP (1) | JP3195920B2 (ja) |
DE (1) | DE60036216T2 (ja) |
ES (1) | ES2292441T3 (ja) |
WO (1) | WO2000077537A1 (ja) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105388478A (zh) * | 2014-09-03 | 2016-03-09 | 计算机科学应用促进会 | 用于检测声学和光学信息的方法和装置、以及对应的计算机程序和对应的计算机可读存储介质 |
WO2019080705A1 (zh) * | 2017-10-23 | 2019-05-02 | 京东方科技集团股份有限公司 | 采集设备、声音采集方法、声源跟踪系统及其方法 |
Families Citing this family (31)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
SE517765C2 (sv) * | 2000-11-16 | 2002-07-16 | Ericsson Telefon Ab L M | Registrering av rörliga bilder medelst en portabel kommunikationsenhet samt en tillbehörsanordning vilken är samlokaliserad med objektet |
JP2004266343A (ja) * | 2003-02-05 | 2004-09-24 | Matsushita Electric Ind Co Ltd | 画像サーバーと画像サーバーシステム、そのプログラム及び記録媒体 |
US20080120100A1 (en) * | 2003-03-17 | 2008-05-22 | Kazuya Takeda | Method For Detecting Target Sound, Method For Detecting Delay Time In Signal Input, And Sound Signal Processor |
JP4269883B2 (ja) * | 2003-10-20 | 2009-05-27 | ソニー株式会社 | マイクロホン装置、再生装置及び撮像装置 |
US20090018828A1 (en) * | 2003-11-12 | 2009-01-15 | Honda Motor Co., Ltd. | Automatic Speech Recognition System |
GB0330253D0 (en) | 2003-12-31 | 2004-02-04 | Mitel Networks Corp | Self-discovery method |
JP2006245725A (ja) * | 2005-03-01 | 2006-09-14 | Yamaha Corp | マイクロフォンシステム |
JP4441879B2 (ja) * | 2005-06-28 | 2010-03-31 | ソニー株式会社 | 信号処理装置および方法、プログラム、並びに記録媒体 |
JP4757786B2 (ja) * | 2006-12-07 | 2011-08-24 | Necアクセステクニカ株式会社 | 音源方向推定装置、音源方向推定方法、及びロボット装置 |
IL188156A0 (en) * | 2007-12-16 | 2008-11-03 | Maly Edelman | A method and system for protecting an area |
US20100098258A1 (en) * | 2008-10-22 | 2010-04-22 | Karl Ola Thorn | System and method for generating multichannel audio with a portable electronic device |
US20100123785A1 (en) * | 2008-11-17 | 2010-05-20 | Apple Inc. | Graphic Control for Directional Audio Input |
WO2010149823A1 (en) * | 2009-06-23 | 2010-12-29 | Nokia Corporation | Method and apparatus for processing audio signals |
TWI402531B (zh) * | 2009-06-29 | 2013-07-21 | Univ Nat Cheng Kung | 音源辨位方法與應用此音源辨位方法之音源辨位系統和電腦程式產品 |
US9094645B2 (en) * | 2009-07-17 | 2015-07-28 | Lg Electronics Inc. | Method for processing sound source in terminal and terminal using the same |
TWI417563B (zh) * | 2009-11-20 | 2013-12-01 | Univ Nat Cheng Kung | 遠距離音源定位晶片裝置及其方法 |
US9955209B2 (en) | 2010-04-14 | 2018-04-24 | Alcatel-Lucent Usa Inc. | Immersive viewer, a method of providing scenes on a display and an immersive viewing system |
US9294716B2 (en) | 2010-04-30 | 2016-03-22 | Alcatel Lucent | Method and system for controlling an imaging system |
US8754925B2 (en) * | 2010-09-30 | 2014-06-17 | Alcatel Lucent | Audio source locator and tracker, a method of directing a camera to view an audio source and a video conferencing terminal |
US8185387B1 (en) | 2011-11-14 | 2012-05-22 | Google Inc. | Automatic gain control |
US9008487B2 (en) | 2011-12-06 | 2015-04-14 | Alcatel Lucent | Spatial bookmarking |
JP6216169B2 (ja) * | 2012-09-26 | 2017-10-18 | キヤノン株式会社 | 情報処理装置、情報処理方法 |
JP2014143678A (ja) * | 2012-12-27 | 2014-08-07 | Panasonic Corp | 音声処理システム及び音声処理方法 |
CN103902963B (zh) * | 2012-12-28 | 2017-06-20 | 联想(北京)有限公司 | 一种识别方位及身份的方法和电子设备 |
KR101997449B1 (ko) * | 2013-01-29 | 2019-07-09 | 엘지전자 주식회사 | 이동 단말기 및 이의 제어 방법 |
EP2879047A3 (en) * | 2013-11-28 | 2015-12-16 | LG Electronics Inc. | Mobile terminal and controlling method thereof |
CN104683933A (zh) | 2013-11-29 | 2015-06-03 | 杜比实验室特许公司 | 音频对象提取 |
JP6297858B2 (ja) * | 2014-02-25 | 2018-03-20 | 株式会社熊谷組 | 音源推定用画像の作成装置 |
CN104914409B (zh) * | 2014-03-10 | 2017-11-07 | 李文嵩 | 智能住宅定位装置 |
CN105070304B (zh) * | 2015-08-11 | 2018-09-04 | 小米科技有限责任公司 | 实现对象音频录音的方法及装置、电子设备 |
JP6589041B1 (ja) * | 2018-01-16 | 2019-10-09 | ハイラブル株式会社 | 音声分析装置、音声分析方法、音声分析プログラム及び音声分析システム |
Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05244587A (ja) * | 1992-02-26 | 1993-09-21 | Mitsubishi Electric Corp | テレビ会議用カメラ制御装置 |
JPH06105306A (ja) * | 1992-09-16 | 1994-04-15 | Funai Denki Kenkyusho:Kk | テレビ会議システム |
JPH0739000A (ja) * | 1992-12-05 | 1995-02-07 | Kazumoto Suzuki | 任意の方向からの音波の選択的抽出法 |
JPH08251561A (ja) * | 1995-03-09 | 1996-09-27 | Nec Corp | 画像通信端末装置のユーザインタフェース |
JPH08286680A (ja) * | 1995-02-17 | 1996-11-01 | Takenaka Komuten Co Ltd | 音抽出装置 |
JPH0933330A (ja) * | 1995-07-17 | 1997-02-07 | Nippon Telegr & Teleph Corp <Ntt> | 音響信号分離方法およびこの方法を実施する装置 |
JPH1051889A (ja) * | 1996-08-05 | 1998-02-20 | Toshiba Corp | 音声収集装置及び音声収集方法 |
JPH10313497A (ja) * | 1996-09-18 | 1998-11-24 | Nippon Telegr & Teleph Corp <Ntt> | 音源分離方法、装置及び記録媒体 |
JPH1141577A (ja) * | 1997-07-18 | 1999-02-12 | Fujitsu Ltd | 話者位置検出装置 |
Family Cites Families (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP3232608B2 (ja) * | 1991-11-25 | 2001-11-26 | ソニー株式会社 | 収音装置、再生装置、収音方法および再生方法、および、音信号処理装置 |
US5402499A (en) * | 1992-08-07 | 1995-03-28 | Lsi Logic Corporation | Multimedia controller |
JP2937009B2 (ja) * | 1994-03-30 | 1999-08-23 | ヤマハ株式会社 | 音像定位制御装置 |
CA2148631C (en) | 1994-06-20 | 2000-06-13 | John J. Hildin | Voice-following video system |
GB2309105A (en) * | 1996-01-12 | 1997-07-16 | Ibm | Intuitive GUI in the form of a representation of a physical environment |
AUPN988996A0 (en) * | 1996-05-16 | 1996-06-06 | Unisearch Limited | Compression and coding of audio-visual services |
EP1016002A4 (en) * | 1996-09-04 | 2000-11-15 | David A Goldberg | METHOD AND DEVICE FOR PRODUCING PERSONAL-SPECIFIC IMAGES IN A PUBLIC SPACE |
US6021206A (en) * | 1996-10-02 | 2000-02-01 | Lake Dsp Pty Ltd | Methods and apparatus for processing spatialised audio |
TW379309B (en) * | 1997-05-16 | 2000-01-11 | Samsung Electronics Co Ltd | Signal management apparatus and method using on screen display |
US6072522A (en) * | 1997-06-04 | 2000-06-06 | Cgc Designs | Video conferencing apparatus for group video conferencing |
JP3541339B2 (ja) | 1997-06-26 | 2004-07-07 | 富士通株式会社 | マイクロホンアレイ装置 |
US6192134B1 (en) * | 1997-11-20 | 2001-02-20 | Conexant Systems, Inc. | System and method for a monolithic directional microphone array |
US5940118A (en) * | 1997-12-22 | 1999-08-17 | Nortel Networks Corporation | System and method for steering directional microphones |
US6005610A (en) * | 1998-01-23 | 1999-12-21 | Lucent Technologies Inc. | Audio-visual object localization and tracking system and method therefor |
US6311155B1 (en) * | 2000-02-04 | 2001-10-30 | Hearing Enhancement Company Llc | Use of voice-to-remaining audio (VRA) in consumer applications |
-
1999
- 1999-06-11 JP JP16518299A patent/JP3195920B2/ja not_active Expired - Fee Related
-
2000
- 2000-06-07 US US09/926,673 patent/US7035418B1/en not_active Expired - Lifetime
- 2000-06-07 ES ES00935570T patent/ES2292441T3/es not_active Expired - Lifetime
- 2000-06-07 DE DE60036216T patent/DE60036216T2/de not_active Expired - Lifetime
- 2000-06-07 EP EP00935570A patent/EP1205762B1/en not_active Expired - Lifetime
- 2000-06-07 WO PCT/JP2000/003695 patent/WO2000077537A1/ja active IP Right Grant
Patent Citations (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH05244587A (ja) * | 1992-02-26 | 1993-09-21 | Mitsubishi Electric Corp | テレビ会議用カメラ制御装置 |
JPH06105306A (ja) * | 1992-09-16 | 1994-04-15 | Funai Denki Kenkyusho:Kk | テレビ会議システム |
JPH0739000A (ja) * | 1992-12-05 | 1995-02-07 | Kazumoto Suzuki | 任意の方向からの音波の選択的抽出法 |
JPH08286680A (ja) * | 1995-02-17 | 1996-11-01 | Takenaka Komuten Co Ltd | 音抽出装置 |
JPH08251561A (ja) * | 1995-03-09 | 1996-09-27 | Nec Corp | 画像通信端末装置のユーザインタフェース |
JPH0933330A (ja) * | 1995-07-17 | 1997-02-07 | Nippon Telegr & Teleph Corp <Ntt> | 音響信号分離方法およびこの方法を実施する装置 |
JPH1051889A (ja) * | 1996-08-05 | 1998-02-20 | Toshiba Corp | 音声収集装置及び音声収集方法 |
JPH10313497A (ja) * | 1996-09-18 | 1998-11-24 | Nippon Telegr & Teleph Corp <Ntt> | 音源分離方法、装置及び記録媒体 |
JPH1141577A (ja) * | 1997-07-18 | 1999-02-12 | Fujitsu Ltd | 話者位置検出装置 |
Non-Patent Citations (1)
Title |
---|
See also references of EP1205762A4 * |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105388478A (zh) * | 2014-09-03 | 2016-03-09 | 计算机科学应用促进会 | 用于检测声学和光学信息的方法和装置、以及对应的计算机程序和对应的计算机可读存储介质 |
CN105388478B (zh) * | 2014-09-03 | 2019-10-18 | 计算机科学应用促进会 | 用于检测声学和光学信息的方法和装置、以及对应的计算机可读存储介质 |
WO2019080705A1 (zh) * | 2017-10-23 | 2019-05-02 | 京东方科技集团股份有限公司 | 采集设备、声音采集方法、声源跟踪系统及其方法 |
US11525883B2 (en) | 2017-10-23 | 2022-12-13 | Beijing Boe Technology Development Co., Ltd. | Acquisition equipment, sound acquisition method, and sound source tracking system and method |
Also Published As
Publication number | Publication date |
---|---|
US7035418B1 (en) | 2006-04-25 |
EP1205762B1 (en) | 2007-08-29 |
EP1205762A1 (en) | 2002-05-15 |
DE60036216D1 (de) | 2007-10-11 |
EP1205762A4 (en) | 2005-07-06 |
DE60036216T2 (de) | 2008-05-15 |
ES2292441T3 (es) | 2008-03-16 |
JP3195920B2 (ja) | 2001-08-06 |
JP2000356674A (ja) | 2000-12-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2000077537A1 (fr) | Procede et appareil de determination d'une source sonore | |
US9595259B2 (en) | Sound source-separating device and sound source-separating method | |
JP6289121B2 (ja) | 音響信号処理装置、動画撮影装置およびそれらの制御方法 | |
JP4896838B2 (ja) | 撮像装置、画像検出装置及びプログラム | |
JP2009141555A (ja) | 音声入力機能付き撮像装置及びその音声記録方法 | |
US20100302401A1 (en) | Image Audio Processing Apparatus And Image Sensing Apparatus | |
JP5597956B2 (ja) | 音声データ合成装置 | |
JP5618043B2 (ja) | 映像音響処理システム、映像音響処理方法及びプログラム | |
JP2001243466A (ja) | 顔認識装置およびその方法 | |
JP4595364B2 (ja) | 情報処理装置および方法、プログラム、並びに記録媒体 | |
JP6216169B2 (ja) | 情報処理装置、情報処理方法 | |
JP3714706B2 (ja) | 音抽出装置 | |
JP4968346B2 (ja) | 撮像装置、画像検出装置及びプログラム | |
JP2009239348A (ja) | 撮影装置 | |
KR101542647B1 (ko) | 화자 검출을 이용한 오디오 신호 처리 방법 및 장치 | |
JP2009177480A (ja) | 撮影装置 | |
JP2003078988A (ja) | 収音装置、方法及びプログラム、記録媒体 | |
JP6881267B2 (ja) | 制御装置、変換装置、制御方法、変換方法、およびプログラム | |
JP6835205B2 (ja) | 撮影収音装置、収音制御システム、撮影収音装置の制御方法、及び収音制御システムの制御方法 | |
JP2009239349A (ja) | 撮影装置 | |
JP6456171B2 (ja) | 情報処理装置、情報処理方法及びプログラム | |
JP7111202B2 (ja) | 収音制御システム及び収音制御システムの制御方法 | |
WO2021020197A1 (ja) | 映像生成方法 | |
JPH11341592A (ja) | 撮像装置に同調する録音装置 | |
JPH10191498A (ja) | 音信号処理装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A1 Designated state(s): US |
|
AL | Designated countries for regional patents |
Kind code of ref document: A1 Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
WWE | Wipo information: entry into national phase |
Ref document number: 09926673 Country of ref document: US |
|
WWE | Wipo information: entry into national phase |
Ref document number: 2000935570 Country of ref document: EP |
|
WWP | Wipo information: published in national office |
Ref document number: 2000935570 Country of ref document: EP |
|
WWG | Wipo information: grant in national office |
Ref document number: 2000935570 Country of ref document: EP |