WO1998044436A1 - Natural language labeling - Google Patents

Natural language labeling Download PDF

Info

Publication number
WO1998044436A1
WO1998044436A1 PCT/JP1998/001391 JP9801391W WO9844436A1 WO 1998044436 A1 WO1998044436 A1 WO 1998044436A1 JP 9801391 W JP9801391 W JP 9801391W WO 9844436 A1 WO9844436 A1 WO 9844436A1
Authority
WO
WIPO (PCT)
Prior art keywords
words
user
label
letters
prompting
Prior art date
Application number
PCT/JP1998/001391
Other languages
French (fr)
Inventor
Jeffrey Brian Sampsell
Original Assignee
Sharp Kabushiki Kaisha
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sharp Kabushiki Kaisha filed Critical Sharp Kabushiki Kaisha
Publication of WO1998044436A1 publication Critical patent/WO1998044436A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N1/00Scanning, transmission or reproduction of documents or the like, e.g. facsimile transmission; Details thereof
    • H04N1/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N1/32101Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N1/32128Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title attached to the image data, e.g. file header, transmitted message header, information on the same page or in the same computer file as the image
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/50Information retrieval; Database structures therefor; File system structures therefor of still image data
    • G06F16/58Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/19Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier
    • G11B27/28Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording
    • G11B27/30Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on the same track as the main recording
    • G11B27/3027Indexing; Addressing; Timing or synchronising; Measuring tape travel by using information detectable on the record carrier by using information signals recorded by the same method as the main recording on the same track as the main recording used signal is digitally coded
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3225Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document
    • H04N2201/3226Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of data relating to an image, a page or a document of identification information or the like, e.g. ID code, index, title, part of an image, reduced-size image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3261Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal
    • H04N2201/3266Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title of multimedia information, e.g. a sound signal of text or character information, e.g. text accompanying an image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N2201/00Indexing scheme relating to scanning, transmission or reproduction of documents or the like, and to details thereof
    • H04N2201/32Circuits or arrangements for control or supervision between transmitter and receiver or between image input and image output device, e.g. between a still-image camera and its memory or between a still-image camera and a printer device
    • H04N2201/3201Display, printing, storage or transmission of additional information, e.g. ID code, date and time or title
    • H04N2201/3274Storage or retrieval of prestored additional information
    • H04N2201/3277The additional information being stored in the same storage device as the image data

Definitions

  • the present invention relates to a natural language labeling system for video and photographic images.
  • Video recorders typically include buttons that permit the user to enter a text label which is overlaid on a small portion of the video images on a tape as the video images are obtained.
  • the user Scrolls through the alphabet in a letter-by- letter process until the desired letter appears.
  • the desired letter appears it may be selected so that it becomes a part of the text label .
  • This letter by letter process is repeated until the desired text label is completely entered.
  • camcorders are small so as to be handheld, they are not suitable for an additional alpha-numeric keypad from which to enter text.
  • Photographic items such as photos, slides, and digital image files have other problems.
  • individuals may take hundreds to thousands of photo- graphic pictures (prints or transparencies) every year.
  • every good picture is stored in a photo album together with its negative, or a slide tray or cassette in the case of transparencies.
  • organizing photos in a photo album requires considerable effort and most often individuals merely look at the pictures once or twice and then place them in a box with other pictures. Over time negatives tend to become separated from their respective pictures making it difficult to obtain a duplicate print of a desired picture because the negative cannot be located.
  • each picture is not manually labeled with a label relating to its subject, then over time the photographer may not be able to recall the subject matter of the picture, the people shown in the picture, and the date that the picture was taken.
  • Several film developing services now scan negatives (or positives) to create a digital photographic image file of each picture. The digital photographic image file is then provided to the customer either on storage media such as a diskette or over a network such as the internet. Similar labeling, identification, locating, and storing problems exist with digital photographic image files, as with traditional photographic pictures and slides.
  • U.S. Patent No. 5,555,408 disclose a knowledge based information retrieval system suitable to query existing databases for desired information.
  • the natural language portion of the retrieval system permits users to enter an English sentence query, as opposed to cryptic database syntax query, to search for desired information within the database.
  • the natural language interface is intuitive for the user and alleviates the need for the user to learn the cryptic database query syntax, thus making the system faster to learn.
  • Such systems are generally referred to as natural language query systems.
  • the present invention overcomes the aforementioned drawbacks of the prior art by providing a system for labeling video or photographic images on a portable handheld device, such as a camcorder or camera, that includes a language interface.
  • the user is prompted through the language interface with a first plurality of first words , each of the first words including a plurality of letters, from which the user selects at least one of the first words.
  • the user is then prompted through the interface with a second plurality of second words, each of the second words including a plurality of letters, from which the user selects at least one of the second words.
  • the selected at least one of the first words and the selected at least one of the second words are combined to create a label relating to subject matter obtained by the portable handheld device.
  • the label is either overlaid on or attached to the video clips (digital and analog) or photographic images (digital and film based) depending on the nature of the portable handheld device and the system configuration.
  • the system includes search tools that use the language interface to locate video clips (digital and analog) or photographic images (digital and film based) .
  • the language interface permits the user to create a label using a word-by-word process so that the video clips and photographic images are easily identified later.
  • the language interface permits the user to select entire words which allows for the quick creation of the label. Since labels are easier to create, it is more likely that the user will actually label his video and photographic images. Also, by labeling the video and photographic images, the user will be able to search for desired video or photographic images by using electronic search tools.
  • the language interface is especially suitable for portable handheld devices, such as cameras and camcorders, where space limitations exist that prohibit the use of an alphanumeric keyboard. As such, the language interface only requires a few controls, such as buttons or touch- sensitive points on a display, to be used effectively.
  • FIG. 1 is a block diagram of an exemplary embodiment of a natural language labeling system that includes a natural language interface of the present invention.
  • FIG. 2 is a block diagram of the natural language interface of the labeling system of FIG. 1.
  • FIG. 3 is a schematic view of a camcorder and a viewfinder including the natural language interface of FIG. 2.
  • FIG. 4 is a schematic view of a digital camcorder including the natural language interface of FIG. 2.
  • FIG. 5 is a simplified pictorial view of a camera including the natural language interface of FIG. 2.
  • FIG. 6 is a block diagram of a natural language labeling system suitable for use with film developing services.
  • the present inventor came to the realization that the text created using a natural language query system, which was previously designed for and specifically used to query existing databases for information, may be used in a new manner to label video and photographic images.
  • the user selects the video or photographic image to label at block 10.
  • the user uses a natural language interface 12 , described in detail below, to create a suitable label that relates to and describes the subject matter of the video or photographic image previously selected at block 10.
  • the label created using the natural language interface 12 is attached to or overlaid on the selected video or photographic image 10, as described below.
  • the natural language interface 12 permits the user to create a label using a word-by-word process for video or photographic images so that they are easily identified later.
  • the natural language interface 12 permits the user to select entire words, as opposed to individual letters, which allows for the quick creation of the label. Since labels are easier to create, it is more likely that the user will actually label his video and photographic images. Also, by labeling the video and photographic images, the user will be able to search for desired video or photographic images by using electronic search tools that search for and compare the natural language query text with labels attached to or overlaid on the video or photographic images.
  • the natural language interface 12 is especially suitable for portable handheld devices, such as cameras and camcorders, where space limitations exist that prohibit the use of an alphanumeric keyboard.
  • the natural language interface 12 allows the user to create the label by following a simple process of presenting different lists of words to the user.
  • the user is first presented with a list of people nouns 20 from which to select the desired person, if any.
  • the people nouns 20 may include, for example, mom, dad, sister, brother, grandma, grandpa, son, daughter, friend, family, soccer club, Tom, Smith, Jeffrey, Susan, and Kevin.
  • Additional people nouns 20 may be added to the list by entering them into the natural language interface 12 by a manual letter-by- letter process, downloaded to the natural language interface 12 through a communication channel, PCMCIA card connected to the natural language interface 12 , or any other suitable method. After selecting the first people noun 20 the user has the option at block 22 to select additional people nouns 20.
  • the user is presented with a list of things 24, such as, for example, car, truck, tree, shoe, computer, desk, ice ax, and ball, from which to select appropriate things 24, if any.
  • things 24, such as, for example, car, truck, tree, shoe, computer, desk, ice ax, and ball from which to select appropriate things 24, if any.
  • the user has the option at block 26 to select additional things 24.
  • the user is presented with a list of prepositions 28 to select from, such as, for example, and, on, from, at, and of.
  • the user is presented with a list of places 30, such as, for example, beach, town, mountain, home, work Portland, and Vancouver, from which to select appropriate places 30, if any.
  • places 30, such as, for example, beach, town, mountain, home, work Portland, and Vancouver, from which to select appropriate places 30, if any.
  • the user has the option at block 32 to select additional places 30.
  • the user is again presented with a list of people nouns 34, such as, for example, mom, dad, sister, brother, grandma, grandpa, son, daughter, friend, Jon, Tom, Smith, Jeffrey, Susan, and Kevin, from which to select appropriate people nouns 34, if any.
  • the user has the option at block 36 to select additional people nouns 34.
  • the user is presented with a list of events 38, such as, for example, birthday, wedding, party, Christmas, marathon, and vacation, from which to select appropriate events 38, if any.
  • events 38 such as, for example, birthday, wedding, party, Christmas, marathon, and vacation, from which to select appropriate events 38, if any.
  • the user has the option at block 40 to select additional events 38, if any.
  • the video or camera device normally includes an internal clock so the selection of "time" actually provides the current time.
  • the natural language interface 12 permits the user to create a label that relates to and identifies the subject matter of the video or photographic image. The particular order of different categories of words and the particular order of words within each category may be modified by the user to accommodate trends in use.
  • the words within each category may be changed, as desired, by any suitable manner such as by manual entry, downloading through a communication channel, and PCMCIA cards. Additionally, other suitable categories of words may likewise be added.
  • the user's selection of any particular individual word within each category may change the words that the natural language interface 12 presents to the user within later selections. This allows the natural language interface 12 to present words to the user that are more likely to accurately describe the video or photographic image. For example, if the thing 24 selected by the user was a beach ball then the list of places 30 may be modified to include words related to beaches and oceans. This helps maintain relatively short lists of words presented to the user which decreases the time required to create a label. In addi- tion, the user should be able to skip particular categories, if desired.
  • a camcorder 52 and in particular a camcorder viewfinder 50, may be used in conjunction with the natural language interface 12 to create a label.
  • the camcorder 52 may include a set of buttons 54a-54f each of which corresponds to respective virtual buttons 56a-56f displayed on the display 58 within the viewfinder 50.
  • a list of selections may be displayed on the display 58 in a box 60.
  • the user may scroll through the available selections to select the desired word, as indicated with the highlight bar 62.
  • the select button 54a is used to select the highlighted word and add it to the label 64 being created, as shown in the lower portion of the display 58. Any necessary prepositions, pronouns, indefinite articles, and definite articles may be automatically added, as needed.
  • the forward button 54d can be used to proceed to the next set of words.
  • the back button 54e can be used to return to and modify items already selected.
  • the stop button 54f is selected to exit the natural language interface 12.
  • the system then records the label at an appropriate location on the video, such as on the lower portion of each video.
  • the label can be selected to appear for a limited number of frames, a single frame, or continuously.
  • the virtual keys 56a-56f may be redefined, as desired, to provide additional functions. Labeling only one or a limited number of frames of the video reduces the time that the label obscures the video image during playback. However, the label may still be searched for by a video search interface to locate the particular video clips associated with the jLabel, as described later.
  • One system suitable to attach or encode text on video is described in Eisen et al. U.S. Patent No. 5,440,678 incorporated herein by reference.
  • a digital camcorder 110 includes a CCD camera 120 that senses the scene in the view of the camera.
  • An analog-to-digital (A/D) converter 124 converts the analog output of the CCD camera 120 to a digital signal.
  • a compressor 126 compresses the digital output from the A/D converter 124 in a manner similar to MPEG-2.
  • the compressed digital data is then stored on the video portion of a tape 128 at 20 Mbits/sec.
  • the tape 128 also includes a digital data track that is used to store additional information thereon. Similar to the analog-based camcorder described in relation to FIG. 3, the digital camcorder may include the natural language interface 12 and store the label either in the video portion or on the data track of the tape 128.
  • a digital camera 68 preferably includes a lens 71, a viewfinder (not shown) , and a set of buttons 70a-70f that may have the same functions as the buttons 54a-54f and 56a-56f previously described in relation to the camcorder 52 of FIG. 3.
  • the words for the label are viewed and selected by use of the viewfinder and buttons 70a-70j , similar to that of the camcorder 52.
  • the digital camera 68 may include a mini- disc 72, built-in memory, or memory card 74 upon which captured images are stored.
  • the natural language labels are preferably overlaid on the digital image obtained by the camera 68. Alternatively, the natural language labels may be electronically attached to one (or more) digital photographic image file without actually altering the image. As such, the label is associated with the file but this image file is not altered.
  • a traditional film camera 80 may be used together with the natural language interface 12.
  • the film from the camera 80 is sent to a film developing service 82 that develops the negative (or positive) and scans each image on the film to obtain digital photographic image files.
  • the digital photographic image files are electronically transferred to the customer 84 through a computer network such as the internet.
  • the digital photographic image files may be recorded onto storage media, such as floppy discs, and mailed to the customer 84.
  • the customer 84 uses a personal computer 86 that includes software with the natural language interface 12 to label each of the digital photographic image files. The labels may be overlaid on the image or attached to the image file without modifying the actual image.
  • An Advanced Photo System (APS) camera uses a film that includes a generally transparent thin layer of magnetic material over either a portion of or all of the film.
  • the magnetic material is suitable to encode digital information therein.
  • the magnetic material records conditions that exist when the respective photo was taken, such as lighting and camera settings, that are used to improve the quality of subsequent film developing.
  • the camera may include the natural language interface 12 with the label stored in a digital format in the magnetic material.
  • the natural language interface 12 also includes a search query function. The user builds a query of words, in a manner similar to creating a label as previously described, that the user wants to locate in previously completed labels using the natural language interface 12.
  • the system searches through all video (digital or analog) and photographic images (digital or film based) to locate video clips or photographs that contain one or more of the keywords.
  • video digital or analog
  • photographic images digital or film based
  • the natural language interface 12 is suitable for use with any type of selection device, such as, for example, a touch-sensitive overlay on the display, a light pen, a mouse, a joystick, a plurality of buttons, and a pointer stick.
  • selection device such as, for example, a touch-sensitive overlay on the display, a light pen, a mouse, a joystick, a plurality of buttons, and a pointer stick.

Abstract

A system for labeling video or photographic images in a portable handheld device, such as a camcorder or camera, includes a language interface. The user is prompted through the language interface with a first plurality of first words, each of the first words including a plurality of letters, from which the user selects at least one of the first words. The user is then prompted through the interface with a second plurality of second words, each of the second words including a plurality of letters, from which the user selects at least one of the second words. The selected at least one of the first words and the selected at least one of the second words are combined to create a label relating to subject matter obtained by the portable handheld device. The label is overlaid on or attached to the video clips (digital and analog) or photographic images (digital and film based) depending on the nature of the portable handheld device.

Description

DESCRIPTION
NATURAL LANGUAGE LABELING
BACKGROUND OF THE INVENTION
The present invention relates to a natural language labeling system for video and photographic images.
Video recorders, and in particular handheld portable camcorders, typically include buttons that permit the user to enter a text label which is overlaid on a small portion of the video images on a tape as the video images are obtained. To enter the desired text, the user scrolls through the alphabet in a letter-by- letter process until the desired letter appears. When the desired letter appears it may be selected so that it becomes a part of the text label . This letter by letter process is repeated until the desired text label is completely entered. Unfortunately, this process is time consuming and therefore infre uently done by users. Because camcorders are small so as to be handheld, they are not suitable for an additional alpha-numeric keypad from which to enter text. However, it is desirable to label individual video clips to assist a user's recollection of the taped event. Also, over time the user may accumulate hundreds of video tapes, with each video tape including hundreds of different video clips. Without accurately labeling the exterior label of a video tape with an indication of all the video clips contained therein, locating the desired video clip among many tapes becomes a nightmarish task. This task is even more difficult for somebody who has not previously viewed the video clip or the video tape.
Photographic items such as photos, slides, and digital image files have other problems. For example, individuals may take hundreds to thousands of photo- graphic pictures (prints or transparencies) every year. Ideally every good picture is stored in a photo album together with its negative, or a slide tray or cassette in the case of transparencies. However, organizing photos in a photo album requires considerable effort and most often individuals merely look at the pictures once or twice and then place them in a box with other pictures. Over time negatives tend to become separated from their respective pictures making it difficult to obtain a duplicate print of a desired picture because the negative cannot be located. In addition, if each picture is not manually labeled with a label relating to its subject, then over time the photographer may not be able to recall the subject matter of the picture, the people shown in the picture, and the date that the picture was taken. There are similar problems relating to locating and identifying slides. Several film developing services now scan negatives (or positives) to create a digital photographic image file of each picture. The digital photographic image file is then provided to the customer either on storage media such as a diskette or over a network such as the internet. Similar labeling, identification, locating, and storing problems exist with digital photographic image files, as with traditional photographic pictures and slides.
Fujisawa et al . U.S. Patent No. 5,555,408 disclose a knowledge based information retrieval system suitable to query existing databases for desired information. The natural language portion of the retrieval system permits users to enter an English sentence query, as opposed to cryptic database syntax query, to search for desired information within the database. The natural language interface is intuitive for the user and alleviates the need for the user to learn the cryptic database query syntax, thus making the system faster to learn. Such systems are generally referred to as natural language query systems.
What is desired, therefore, is a system for efficiently labeling video and photographic images that is suitable for portable handheld devices. Also, the system should permit the efficient categorization and retrieval of video clips and photographic images.
SUMMARY OF THE PRESENT INVENTION
The present invention overcomes the aforementioned drawbacks of the prior art by providing a system for labeling video or photographic images on a portable handheld device, such as a camcorder or camera, that includes a language interface. The user is prompted through the language interface with a first plurality of first words , each of the first words including a plurality of letters, from which the user selects at least one of the first words. The user is then prompted through the interface with a second plurality of second words, each of the second words including a plurality of letters, from which the user selects at least one of the second words. The selected at least one of the first words and the selected at least one of the second words are combined to create a label relating to subject matter obtained by the portable handheld device. The label is either overlaid on or attached to the video clips (digital and analog) or photographic images (digital and film based) depending on the nature of the portable handheld device and the system configuration. Preferably, the system includes search tools that use the language interface to locate video clips (digital and analog) or photographic images (digital and film based) .
The language interface permits the user to create a label using a word-by-word process so that the video clips and photographic images are easily identified later. In addition, the language interface permits the user to select entire words which allows for the quick creation of the label. Since labels are easier to create, it is more likely that the user will actually label his video and photographic images. Also, by labeling the video and photographic images, the user will be able to search for desired video or photographic images by using electronic search tools. Further, the language interface is especially suitable for portable handheld devices, such as cameras and camcorders, where space limitations exist that prohibit the use of an alphanumeric keyboard. As such, the language interface only requires a few controls, such as buttons or touch- sensitive points on a display, to be used effectively. The foregoing and other objectives, features, and advantages of the invention will be more readily understood upon consideration of the following detailed description of the invention, taken in conjunction with the accompanying drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a block diagram of an exemplary embodiment of a natural language labeling system that includes a natural language interface of the present invention. FIG. 2 is a block diagram of the natural language interface of the labeling system of FIG. 1.
FIG. 3 is a schematic view of a camcorder and a viewfinder including the natural language interface of FIG. 2. FIG. 4 is a schematic view of a digital camcorder including the natural language interface of FIG. 2.
FIG. 5 is a simplified pictorial view of a camera including the natural language interface of FIG. 2.
FIG. 6 is a block diagram of a natural language labeling system suitable for use with film developing services.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
Referring to FIG. 1, the present inventor came to the realization that the text created using a natural language query system, which was previously designed for and specifically used to query existing databases for information, may be used in a new manner to label video and photographic images. First, the user selects the video or photographic image to label at block 10. Then, the user uses a natural language interface 12 , described in detail below, to create a suitable label that relates to and describes the subject matter of the video or photographic image previously selected at block 10. Thereafter, the label created using the natural language interface 12 is attached to or overlaid on the selected video or photographic image 10, as described below. By connecting the selected video or photographic image with the label, the video or photographic image can later be located with electronic search tools and the subject matter of the video or photographic image will not be forgotten over time.
The natural language interface 12 permits the user to create a label using a word-by-word process for video or photographic images so that they are easily identified later. In addition, the natural language interface 12 permits the user to select entire words, as opposed to individual letters, which allows for the quick creation of the label. Since labels are easier to create, it is more likely that the user will actually label his video and photographic images. Also, by labeling the video and photographic images, the user will be able to search for desired video or photographic images by using electronic search tools that search for and compare the natural language query text with labels attached to or overlaid on the video or photographic images. Further, the natural language interface 12 is especially suitable for portable handheld devices, such as cameras and camcorders, where space limitations exist that prohibit the use of an alphanumeric keyboard. As such, the user can enter the label by selecting entire words presented to the user that are suitable for each location within a sentence. As such, the natural language interface 12 only requires a few controls, such as buttons or touch-sensitive points on a display, to be used effectively. Referring to FIG. 2, the natural language interface 12 allows the user to create the label by following a simple process of presenting different lists of words to the user. The user is first presented with a list of people nouns 20 from which to select the desired person, if any. The people nouns 20 may include, for example, mom, dad, sister, brother, grandma, grandpa, son, daughter, friend, family, soccer club, Tom, Smith, Jeffrey, Susan, and Kevin. Additional people nouns 20 may be added to the list by entering them into the natural language interface 12 by a manual letter-by- letter process, downloaded to the natural language interface 12 through a communication channel, PCMCIA card connected to the natural language interface 12 , or any other suitable method. After selecting the first people noun 20 the user has the option at block 22 to select additional people nouns 20.
Next, the user is presented with a list of things 24, such as, for example, car, truck, tree, shoe, computer, desk, ice ax, and ball, from which to select appropriate things 24, if any. After selection of one of the things 24, if any, the user has the option at block 26 to select additional things 24.
Next, the user is presented with a list of prepositions 28 to select from, such as, for example, and, on, from, at, and of.
Next, the user is presented with a list of places 30, such as, for example, beach, town, mountain, home, work Portland, and Vancouver, from which to select appropriate places 30, if any. After selection of one of the places 30, if any, the user has the option at block 32 to select additional places 30. Next, the user is again presented with a list of people nouns 34, such as, for example, mom, dad, sister, brother, grandma, grandpa, son, daughter, friend, Jon, Tom, Smith, Jeffrey, Susan, and Kevin, from which to select appropriate people nouns 34, if any. After selection of one of the people nouns 34, if any, the user has the option at block 36 to select additional people nouns 34.
Next, the user is again presented with a list of prepositions 37 from which to select one.
Next, the user is presented with a list of events 38, such as, for example, birthday, wedding, party, Christmas, marathon, and vacation, from which to select appropriate events 38, if any. After selection of one of the events 38, the user has the option at block 40 to select additional events 38, if any.
Next, the user is presented with a list of dates and times 42, such as, for example, Monday, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday, January, February, morning, afternoon, evening, and time, from which to select appropriate dates and times 42, if any. The video or camera device normally includes an internal clock so the selection of "time" actually provides the current time. The natural language interface 12 permits the user to create a label that relates to and identifies the subject matter of the video or photographic image. The particular order of different categories of words and the particular order of words within each category may be modified by the user to accommodate trends in use.
Further, the words within each category may be changed, as desired, by any suitable manner such as by manual entry, downloading through a communication channel, and PCMCIA cards. Additionally, other suitable categories of words may likewise be added. The user's selection of any particular individual word within each category may change the words that the natural language interface 12 presents to the user within later selections. This allows the natural language interface 12 to present words to the user that are more likely to accurately describe the video or photographic image. For example, if the thing 24 selected by the user was a beach ball then the list of places 30 may be modified to include words related to beaches and oceans. This helps maintain relatively short lists of words presented to the user which decreases the time required to create a label. In addi- tion, the user should be able to skip particular categories, if desired. Also, the user should be able to vary the order in which the selections are presented to create a label with a different structure. For example, people nouns may be switched with things. Further, suit- able prepositions, pronouns (which, who, that) , indefinite articles (a, an) , and definite articles (the) may be automatically added by the interface 12 in appropriate locations to create a more grammatically correct label. Referring to FIG. 3, a camcorder 52, and in particular a camcorder viewfinder 50, may be used in conjunction with the natural language interface 12 to create a label. The camcorder 52 may include a set of buttons 54a-54f each of which corresponds to respective virtual buttons 56a-56f displayed on the display 58 within the viewfinder 50. A list of selections may be displayed on the display 58 in a box 60. By using the up button 54b and the down button 54c the user may scroll through the available selections to select the desired word, as indicated with the highlight bar 62. The select button 54a is used to select the highlighted word and add it to the label 64 being created, as shown in the lower portion of the display 58. Any necessary prepositions, pronouns, indefinite articles, and definite articles may be automatically added, as needed. To proceed to the next set of words the "none" selection is highlighted and selected. Alternatively, the forward button 54d can be used to proceed to the next set of words. The back button 54e can be used to return to and modify items already selected. After the desired label 64 is created, the stop button 54f is selected to exit the natural language interface 12. The system then records the label at an appropriate location on the video, such as on the lower portion of each video. The label can be selected to appear for a limited number of frames, a single frame, or continuously. The virtual keys 56a-56f may be redefined, as desired, to provide additional functions. Labeling only one or a limited number of frames of the video reduces the time that the label obscures the video image during playback. However, the label may still be searched for by a video search interface to locate the particular video clips associated with the jLabel, as described later. One system suitable to attach or encode text on video is described in Eisen et al. U.S. Patent No. 5,440,678 incorporated herein by reference. Referring to FIG. 4, a digital camcorder 110 includes a CCD camera 120 that senses the scene in the view of the camera. An analog-to-digital (A/D) converter 124 converts the analog output of the CCD camera 120 to a digital signal. A compressor 126 compresses the digital output from the A/D converter 124 in a manner similar to MPEG-2. The compressed digital data is then stored on the video portion of a tape 128 at 20 Mbits/sec. The tape 128 also includes a digital data track that is used to store additional information thereon. Similar to the analog-based camcorder described in relation to FIG. 3, the digital camcorder may include the natural language interface 12 and store the label either in the video portion or on the data track of the tape 128.
Referring to FIG. 5, a digital camera 68 preferably includes a lens 71, a viewfinder (not shown) , and a set of buttons 70a-70f that may have the same functions as the buttons 54a-54f and 56a-56f previously described in relation to the camcorder 52 of FIG. 3. The words for the label are viewed and selected by use of the viewfinder and buttons 70a-70j , similar to that of the camcorder 52. The digital camera 68 may include a mini- disc 72, built-in memory, or memory card 74 upon which captured images are stored. The natural language labels are preferably overlaid on the digital image obtained by the camera 68. Alternatively, the natural language labels may be electronically attached to one (or more) digital photographic image file without actually altering the image. As such, the label is associated with the file but this image file is not altered.
Referring to FIG. 6, a traditional film camera 80 may be used together with the natural language interface 12. The film from the camera 80 is sent to a film developing service 82 that develops the negative (or positive) and scans each image on the film to obtain digital photographic image files. The digital photographic image files are electronically transferred to the customer 84 through a computer network such as the internet. Alternatively, the digital photographic image files may be recorded onto storage media, such as floppy discs, and mailed to the customer 84. The customer 84 uses a personal computer 86 that includes software with the natural language interface 12 to label each of the digital photographic image files. The labels may be overlaid on the image or attached to the image file without modifying the actual image.
An Advanced Photo System (APS) camera uses a film that includes a generally transparent thin layer of magnetic material over either a portion of or all of the film. The magnetic material is suitable to encode digital information therein. Traditionally, the magnetic material records conditions that exist when the respective photo was taken, such as lighting and camera settings, that are used to improve the quality of subsequent film developing. The camera may include the natural language interface 12 with the label stored in a digital format in the magnetic material. The natural language interface 12 also includes a search query function. The user builds a query of words, in a manner similar to creating a label as previously described, that the user wants to locate in previously completed labels using the natural language interface 12. The system then searches through all video (digital or analog) and photographic images (digital or film based) to locate video clips or photographs that contain one or more of the keywords. In the case where the labels are recorded on the video or photographic image this may involve the analysis of the content of the image (s) itself.
It is to be understood that many electronic devices, and particularly camcorders, may include larger displays for viewing the images. The natural language interface 12 is suitable for use with any type of selection device, such as, for example, a touch-sensitive overlay on the display, a light pen, a mouse, a joystick, a plurality of buttons, and a pointer stick. The terms and expressions which have been employed in the foregoing specification are used therein as terms of description and not of limitation, and there is no intention, in the use of such terms and expressions, of excluding equivalents of the features shown and described or portions thereof, it being recognized that the scope of the invention is defined and limited only by the claims which follow.

Claims

1. A method of labeling video comprising the steps of: (a) providing a portable handheld device for recording a video onto a tape that includes a language interface;
(b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters, from which said user selects at least one of said first words ;
(c) prompting said user through said interface with a second plurality of second words, each of said second words including a plurality of letters, from which said user selects at least one of said second words;
(d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to at least one of subject matter of a video clip recorded on said tape and of subject matter of a video clip to be recorded on said tape; and
(e) recording said label on said tape by said portable handheld device such that said label is observable by said user while viewing said video clip.
2. The method of claim 1 wherein said portable handheld device is a camcorder.
3. The method of claim 1 wherein said first words and said second words include at least one identical word.
4. The method of claim 1 further comprising the steps of:
(a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words ;
(b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and (c) searching said tape by said portable handheld device to locate at least one said label previously recorded on said tape that matches at least one of said at least one of said third words and said at least one of said fourth words.
5. The method of claim 4 wherein said third words and said fourth words include at least one identical word.
6. The method of claim 1 wherein said recording of said label is on the video portion of said tape.
7. A method of labeling video comprising the steps of:
(a) providing a portable handheld device for recording a video onto a tape that includes a language interface; (b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters, from which said user selects at least one of said first words ;
(c) prompting said user through said interface with a second plurality of second words, each of said second words including a plurality of letters, from which said user selects at least one of said second words;
(d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to at least one of subject matter of a video clip recorded on said tape and subject matter of a video clip to be recorded on said tape; and
(e) recording said label by said portable handheld device on a limited number of frames of said tape such that said label is not clearly noticeable to the human eye while viewing said video clip.
8. The method of claim 7 wherein said limited number of frames is one frame.
9. The method of claim 7 wherein said portable handheld device is a camcorder.
10. The method of claim 7 wherein said first words and said second words include at least one identical word.
11. The method of claim 7, further comprising the steps of:
(a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words ;
(b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and
(c) searching said tape by said portable handheld device to locate at least one said label previously recorded on said tape that matches at least one of said at least one of said third words and said at least one of said fourth words.
12. The method of claim 11 wherein said third words and said fourth words include at least one identical word.
13. A method of labeling a digital photographic image comprising the steps of:
(a) providing a portable handheld device for obtaining a digital photographic image of a scene that includes a language interface;
(b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters from which said user selects at least one of said first words;
(c) prompting said user through said interface with a second plurality of second words, each of said second words including a plurality of letters from which said user selects at least one of said second words; (d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to the subject matter depicted in said digital photographic image; and
(e) overlaying said label on said digital photographic image by said portable handheld device such that said label is observable by said user while viewing said digital photographic image.
14. The method of claim 13 wherein said portable handheld device is a camera.
15. The method of claim 13 wherein said first words and said second words include at least one identical word.
16. The method of claim 13 further comprising the steps of:
(a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words ;
(b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and
(c) searching a plurality of said digital photographic images obtained by said portable handheld device upon which each digital photographic image has been overlaid with a said label in order to locate at least one said label previously overlaid on one of said digital photographic images that matches at least one of said at least one of said third words and said at least one of said fourth words .
17. The method of claim 16 wherein said third words and said fourth words include at least one identical word.
18. The method of claim 13 wherein said overlaying said label modifies the digital photographic image such that said label is incorporated within said digital photographic image.
19. The method of claim 13 wherein said overlaying said label does not alter said digital photographic image.
20. A method of labeling a digital photographic image comprising the steps of:
(a) providing a portable handheld device for obtaining a digital photographic image of a scene that includes a language interface;
(b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters from which said user selects at least one of said first words ;
(c) prompting said user through said language interface with a second plurality of second words, each of said second words including a plurality of letters from which said user selects at least one of said second words ;
(d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to the subject matter depicted in said digital photographic image; and
(e) attaching said label to said digital photographic image by said portable handheld device such that said label does not obscure viewing said digital photographic image.
21. The method of claim 20 wherein said portable handheld device is a camera.
22. The method of claim 20 wherein said first words and said second words include at least one identical word.
23. The method of claim 20, further comprising the steps of:
(a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words ; (b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and
(c) searching a plurality of said digital photographic images obtained by said portable handheld device upon which each digital photographic image has been attached with a said label in order to locate at least one said label previously attached to one of said digital photographic images that matches at least one of said at least one of said third words and said at least one of said fourth words.
24. The method of claim 23 wherein said third words and said fourth words include at least one identical word.
25. The method of claim 20 wherein said attaching said label does not alter said digital photographic image .
26. A method of labeling a digital image comprising the steps of:
(a) providing a portable handheld device for obtaining a film-based image of a scene;
(b) scanning said film to obtain a digital photographic image of said film-based photographic image;
(c) prompting a user through a language interface on a computer with a first plurality of first words, each of said first words including a plurality of letters from which said user selects at least one of said first words;
(d) prompting said user through said language interface on said computer with a second plurality of second words, each of said second words including a plurality of letters from which said user selects at least one of said second words; (e) combining on said computer said selected at least one of said first words and said selected at least one of said second words to create a label relating to the subject matter depicted in said digital photographic image; and
(f) attaching said label to said digital photographic image in a manner such that said label is at least one of, observable by said user while viewing said digital photographic image, and not obscure viewing said digital photographic image while viewing said digital photographic image
27. The method of claim 26 wherein said portable handheld device is a camera.
28. The method of claim 26 wherein said first words and said second words include at least one identical word.
29. The method of claim 26 further comprising the steps of:
(a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words ; (b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and
(c) searching on said computer a plurality of said digital photographic images obtained by said portable handheld device upon which each digital photographic image has been attached with a said label in order to locate at least one said label previously attached to one of said digital photographic images that matches at least one of said at least one of said third words and said at least one of said fourth words .
30. A method of labeling a photographic image comprising the steps of:
(a) providing a portable handheld device for obtaining a photographic image of a scene on film that includes a language interface;
(b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters from which said user selects at least one of said first words ;
(c) prompting said user through said language interface with a second plurality of second words, each of said second words including a plurality of letters from which said user selects at least one of said second words;
(d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to the subject matter depicted in said photographic image on said film; and (e) recording said label in a digital format on a magnetic layer overlaying said film by said portable handheld device.
31. The method of claim 30 wherein said portable handheld device is a camera.
32. The method of claim 30 wherein said first words and said second words include at least one identical word.
33. A method of labeling video comprising the steps of: (a) providing a portable handheld device for recording a video onto a tape in a digital format that includes a language interface;
(b) prompting a user through said language interface with a first plurality of first words, each of said first words including a plurality of letters, from which said user selects at least one of said second words ;
(c) prompting said user through said interface with a second plurality of second words, each of said first words including a plurality of letters, from which said user selects at least one of said second words;
(d) combining said selected at least one of said first words and said selected at least one of said second words to create a label relating to at least one of subject matter of a video clip recorded on said tape and subject matter of a video clip to be recorded on said tape; and
(e) recording said label on said tape in a digital format by said portable handheld device.
34. The method of claim 33 wherein said label is observable by said user while viewing said video clip.
35. The method of claim 33 wherein said label is not observable by said user while viewing said video clip.
36. The method of claim 33 wherein said label is recorded on a video portion of said tape.
37. The method of claim 33 wherein said label is recorded on a data track portion of said tape.
38. The method of claim 33 wherein said portable handheld device is a camcorder.
39. The method of claim 33 wherein said first words and said second words include at least one identical word.
40. The method of claim 33, further comprising the steps of: (a) prompting said user through said language interface with a third plurality of third words, each of said third words including a plurality of letters, from which said user selects at least one of said third words;
(b) prompting said user through said interface with a fourth plurality of fourth words, each of said fourth words including a plurality of letters, from which said user selects at least one of said fourth words; and
(c) searching said tape by said portable handheld device to locate at least one said label previously recorded on said tape that matches at least one of said at least one of said third words and said at least one of said fourth words.
41. The method of claim 40 wherein said third words and said fourth words include at least one identical word.
PCT/JP1998/001391 1997-03-28 1998-03-27 Natural language labeling WO1998044436A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/825,140 US6614988B1 (en) 1997-03-28 1997-03-28 Natural language labeling of video using multiple words
US08/825,140 1997-03-28

Publications (1)

Publication Number Publication Date
WO1998044436A1 true WO1998044436A1 (en) 1998-10-08

Family

ID=25243218

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP1998/001391 WO1998044436A1 (en) 1997-03-28 1998-03-27 Natural language labeling

Country Status (2)

Country Link
US (2) US6614988B1 (en)
WO (1) WO1998044436A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1286516A2 (en) * 2001-08-08 2003-02-26 Ricoh Company, Ltd. Digest transmission system for mobile telephones

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7992163B1 (en) 1999-06-11 2011-08-02 Jerding Dean F Video-on-demand navigational system
US6817028B1 (en) 1999-06-11 2004-11-09 Scientific-Atlanta, Inc. Reduced screen control system for interactive program guide
US7010801B1 (en) 1999-06-11 2006-03-07 Scientific-Atlanta, Inc. Video on demand system with parameter-controlled bandwidth deallocation
US8516525B1 (en) 2000-06-09 2013-08-20 Dean F. Jerding Integrated searching system for interactive media guide
US7975277B1 (en) 2000-04-03 2011-07-05 Jerding Dean F System for providing alternative services
US7200857B1 (en) 2000-06-09 2007-04-03 Scientific-Atlanta, Inc. Synchronized video-on-demand supplemental commentary
US7934232B1 (en) 2000-05-04 2011-04-26 Jerding Dean F Navigation paradigm for access to television services
US8069259B2 (en) 2000-06-09 2011-11-29 Rodriguez Arturo A Managing removal of media titles from a list
US7962370B2 (en) 2000-06-29 2011-06-14 Rodriguez Arturo A Methods in a media service system for transaction processing
US7489853B2 (en) * 2000-08-29 2009-02-10 Panasonic Corporation Auxiliary information generation method, auxiliary information generation apparatus, video data generation method, video data playback method, video data playback apparatus, and data storage medium
US7340759B1 (en) 2000-11-10 2008-03-04 Scientific-Atlanta, Inc. Systems and methods for adaptive pricing in a digital broadband delivery system
US7512964B2 (en) 2001-06-29 2009-03-31 Cisco Technology System and method for archiving multiple downloaded recordable media content
US7526788B2 (en) 2001-06-29 2009-04-28 Scientific-Atlanta, Inc. Graphic user interface alternate download options for unavailable PRM content
US7496945B2 (en) 2001-06-29 2009-02-24 Cisco Technology, Inc. Interactive program guide for bidirectional services
US8006262B2 (en) 2001-06-29 2011-08-23 Rodriguez Arturo A Graphic user interfaces for purchasable and recordable media (PRM) downloads
JP2003224750A (en) * 2002-01-29 2003-08-08 Ricoh Co Ltd Digital camera and image edit system
US7334251B2 (en) 2002-02-11 2008-02-19 Scientific-Atlanta, Inc. Management of television advertising
US20030189642A1 (en) * 2002-04-04 2003-10-09 Bean Heather N. User-designated image file identification for a digital camera
JP4329893B2 (en) * 2002-07-15 2009-09-09 株式会社リコー Digital camera device
JP2004187269A (en) * 2002-11-20 2004-07-02 Sanyo Electric Co Ltd Portable device
US20070206921A1 (en) * 2003-12-15 2007-09-06 Matsushita Information Systems Research Laboratory Recording Apparatus for Supporting Titling Image, and Method and Control Program for the Same
US8161388B2 (en) 2004-01-21 2012-04-17 Rodriguez Arturo A Interactive discovery of display device characteristics
FI122372B (en) * 2004-02-13 2011-12-30 Futurice Oy Data Processing system
US8189472B2 (en) 2005-09-07 2012-05-29 Mcdonald James F Optimizing bandwidth utilization to a subscriber premises
US7966024B2 (en) 2008-09-30 2011-06-21 Microsoft Corporation Virtual skywriting
US20160259888A1 (en) * 2015-03-02 2016-09-08 Sony Corporation Method and system for content management of video images of anatomical regions
TWI553494B (en) * 2015-11-04 2016-10-11 創意引晴股份有限公司 Multi-modal fusion based Intelligent fault-tolerant video content recognition system and recognition method

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6466888A (en) * 1987-09-07 1989-03-13 Canon Kk Magnetic tape recording and reproducing method
EP0437533A1 (en) * 1988-10-07 1991-07-24 Eastman Kodak Co Photofinishing process with film-to-video player using dedicated magnetic tracks on film.
US5097348A (en) * 1988-02-29 1992-03-17 Casio Computer Co., Ltd. Image data recording/reproducing apparatus including superimposing function
US5296884A (en) * 1990-02-23 1994-03-22 Minolta Camera Kabushiki Kaisha Camera having a data recording function
EP0678816A2 (en) * 1994-04-21 1995-10-25 Canon Kabushiki Kaisha Information processing method and apparatus therefor

Family Cites Families (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4714962A (en) 1976-08-27 1987-12-22 Levine Alfred B Dual electronic camera, previewing, and control
US4641203A (en) * 1981-03-13 1987-02-03 Miller Richard L Apparatus for storing and relating visual data and computer information
US4908713A (en) * 1981-12-14 1990-03-13 Levine Michael R VCR Programmer
US4482924A (en) 1982-09-29 1984-11-13 Eastman Kodak Company Video player, film medium, and photographic printer for automatic cropping
US4635136A (en) * 1984-02-06 1987-01-06 Rochester Institute Of Technology Method and apparatus for storing a massive inventory of labeled images
US5404506A (en) 1985-03-27 1995-04-04 Hitachi, Ltd. Knowledge based information retrieval system
EP0218218A3 (en) 1985-10-07 1989-11-08 Sharp Kabushiki Kaisha An inputting system and an editing system in an inquiry-and-answer system
JP2829958B2 (en) * 1988-01-27 1998-12-02 ソニー株式会社 Title image insertion device
US5003396A (en) 1988-07-30 1991-03-26 Goldstar Co., Ltd. Black and white monitoring system for use in a remote controller
US5515101A (en) * 1989-04-28 1996-05-07 Minolta Co., Ltd. Title generator for a video camera
JPH03147556A (en) 1989-11-01 1991-06-24 Sharp Corp Magnetic recording and reproducing device
JP3318897B2 (en) 1991-01-29 2002-08-26 ソニー株式会社 Remote controller with video monitor
KR940001789B1 (en) * 1991-01-31 1994-03-05 삼성전자 주식회사 Device for displaying information data and accessing method
CA2094526C (en) 1992-07-22 1998-05-05 Ivan Eisen Method and apparatus for creating a multi-media footnote control in a video data
GB9217886D0 (en) * 1992-08-21 1992-10-07 Canon Res Ct Europe Ltd Method and apparatus for parsing natural language
US5561470A (en) 1992-11-30 1996-10-01 U.S. Philips Corporation Method and system for entering a plurality of data into an apparatus
US5613032A (en) * 1994-09-02 1997-03-18 Bell Communications Research, Inc. System and method for recording, playing back and searching multimedia events wherein video, audio and text can be searched and retrieved

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6466888A (en) * 1987-09-07 1989-03-13 Canon Kk Magnetic tape recording and reproducing method
US5097348A (en) * 1988-02-29 1992-03-17 Casio Computer Co., Ltd. Image data recording/reproducing apparatus including superimposing function
EP0437533A1 (en) * 1988-10-07 1991-07-24 Eastman Kodak Co Photofinishing process with film-to-video player using dedicated magnetic tracks on film.
US5296884A (en) * 1990-02-23 1994-03-22 Minolta Camera Kabushiki Kaisha Camera having a data recording function
EP0678816A2 (en) * 1994-04-21 1995-10-25 Canon Kabushiki Kaisha Information processing method and apparatus therefor

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
KIM Y -B ET AL: "CONTENT-BASED VIDEO INDEXING AND RETRIEVAL - A NATURAL LANGUAGE APPROACH", IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, vol. E79-D, no. 6, 1 June 1996 (1996-06-01), pages 695 - 705, XP000595174 *
PATENT ABSTRACTS OF JAPAN vol. 013, no. 279 (P - 891) 27 June 1989 (1989-06-27) *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1286516A2 (en) * 2001-08-08 2003-02-26 Ricoh Company, Ltd. Digest transmission system for mobile telephones
EP1286516A3 (en) * 2001-08-08 2003-03-26 Ricoh Company, Ltd. Digest transmission system for mobile telephones

Also Published As

Publication number Publication date
US6614988B1 (en) 2003-09-02
US6850695B1 (en) 2005-02-01

Similar Documents

Publication Publication Date Title
US6614988B1 (en) Natural language labeling of video using multiple words
US7414651B2 (en) Efficient image categorization
US6629104B1 (en) Method for adding personalized metadata to a collection of digital images
US7734654B2 (en) Method and system for linking digital pictures to electronic documents
US8010579B2 (en) Bookmarking and annotating in a media diary application
US6128446A (en) Method and apparatus for annotation of photographic film in a camera
US7127164B1 (en) Method for rating images to facilitate image retrieval
US20110200980A1 (en) Information processing device operation control system and operation control method
EP1557035B1 (en) Method and apparatus for transmitting a digital picture with textual material
US20110199511A1 (en) Image photographing system and image photographing method
US20020140820A1 (en) Calendar based photo browser
US20160132534A1 (en) Information processing system, information processing device, inofrmation processing method, and computer readable recording medium
US20050108644A1 (en) Media diary incorporating media and timeline views
US20100121852A1 (en) Apparatus and method of albuming content
US20080256488A1 (en) Method and Apparatus for Enabling Browsing of Images
JP2003298991A (en) Image arranging method and apparatus, and program
AU735499B2 (en) Picture print generating method and system, and recording medium
JP2005174308A (en) Method and apparatus for organizing digital media by face recognition
US20050018057A1 (en) Image capture device loaded with image metadata
JP2001297090A (en) Image data retrieval method, image display method, data retrieval system, image editing device and computer readable storage medium
JP2002010178A (en) Image managing system and method for managing image as well as storage medium
US20020143762A1 (en) Envelope printing feature for photo filing system
JP2004120420A (en) Image adjusting device and program
JP2002185908A (en) Computer-readable recording medium recording image extract program, image extract device and image extract method
US20060117008A1 (en) File management apparatus and file management program

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): JP

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LU MC NL PT SE

DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: JP

Ref document number: 1998541434

Format of ref document f/p: F

122 Ep: pct application non-entry in european phase