CA2244127A1 - Phrase recognition method and apparatus - Google Patents

Phrase recognition method and apparatus

Info

Publication number
CA2244127A1
CA2244127A1 CA002244127A CA2244127A CA2244127A1 CA 2244127 A1 CA2244127 A1 CA 2244127A1 CA 002244127 A CA002244127 A CA 002244127A CA 2244127 A CA2244127 A CA 2244127A CA 2244127 A1 CA2244127 A1 CA 2244127A1
Authority
CA
Canada
Prior art keywords
text
chunks
phrases
phrase recognition
selects
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CA002244127A
Other languages
French (fr)
Other versions
CA2244127C (en
Inventor
Xin Allan Lu
David James Miller
John Richard Wassum
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
LEXIX-NEXIS A DIVISION OF REED ELSEVIER Inc
Original Assignee
LexisNexis Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by LexisNexis Inc filed Critical LexisNexis Inc
Publication of CA2244127A1 publication Critical patent/CA2244127A1/en
Application granted granted Critical
Publication of CA2244127C publication Critical patent/CA2244127C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/912Applications of a database
    • Y10S707/917Text
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/964Database arrangement
    • Y10S707/966Distributed
    • Y10S707/967Peer-to-peer
    • Y10S707/968Partitioning
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99934Query formulation, input preparation, or translation
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99931Database or file accessing
    • Y10S707/99933Query processing, i.e. searching
    • Y10S707/99935Query augmenting and refining, e.g. inexact access

Abstract

A phrase recognition method breaks streams of text into text "chunks" and selects certain chunks as "phrases" useful for automated full text searching.
The phrase recognition method uses a carefully assembled list of partition elements (10) to partition the text into chunks, and selects phrases from the chunks according to a small number of frequency based definitions (20). The method can also incorporate additional processes such as categorization of proper names (30) to enhance phrase recognition. The method selects phrases quickly and efficiently, referring simply to the phrases themselves and the frequency with which they are encountered, rather than relying on complex, time-consuming, resource-consuming grammatical analysis, or on collocation schemes of limited applicability, or on heuristical text analysis of limited reliability or utility.
CA002244127A 1996-01-22 1997-01-21 Phrase recognition method and apparatus Expired - Lifetime CA2244127C (en)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US08/589,468 1996-01-22
US08/589,468 US5819260A (en) 1996-01-22 1996-01-22 Phrase recognition method and apparatus
PCT/US1997/000212 WO1997027552A1 (en) 1996-01-22 1997-01-21 Phrase recognition method and apparatus

Publications (2)

Publication Number Publication Date
CA2244127A1 true CA2244127A1 (en) 1997-07-31
CA2244127C CA2244127C (en) 2002-04-09

Family

ID=24358144

Family Applications (1)

Application Number Title Priority Date Filing Date
CA002244127A Expired - Lifetime CA2244127C (en) 1996-01-22 1997-01-21 Phrase recognition method and apparatus

Country Status (5)

Country Link
US (1) US5819260A (en)
EP (1) EP0883846A4 (en)
AU (1) AU1528897A (en)
CA (1) CA2244127C (en)
WO (1) WO1997027552A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11144994B1 (en) 2014-08-18 2021-10-12 Street Diligence, Inc. Computer-implemented apparatus and method for providing information concerning a financial instrument

Families Citing this family (129)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0841624A1 (en) * 1996-11-08 1998-05-13 Softmark Limited Input and output communication in a data processing system
US6498921B1 (en) 1999-09-01 2002-12-24 Chi Fai Ho Method and system to answer a natural-language question
US5836771A (en) 1996-12-02 1998-11-17 Ho; Chi Fai Learning method and system based on questioning
US5893102A (en) * 1996-12-06 1999-04-06 Unisys Corporation Textual database management, storage and retrieval system utilizing word-oriented, dictionary-based data compression/decompression
US6122632A (en) * 1997-07-21 2000-09-19 Convergys Customer Management Group Inc. Electronic message management system
US6353827B1 (en) * 1997-09-04 2002-03-05 British Telecommunications Public Limited Company Methods and/or systems for selecting data sets
US7529756B1 (en) 1998-07-21 2009-05-05 West Services, Inc. System and method for processing formatted text documents in a database
US7778954B2 (en) 1998-07-21 2010-08-17 West Publishing Corporation Systems, methods, and software for presenting legal case histories
US6549897B1 (en) * 1998-10-09 2003-04-15 Microsoft Corporation Method and system for calculating phrase-document importance
US6411950B1 (en) * 1998-11-30 2002-06-25 Compaq Information Technologies Group, Lp Dynamic query expansion
US6981217B1 (en) * 1998-12-08 2005-12-27 Inceptor, Inc. System and method of obfuscating data
US8327265B1 (en) * 1999-04-09 2012-12-04 Lucimedia Networks, Inc. System and method for parsing a document
US6424982B1 (en) * 1999-04-09 2002-07-23 Semio Corporation System and method for parsing a document using one or more break characters
US6665681B1 (en) * 1999-04-09 2003-12-16 Entrieva, Inc. System and method for generating a taxonomy from a plurality of documents
US20020032564A1 (en) * 2000-04-19 2002-03-14 Farzad Ehsani Phrase-based dialogue modeling with particular application to creating a recognition grammar for a voice-controlled user interface
JP3791877B2 (en) * 1999-06-15 2006-06-28 富士通株式会社 An apparatus for searching information using the reason for referring to a document
US6438543B1 (en) * 1999-06-17 2002-08-20 International Business Machines Corporation System and method for cross-document coreference
US6785869B1 (en) 1999-06-17 2004-08-31 International Business Machines Corporation Method and apparatus for providing a central dictionary and glossary server
US6829348B1 (en) 1999-07-30 2004-12-07 Convergys Cmg Utah, Inc. System for customer contact information management and methods for using same
US6519586B2 (en) * 1999-08-06 2003-02-11 Compaq Computer Corporation Method and apparatus for automatic construction of faceted terminological feedback for document retrieval
US6772149B1 (en) * 1999-09-23 2004-08-03 Lexis-Nexis Group System and method for identifying facts and legal discussion in court case law documents
US7725307B2 (en) * 1999-11-12 2010-05-25 Phoenix Solutions, Inc. Query engine for processing voice based queries including semantic decoding
US6651058B1 (en) 1999-11-15 2003-11-18 International Business Machines Corporation System and method of automatic discovery of terms in a document that are relevant to a given target topic
US6519602B2 (en) 1999-11-15 2003-02-11 International Business Machine Corporation System and method for the automatic construction of generalization-specialization hierarchy of terms from a database of terms and associated meanings
US6505197B1 (en) * 1999-11-15 2003-01-07 International Business Machines Corporation System and method for automatically and iteratively mining related terms in a document through relations and patterns of occurrences
US6651059B1 (en) 1999-11-15 2003-11-18 International Business Machines Corporation System and method for the automatic recognition of relevant terms by mining link annotations
US6539376B1 (en) 1999-11-15 2003-03-25 International Business Machines Corporation System and method for the automatic mining of new relationships
US6385629B1 (en) * 1999-11-15 2002-05-07 International Business Machine Corporation System and method for the automatic mining of acronym-expansion pairs patterns and formation rules
US7475334B1 (en) * 2000-01-19 2009-01-06 Alcatel-Lucent Usa Inc. Method and system for abstracting electronic documents
US6571240B1 (en) * 2000-02-02 2003-05-27 Chi Fai Ho Information processing for searching categorizing information in a document based on a categorization hierarchy and extracted phrases
AU2001249240A1 (en) * 2000-03-17 2001-10-03 Jeffrey Williams Dahms Method and system for accessing medical information
US7007066B1 (en) * 2000-05-04 2006-02-28 Bellsouth Intellectual Property Corp. Method and apparatus for configuring electronic mail according to a user-selected type
US7752275B2 (en) * 2000-05-04 2010-07-06 At&T Intellectual Property I, L.P. Method and apparatus for configuring electronic mail for delivery of electronic services
US7016937B1 (en) * 2000-05-04 2006-03-21 Bellsouth Intellectual Property Corporation Method and apparatus for generating reminders to transmit electronic mail attachments by parsing e-mail message text
US7219136B1 (en) * 2000-06-12 2007-05-15 Cisco Technology, Inc. Apparatus and methods for providing network-based information suitable for audio output
US20020049792A1 (en) * 2000-09-01 2002-04-25 David Wilcox Conceptual content delivery system, method and computer program product
DE10057634C2 (en) * 2000-11-21 2003-01-30 Bosch Gmbh Robert Process for processing text in a computer unit and computer unit
US6697793B2 (en) * 2001-03-02 2004-02-24 The United States Of America As Represented By The Administrator Of The National Aeronautics And Space Administration System, method and apparatus for generating phrases from a database
US20020152064A1 (en) * 2001-04-12 2002-10-17 International Business Machines Corporation Method, apparatus, and program for annotating documents to expand terms in a talking browser
US20060253784A1 (en) * 2001-05-03 2006-11-09 Bower James M Multi-tiered safety control system and methods for online communities
US7536413B1 (en) 2001-05-07 2009-05-19 Ixreveal, Inc. Concept-based categorization of unstructured objects
USRE46973E1 (en) 2001-05-07 2018-07-31 Ureveal, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US6970881B1 (en) 2001-05-07 2005-11-29 Intelligenxia, Inc. Concept-based method and system for dynamically analyzing unstructured information
US7194483B1 (en) 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US7627588B1 (en) 2001-05-07 2009-12-01 Ixreveal, Inc. System and method for concept based analysis of unstructured data
US7475006B2 (en) * 2001-07-11 2009-01-06 Microsoft Corporation, Inc. Method and apparatus for parsing text using mutual information
US6966030B2 (en) * 2001-07-18 2005-11-15 International Business Machines Corporation Method, system and computer program product for implementing acronym assistance
AU2002339615A1 (en) * 2001-08-27 2003-03-10 E-Base Ltd. Methodology for constructing and optimizing a self-populating directory
US6978274B1 (en) * 2001-08-31 2005-12-20 Attenex Corporation System and method for dynamically evaluating latent concepts in unstructured documents
US20030105622A1 (en) * 2001-12-03 2003-06-05 Netbytel, Inc. Retrieval of records using phrase chunking
US7356527B2 (en) * 2001-12-19 2008-04-08 International Business Machines Corporation Lossy index compression
US6941293B1 (en) 2002-02-01 2005-09-06 Google, Inc. Methods and apparatus for determining equivalent descriptions for an information need
US8589413B1 (en) 2002-03-01 2013-11-19 Ixreveal, Inc. Concept-based method and system for dynamically analyzing results from search engines
US20040205660A1 (en) * 2002-04-23 2004-10-14 Joe Acton System and method for generating and displaying attribute-enhanced documents
US7003511B1 (en) * 2002-08-02 2006-02-21 Infotame Corporation Mining and characterization of data
US7236923B1 (en) 2002-08-07 2007-06-26 Itt Manufacturing Enterprises, Inc. Acronym extraction system and method of identifying acronyms and extracting corresponding expansions from text
CA2496567A1 (en) * 2002-09-16 2004-03-25 The Trustees Of Columbia University In The City Of New York System and method for document collection, grouping and summarization
US7151864B2 (en) * 2002-09-18 2006-12-19 Hewlett-Packard Development Company, L.P. Information research initiated from a scanned image media
US7346511B2 (en) * 2002-12-13 2008-03-18 Xerox Corporation Method and apparatus for recognizing multiword expressions
US7552051B2 (en) * 2002-12-13 2009-06-23 Xerox Corporation Method and apparatus for mapping multiword expressions to identifiers using finite-state networks
US7412392B1 (en) * 2003-04-14 2008-08-12 Sprint Communications Company L.P. Conference multi-tasking system and method
US20050033750A1 (en) * 2003-08-06 2005-02-10 Sbc Knowledge Ventures, L.P. Rhetorical content management system and methods
US7296027B2 (en) * 2003-08-06 2007-11-13 Sbc Knowledge Ventures, L.P. Rhetorical content management with tone and audience profiles
US20050131935A1 (en) * 2003-11-18 2005-06-16 O'leary Paul J. Sector content mining system using a modular knowledge base
US7191175B2 (en) 2004-02-13 2007-03-13 Attenex Corporation System and method for arranging concept clusters in thematic neighborhood relationships in a two-dimensional visual display space
US20050203924A1 (en) * 2004-03-13 2005-09-15 Rosenberg Gerald B. System and methods for analytic research and literate reporting of authoritative document collections
US20050234724A1 (en) * 2004-04-15 2005-10-20 Andrew Aaron System and method for improving text-to-speech software intelligibility through the detection of uncommon words and phrases
WO2005119356A2 (en) 2004-05-28 2005-12-15 Erik Jan Banning Interactive direct-pointing system and calibration method
US20060025091A1 (en) * 2004-08-02 2006-02-02 Matsushita Electric Industrial Co., Ltd Method for creating and using phrase history for accelerating instant messaging input on mobile devices
CA2577376C (en) 2004-08-23 2012-02-21 Lexisnexis, A Division Of Reed Elsevier Inc. Point of law search system and method
CA2577450C (en) 2004-08-23 2014-10-07 Lexisnexis, A Division Of Reed Elsevier Inc. Landmark case identification system and method
JP4189369B2 (en) * 2004-09-24 2008-12-03 株式会社東芝 Structured document search apparatus and structured document search method
US7783633B2 (en) * 2004-11-19 2010-08-24 International Business Machines Corporation Display of results of cross language search
US7689557B2 (en) * 2005-06-07 2010-03-30 Madan Pandit System and method of textual information analytics
US9285897B2 (en) 2005-07-13 2016-03-15 Ultimate Pointer, L.L.C. Easily deployable interactive direct-pointing system and calibration method therefor
US7747937B2 (en) * 2005-08-16 2010-06-29 Rojer Alan S Web bookmark manager
US20070067320A1 (en) * 2005-09-20 2007-03-22 International Business Machines Corporation Detecting relationships in unstructured text
US8874477B2 (en) 2005-10-04 2014-10-28 Steven Mark Hoffberg Multifactorial optimization system and method
JP5368100B2 (en) 2005-10-11 2013-12-18 アイエックスリビール インコーポレイテッド System, method, and computer program product for concept-based search and analysis
US7814102B2 (en) * 2005-12-07 2010-10-12 Lexisnexis, A Division Of Reed Elsevier Inc. Method and system for linking documents with multiple topics to related documents
US7676485B2 (en) 2006-01-20 2010-03-09 Ixreveal, Inc. Method and computer program product for converting ontologies into concept semantic networks
US20070226321A1 (en) * 2006-03-23 2007-09-27 R R Donnelley & Sons Company Image based document access and related systems, methods, and devices
US7735010B2 (en) * 2006-04-05 2010-06-08 Lexisnexis, A Division Of Reed Elsevier Inc. Citation network viewer and method
US20070271136A1 (en) * 2006-05-19 2007-11-22 Dw Data Inc. Method for pricing advertising on the internet
US20080065666A1 (en) * 2006-09-08 2008-03-13 Battelle Memorial Institute, A Part Interest Apparatuses, data structures, and methods for dynamic information analysis
CA2675216A1 (en) * 2007-01-10 2008-07-17 Nick Koudas Method and system for information discovery and text analysis
WO2008120030A1 (en) * 2007-04-02 2008-10-09 Sobha Renaissance Information Latent metonymical analysis and indexing [lmai]
US20080288488A1 (en) * 2007-05-15 2008-11-20 Iprm Intellectual Property Rights Management Ag C/O Dr. Hans Durrer Method and system for determining trend potentials
US9251137B2 (en) * 2007-06-21 2016-02-02 International Business Machines Corporation Method of text type-ahead
AU2008311937B2 (en) 2007-10-15 2014-06-12 Lexisnexis Group System and method for searching for documents
US20090113002A1 (en) * 2007-10-30 2009-04-30 At&T Bls Intellectual Property, Inc. Electronic Message Attachment Options
US10013536B2 (en) * 2007-11-06 2018-07-03 The Mathworks, Inc. License activation and management
US8812508B2 (en) * 2007-12-14 2014-08-19 Hewlett-Packard Development Company, L.P. Systems and methods for extracting phases from text
US7895205B2 (en) * 2008-03-04 2011-02-22 Microsoft Corporation Using core words to extract key phrases from documents
US8219397B2 (en) * 2008-06-10 2012-07-10 Nuance Communications, Inc. Data processing system for autonomously building speech identification and tagging data
US8984398B2 (en) * 2008-08-28 2015-03-17 Yahoo! Inc. Generation of search result abstracts
US10346879B2 (en) * 2008-11-18 2019-07-09 Sizmek Technologies, Inc. Method and system for identifying web documents for advertisements
CN101859309A (en) * 2009-04-07 2010-10-13 慧科讯业有限公司 System and method for identifying repeated text
US9245243B2 (en) 2009-04-14 2016-01-26 Ureveal, Inc. Concept-based analysis of structured and unstructured data using concept inheritance
US8635223B2 (en) 2009-07-28 2014-01-21 Fti Consulting, Inc. System and method for providing a classification suggestion for electronically stored information
EP2471009A1 (en) 2009-08-24 2012-07-04 FTI Technology LLC Generating a reference set for use during document review
US9652802B1 (en) 2010-03-24 2017-05-16 Consumerinfo.Com, Inc. Indirect monitoring and reporting of a user's credit data
US9582575B2 (en) 2010-07-09 2017-02-28 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for linking items to a matter
US9262390B2 (en) 2010-09-02 2016-02-16 Lexis Nexis, A Division Of Reed Elsevier Inc. Methods and systems for annotating electronic documents
CA2827478C (en) 2011-02-18 2020-07-28 Csidentity Corporation System and methods for identifying compromised personally identifiable information on the internet
US9760566B2 (en) 2011-03-31 2017-09-12 Microsoft Technology Licensing, Llc Augmented conversational understanding agent to identify conversation context between two humans and taking an agent action thereof
US9842168B2 (en) 2011-03-31 2017-12-12 Microsoft Technology Licensing, Llc Task driven user intents
US9858343B2 (en) 2011-03-31 2018-01-02 Microsoft Technology Licensing Llc Personalization of queries, conversations, and searches
US9244984B2 (en) 2011-03-31 2016-01-26 Microsoft Technology Licensing, Llc Location based conversational understanding
US10642934B2 (en) 2011-03-31 2020-05-05 Microsoft Technology Licensing, Llc Augmented conversational understanding architecture
US9064006B2 (en) 2012-08-23 2015-06-23 Microsoft Technology Licensing, Llc Translating natural language utterances to keyword search queries
US11030562B1 (en) 2011-10-31 2021-06-08 Consumerinfo.Com, Inc. Pre-data breach monitoring
US20140012859A1 (en) * 2012-07-03 2014-01-09 AGOGO Amalgamated, Inc. Personalized dynamic content delivery system
US9201969B2 (en) 2013-01-31 2015-12-01 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for identifying documents based on citation history
US8812387B1 (en) 2013-03-14 2014-08-19 Csidentity Corporation System and method for identifying related credit inquiries
US20150074202A1 (en) * 2013-09-10 2015-03-12 Lenovo (Singapore) Pte. Ltd. Processing action items from messages
US9886422B2 (en) * 2014-08-06 2018-02-06 International Business Machines Corporation Dynamic highlighting of repetitions in electronic documents
US10474702B1 (en) 2014-08-18 2019-11-12 Street Diligence, Inc. Computer-implemented apparatus and method for providing information concerning a financial instrument
US10339527B1 (en) 2014-10-31 2019-07-02 Experian Information Solutions, Inc. System and architecture for electronic fraud detection
US10331782B2 (en) 2014-11-19 2019-06-25 Lexisnexis, A Division Of Reed Elsevier Inc. Systems and methods for automatic identification of potential material facts in documents
US11151468B1 (en) 2015-07-02 2021-10-19 Experian Information Solutions, Inc. Behavior analysis using distributed representations of event data
AU2017274558B2 (en) 2016-06-02 2021-11-11 Nuix North America Inc. Analyzing clusters of coded documents
US10275444B2 (en) 2016-07-15 2019-04-30 At&T Intellectual Property I, L.P. Data analytics system and methods for text data
US10699028B1 (en) 2017-09-28 2020-06-30 Csidentity Corporation Identity security architecture systems and methods
US10896472B1 (en) 2017-11-14 2021-01-19 Csidentity Corporation Security and identity verification system and architecture
CN108664468A (en) * 2018-05-02 2018-10-16 武汉烽火普天信息技术有限公司 A kind of name recognition methods and device based on dictionary and semantic disambiguation
US11544300B2 (en) * 2018-10-23 2023-01-03 EMC IP Holding Company LLC Reducing storage required for an indexing structure through index merging
US11687710B2 (en) * 2020-04-03 2023-06-27 Braincat, Inc. Systems and methods for cloud-based productivity tools
US11687733B2 (en) * 2020-06-25 2023-06-27 Sap Se Contrastive self-supervised machine learning for commonsense reasoning

Family Cites Families (23)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5225981A (en) * 1986-10-03 1993-07-06 Ricoh Company, Ltd. Language analyzer for morphemically and syntactically analyzing natural languages by using block analysis and composite morphemes
US5123103A (en) * 1986-10-17 1992-06-16 Hitachi, Ltd. Method and system of retrieving program specification and linking the specification by concept to retrieval request for reusing program parts
US4868750A (en) * 1987-10-07 1989-09-19 Houghton Mifflin Company Collocational grammar system
US4864502A (en) * 1987-10-07 1989-09-05 Houghton Mifflin Company Sentence analyzer
US4931936A (en) * 1987-10-26 1990-06-05 Sharp Kabushiki Kaisha Language translation system with means to distinguish between phrases and sentence and number discrminating means
US5146405A (en) * 1988-02-05 1992-09-08 At&T Bell Laboratories Methods for part-of-speech determination and usage
US4994966A (en) * 1988-03-31 1991-02-19 Emerson & Stern Associates, Inc. System and method for natural language parsing by initiating processing prior to entry of complete sentences
US4914590A (en) * 1988-05-18 1990-04-03 Emhart Industries, Inc. Natural language understanding system
JPH0743717B2 (en) * 1989-02-06 1995-05-15 株式会社テレマティーク国際研究所 Abstract sentence generator
JPH077419B2 (en) * 1989-06-30 1995-01-30 シャープ株式会社 Abbreviated proper noun processing method in machine translation device
JPH03105566A (en) * 1989-09-20 1991-05-02 Hitachi Ltd Summary preparing system
JPH03122770A (en) * 1989-10-05 1991-05-24 Ricoh Co Ltd Method for retrieving keyword associative document
US5289375A (en) * 1990-01-22 1994-02-22 Sharp Kabushiki Kaisha Translation machine
US5481742A (en) * 1990-05-04 1996-01-02 Reed Elsevier Inc. Printer control apparatus for remotely modifying local printer by configuration signals from remote host to produce customized printing control codes
JPH04235672A (en) * 1991-01-10 1992-08-24 Sharp Corp Translation machine
US5251316A (en) * 1991-06-28 1993-10-05 Digital Equipment Corporation Method and apparatus for integrating a dynamic lexicon into a full-text information retrieval system
US5265065A (en) * 1991-10-08 1993-11-23 West Publishing Company Method and apparatus for information retrieval from a database by replacing domain specific stemmed phases in a natural language to create a search query
US5488725A (en) * 1991-10-08 1996-01-30 West Publishing Company System of document representation retrieval by successive iterated probability sampling
US5287278A (en) * 1992-01-27 1994-02-15 General Electric Company Method for extracting company names from text
JP3270783B2 (en) * 1992-09-29 2002-04-02 ゼロックス・コーポレーション Multiple document search methods
US5410475A (en) * 1993-04-19 1995-04-25 Mead Data Central, Inc. Short case name generating method and apparatus
JPH07182370A (en) * 1993-12-22 1995-07-21 Canon Inc Text retrieval device
US5745602A (en) * 1995-05-01 1998-04-28 Xerox Corporation Automatic method of selecting multi-word key phrases from a document

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11144994B1 (en) 2014-08-18 2021-10-12 Street Diligence, Inc. Computer-implemented apparatus and method for providing information concerning a financial instrument

Also Published As

Publication number Publication date
CA2244127C (en) 2002-04-09
WO1997027552A1 (en) 1997-07-31
US5819260A (en) 1998-10-06
AU1528897A (en) 1997-08-20
EP0883846A1 (en) 1998-12-16
EP0883846A4 (en) 1999-04-14

Similar Documents

Publication Publication Date Title
CA2244127A1 (en) Phrase recognition method and apparatus
WO1999066378A3 (en) Method and apparatus for knowledgebase searching
WO2002050662A3 (en) Apparatus and method of video program classification based on syntax of transcript information
MX222918B (en) PROCEDURE FOR THE REACTION OF AN ORGANIC COMPOUND WITH A HYDROPEROXIDE.
WO2004025391A3 (en) System and method of searching data utilizing automatic categorization
GB2330655A (en) Method and system for performing a binding assay
AU6456800A (en) Music database searching
WO2003012679A1 (en) Data processing method, data processing system, and program
WO2004021147A3 (en) Decision analysis system and method
ES2145580T3 (en) METHODS AND COMPOSITIONS FOR ANALYZING NUCLEIC ACID MOLECULES USING TECHNIQUES BASED ON SIZE.
CA2179979A1 (en) Method and apparatus for multiuser-interference reduction
WO1997007469A3 (en) Text abstraction method and apparatus
CA2302264A1 (en) Methods and/or systems for selecting data sets
EP1246430A3 (en) A communication terminal having a predictive text editor application
SE0004043D0 (en) Method and apparatus for document indexing and searching
ATE382963T1 (en) FUEL CELL SYSTEM AND FUEL CELL FOR SUCH SYSTEM
WO2001042981A3 (en) Natural english language search and retrieval system and method
EP1083746A3 (en) Signal processing apparatus , method and system comprising a printer
EP1830313A4 (en) Unoccupied seat route search system, unoccupied seat route search device, and terminal
MX9806168A (en) Database access.
WO2000055842A3 (en) Speech synthesis
WO2000018142A3 (en) Apparatus and method of searching for pn sequence phase in multi-carrier cdma mobile communication system
EP0813160A3 (en) Apparatus for and method of accessing a database
EP1016985A3 (en) Method and system for topic based cross indexing of text and audio
EP1492036A3 (en) Database search system

Legal Events

Date Code Title Description
EEER Examination request
MKEX Expiry

Effective date: 20170123