WO2006001906A3 - Graph-based ranking algorithms for text processing - Google Patents

Graph-based ranking algorithms for text processing Download PDF

Info

Publication number
WO2006001906A3
WO2006001906A3 PCT/US2005/015630 US2005015630W WO2006001906A3 WO 2006001906 A3 WO2006001906 A3 WO 2006001906A3 US 2005015630 W US2005015630 W US 2005015630W WO 2006001906 A3 WO2006001906 A3 WO 2006001906A3
Authority
WO
WIPO (PCT)
Prior art keywords
graph
text
natural language
determining
text processing
Prior art date
Application number
PCT/US2005/015630
Other languages
French (fr)
Other versions
WO2006001906A2 (en
Inventor
Rada Mihalcea
Paul Tarau
Original Assignee
Univ North Texas
Rada Mihalcea
Paul Tarau
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Univ North Texas, Rada Mihalcea, Paul Tarau filed Critical Univ North Texas
Publication of WO2006001906A2 publication Critical patent/WO2006001906A2/en
Publication of WO2006001906A3 publication Critical patent/WO2006001906A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/30Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
    • G06F16/33Querying
    • G06F16/3331Query processing
    • G06F16/334Query execution
    • G06F16/3344Query execution using natural language analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/30Semantic analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/279Recognition of textual entities
    • G06F40/284Lexical analysis, e.g. tokenisation or collocates

Abstract

The present invention provides a method of processing at least one natural language text using a graph. The method includes determining a plurality of text units based upon the natural language text, associating the plurality of text units with a plurality of graph nodes, and determining at least one connecting relation between at least two of the plurality of text units. The method also includes associating the at least one connecting relation with at least one graph edge connecting at least two of the plurality of graph nodes and determining a plurality of rankings associated with the plurality of graph nodes based upon the at least one graph edge. The method can also include a graphical visualization of at least one important text unit in a natural language text or collection of texts. Methods for word sense disambiguation, keyword extraction, and sentence extraction are also provided.
PCT/US2005/015630 2004-06-14 2005-05-05 Graph-based ranking algorithms for text processing WO2006001906A2 (en)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
US57937204P 2004-06-14 2004-06-14
US60/579,372 2004-06-14
US11/075,625 2005-03-09
US11/075,625 US7809548B2 (en) 2004-06-14 2005-03-09 Graph-based ranking algorithms for text processing

Publications (2)

Publication Number Publication Date
WO2006001906A2 WO2006001906A2 (en) 2006-01-05
WO2006001906A3 true WO2006001906A3 (en) 2006-09-08

Family

ID=35427495

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/015630 WO2006001906A2 (en) 2004-06-14 2005-05-05 Graph-based ranking algorithms for text processing

Country Status (2)

Country Link
US (1) US7809548B2 (en)
WO (1) WO2006001906A2 (en)

Families Citing this family (156)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040243531A1 (en) 2003-04-28 2004-12-02 Dean Michael Anthony Methods and systems for representing, using and displaying time-varying information on the Semantic Web
US7698267B2 (en) * 2004-08-27 2010-04-13 The Regents Of The University Of California Searching digital information and databases
US7328136B2 (en) * 2004-09-15 2008-02-05 Council Of Scientific & Industrial Research Computer based method for finding the effect of an element in a domain of N-dimensional function with a provision for N+1 dimensions
US7777125B2 (en) * 2004-11-19 2010-08-17 Microsoft Corporation Constructing a table of music similarity vectors from a music similarity graph
EP1846815A2 (en) * 2005-01-31 2007-10-24 Textdigger, Inc. Method and system for semantic search and retrieval of electronic documents
US20060200464A1 (en) * 2005-03-03 2006-09-07 Microsoft Corporation Method and system for generating a document summary
JP2008537225A (en) 2005-04-11 2008-09-11 テキストディガー,インコーポレイテッド Search system and method for queries
WO2006113970A1 (en) * 2005-04-27 2006-11-02 The University Of Queensland Automatic concept clustering
EP1746521A1 (en) * 2005-07-22 2007-01-24 France Telecom Method of sorting a set of electronic documents of a type which may contain hypertext links to other electronic documents
US8024653B2 (en) * 2005-11-14 2011-09-20 Make Sence, Inc. Techniques for creating computer generated notes
US8429184B2 (en) 2005-12-05 2013-04-23 Collarity Inc. Generation of refinement terms for search queries
US8903810B2 (en) * 2005-12-05 2014-12-02 Collarity, Inc. Techniques for ranking search results
DE112006003614T5 (en) * 2005-12-29 2008-12-18 Leibniz-Institut für Pflanzengenetik Und Kulturpflanzenforschung (IPK) Device and method for evaluating a network-related relevance of a network element, program element and machine-readable data carrier connected to one or more further network elements in a network
US8694530B2 (en) 2006-01-03 2014-04-08 Textdigger, Inc. Search system with query refinement and search method
WO2007114932A2 (en) 2006-04-04 2007-10-11 Textdigger, Inc. Search system and method with text function tagging
US8583634B2 (en) * 2006-12-05 2013-11-12 Avaya Inc. System and method for determining social rank, relevance and attention
US7769762B2 (en) * 2006-12-19 2010-08-03 Sap Ag Method and system for consolidating data type repositories
US7630981B2 (en) * 2006-12-26 2009-12-08 Robert Bosch Gmbh Method and system for learning ontological relations from documents
US8930178B2 (en) 2007-01-04 2015-01-06 Children's Hospital Medical Center Processing text with domain-specific spreading activation methods
US8131536B2 (en) 2007-01-12 2012-03-06 Raytheon Bbn Technologies Corp. Extraction-empowered machine translation
US8112402B2 (en) 2007-02-26 2012-02-07 Microsoft Corporation Automatic disambiguation based on a reference resource
US20080215571A1 (en) * 2007-03-01 2008-09-04 Microsoft Corporation Product review search
US7702620B2 (en) * 2007-03-29 2010-04-20 International Business Machines Corporation System and method for ranked keyword search on graphs
US8190422B2 (en) * 2007-05-20 2012-05-29 George Mason Intellectual Properties, Inc. Semantic cognitive map
US7890539B2 (en) * 2007-10-10 2011-02-15 Raytheon Bbn Technologies Corp. Semantic matching using predicate-argument structure
US8136034B2 (en) * 2007-12-18 2012-03-13 Aaron Stanton System and method for analyzing and categorizing text
US8756527B2 (en) * 2008-01-18 2014-06-17 Rpx Corporation Method, apparatus and computer program product for providing a word input mechanism
US8290975B2 (en) * 2008-03-12 2012-10-16 Microsoft Corporation Graph-based keyword expansion
US20090313243A1 (en) * 2008-06-13 2009-12-17 Siemens Aktiengesellschaft Method and apparatus for processing semantic data resources
US20110087670A1 (en) * 2008-08-05 2011-04-14 Gregory Jorstad Systems and methods for concept mapping
US8112269B2 (en) * 2008-08-25 2012-02-07 Microsoft Corporation Determining utility of a question
WO2010040125A1 (en) 2008-10-03 2010-04-08 Beliefnetworks, Inc. Systems and methods for automatic creation of agent-based systems
US8346534B2 (en) * 2008-11-06 2013-01-01 University of North Texas System Method, system and apparatus for automatic keyword extraction
US8463808B2 (en) * 2008-11-07 2013-06-11 Raytheon Company Expanding concept types in conceptual graphs
KR101045955B1 (en) * 2008-11-14 2011-07-04 한국과학기술정보연구원 Method for extracting semantic correlation of context, and recording device storing device and program source thereof
US9158838B2 (en) * 2008-12-15 2015-10-13 Raytheon Company Determining query return referents for concept types in conceptual graphs
US8577924B2 (en) * 2008-12-15 2013-11-05 Raytheon Company Determining base attributes for terms
US9087293B2 (en) * 2008-12-23 2015-07-21 Raytheon Company Categorizing concept types of a conceptual graph
US8095546B1 (en) * 2009-01-09 2012-01-10 Google Inc. Book content item search
US8316032B1 (en) 2009-01-09 2012-11-20 Google Inc. Book content item search
US8768960B2 (en) * 2009-01-20 2014-07-01 Microsoft Corporation Enhancing keyword advertising using online encyclopedia semantics
US20100185943A1 (en) * 2009-01-21 2010-07-22 Nec Laboratories America, Inc. Comparative document summarization with discriminative sentence selection
US20100208984A1 (en) * 2009-02-13 2010-08-19 Microsoft Corporation Evaluating related phrases
US20100306166A1 (en) * 2009-06-01 2010-12-02 Yahoo! Inc. Automatic fact validation
US20110029926A1 (en) * 2009-07-30 2011-02-03 Hao Ming C Generating a visualization of reviews according to distance associations between attributes and opinion words in the reviews
US20110041075A1 (en) * 2009-08-12 2011-02-17 Google Inc. Separating reputation of users in different roles
US20110123967A1 (en) * 2009-11-24 2011-05-26 Xerox Corporation Dialog system for comprehension evaluation
KR101306667B1 (en) * 2009-12-09 2013-09-10 한국전자통신연구원 Apparatus and method for knowledge graph stabilization
US8875038B2 (en) * 2010-01-19 2014-10-28 Collarity, Inc. Anchoring for content synchronization
WO2011137386A1 (en) * 2010-04-30 2011-11-03 Orbis Technologies, Inc. Systems and methods for semantic search, content correlation and visualization
US9600566B2 (en) 2010-05-14 2017-03-21 Microsoft Technology Licensing, Llc Identifying entity synonyms
US8375061B2 (en) * 2010-06-08 2013-02-12 International Business Machines Corporation Graphical models for representing text documents for computer analysis
US20130197900A1 (en) * 2010-06-29 2013-08-01 Springsense Pty Ltd Method and System for Determining Word Senses by Latent Semantic Distance
US20120016661A1 (en) * 2010-07-19 2012-01-19 Eyal Pinkas System, method and device for intelligent textual conversation system
US8572760B2 (en) 2010-08-10 2013-10-29 Benefitfocus.Com, Inc. Systems and methods for secure agent information
US8977538B2 (en) 2010-09-13 2015-03-10 Richard Salisbury Constructing and analyzing a word graph
US8560477B1 (en) * 2010-10-08 2013-10-15 Google Inc. Graph-based semi-supervised learning of structured tagging models
US20120197993A1 (en) * 2011-01-27 2012-08-02 Linkedln Corporation Skill ranking system
KR101290439B1 (en) * 2011-04-15 2013-07-26 경북대학교 산학협력단 Method for summerizing meeting minutes based on sentence network
CN102831116A (en) * 2011-06-14 2012-12-19 国际商业机器公司 Method and system for document clustering
WO2013043159A1 (en) * 2011-09-20 2013-03-28 Hewlett-Packard Development Company, L.P. Document analysis
WO2013043160A1 (en) * 2011-09-20 2013-03-28 Hewlett-Packard Development Company, L.P. Text summarization
US9305082B2 (en) * 2011-09-30 2016-04-05 Thomson Reuters Global Resources Systems, methods, and interfaces for analyzing conceptually-related portions of text
US20130204883A1 (en) * 2012-02-02 2013-08-08 Microsoft Corporation Computation of top-k pairwise co-occurrence statistics
CN104246775B (en) * 2012-04-26 2018-04-17 日本电气株式会社 Text Mining System, text mining methods and procedures
CN103473217B (en) * 2012-06-08 2016-08-03 华为技术有限公司 The method and apparatus of extracting keywords from text
US10032131B2 (en) 2012-06-20 2018-07-24 Microsoft Technology Licensing, Llc Data services for enterprises leveraging search system data assets
US9594831B2 (en) 2012-06-22 2017-03-14 Microsoft Technology Licensing, Llc Targeted disambiguation of named entities
US20150227592A1 (en) * 2012-09-18 2015-08-13 Hewlett-Packard Development Company, L.P. Mining Questions Related To An Electronic Text Document
WO2014059491A1 (en) * 2012-10-19 2014-04-24 Patent Analytics Holding Pty Ltd A system and method for presentation and visual navigation of network data sets
CN104871151A (en) * 2012-10-26 2015-08-26 惠普发展公司,有限责任合伙企业 Method for summarizing document
US9654592B2 (en) 2012-11-08 2017-05-16 Linkedin Corporation Skills endorsements
US10810193B1 (en) 2013-03-13 2020-10-20 Google Llc Querying a data graph using natural language queries
US9224103B1 (en) 2013-03-13 2015-12-29 Google Inc. Automatic annotation for training and evaluation of semantic analysis engines
US9514191B2 (en) * 2013-03-14 2016-12-06 Microsoft Technology Licensing, Llc Visualizing ranking factors for items in a search result list
EP2973025A1 (en) * 2013-03-15 2016-01-20 Mark, Bobick Method for resource decomposition and related devices
US9286289B2 (en) * 2013-04-09 2016-03-15 Softwin Srl Romania Ordering a lexicon network for automatic disambiguation
US9727641B2 (en) * 2013-04-25 2017-08-08 Entit Software Llc Generating a summary based on readability
US10019531B2 (en) * 2013-05-19 2018-07-10 Carmel Kent System and method for displaying, connecting and analyzing data in an online collaborative webpage
US20140372102A1 (en) * 2013-06-18 2014-12-18 Xerox Corporation Combining temporal processing and textual entailment to detect temporally anchored events
US9697472B2 (en) 2013-09-20 2017-07-04 Linkedin Corporation Skills ontology creation
US11188543B2 (en) 2013-10-14 2021-11-30 International Business Machines Corporation Utilizing social information for recommending an application
US11238056B2 (en) 2013-10-28 2022-02-01 Microsoft Technology Licensing, Llc Enhancing search results with social labels
US9542440B2 (en) 2013-11-04 2017-01-10 Microsoft Technology Licensing, Llc Enterprise graph search based on object and actor relationships
US9471561B2 (en) * 2013-12-26 2016-10-18 International Business Machines Corporation Adaptive parser-centric text normalization
US9436755B1 (en) * 2014-01-26 2016-09-06 Google Inc. Determining and scoring task indications
US11645289B2 (en) 2014-02-04 2023-05-09 Microsoft Technology Licensing, Llc Ranking enterprise graph queries
US9870432B2 (en) 2014-02-24 2018-01-16 Microsoft Technology Licensing, Llc Persisted enterprise graph queries
US11657060B2 (en) 2014-02-27 2023-05-23 Microsoft Technology Licensing, Llc Utilizing interactivity signals to generate relationships and promote content
US9531793B2 (en) * 2014-02-28 2016-12-27 Microsoft Technology Licensing, Llc Displaying and navigating implicit and explicit enterprise people relationships
US10757201B2 (en) 2014-03-01 2020-08-25 Microsoft Technology Licensing, Llc Document and content feed
US10394827B2 (en) 2014-03-03 2019-08-27 Microsoft Technology Licensing, Llc Discovering enterprise content based on implicit and explicit signals
US10255563B2 (en) 2014-03-03 2019-04-09 Microsoft Technology Licensing, Llc Aggregating enterprise graph content around user-generated topics
US10169457B2 (en) 2014-03-03 2019-01-01 Microsoft Technology Licensing, Llc Displaying and posting aggregated social activity on a piece of enterprise content
AU2015201364A1 (en) * 2014-03-17 2015-10-01 Accenture Global Services Limited Generating a semantic network based on semantic connections between subject-verb-object units
US9251470B2 (en) 2014-05-30 2016-02-02 Linkedin Corporation Inferred identity
US9946808B2 (en) 2014-07-09 2018-04-17 International Business Machines Corporation Using vertex self-information scores for vertices in an entity graph to determine whether to perform entity resolution on the vertices in the entity graph
US10061826B2 (en) 2014-09-05 2018-08-28 Microsoft Technology Licensing, Llc. Distant content discovery
WO2016068955A1 (en) * 2014-10-30 2016-05-06 Hewlett Packard Enterprise Development Lp Data entries having values for features
US20160162464A1 (en) 2014-12-09 2016-06-09 Idibon, Inc. Techniques for combining human and machine learning in natural language processing
US10176228B2 (en) * 2014-12-10 2019-01-08 International Business Machines Corporation Identification and evaluation of lexical answer type conditions in a question to generate correct answers
WO2016099422A2 (en) 2014-12-17 2016-06-23 Bogazici Universitesi Content sensitive document ranking method by analyzing the citation contexts
KR101668725B1 (en) * 2015-03-18 2016-10-24 성균관대학교산학협력단 Latent keyparase generation method and apparatus
US20160299881A1 (en) * 2015-04-07 2016-10-13 Xerox Corporation Method and system for summarizing a document
WO2016188591A1 (en) * 2015-05-22 2016-12-01 Longsand Limited Semantic consolidation of data received from customers and enterprises
WO2016200359A1 (en) * 2015-06-06 2016-12-15 Hewlett-Packard Development Company, L.P Term scores
US20160364733A1 (en) * 2015-06-09 2016-12-15 International Business Machines Corporation Attitude Inference
US9436760B1 (en) * 2016-02-05 2016-09-06 Quid, Inc. Measuring accuracy of semantic graphs with exogenous datasets
WO2017156399A1 (en) * 2016-03-11 2017-09-14 Cameron Nathan R Systems, methods, and user interfaces for evaluating quality, health, safety, and environment data
CN107291723B (en) * 2016-03-30 2021-04-30 阿里巴巴集团控股有限公司 Method and device for classifying webpage texts and method and device for identifying webpage texts
RU2628436C1 (en) * 2016-04-12 2017-08-16 Общество с ограниченной ответственностью "Аби Продакшн" Classification of texts on natural language based on semantic signs
US10089761B2 (en) * 2016-04-29 2018-10-02 Hewlett Packard Enterprise Development Lp Graph processing using a shared memory
US9881082B2 (en) 2016-06-20 2018-01-30 International Business Machines Corporation System and method for automatic, unsupervised contextualized content summarization of single and multiple documents
US9886501B2 (en) 2016-06-20 2018-02-06 International Business Machines Corporation Contextual content graph for automatic, unsupervised summarization of content
US10331788B2 (en) 2016-06-22 2019-06-25 International Business Machines Corporation Latent ambiguity handling in natural language processing
US9645999B1 (en) * 2016-08-02 2017-05-09 Quid, Inc. Adjustment of document relationship graphs
US10380552B2 (en) 2016-10-31 2019-08-13 Microsoft Technology Licensing, Llc Applicant skills inference for a job
CN106503255B (en) * 2016-11-15 2020-05-12 科大讯飞股份有限公司 Method and system for automatically generating article based on description text
CN106372064B (en) * 2016-11-18 2019-04-19 北京工业大学 A kind of term weight function calculation method of text mining
US10255269B2 (en) 2016-12-30 2019-04-09 Microsoft Technology Licensing, Llc Graph long short term memory for syntactic relationship discovery
US10043511B2 (en) 2017-01-06 2018-08-07 International Business Machines Corporation Domain terminology expansion by relevancy
US10032448B1 (en) 2017-01-06 2018-07-24 International Business Machines Corporation Domain terminology expansion by sensitivity
US10326863B2 (en) 2017-01-21 2019-06-18 Adp, Llc Speed and accuracy of computers when resolving client queries by using graph database model
CN107153641B (en) * 2017-05-08 2021-01-12 北京百度网讯科技有限公司 Comment information determination method, comment information determination device, server and storage medium
US10810472B2 (en) * 2017-05-26 2020-10-20 Oracle International Corporation Techniques for sentiment analysis of data using a convolutional neural network and a co-occurrence network
CN109255118B (en) * 2017-07-11 2023-08-08 普天信息技术有限公司 Keyword extraction method and device
US11238095B1 (en) * 2017-09-19 2022-02-01 Goldman Sachs & Co. LLC Determining relatedness of data using graphs to support machine learning, natural language parsing, search engine, or other functions
EP3528144A1 (en) 2018-02-20 2019-08-21 INESC TEC - Instituto de Engenharia de Sistemas e Computadores, Tecnologia e Ciência Device and method for keyword extraction from a text stream
US10685050B2 (en) * 2018-04-23 2020-06-16 Adobe Inc. Generating a topic-based summary of textual content
KR102060486B1 (en) * 2018-07-12 2019-12-30 주식회사 아카인텔리전스 Method for generating chatbot utterance based on the semantic graph database
CN109189828A (en) * 2018-08-16 2019-01-11 国云科技股份有限公司 A method of data value is assessed between the business department based on complex network
CN109255073B (en) * 2018-08-28 2022-03-29 麒麟合盛网络技术股份有限公司 Personalized recommendation method and device and electronic equipment
KR102086248B1 (en) * 2018-09-19 2020-03-06 충북대학교 산학협력단 Method and system for detecting graph based event in social networks
US20220164678A1 (en) * 2018-09-26 2022-05-26 Entigenlogic Llc Curing a deficiency of a knowledge database
CN109739973A (en) * 2018-12-20 2019-05-10 北京奇安信科技有限公司 Text snippet generation method, device, electronic equipment and storage medium
US11507608B2 (en) * 2019-01-24 2022-11-22 Dell Products L.P. System for improving search engine ranking of a landing page using automated analysis of landing pages of third-party entities
US10387575B1 (en) * 2019-01-30 2019-08-20 Babylon Partners Limited Semantic graph traversal for recognition of inferred clauses within natural language inputs
US10936796B2 (en) * 2019-05-01 2021-03-02 International Business Machines Corporation Enhanced text summarizer
US11645479B1 (en) 2019-11-07 2023-05-09 Kino High Coursey Method for AI language self-improvement agent using language modeling and tree search techniques
US11948560B1 (en) 2019-11-07 2024-04-02 Kino High Coursey Method for AI language self-improvement agent using language modeling and tree search techniques
CN111062574B (en) * 2019-11-20 2023-04-18 南昌大学 Method for measuring similarity of manufacturing process
US11366964B2 (en) * 2019-12-04 2022-06-21 International Business Machines Corporation Visualization of the entities and relations in a document
CN111581952B (en) * 2020-05-20 2023-10-03 长沙理工大学 Large-scale replaceable word library construction method for natural language information hiding
US11461539B2 (en) 2020-07-29 2022-10-04 Docusign, Inc. Automated document highlighting in a digital management platform
CN111753498B (en) * 2020-08-10 2024-01-26 腾讯科技(深圳)有限公司 Text processing method, device, equipment and storage medium
US11263407B1 (en) * 2020-09-01 2022-03-01 Rammer Technologies, Inc. Determining topics and action items from conversations
US11093718B1 (en) * 2020-12-01 2021-08-17 Rammer Technologies, Inc. Determining conversational structure from speech
US20220237384A1 (en) * 2021-01-22 2022-07-28 Thomson Reuters Enterprise Centre Gmbh System and method for automated hashtag hierarchical ontology generation from social media data
US11669680B2 (en) 2021-02-02 2023-06-06 International Business Machines Corporation Automated graph based information extraction
US11164153B1 (en) * 2021-04-27 2021-11-02 Skyhive Technologies Inc. Generating skill data through machine learning
KR102345818B1 (en) * 2021-06-23 2021-12-31 주식회사 아티피셜 소사이어티 System and method of generating the mind map of the structure of thought with targeted part-of-speech words from text data
US11361571B1 (en) 2021-06-28 2022-06-14 International Business Machines Corporation Term extraction in highly technical domains
US11302314B1 (en) 2021-11-10 2022-04-12 Rammer Technologies, Inc. Tracking specialized concepts, topics, and activities in conversations
US20230177256A1 (en) * 2021-12-07 2023-06-08 International Business Machines Corporation Role-Based Cross Data Source Actionable Conversation Summarizer
US11599713B1 (en) 2022-07-26 2023-03-07 Rammer Technologies, Inc. Summarizing conversational speech
US20240069870A1 (en) * 2022-08-23 2024-02-29 International Business Machines Corporation Computer-based software development and product management

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002010985A2 (en) * 2000-07-28 2002-02-07 Tenara Limited Method of and system for automatic document retrieval, categorization and processing

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6285999B1 (en) * 1997-01-10 2001-09-04 The Board Of Trustees Of The Leland Stanford Junior University Method for node ranking in a linked database
US6112203A (en) * 1998-04-09 2000-08-29 Altavista Company Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis
US6480843B2 (en) * 1998-11-03 2002-11-12 Nec Usa, Inc. Supporting web-query expansion efficiently using multi-granularity indexing and query processing
US7286977B1 (en) * 2000-09-05 2007-10-23 Novell, Inc. Intentional-stance characterization of a general content stream or repository
US7403890B2 (en) * 2002-05-13 2008-07-22 Roushar Joseph C Multi-dimensional method and apparatus for automated language interpretation
US7167871B2 (en) * 2002-05-17 2007-01-23 Xerox Corporation Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections
US7664735B2 (en) * 2004-04-30 2010-02-16 Microsoft Corporation Method and system for ranking documents of a search result to improve diversity and information richness
US7519613B2 (en) * 2006-02-28 2009-04-14 International Business Machines Corporation Method and system for generating threads of documents

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2002010985A2 (en) * 2000-07-28 2002-02-07 Tenara Limited Method of and system for automatic document retrieval, categorization and processing

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
BRIN S ET AL: "The anatomy of a large-scale hypertextual Web search engine", April 1998, COMPUTER NETWORKS AND ISDN SYSTEMS, NORTH HOLLAND PUBLISHING. AMSTERDAM, NL, PAGE(S) 107-117, ISSN: 0169-7552, XP004121435 *
G. RAMAKRISHNAN, P. BHATTACHARYYA: "Text Representation with WordNet Synsets using Soft Sense Disambiguation", June 2003, NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS, 8TH INTERNATIONAL CONFERENCE ON APPLICATIONS OF NATURAL LANGUAGE TO INFORMATION SYSTEMS, BURG (SPREEWALD), GERMANY, ISBN: 3-88579-358-X, XP002383976 *
K. FRAGOS, Y. MAISTROS, C. SKOURLAS: "Word Sense Disambiguation using WORDNET relations", October 2003, PROCEEDINGS OF 1ST BALKAN CONFERENCE IN INFORMATICS, THESSALONIKI GREECE, XP002383979 *
LESK M: "Automatic Sense Disambiguation Using Machine Readable Dictionaries: How to Tell a Pine Cone from an Ice Cream Cone", 1986, PROCEEDINGS OF THE ANNUAL INTERNATIONAL CONFERENCE ON SYSTEMS DOCUMENTATION, PAGE(S) 24-26, XP002224563 *
M. GALLEY, K. MCKEOWN: "Improving Word Sense Disambiguation in Lexical Chaining", August 2003, PROCEEDINGS OF THE 18TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE (IJCAI-03), ACAPULCO, MEXICO, XP002383977 *
M. SUSSNA: "Word sense disambiguation for free-text indexing using a massive semantic network", 1993, ACM PRESS, PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, PAGES 67-74, WASHINGTON, D.C., USA, XP002383978 *
RADA MIHALCEA, PAUL TARAU, ELIZABETH FIGA: "PageRank on Semantic Networks, with application to Word Sense Disambiguation", August 2004, PROCEEDINGS OF THE 20ST INTERNATIONAL CONFERENCE ON COMPUTATIONAL LINGUISTICS (COLING 2004), GENEVA, SWITZERLAND, XP002383980 *

Also Published As

Publication number Publication date
WO2006001906A2 (en) 2006-01-05
US20050278325A1 (en) 2005-12-15
US7809548B2 (en) 2010-10-05

Similar Documents

Publication Publication Date Title
WO2006001906A3 (en) Graph-based ranking algorithms for text processing
Peersman et al. Predicting age and gender in online social networks
Al-Kabi et al. A novel root based Arabic stemmer
WO2006115598A3 (en) Method and system for generating spelling suggestions
WO2008070877A3 (en) Online computer-aided translation
WO2007055821A3 (en) Defining ontologies and word disambiguation
EP2511832A3 (en) Method, system and computer program product for selecting a language for text segmentation
WO2008075161A3 (en) Method, apparatus and computer program product for providing flexible text based language identification
TW200709120A (en) Systems and methods for semantic knowledge assessment, instruction, and acquisition
WO2008157021A3 (en) Text prediction with partial selection in a variety of domains
WO2007026365A3 (en) Decision-support expert system and methods for real-time exploitation of documents in non-english languages
WO2008085857A3 (en) Processing text with domain-specific spreading activation methods
EP1577793A3 (en) Systems and methods for spell checking
WO2008095162A3 (en) Method and system for fast, generic, online and offline, multi-source text analysis and visualization
WO2008107305A3 (en) Search-based word segmentation method and device for language without word boundary tag
WO2007130544A3 (en) Method for domain identification of documents in a document database
KR20100035940A (en) System for extraction and analysis of opinion in web documents and method thereof
WO2012082886A3 (en) Sender-based ranking of person profiles and multi-person automatic suggestions
WO2009066501A1 (en) Information search method, device, and program, and computer-readable recording medium
WO2008003095A3 (en) Recognizing text in images
WO2004072757A3 (en) Text and attribute searches of data stores that include business object
WO2007048607A3 (en) Automatic, computer-based similarity calculation system for quantifying the similarity of text expressions
JP2009503739A5 (en)
WO2008093569A1 (en) Information extraction rule making support system, information extraction rule making support method, and information extraction rule making support program
WO2014210387A3 (en) Concept extraction

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

DPEN Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed from 20040101)
NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05743577

Country of ref document: EP

Kind code of ref document: A2