US7693825B2 - Systems and methods for ranking implicit search results - Google Patents
Systems and methods for ranking implicit search results Download PDFInfo
- Publication number
- US7693825B2 US7693825B2 US10/813,875 US81387504A US7693825B2 US 7693825 B2 US7693825 B2 US 7693825B2 US 81387504 A US81387504 A US 81387504A US 7693825 B2 US7693825 B2 US 7693825B2
- Authority
- US
- United States
- Prior art keywords
- article
- user
- ranking
- keyword
- content
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/951—Indexing; Web crawling techniques
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
- G06F16/3332—Query translation
- G06F16/3334—Selection or weighting of terms from queries, including natural language queries
Definitions
- the present invention relates generally to methods and systems for information retrieval.
- the present invention relates particularly to methods and systems for ranking implicit search results.
- Conventional search engines receive a search query from a user and execute a search against a global index. Such conventional search engines typically use one or more conventional methods for performing a search. For example, one known method, described in an article entitled “The Anatomy of a Large-Scale Hypertextual Search Engine,” by Sergey Brin and Lawrence Page, assigns a degree of importance to a document, such as a web page, based on the link structure of the web.
- the search results are often presented in a list format, comprising article identifiers and brief snippets about the documents in a web page that can be resized.
- the user has access to other information stored on the user's local machine or on other storage media accessible via a network that is relevant to the user's current contextual state. For example, if a user is working on a document regarding a particular subject, information about the subject may be stored on the user's hard drive or in a global index accessible to the user. In order to access this information, the user issues an explicit search query in an application, such as a web search page. The information is provided to the user as a result set. Thus, the user shifts focus from the document that the user is working on to perform the search.
- the user may be unaware or may not remember that information is available regarding a particular subject. In such a case, the user may not perform an explicit search and thus, will not have access to the potentially relevant information.
- Embodiments of the present invention provide systems and methods for ranking implicit search results.
- a method comprising receiving an event, the event comprising user interaction with an article on a client device, wherein the article is capable of being associated with at least one of a plurality of client applications, extracting at least one keyword from the event, generating a query based at least in part on at least that one keyword, performing a search based at least in part on the query to determine a result set, wherein the result set comprises one or more article identifiers associated with articles comprising the at least one keyword, and determining a ranking for each of the one or more article identifiers comprising the result set is described.
- Another embodiment of the present invention comprises receiving an event, the event comprising user interaction with an article on a client device, wherein the article is capable of being associated with at least one of a plurality of client applications, extracting at least one keyword from the event, generating a query based at least in part on the at least one keyword, performing a search based at least in part on the query to determine a result set, wherein the result set comprises one or more article identifiers associated with articles comprising the at least one keyword, filtering the article identifiers in the result set based on a threshold, and causing the display of the result set.
- FIG. 1 is a block diagram illustrating an exemplary environment in which one embodiment of the present invention may operate
- FIG. 2 is a flowchart illustrating a method in accordance with one embodiment of the present invention.
- Embodiments of the present invention provide systems and methods for ranking implicit search results.
- FIG. 1 is a block diagram illustrating an exemplary environment for implementation of an embodiment of the present invention. While the environment shown reflects a client-side search engine architecture embodiment, other embodiments are possible.
- the system 100 shown in FIG. 1 includes multiple client devices 102 a - n in communication with a server device 150 over a wired or wireless network 106 .
- the network 106 shown comprises the Internet. In other embodiments, other networks, such as an intranet, may be used instead. Moreover, methods according to the present invention may operate within a single client device.
- the client devices 102 a - n shown each includes a computer-readable medium 108 .
- the embodiment shown includes a random access memory (RAM) 108 coupled to a processor 110 .
- the processor 110 executes computer-executable program instructions stored in memory 108 .
- Such processors may include a microprocessor, an ASIC, a state machine, or other processor, and can be any of a number of computer processors, such as processors from Intel Corporation of Santa Clara, Calif. and Motorola Corporation of Schaumburg, Ill.
- Such processors include, or may be in communication with, media, for example computer-readable media, which stores instructions that, when executed by the processor, cause the processor to perform the steps described herein.
- Embodiments of computer-readable media include, but are not limited to, an electronic, optical, magnetic, or other storage or transmission device capable of providing a processor, such as the processor 110 of client 102 a , with computer-readable instructions.
- suitable media include, but are not limited to, a floppy disk, CD-ROM, DVD, magnetic disk, memory chip, ROM, RAM, an ASIC, a configured processor, all optical media, all magnetic tape or other magnetic media, or any other medium from which a computer processor can read instructions.
- various other forms of computer-readable media may transmit or carry instructions to a computer, including a router, private or public network, or other transmission device or channel, both wired and wireless.
- the instructions may comprise code from any suitable computer-programming language, including, for example, C, C++, C#, Visual Basic, Java, Python, Perl, and JavaScript.
- Client devices 102 a - n can be connected to a network 106 as shown, or can be stand-alone machines. Client devices 102 a - n may also include a number of external or internal devices such as a mouse, a CD-ROM, DVD, a keyboard, a display, or other input or output devices. Examples of client devices 102 a - n are personal computers, digital assistants, personal digital assistants, cellular phones, mobile phones, smart phones, pagers, digital tablets, laptop computers, Internet appliances, and other processor-based devices. In general, the client devices 102 a - n may be any type of processor-based platform that operates on any operating system, such as Microsoft® Windows® or Linux, capable of supporting one or more client application programs.
- any operating system such as Microsoft® Windows® or Linux
- the client device 102 a shown comprises a personal computer executing client application programs, also known as client applications 120 .
- the client applications 120 can be contained in memory 108 and can include, for example, a word processing application, a spreadsheet application, an e-mail application, an instant messenger application, a presentation application, an Internet browser application, a calendar/organizer application, and any other application capable of being executed by a client device.
- the user 112 a can interact with the various client applications 120 and articles associated with the client applications 120 via various input and output devices of the client device 102 a .
- Articles include, for example, word processor, spreadsheet, presentation, e-mail, instant messenger, database, and other client application program content files or groups of files, web pages of various formats, such as HTML, XML, XHTML, Portable Document Format (PDF) files, and audio files, video files, or any other documents or groups of documents or information of any type whatsoever.
- PDF Portable Document Format
- the memory 108 of the client device 102 a shown also contains a capture processor 124 , a queue 126 , and a search engine 122 .
- the client device 102 a shown also contains or is in communication with a data store 140 .
- the search engine 122 can receive an explicit query from the user 112 a or generate an implicit query and retrieve information from the data store 140 in response to the query.
- the search engine 122 shown contains an indexer 130 , a query system 132 , and a formatter 134 . Events, real-time and historical, contextual and indexable, and performance data can be sent by the queue 126 to the query system 132 to provide the query system 132 with information concerning current user context. The query system 132 can use this information to generate an implicit query. The query system 132 can also receive and process explicit queries from the user 112 a.
- the user context attribute may comprise, for example, the current word in a buffer, the last n words received from the user (e.g., the last 10 words the user typed), the text nearby the cursor (e.g., the text up to x words before and y words after), the current sentence, the current paragraph, an entire buffer (e.g., an entire word-processing document), the selected or highlighted buffer, the buffer currently in the clipboard, a term measure, such as a term frequency or inverse document frequency measure, an identified term, such as an e-mail address, the name of a person, or an instant messaging buddy name, a previously copied term, a prior implicit or explicit search term, a user identifier, or a word determined by rules specific to the application that generated the event, such as a web page URL for a web browser application.
- a term measure such as a term frequency or inverse document frequency measure
- an identified term such as an e-mail address, the name of a person, or an instant messaging buddy name, a previously copied
- the data store 140 can be any type of computer-readable media and can be integrated with the client device 102 a , such as a hard drive, or external to the client device 102 a , such as an external hard drive or on another data storage device accessed through the network 106 .
- the data store 140 may include any one or combination of methods for storing data, including without limitation, arrays, hash tables, lists, and pairs.
- a user 112 a can input an explicit query into a search engine interface displayed on the client device 102 a , which is received by the search engine 122 .
- the search engine 122 can also generate an implicit query based on a current user context or state, which can be determined by the query system 132 from contextual real time events. Based on the query, the query system 132 can locate relevant information in the data store 140 and provide a result set.
- the result set comprises article identifiers identifying articles associated with the client applications 120 or client articles.
- Client articles stored in the data store 140 include articles associated with the user 112 a or client device 102 a , such as the word processing documents, previously viewed web pages and any other article associated with the client device 102 a or user 112 a .
- the result set also comprises identifiers identifying articles located on the network 106 or network articles located by a search engine on a server device.
- Network articles include articles located on the network 106 not previously viewed or otherwise referenced by the user 112 a , such as web pages not previously viewed by the user 112 a.
- the result sets comprise one or more article identifiers.
- An article identifier may be, for example, a Uniform Resource Locator (URL), a file name, a link, an icon, a path for a local file, or any other suitable item that identifies an article.
- an article identifier comprises a URL associated with an article.
- Messaging articles stored in the data store 140 include user's e-mails, chat messages, and instant messaging messages. Each time a message is received, sent, modified, printed, or otherwise accessed, a record is stored in the data store 140 . This information can later be searched to identify messages that should be displayed in a user interface element.
- An embodiment of the present invention may also store message threads in the data store 140 .
- messages are related together by various attributes, including, for example, the sender, recipient, date/time sent and received, the subject, the content, a window identifier of the display window in which the messages were displayed, or any other attribute of the message.
- the related messages can then be retrieved as a thread, which may be treated as a document by the display processor 128 .
- the formatter 134 can receive the search result set from the query system 132 of the search engine 122 and can format the results for output to a display processor 128 .
- the formatter 134 formats the results in XML or HTML.
- the formatter 134 displays the results as strings on user interface components such as, for example, labels.
- the display processor 128 can be contained in memory 108 and can control the display of the result set on a display device associated with the client device 102 a .
- the display processor 128 may comprise various components.
- the display processor 128 comprises a Hypertext Transfer Protocol (HTTP) server that receives requests for information and responds by constructing and transmitting Hypertext Markup Language (HTML) pages.
- the HTTP server comprises a scaled-down version of the Apache Web server.
- the functions described herein may be performed by various other components and devices.
- a server device 150 is also coupled to the network 106 .
- the search engine 122 can transmit a search query comprised of an explicit or implicit query or both to the server device 150 .
- the user 112 a can also enter a search query in a search engine interface, which can be transmitted to the server device 150 .
- the query signal may instead be sent to a proxy server (not shown), which then transmits the query signal to server device 150 .
- Other configurations are also possible.
- the server device 150 shown includes a server executing a search engine application program, such as the GoogleTM search engine. Similar to the client devices 102 a - n , the server device 150 shown includes a processor 160 coupled to a computer-readable memory 162 .
- Server device 150 depicted as a single computer system, may be implemented as a network of computer processors. Examples of a server device 150 are servers, mainframe computers, networked computers, a processor-based device, and similar types of systems and devices.
- the server processor 160 can be any of a number of computer processors, such as processors from Intel Corporation of Santa Clara, Calif. and Motorola Corporation of Schaumburg, Ill.
- Memory 162 contains the search engine application program, also known as a search engine 170 .
- the search engine 170 locates relevant information in response to a search query from a client device 102 a .
- the search engine 122 then provides the result set to the client device 102 a via the network 106 .
- the result set 134 comprises one or more article identifiers.
- An article identifier may be, for example, a uniform resource locator (URL), a file name, a link, an icon, a path for a local file, or anything else that identifies an article.
- an article identifier comprises a URL associated with an article.
- the server device 150 has previously performed a crawl of the network 106 to locate articles, such as web pages, stored at other devices or systems connected to the network 106 , and indexed the articles in memory 162 or on another data storage device.
- server device 104 may comprise a single physical or logical server.
- the system 100 shown in FIG. 1 is merely exemplary, and is used to explain the exemplary methods shown in FIG. 2 .
- Methods according to the present invention may be implemented by, for example, a processor-executable program code stored on a computer-readable medium.
- Embodiments of the present invention are capable of generating implicit queries based on a user's contextual state.
- the results of an implicit query are displayed to the user in a content display window.
- the results may be updated periodically as the user's contextual state changes.
- the user is working on a word document concerning budgeting.
- a query implicit builder (“QUIB”) one component of the query system 132 shown in FIG. 1 , requests and receives events related to the document.
- the QUIB generates queries from the events and presents the results of the queries to the user.
- Events comprise historical, contextual, and real-time events.
- contextual events are time sensitive and may be of higher significance even after an elapsed period of time.
- Contextual events relate to actions that are occurring now or have occurred within a short time frame, e.g., the last ten words that the user typed.
- real-time events are less time-sensitive, e.g., the user printed or opened a file.
- Events may be tracked over multiple sessions. For example, in one embodiment, if a user has opened a web page repeatedly during the last several times the user has used a client machine, the query system 132 tracks the usage for each of those sessions by tracking the events associated with the usage. In one such embodiment, access during a particular session is down-weighted or promoted based on the period of time that has elapsed since the session. In other words, events associated with more recent accesses of a specific article are weighted more heavily than those occurring less recently.
- the events may include information, such as the last twenty words the user typed, the last sentence the user typed, the text nearby the cursor (e.g. the text up to x words before and y words after), the currently active buffer (e.g., the entire active document), the selected or highlighted buffer, the buffer in the clipboard, or other information relevant to the user's context.
- the query system 132 extracts keywords from the information and generates a search query to be submitted to a search engine.
- the query system 132 creates and executes the query as if the user had explicitly typed the keywords in a search interface.
- the query system 132 learns from a user's behavior whether or not certain data streams or keywords are particularly relevant.
- the query system 132 may rely on click-throughs within the content display window to determine results in which the user exhibits particular interest. For example, if the content display includes a link that has been shown to a user multiple times but has not been clicked, the link may be eliminated from the content display.
- the data streams, query types, or keywords that resulted in the link being displayed may be down-weighted in subsequent analysis.
- the user clicks the link this typically indicates that the user is interested in the article, and can result in promoting the data streams, query types, or keywords that resulted in the link being displayed.
- click-through data can be used to identify a type preference for the user 112 a .
- a type preference can comprise, for example, a file format preferred by the user 112 a .
- the query system 132 can promote future identifiers associated with articles in HTML format and down-weight articles in PDF format.
- Click-through data can also be used to identify a preference for a particular method of generating keywords.
- the query system 132 can promote future identifiers associated with articles generated from the most recently typed 10 words, and down-weight articles associated with text from the clipboard.
- the query system 132 shown in FIG. 1 utilizes multiple data streams as sources for generating search queries. For example, if the user is editing a document, the query system 132 may use the last 20 words that were typed, as well as the entire document to extract keywords and generate search queries. The query system 132 generates a search query for each data stream and combines the result sets corresponding to each search query for display to the user.
- one embodiment comprises receiving an event, the event comprising user interaction with an article on a client device, wherein the article is capable of being associated with at least one of a plurality of client applications, extracting at least one keyword from the event generating a query based at least in part on the at least one keyword performing a search based at least in part on the query to determine a result set, wherein the result set comprises one or more article identifiers associated with articles comprising the at least one keyword, and determining a ranking for each of the one or more article identifiers comprising the result set.
- ranking the article identifiers can be based at least in part on a user preference.
- the user preference can be based at least in part on click-through data or file type.
- ranking the article identifiers can be based at least in part on meta-data.
- the meta-data can comprise at least one of bolding, highlighting, italicizing, font color, or heading data.
- ranking the article identifiers is based at least in part on a term frequency and a document frequency.
- the ranking can be proportional to the log of the sum of a first constant plus the term frequency and inversely proportional to the log of the sum of a second constant plus the document frequency.
- both the first and second constants have the value one. In another embodiment, they have different values.
- the document frequency is not used directly but is hashed into a pre-defined table which maps ranges of document frequency into constants used for ranking article identifiers.
- the ranking is based at least in part on a number data.
- the number data can comprise a number of letters in the keyword or whether a keyword comprises numbers.
- the ranking is based at least in part on capitalization data.
- the ranking is based at least in part on source data.
- the keywords can be associated with keyword ranking scores.
- the ranking of article identifiers can be based at least in part on the keyword ranking scores.
- ranking the article identifiers can comprise assigning a higher ranking to article identifiers associated with articles containing higher ranked keywords.
- extracting at least one keyword from an event comprises extracting a keyword from at least one of recently typed words, an entire document, a selected portion of a document, or words surrounding a cursor.
- extracting at least one keyword from an event comprises determining names. Determining names can comprise crawling at least one article.
- a method comprises receiving an event, the event comprising user interaction with an article on a client device, wherein the article is capable of being associated with at least one of a plurality of client applications, extracting at least one keyword from the event, generating a query based at least in part on the at least one keyword, performing a search based at least in part on the query to determine a result set, wherein the result set comprises one or more article identifiers associated with articles comprising the at least one keyword, filtering the article identifiers in the result set based on a threshold, and causing the display of the result set.
- the threshold can comprise a number of keywords or a minimum weighting score.
- the minimum weighting score can be based at least in part on a number of keywords multiplier, a source multiplier, and a time multiplier.
- FIG. 2 is a flowchart illustrating a method 200 for processing an implicit query.
- the method 200 begins in block 202 , wherein the query system 132 receives a contextual event 202 .
- the contextual event is an occurrence that is captured by the capture processor 124 and can be used to update the user's contextual state and can be indexed and stored in the event database in data store 140 to provide information for future queries.
- the method 200 proceeds to block 204 , wherein the query system 132 extracts keywords from the event in order to generate one or more search queries.
- the keywords may comprise, for example, words that the user has recently typed, words that occur in a document or buffer, words that are highlighted or selected, words placed into the clipboard, words that are identified as proper names, words that are typed as explicit queries by the user, or may comprise any other type of keyword that the system is able to identify.
- the keywords may comprise all of the words in the event.
- the query system 132 may extract keywords from any of a number of data streams.
- Data streams can comprise, for example, sources of implicit query keywords including one or more of the following: the most recently typed n words where n is on the order of ten; the n words around the user's cursor where n is around ten; words in the current selection; words from the current document (e.g., one such method selects the most frequently occurring words); previous explicit queries executed by the user or submitted by the user; clipboard content; and a list of all the names of people with which the user has communicated; a list of e-mail addresses and/or instant messenger “buddy names”; and a list of important terms or phrases for the user.
- sources of implicit query keywords including one or more of the following: the most recently typed n words where n is on the order of ten; the n words around the user's cursor where n is around ten; words in the current selection; words from the current document (e.g., one such method selects the most frequently occurring words); previous explicit queries executed by the user or submitted by the user; clipboard content; and
- Words from a current document can comprise, for example, words from an entire buffer, e.g., an entire Microsoft Word document.
- the query system extracts keywords from explicit queries that are captured by an application on the client 102 a , such as a Winsock Layered Service Provider (“LSP”).
- LSP Winsock Layered Service Provider
- the Winsock LSP captures the query as an event and provides a query, either the original or a modified version, to another search engine application, such as search engine 122 on the client 102 a .
- the local search engine 122 processes the query substantially simultaneously with the global search engine.
- the query system 132 may use identified terms to generate search queries.
- An identified term is a term which the user uses in a manner that has been noted as being particularly relevant to the user's contextual state.
- an identified term may comprise the name of a person to which the user recently directed an e-mail.
- the names need not be recent or popular; for example, the names may include all e-mail addresses, etc. captured for a user. Even old, rare names may be useful to identify. For example, if a user has only sent or received a single message to a particular person several years ago, it may still be desirable to recall the message when the sender/recipient e-mail address is recognized.
- the names are limited to recent and/or popular names to limit the amount of data required to store the names.
- the query system 132 can examine the user's e-mail system and determine the names of users to which the user recently or often sends e-mail messages. The query system can extract all names associated with the user's e-mail system, or can extract names based on recipients of an e-mail or names appearing in the e-mail, for example. In another embodiment, the query system also correlates this information with the subject and/or text of e-mail or other correspondence.
- the query system can identify the organization and content of interest to the person.
- the query system 132 can extract names from a list of contacts comprising, for example, a set of names and associated telephone numbers and e-mails.
- the query system 132 can extract keywords based on identified proper names.
- the query system 132 can identify proper names, for example, by identifying capitalized words not at the beginning of a sentence.
- the query system can also search for proper names by crawling articles located on the client device 102 a or on the network 106 . After determining proper names by crawling articles, the query system 132 can store a list of proper names in the data store 140 or other suitable location. The names can then be used by the query system 132 to identify keywords to extract from an article.
- the query system 132 may also extract keywords from a selection or from a clipboard buffer.
- a selection can comprise, for example, the text or objects that are highlighted in the currently active application.
- the user 112 a can select a portion of text to modify and the query system 132 can extract keywords from the selected or highlighted portion of text.
- the clipboard buffer can comprise, for example, information that was previously selected and copied or cut by the user 112 a.
- the query system 132 can also extract keywords based on a list of common words. For example, the query system 132 can extract the following sentence from a text document: “What is the budget for the second quarter of 2003?” Not all the words that appear in this sentence are necessary for a search query. For example, many of the words in the sentence are filler words. Filler words include words such as “the” which are determiners and are not necessarily relevant to any particular query. These words are filtered out before the search query is submitted to the search engine 122 . The original sentence may be maintained to compare to future content extracts. According to some embodiments, filtering words can comprise, for example, comparing words to a list of common words.
- the list of common words can comprise, for example, a list of words determined to appear frequently and be of little value in ranking search results.
- a list of common words can comprise the words “is,” “of,” “to,” “it,” and other common words.
- the query system 132 can compare words extracted from a string or document to the list of common words and filter out words that appear in the list.
- a list can contain common words which are not be excluded as keywords, but which are down-weighted. For example, such words can be made less likely to appear as keywords, but may still be selected as keywords if they appear frequently within an article.
- keywords can be associated with keyword ranking scores. Keyword ranking scores can reflect, for example, the relative importance or lack of importance of keywords.
- common keywords can have low keyword ranking scores associated with them while proper name keywords can have high keyword ranking scores associated with them.
- the keyword ranking scores can be used in ranking an article containing the keyword ranking scores. For example, articles containing keywords associated with high keyword ranking scores can receive high ranking scores themselves. Likewise, articles containing keywords associated with low keyword ranking scores can receive low ranking scores themselves.
- the method 200 proceeds to block 206 , wherein the query system 132 generates a search query 206 .
- the search query that the query system 132 generates may comprise keywords extracted from a single data stream or may comprise keywords extracted from multiple streams.
- the query system 132 can extract keywords from a selected portion of text within a document and from the entire contents of the document. Whether a word extracted from more than one source continues to be used in an implicit query may be determined in various ways. For example, if the word “budget” occurs with some frequency (e.g. fifty times) in a document but the user has not recently typed the word budget, budget may continue to be included in a query generated by the query system 132 .
- the method 200 proceeds to block 208 , wherein the query system 132 transmits the search query to a search engine, for example, search engine 122 .
- the query system 132 transmits the query to other search engines, for example, a search engine running on a server device 150 , such as the GoogleTM search engine.
- the search engine 122 performs a search of one or more indices, either local or global, and provides at least one article identifier associated with a relevant article as a result set.
- the method 200 proceeds to block 210 , wherein the query system 132 ranks the article identifiers in the result set based on ranking scores.
- the ranking scores may be related to previous events that were recorded by the query system 132 or another component or may be based on other criteria.
- the query system 132 can determine ranking scores based at least in part on meta-data associated with articles in the result set. Meta-data can include, for example, bolding, highlighting, underlining, italicizing, font color, heading data, or any other formatting or meta-data associated with a portion of an article. Heading data can comprise, for example, whether a portion of an article is designated as a heading in a text document.
- the query system 132 can determine the meta-data associated with an article in the result set by determining the meta-data associated with the keywords in the search query. For example, if the search query comprises the terms “budgeting meeting” the query system can identify a result set containing articles comprising the words “budgeting meeting.” One such article can be, for example, a spreadsheet with a title “budgeting meeting” appearing in bold. A second such article can be an e-mail with the words “budgeting meeting” appearing in the text. The query system 132 can determine meta-data associated with the keywords “budgeting meeting” in the spreadsheet indicating that the words are bolded. The query system can then boost a ranking score associated with the spreadsheet to reflect the likelihood that the spreadsheet titled “budgeting meeting” is more responsive to the search query than the e-mail simply containing these words in the body of the e-mail.
- the query system 132 can further rank the article identifiers based at least in part on capitalization data associated with the articles in the result set.
- Capitalization data can comprise, for example, data indicating whether one or more letters in a word are capitalized. For example, if the words “budgeting meeting” in the spreadsheet from the example above are capitalized, this is a further indication that they are of greater significance in the article and thus that the article is more closely related to the search query “budgeting meeting.” Additionally, capitalized letters can indicate the proper names of people and places. Keywords associated with names and places can be a better indicator that an article containing such keywords is responsive to a search query.
- the query system 132 can determine key words “meet,” “with,” “Bob,” “Jones,” and “lunch” from the sentence. The query system 132 can then identify an article containing the keywords “lunch” and “with” and an article containing the keywords “Bob” and “Jones.” The article containing the keywords “Bob” and “Jones” can be more likely to interest the user 112 a , and so the query system 132 can rank the identifier associated with the article containing the capitalized words “Bob” and “Jones” higher based at least in part on the capitalization. According to some embodiments, the query system can assign a higher ranking to capitalized keywords that do not begin a sentence as these more likely reflect proper names or places.
- the query system 132 can determine a ranking score based at least in part on term frequency (TF) and a document frequency (DF) or an inverse document frequency (IDF) associated with a key word.
- TF can comprise, for example, the frequency with which a keyword appears in a single article.
- a DF can comprise, for example, the frequency with which a keyword appears in all documents, and an IDF can comprise, for example, the inverse of the frequency with which the keyword appears in all documents.
- TF can comprise, for example, the frequency with which a keyword appears in a single article.
- a DF can comprise, for example, the frequency with which a keyword appears in all documents
- an IDF can comprise, for example, the inverse of the frequency with which the keyword appears in all documents.
- a common keyword can appear frequently within any one particular document and thus have a high TF. The same common keyword can also appear frequently in all documents and thus have a high DF and consequently a low IDF.
- the query system can compensate for keywords appearing frequently in one document when the keywords also appear frequently in all documents.
- a unique keyword that appears a few times in one particular document may have a relatively low TF but can have a very high IDF and thus the composite for such a keyword can be high.
- the query system can determine a ranking score for an identifier in the result set proportional to: Log(TF+A)/log(DF+B) Where TF denotes the term frequency of a term, DF denotes the document frequency of a term, A denotes a first constant, and B denotes a second constant.
- A can have the value of 1, and B can have the value of 1.
- A can have the value of 0.5, and B can have the value of 0.
- the logarithm of the DF may not be used, and the DF may be hashed into a lookup table which maps ranges of DF values into constants.
- the ranking score can be proportional to: Log(TF+A)/mapping function(DF)
- the query system 132 can further determine a ranking score based at least in part on number data associated with articles in the result set.
- Number data can comprise, for example, whether a keyword comprises numbers. For example if the user 112 a types a date into a document, a keyword “2004” can be determined by the query processor 132 .
- the query processor can further determine number data indicating that the keyword “2004” comprises numbers and determine a ranking score for the article containing the keyword “2004” based at least in part on the number data. For example, keywords containing numbers can be less likely to indicate important portions of an article and thus less likely to be associated with search results of interest to the user 112 a .
- number data can comprise, for example, a number of letters comprising a keyword.
- the query system 132 can determine that a keyword “the” comprises three letters and that a keyword “antidisestablishmentarianism” contains 28 letters.
- a keyword containing a high number of letters can be more likely to be unique and thus more likely to indicate unique results interesting to the user 112 a.
- the query system 132 can further determine a ranking score based at least in part on preference data.
- Preference data can comprise, for example, data indicating the user's 112 a preference for a particular article or for a particular file type.
- the query system 132 can receive click-through data indicating the user 112 a has selected an article identifier displayed in a content display window.
- the query system 132 evaluates the article identifier to determine a content type associated with the article identifier.
- the file type may be a web page, e-mail, text file, image, or any other content type.
- the user 112 a can be presented with multiple article identifiers of different types as the result of an implicit query.
- the user can be presented with e-mails, web pages, and text documents.
- the user can demonstrate a preference by selecting a particular article type more frequently than any other.
- the user 112 a can select e-mails when presented and ignore results associated with text documents.
- the query system 132 can rank subsequent e-mail articles higher to reflect the user's 112 a preference for e-mail documents.
- the query system 132 can use the click-through data to adjust the ranking scores both within and across result sets before displaying the combined result set to the user.
- the present invention utilizes content type, source, keyword, and other data related to items that the user did not click on. The query system 132 of one such embodiment reduces the relevancy score of article identifiers corresponding to content types and sources that the user has not clicked as frequently as other types of content.
- the query system 132 can rank article identifiers based on the number of results sets in which the articles are located.
- the user 112 a can view a web page and edit a text document.
- Four queries are generated from the user context.
- the first query comprises information from the web page.
- the second query comprises the last ten words that the user types.
- the third query comprises the sentence that the user just pasted in the document.
- the fourth query comprises the words that the user is currently selecting with the mouse.
- the query system 132 can submit the queries to one or more search engines and receive four result sets in response.
- the query system 132 can merge the results and can present the first five article identifiers from the merged result set to the user 112 a in a contextual display window for example.
- the first query can produce a results set comprising articles A, B and C.
- the second query can produce a result set comprising articles C, D, and E. Because article C appears in both result sets, it can receive a higher ranking score when displayed in the
- the query system 132 can further determine a ranking score based at least in part on source data.
- Source data can comprise, for example, data indicating the source of keywords contained in an article.
- query results based on keywords extracted from recently typed words receive a higher ranking score than results based on keywords extracted from an entire document.
- Source data may further include data indicating the relevancy of a source of keywords.
- a ranking score can be based on a how frequently the keywords appear in a document, the document frequency of the keywords, or how long an application from which the keywords are extracted has been in the foreground.
- the method 200 proceeds to block 212 , wherein the query system 132 transmits the result set to the display processor 128 and the display processor 128 causes the output of the article identifiers.
- the display processor 128 may output the result set in a format similar to a format used for global result sets such as those provided by a search engine utilizing a global index, e.g., GoogleTM search engine.
- the display processor 128 may alternatively output the result sets in a small window superimposed over another application that the user is currently using.
- the display processor 128 creates a window based on the amount of available screen space on the user's 112 a display and outputs the result sets from the query system 132 in the window that it created.
- the window of an active application may be modified to include the result set.
- the results can be stored in memory and the query system informs the display processor 128 .
- the query system 132 can execute additional queries to retrieve results until the minimum threshold of results has been exceeded.
- the query system 132 may execute a single query or may execute multiple queries based on multiple data streams in order to return result sets that are relevant to the current user context.
- article identifiers can be presented to the user 112 a based on a threshold determined for occurrences of keywords in an article associated with the article identifier.
- a threshold can be determined to exclude articles from the result set that contain fewer than three occurrences of one or more keywords.
- the display processor 128 can present only those results above a weighted score threshold.
- the query system 132 can determine a weighted score for each article in a result set.
- the weighted score can comprise, for example, number of keywords multiplier, a source multiplier, and a time multiplier.
- the number of keywords multiplier can comprise, for example, a weighting factor based on the number of keywords within a result and a normalizing factor based on a total number of keywords.
- the normalizing factor can be used to compare results associated with different numbers of keywords.
- the source multiplier can comprise, for example, a weighting factor based on the source of a keyword.
- the source multiplier can boost a ranking score for the first article.
- the query system 132 can compare the weighted score to a threshold and the display processor 128 can receive this data and present only results exceeding the threshold. For example, the query system can determine two articles associated with a search query and can further determine a weighted score for each article.
- the query system 132 can transmit this data to the display processor 128 and the display processor 128 can present to the user 112 a an article identifier associated with the first article and not present an article identifier associated with the second article. Once the article identifiers are presented to the user 112 a , the method 200 ends.
Abstract
Description
Log(TF+A)/log(DF+B)
Where TF denotes the term frequency of a term, DF denotes the document frequency of a term, A denotes a first constant, and B denotes a second constant.
Log(TF+A)/mapping function(DF)
Claims (56)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/813,875 US7693825B2 (en) | 2004-03-31 | 2004-03-31 | Systems and methods for ranking implicit search results |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US10/813,875 US7693825B2 (en) | 2004-03-31 | 2004-03-31 | Systems and methods for ranking implicit search results |
Publications (2)
Publication Number | Publication Date |
---|---|
US20070276829A1 US20070276829A1 (en) | 2007-11-29 |
US7693825B2 true US7693825B2 (en) | 2010-04-06 |
Family
ID=38750730
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
US10/813,875 Active 2025-10-11 US7693825B2 (en) | 2004-03-31 | 2004-03-31 | Systems and methods for ranking implicit search results |
Country Status (1)
Country | Link |
---|---|
US (1) | US7693825B2 (en) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20060271520A1 (en) * | 2005-05-27 | 2006-11-30 | Ragan Gene Z | Content-based implicit search query |
US20080059187A1 (en) * | 2006-08-31 | 2008-03-06 | Roitblat Herbert L | Retrieval of Documents Using Language Models |
US20080228548A1 (en) * | 2007-03-12 | 2008-09-18 | Mcbrearty Gerald F | System and method for selecting calendar events by examining content of user's recent e-mail activity |
US20080294619A1 (en) * | 2007-05-23 | 2008-11-27 | Hamilton Ii Rick Allen | System and method for automatic generation of search suggestions based on recent operator behavior |
US20090055426A1 (en) * | 2007-08-20 | 2009-02-26 | Samsung Electronics Co., Ltd. | Method and system for generating playlists for content items |
US20100094831A1 (en) * | 2008-10-14 | 2010-04-15 | Microsoft Corporation | Named entity resolution using multiple text sources |
US20100145939A1 (en) * | 2008-12-05 | 2010-06-10 | Yahoo! Inc. | Determining related keywords based on lifestream feeds |
US20110238671A1 (en) * | 2010-03-23 | 2011-09-29 | Research In Motion Limited | Method, system and apparatus for efficiently determining priority of data in a database |
US20130080424A1 (en) * | 2004-11-22 | 2013-03-28 | Facebook, Inc. | Systems and methods for sorting search results |
US9098543B2 (en) | 2013-03-14 | 2015-08-04 | Wal-Mart Stores, Inc. | Attribute detection |
US9111289B2 (en) * | 2011-08-25 | 2015-08-18 | Ebay Inc. | System and method for providing automatic high-value listing feeds for online computer users |
US9436744B2 (en) | 2014-05-08 | 2016-09-06 | Accenture Global Services Limited | Combining internal and external search results |
US9838348B2 (en) | 2014-12-31 | 2017-12-05 | Yahoo Holdings, Inc. | Electronic message search system and method |
US20230281257A1 (en) * | 2022-01-31 | 2023-09-07 | Walmart Apollo, Llc | Systems and methods for determining and utilizing search token importance using machine learning architectures |
US11809432B2 (en) | 2002-01-14 | 2023-11-07 | Awemane Ltd. | Knowledge gathering system based on user's affinity |
Families Citing this family (108)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8590013B2 (en) | 2002-02-25 | 2013-11-19 | C. S. Lee Crawford | Method of managing and communicating data pertaining to software applications for processor-based devices comprising wireless communication circuitry |
US8572104B2 (en) * | 2003-04-18 | 2013-10-29 | Kaleidescape, Inc. | Sales of collections excluding those already purchased |
US8631001B2 (en) | 2004-03-31 | 2014-01-14 | Google Inc. | Systems and methods for weighting a search query result |
US9009153B2 (en) * | 2004-03-31 | 2015-04-14 | Google Inc. | Systems and methods for identifying a named entity |
US8041713B2 (en) | 2004-03-31 | 2011-10-18 | Google Inc. | Systems and methods for analyzing boilerplate |
US8131754B1 (en) * | 2004-06-30 | 2012-03-06 | Google Inc. | Systems and methods for determining an article association measure |
US7606793B2 (en) | 2004-09-27 | 2009-10-20 | Microsoft Corporation | System and method for scoping searches using index keys |
US7739277B2 (en) | 2004-09-30 | 2010-06-15 | Microsoft Corporation | System and method for incorporating anchor text into ranking search results |
US7827181B2 (en) | 2004-09-30 | 2010-11-02 | Microsoft Corporation | Click distance determination |
US7761448B2 (en) | 2004-09-30 | 2010-07-20 | Microsoft Corporation | System and method for ranking search results using click distance |
US7716198B2 (en) * | 2004-12-21 | 2010-05-11 | Microsoft Corporation | Ranking search results using feature extraction |
US7792833B2 (en) | 2005-03-03 | 2010-09-07 | Microsoft Corporation | Ranking search results using language types |
US20060218115A1 (en) * | 2005-03-24 | 2006-09-28 | Microsoft Corporation | Implicit queries for electronic documents |
US8135728B2 (en) * | 2005-03-24 | 2012-03-13 | Microsoft Corporation | Web document keyword and phrase extraction |
JP2007072646A (en) * | 2005-09-06 | 2007-03-22 | Internatl Business Mach Corp <Ibm> | Retrieval device, retrieval method, and program therefor |
US9703892B2 (en) | 2005-09-14 | 2017-07-11 | Millennial Media Llc | Predictive text completion for a mobile communication facility |
US20110313853A1 (en) | 2005-09-14 | 2011-12-22 | Jorey Ramer | System for targeting advertising content to a plurality of mobile communication facilities |
US9471925B2 (en) | 2005-09-14 | 2016-10-18 | Millennial Media Llc | Increasing mobile interactivity |
US7769764B2 (en) | 2005-09-14 | 2010-08-03 | Jumptap, Inc. | Mobile advertisement syndication |
US7577665B2 (en) | 2005-09-14 | 2009-08-18 | Jumptap, Inc. | User characteristic influenced search results |
US7702318B2 (en) | 2005-09-14 | 2010-04-20 | Jumptap, Inc. | Presentation of sponsored content based on mobile transaction event |
US7660581B2 (en) | 2005-09-14 | 2010-02-09 | Jumptap, Inc. | Managing sponsored content based on usage history |
US10592930B2 (en) | 2005-09-14 | 2020-03-17 | Millenial Media, LLC | Syndication of a behavioral profile using a monetization platform |
US9058406B2 (en) | 2005-09-14 | 2015-06-16 | Millennial Media, Inc. | Management of multiple advertising inventories using a monetization platform |
US8819659B2 (en) | 2005-09-14 | 2014-08-26 | Millennial Media, Inc. | Mobile search service instant activation |
US8156128B2 (en) | 2005-09-14 | 2012-04-10 | Jumptap, Inc. | Contextual mobile content placement on a mobile communication facility |
US8812526B2 (en) | 2005-09-14 | 2014-08-19 | Millennial Media, Inc. | Mobile content cross-inventory yield optimization |
US8290810B2 (en) | 2005-09-14 | 2012-10-16 | Jumptap, Inc. | Realtime surveying within mobile sponsored content |
US8832100B2 (en) | 2005-09-14 | 2014-09-09 | Millennial Media, Inc. | User transaction history influenced search results |
US10038756B2 (en) | 2005-09-14 | 2018-07-31 | Millenial Media LLC | Managing sponsored content based on device characteristics |
US8615719B2 (en) | 2005-09-14 | 2013-12-24 | Jumptap, Inc. | Managing sponsored content for delivery to mobile communication facilities |
US8229914B2 (en) | 2005-09-14 | 2012-07-24 | Jumptap, Inc. | Mobile content spidering and compatibility determination |
US8660891B2 (en) | 2005-11-01 | 2014-02-25 | Millennial Media | Interactive mobile advertisement banners |
US8463249B2 (en) | 2005-09-14 | 2013-06-11 | Jumptap, Inc. | System for targeting advertising content to a plurality of mobile communication facilities |
US7860871B2 (en) | 2005-09-14 | 2010-12-28 | Jumptap, Inc. | User history influenced search results |
US8666376B2 (en) | 2005-09-14 | 2014-03-04 | Millennial Media | Location based mobile shopping affinity program |
US10911894B2 (en) | 2005-09-14 | 2021-02-02 | Verizon Media Inc. | Use of dynamic content generation parameters based on previous performance of those parameters |
US8103545B2 (en) | 2005-09-14 | 2012-01-24 | Jumptap, Inc. | Managing payment for sponsored content presented to mobile communication facilities |
US8364540B2 (en) | 2005-09-14 | 2013-01-29 | Jumptap, Inc. | Contextual targeting of content using a monetization platform |
US8302030B2 (en) | 2005-09-14 | 2012-10-30 | Jumptap, Inc. | Management of multiple advertising inventories using a monetization platform |
US8503995B2 (en) | 2005-09-14 | 2013-08-06 | Jumptap, Inc. | Mobile dynamic advertisement creation and placement |
US8027879B2 (en) | 2005-11-05 | 2011-09-27 | Jumptap, Inc. | Exclusivity bidding for mobile sponsored content |
US8209344B2 (en) | 2005-09-14 | 2012-06-26 | Jumptap, Inc. | Embedding sponsored content in mobile applications |
US7676394B2 (en) | 2005-09-14 | 2010-03-09 | Jumptap, Inc. | Dynamic bidding and expected value |
US8131271B2 (en) | 2005-11-05 | 2012-03-06 | Jumptap, Inc. | Categorization of a mobile user profile based on browse behavior |
US9201979B2 (en) | 2005-09-14 | 2015-12-01 | Millennial Media, Inc. | Syndication of a behavioral profile associated with an availability condition using a monetization platform |
US8311888B2 (en) | 2005-09-14 | 2012-11-13 | Jumptap, Inc. | Revenue models associated with syndication of a behavioral profile using a monetization platform |
US8238888B2 (en) | 2006-09-13 | 2012-08-07 | Jumptap, Inc. | Methods and systems for mobile coupon placement |
US8364521B2 (en) | 2005-09-14 | 2013-01-29 | Jumptap, Inc. | Rendering targeted advertisement on mobile communication facilities |
US7752209B2 (en) | 2005-09-14 | 2010-07-06 | Jumptap, Inc. | Presenting sponsored content on a mobile communication facility |
US8989718B2 (en) | 2005-09-14 | 2015-03-24 | Millennial Media, Inc. | Idle screen advertising |
US8195133B2 (en) | 2005-09-14 | 2012-06-05 | Jumptap, Inc. | Mobile dynamic advertisement creation and placement |
US7912458B2 (en) | 2005-09-14 | 2011-03-22 | Jumptap, Inc. | Interaction analysis and prioritization of mobile content |
US8688671B2 (en) | 2005-09-14 | 2014-04-01 | Millennial Media | Managing sponsored content based on geographic region |
US9076175B2 (en) | 2005-09-14 | 2015-07-07 | Millennial Media, Inc. | Mobile comparison shopping |
US8805339B2 (en) | 2005-09-14 | 2014-08-12 | Millennial Media, Inc. | Categorization of a mobile user profile based on browse and viewing behavior |
US20070067197A1 (en) * | 2005-09-16 | 2007-03-22 | Sbc Knowledge Ventures, L.P. | Efficiently routing customer inquiries created with a self-service application |
US7783632B2 (en) * | 2005-11-03 | 2010-08-24 | Microsoft Corporation | Using popularity data for ranking |
US8175585B2 (en) * | 2005-11-05 | 2012-05-08 | Jumptap, Inc. | System for targeting advertising content to a plurality of mobile communication facilities |
US8571999B2 (en) | 2005-11-14 | 2013-10-29 | C. S. Lee Crawford | Method of conducting operations for a social network application including activity list generation |
US8429184B2 (en) * | 2005-12-05 | 2013-04-23 | Collarity Inc. | Generation of refinement terms for search queries |
US8903810B2 (en) | 2005-12-05 | 2014-12-02 | Collarity, Inc. | Techniques for ranking search results |
US7774341B2 (en) | 2006-03-06 | 2010-08-10 | Veveo, Inc. | Methods and systems for selecting and presenting content based on dynamically identifying microgenres associated with the content |
JP4826331B2 (en) * | 2006-05-09 | 2011-11-30 | 富士ゼロックス株式会社 | Document usage tracking system |
US20070266025A1 (en) * | 2006-05-12 | 2007-11-15 | Microsoft Corporation | Implicit tokenized result ranking |
US8661031B2 (en) * | 2006-06-23 | 2014-02-25 | Rohit Chandra | Method and apparatus for determining the significance and relevance of a web page, or a portion thereof |
US7822764B2 (en) | 2006-07-18 | 2010-10-26 | Cisco Technology, Inc. | Methods and apparatuses for dynamically displaying search suggestions |
US8001114B2 (en) * | 2006-07-18 | 2011-08-16 | Wilson Chu | Methods and apparatuses for dynamically searching for electronic mail messages |
US8442972B2 (en) * | 2006-10-11 | 2013-05-14 | Collarity, Inc. | Negative associations for search results ranking and refinement |
US20080104049A1 (en) * | 2006-10-25 | 2008-05-01 | Microsoft Corporation | Document ranking utilizing parameter varying data |
US20080154879A1 (en) * | 2006-12-22 | 2008-06-26 | Yahoo! Inc. | Method and apparatus for creating user-generated document feedback to improve search relevancy |
US8010502B2 (en) * | 2007-04-13 | 2011-08-30 | Harris Corporation | Methods and systems for data recovery |
US7840569B2 (en) | 2007-10-18 | 2010-11-23 | Microsoft Corporation | Enterprise relevancy ranking using a neural network |
US9348912B2 (en) | 2007-10-18 | 2016-05-24 | Microsoft Technology Licensing, Llc | Document length as a static relevance feature for ranking search results |
US9710817B2 (en) * | 2008-09-30 | 2017-07-18 | Microsoft Technology Licensing, Llc | Adaptive run-time advertisements |
US8812493B2 (en) | 2008-04-11 | 2014-08-19 | Microsoft Corporation | Search results ranking using editing distance and document information |
US20110219319A1 (en) * | 2008-04-24 | 2011-09-08 | Lonsou (Beijing) Technologies Co., Ltd. | System and method for knowledge-based input in a browser |
US20090282023A1 (en) * | 2008-05-12 | 2009-11-12 | Bennett James D | Search engine using prior search terms, results and prior interaction to construct current search term results |
US8538943B1 (en) | 2008-07-24 | 2013-09-17 | Google Inc. | Providing images of named resources in response to a search query |
US8818992B2 (en) * | 2008-09-12 | 2014-08-26 | Nokia Corporation | Method, system, and apparatus for arranging content search results |
US20100070482A1 (en) * | 2008-09-12 | 2010-03-18 | Murali-Krishna Punaganti Venkata | Method, system, and apparatus for content search on a device |
US20100191746A1 (en) * | 2009-01-26 | 2010-07-29 | Microsoft Corporation | Competitor Analysis to Facilitate Keyword Bidding |
WO2010141799A2 (en) | 2009-06-05 | 2010-12-09 | West Services Inc. | Feature engineering and user behavior analysis |
JP2011018178A (en) * | 2009-07-08 | 2011-01-27 | Sony Corp | Apparatus and method for processing information and program |
US8875038B2 (en) | 2010-01-19 | 2014-10-28 | Collarity, Inc. | Anchoring for content synchronization |
US8244700B2 (en) * | 2010-02-12 | 2012-08-14 | Microsoft Corporation | Rapid update of index metadata |
US8244701B2 (en) | 2010-02-12 | 2012-08-14 | Microsoft Corporation | Using behavior data to quickly improve search ranking |
US20110270819A1 (en) * | 2010-04-30 | 2011-11-03 | Microsoft Corporation | Context-aware query classification |
CN102253936B (en) * | 2010-05-18 | 2013-07-24 | 阿里巴巴集团控股有限公司 | Method for recording access of user to merchandise information, search method and server |
US8738635B2 (en) | 2010-06-01 | 2014-05-27 | Microsoft Corporation | Detection of junk in search result ranking |
US20120078979A1 (en) * | 2010-07-26 | 2012-03-29 | Shankar Raj Ghimire | Method for advanced patent search and analysis |
US9798800B2 (en) | 2010-09-24 | 2017-10-24 | International Business Machines Corporation | Providing question and answers with deferred type evaluation using text with limited structure |
WO2012126180A1 (en) * | 2011-03-24 | 2012-09-27 | Microsoft Corporation | Multi-layer search-engine index |
CN102508884A (en) * | 2011-10-18 | 2012-06-20 | 盘古文化传播有限公司 | Method and device for acquiring hotpot events and real-time comments |
US9495462B2 (en) | 2012-01-27 | 2016-11-15 | Microsoft Technology Licensing, Llc | Re-ranking search results |
CN103368986B (en) | 2012-03-27 | 2017-04-26 | 阿里巴巴集团控股有限公司 | Information recommendation method and information recommendation device |
US9519629B1 (en) * | 2012-08-06 | 2016-12-13 | Amazon Technologies, Inc. | Style consolidation and optimization with strong ownership |
US9916301B2 (en) * | 2012-12-21 | 2018-03-13 | Microsoft Technology Licensing, Llc | Named entity variations for multimodal understanding systems |
US9497144B2 (en) * | 2014-03-27 | 2016-11-15 | International Business Machines Corporation | Context-based storage of a conversation of one or more instant messages as a record |
US9959355B2 (en) * | 2015-08-31 | 2018-05-01 | International Business Machines Corporation | Associating related threads in a question and answer session |
CN105843902A (en) * | 2016-03-23 | 2016-08-10 | 乐视网信息技术(北京)股份有限公司 | Interaction information sorting method and apparatus |
US10733359B2 (en) * | 2016-08-26 | 2020-08-04 | Adobe Inc. | Expanding input content utilizing previously-generated content |
US10585923B2 (en) * | 2017-04-25 | 2020-03-10 | International Business Machines Corporation | Generating search keyword suggestions from recently used application |
CA3067326A1 (en) | 2017-06-19 | 2018-12-27 | Equifax Inc. | Machine-learning system for servicing queries for digital content |
CN110555108B (en) * | 2018-05-31 | 2022-03-15 | 北京百度网讯科技有限公司 | Event context generation method, device, equipment and storage medium |
CN109670183B (en) * | 2018-12-21 | 2023-03-24 | 北京锐安科技有限公司 | Text importance calculation method, device, equipment and storage medium |
US20200342017A1 (en) * | 2019-04-24 | 2020-10-29 | Microsoft Technology Licensing, Llc | System and method for managing related feedback |
US11238219B2 (en) * | 2019-06-06 | 2022-02-01 | Rakuten Group, Inc. | Sentence extraction system, sentence extraction method and information storage medium |
Citations (144)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5418948A (en) | 1991-10-08 | 1995-05-23 | West Publishing Company | Concept matching of natural language queries with a database of document concepts |
US5678038A (en) | 1994-06-21 | 1997-10-14 | International Business Machines Corporation | Storing and retrieving heterogeneous classification systems utilizing globally unique identifiers |
US5696962A (en) | 1993-06-24 | 1997-12-09 | Xerox Corporation | Method for computerized information retrieval using shallow linguistic analysis |
US5701469A (en) | 1995-06-07 | 1997-12-23 | Microsoft Corporation | Method and system for generating accurate search results using a content-index |
US5717913A (en) | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5754938A (en) | 1994-11-29 | 1998-05-19 | Herz; Frederick S. M. | Pseudonymous server for system for customized electronic identification of desirable objects |
US5826261A (en) * | 1996-05-10 | 1998-10-20 | Spencer; Graham | System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query |
US5890152A (en) | 1996-09-09 | 1999-03-30 | Seymour Alvin Rapaport | Personal feedback browser for obtaining media files |
US5911139A (en) * | 1996-03-29 | 1999-06-08 | Virage, Inc. | Visual image database search engine which allows for different schema |
US5933827A (en) | 1996-09-25 | 1999-08-03 | International Business Machines Corporation | System for identifying new web pages of interest to a user |
US5940821A (en) | 1997-05-21 | 1999-08-17 | Oracle Corporation | Information presentation in a knowledge base search and retrieval system |
US5964839A (en) | 1996-03-29 | 1999-10-12 | At&T Corp | System and method for monitoring information flow and performing data collection |
US5987446A (en) * | 1996-11-12 | 1999-11-16 | U.S. West, Inc. | Searching large collections of text using multiple search engines concurrently |
US6006222A (en) | 1997-04-25 | 1999-12-21 | Culliss; Gary | Method for organizing information |
US6012067A (en) | 1998-03-02 | 2000-01-04 | Sarkar; Shyam Sundar | Method and apparatus for storing and manipulating objects in a plurality of relational data managers on the web |
US6014665A (en) * | 1997-08-01 | 2000-01-11 | Culliss; Gary | Method for organizing information |
US6070158A (en) | 1996-08-14 | 2000-05-30 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
USRE36727E (en) * | 1989-12-26 | 2000-06-06 | Kageneck; Karl-Erbo G. | Method of indexing and retrieval of electronically-stored documents |
US6078916A (en) | 1997-08-01 | 2000-06-20 | Culliss; Gary | Method for organizing information |
US6112203A (en) | 1998-04-09 | 2000-08-29 | Altavista Company | Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis |
US6122647A (en) * | 1998-05-19 | 2000-09-19 | Perspecta, Inc. | Dynamic generation of contextual links in hypertext documents |
US6167434A (en) | 1998-07-15 | 2000-12-26 | Pang; Stephen Y. | Computer code for removing junk e-mail messages |
US6182068B1 (en) | 1997-08-01 | 2001-01-30 | Ask Jeeves, Inc. | Personalized search methods |
US6199059B1 (en) | 1998-04-22 | 2001-03-06 | International Computex, Inc. | System and method for classifying and retrieving information with virtual object hierarchy |
US6272507B1 (en) * | 1997-04-09 | 2001-08-07 | Xerox Corporation | System for ranking search results from a collection of documents using spreading activation techniques |
US20010037377A1 (en) * | 2000-04-27 | 2001-11-01 | Yumiko Nakano | Information searching apparatus and method |
US20010037328A1 (en) | 2000-03-23 | 2001-11-01 | Pustejovsky James D. | Method and system for interfacing to a knowledge acquisition system |
US6321228B1 (en) * | 1999-08-31 | 2001-11-20 | Powercast Media, Inc. | Internet search system for retrieving selected results from a previous search |
US20020016786A1 (en) | 1999-05-05 | 2002-02-07 | Pitkow James B. | System and method for searching and recommending objects from a categorically organized information repository |
US20020040311A1 (en) * | 2000-10-04 | 2002-04-04 | John Douglass | Web browser page rating system |
US20020059272A1 (en) * | 2000-04-20 | 2002-05-16 | Porter Edward W. | Apparatuses, methods, programming, and propagated signals for creating, editing, organizing and viewing collaborative databases |
US6397221B1 (en) | 1998-09-12 | 2002-05-28 | International Business Machines Corp. | Method for creating and maintaining a frame-based hierarchically organized databases with tabularly organized data |
US20020065800A1 (en) | 2000-11-30 | 2002-05-30 | Morlitz David M. | HTTP archive file |
US6421675B1 (en) * | 1998-03-16 | 2002-07-16 | S. L. I. Systems, Inc. | Search engine |
US20020095427A1 (en) | 2000-11-09 | 2002-07-18 | Kaplan Ari David | System, method and apparatus for the wireless monitoring and management of computer systems |
US20020099700A1 (en) * | 1999-12-14 | 2002-07-25 | Wen-Syan Li | Focused search engine and method |
US20020103806A1 (en) | 2000-10-18 | 2002-08-01 | Masafumi Yamanoue | Data supply controlling device, data supplying method, storage medium for data supplying program, and data supplying system |
US20020103737A1 (en) | 2000-09-07 | 2002-08-01 | Briere Daniel D. | Marketing collateral repository and supporting data management and communication environment |
US20020103698A1 (en) | 2000-10-31 | 2002-08-01 | Christian Cantrell | System and method for enabling user control of online advertising campaigns |
US20020116291A1 (en) | 2000-12-22 | 2002-08-22 | Xerox Corporation | Recommender system and method |
US20020129059A1 (en) | 2000-12-29 | 2002-09-12 | Eck Jeffery R. | XML auto map generator |
US6460036B1 (en) | 1994-11-29 | 2002-10-01 | Pinpoint Incorporated | System and method for providing customized electronic newspapers and target advertisements |
US6473752B1 (en) * | 1997-12-04 | 2002-10-29 | Micron Technology, Inc. | Method and system for locating documents based on previously accessed documents |
US20020174101A1 (en) * | 2000-07-12 | 2002-11-21 | Fernley Helen Elaine Penelope | Document retrieval system |
US6490575B1 (en) | 1999-12-06 | 2002-12-03 | International Business Machines Corporation | Distributed network search engine |
US6505191B1 (en) | 1998-07-24 | 2003-01-07 | Jarg Corporation | Distributed computer database system and method employing hypertext linkage analysis |
US20030014398A1 (en) * | 2001-06-29 | 2003-01-16 | Hitachi, Ltd. | Query modification system for information retrieval |
US20030020749A1 (en) * | 2001-07-10 | 2003-01-30 | Suhayya Abu-Hakima | Concept-based message/document viewer for electronic communications and internet searching |
US20030033296A1 (en) | 2000-01-31 | 2003-02-13 | Kenneth Rothmuller | Digital media management apparatus and methods |
US20030046311A1 (en) * | 2001-06-19 | 2003-03-06 | Ryan Baidya | Dynamic search engine and database |
US6546388B1 (en) * | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US20030069877A1 (en) | 2001-08-13 | 2003-04-10 | Xerox Corporation | System for automatically generating queries |
US20030079185A1 (en) | 1998-10-09 | 2003-04-24 | Sanjeev Katariya | Method and system for generating a document summary |
US20030093790A1 (en) * | 2000-03-28 | 2003-05-15 | Logan James D. | Audio and video program recording, editing and playback systems using metadata |
US20030093276A1 (en) * | 2001-11-13 | 2003-05-15 | Miller Michael J. | System and method for automated answering of natural language questions and queries |
US6571234B1 (en) * | 1999-05-11 | 2003-05-27 | Prophet Financial Systems, Inc. | System and method for managing online message board |
US6581056B1 (en) * | 1996-06-27 | 2003-06-17 | Xerox Corporation | Information retrieval system providing secondary content analysis on collections of information objects |
US20030115552A1 (en) * | 2001-11-27 | 2003-06-19 | Jorg Jahnke | Method and system for automatic creation of multilingual immutable image files |
US6583798B1 (en) | 2000-07-21 | 2003-06-24 | Microsoft Corporation | On-object user interface |
US6587856B1 (en) | 1998-12-07 | 2003-07-01 | Oracle International Corporation | Method and system for representing and accessing object-oriented data in a relational database system |
US20030123443A1 (en) * | 1999-04-01 | 2003-07-03 | Anwar Mohammed S. | Search engine with user activity memory |
US20030130982A1 (en) | 2002-01-09 | 2003-07-10 | Stephane Kasriel | Web-site analysis system |
US20030135490A1 (en) | 2002-01-15 | 2003-07-17 | Barrett Michael E. | Enhanced popularity ranking |
US20030135499A1 (en) | 2002-01-14 | 2003-07-17 | Schirmer Andrew Lewis | System and method for mining a user's electronic mail messages to determine the user's affinities |
US6602300B2 (en) | 1998-02-03 | 2003-08-05 | Fujitsu Limited | Apparatus and method for retrieving data from a document database |
US20030154071A1 (en) | 2002-02-11 | 2003-08-14 | Shreve Gregory M. | Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents |
US20030158855A1 (en) | 2002-02-20 | 2003-08-21 | Farnham Shelly D. | Computer system architecture for automatic context associations |
US20030167266A1 (en) | 2001-01-08 | 2003-09-04 | Alexander Saldanha | Creation of structured data from plain text |
US6633868B1 (en) | 2000-07-28 | 2003-10-14 | Shermann Loyall Min | System and method for context-based document retrieval |
US20030220913A1 (en) * | 2002-05-24 | 2003-11-27 | International Business Machines Corporation | Techniques for personalized and adaptive search services |
US6665666B1 (en) | 1999-10-26 | 2003-12-16 | International Business Machines Corporation | System, method and program product for answering questions using a search engine |
US20040001104A1 (en) | 2002-06-28 | 2004-01-01 | Microsoft Corporation | Resource browser sessions search |
US20040003097A1 (en) | 2002-05-17 | 2004-01-01 | Brian Willis | Content delivery system |
US6687704B1 (en) | 2000-08-31 | 2004-02-03 | Hewlett-Packard Development Company, L.P. | Database model system and method |
US20040030741A1 (en) | 2001-04-02 | 2004-02-12 | Wolton Richard Ernest | Method and apparatus for search, visual navigation, analysis and retrieval of information from networks with remote notification and content delivery |
US6697840B1 (en) | 2000-02-29 | 2004-02-24 | Lucent Technologies Inc. | Presence awareness in collaborative systems |
US6697799B1 (en) | 1999-09-10 | 2004-02-24 | Requisite Technology, Inc. | Automated classification of items using cascade searches |
US20040036716A1 (en) | 2002-06-12 | 2004-02-26 | Jordahl Jena J. | Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view |
US20040059730A1 (en) * | 2002-09-19 | 2004-03-25 | Ming Zhou | Method and system for detecting user intentions in retrieval of hint sentences |
US20040059564A1 (en) | 2002-09-19 | 2004-03-25 | Ming Zhou | Method and system for retrieving hint sentences using expanded queries |
US20040064447A1 (en) | 2002-09-27 | 2004-04-01 | Simske Steven J. | System and method for management of synonymic searching |
US20040068486A1 (en) | 2002-10-02 | 2004-04-08 | Xerox Corporation | System and method for improving answer relevance in meta-search engines |
US20040073534A1 (en) | 2002-10-11 | 2004-04-15 | International Business Machines Corporation | Method and apparatus for data mining to discover associations and covariances associated with data |
US6745178B1 (en) * | 2000-04-28 | 2004-06-01 | International Business Machines Corporation | Internet based method for facilitating networking among persons with similar interests and for facilitating collaborative searching for information |
US20040122656A1 (en) | 2001-03-16 | 2004-06-24 | Eli Abir | Knowledge system method and appparatus |
US20040133560A1 (en) * | 2003-01-07 | 2004-07-08 | Simske Steven J. | Methods and systems for organizing electronic documents |
US20040139106A1 (en) * | 2002-12-31 | 2004-07-15 | International Business Machines Corporation | Search engine facility with automated knowledge retrieval generation and maintenance |
US6766320B1 (en) | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
US20040143569A1 (en) * | 2002-09-03 | 2004-07-22 | William Gross | Apparatus and methods for locating data |
US6772188B1 (en) | 2000-07-14 | 2004-08-03 | America Online, Incorporated | Method and apparatus for communicating with an entity automatically identified in an electronic communication |
US6778951B1 (en) | 2000-08-09 | 2004-08-17 | Concerto Software, Inc. | Information retrieval method with natural language interface |
US6785671B1 (en) | 1999-12-08 | 2004-08-31 | Amazon.Com, Inc. | System and method for locating web-based product offerings |
US6795825B2 (en) | 2000-09-12 | 2004-09-21 | Naphtali David Rishe | Database querying system and method |
US6803906B1 (en) | 2000-07-05 | 2004-10-12 | Smart Technologies, Inc. | Passive touch system and method of detecting user input |
US20040225667A1 (en) | 2003-03-12 | 2004-11-11 | Canon Kabushiki Kaisha | Apparatus for and method of summarising text |
US6820093B2 (en) | 1996-07-30 | 2004-11-16 | Hyperphrase Technologies, Llc | Method for verifying record code prior to an action based on the code |
US6820237B1 (en) * | 2000-01-21 | 2004-11-16 | Amikanow! Corporation | Apparatus and method for context-based highlighting of an electronic document |
US6834287B1 (en) | 2001-03-14 | 2004-12-21 | Trilogy Development Group, Inc. | Classification engine for managing attribute-based data |
US20040267813A1 (en) | 2003-06-30 | 2004-12-30 | Rivers-Moore Jonathan E. | Declarative solution definition |
US20040267700A1 (en) | 2003-06-26 | 2004-12-30 | Dumais Susan T. | Systems and methods for personal ubiquitous information retrieval and reuse |
US20040267730A1 (en) * | 2003-06-26 | 2004-12-30 | Microsoft Corporation | Systems and methods for performing background queries from content and activity |
US6850934B2 (en) | 2001-03-26 | 2005-02-01 | International Business Machines Corporation | Adaptive search engine query |
US6853998B2 (en) | 2001-02-07 | 2005-02-08 | International Business Machines Corporation | Customer self service subsystem for classifying user contexts |
US20050065909A1 (en) | 2003-08-05 | 2005-03-24 | Musgrove Timothy A. | Product placement engine and method |
US6874126B1 (en) | 2001-11-30 | 2005-03-29 | View Space Technologies | Method and apparatus for controlling content display by the cursor motion |
US20050114306A1 (en) | 2003-11-20 | 2005-05-26 | International Business Machines Corporation | Integrated searching of multiple search sources |
US20050125390A1 (en) * | 2003-12-03 | 2005-06-09 | Oliver Hurst-Hiller | Automated satisfaction measurement for web search |
US20050125382A1 (en) | 2003-12-03 | 2005-06-09 | Microsoft Corporation | Search system using user behavior data |
US20050198026A1 (en) | 2004-02-03 | 2005-09-08 | Dehlinger Peter J. | Code, system, and method for generating concepts |
US6948134B2 (en) | 2000-07-21 | 2005-09-20 | Microsoft Corporation | Integrated method for creating a refreshable Web Query |
US6950791B1 (en) | 2000-09-13 | 2005-09-27 | Antartica Systems, Inc. | Method for describing objects in a virtual space |
US20050222987A1 (en) | 2004-04-02 | 2005-10-06 | Vadon Eric R | Automated detection of associations between search criteria and item categories based on collective analysis of user activity data |
US6961954B1 (en) | 1997-10-27 | 2005-11-01 | The Mitre Corporation | Automated segmentation, information extraction, summarization, and presentation of broadcast news |
US6961910B2 (en) | 2000-02-17 | 2005-11-01 | International Business Machines Corporation | System for interacting with participants at a web site through an interactive visual proxy |
US6963830B1 (en) | 1999-07-19 | 2005-11-08 | Fujitsu Limited | Apparatus and method for generating a summary according to hierarchical structure of topic |
US20050262073A1 (en) | 1989-10-26 | 2005-11-24 | Michael Reed | Multimedia search system |
US6976053B1 (en) | 1999-10-14 | 2005-12-13 | Arcessa, Inc. | Method for using agents to create a computer index corresponding to the contents of networked computers |
US6976090B2 (en) | 2000-04-20 | 2005-12-13 | Actona Technologies Ltd. | Differentiated content and application delivery via internet |
US20060010150A1 (en) | 1999-05-18 | 2006-01-12 | Kom, Inc. | Method and System for Electronic File Lifecycle Management |
US7007085B1 (en) | 2001-09-28 | 2006-02-28 | Bellsouth Intellectual Property Corporation | Message log for wireline, voice mail, email, fax, pager, instant messages and chat |
US7022905B1 (en) | 1999-10-18 | 2006-04-04 | Microsoft Corporation | Classification of information and use of classifications in searching and retrieval of information |
US7027975B1 (en) | 2000-08-08 | 2006-04-11 | Object Services And Consulting, Inc. | Guided natural language interface system and method |
US7032174B2 (en) | 2001-03-27 | 2006-04-18 | Microsoft Corporation | Automatically adding proper names to a database |
US7043492B1 (en) | 2001-07-05 | 2006-05-09 | Requisite Technology, Inc. | Automated classification of items using classification mappings |
US7054860B2 (en) | 2000-06-02 | 2006-05-30 | Hitachi, Ltd. | Method and system for retrieving a document and computer readable storage medium |
US7054870B2 (en) | 2000-11-15 | 2006-05-30 | Kooltorch, Llc | Apparatus and methods for organizing and/or presenting data |
US7062442B2 (en) | 2001-02-23 | 2006-06-13 | Popcatcher Ab | Method and arrangement for search and recording of media signals |
US20060136405A1 (en) * | 2003-01-24 | 2006-06-22 | Ducatel Gary M | Searching apparatus and methods |
US7082428B1 (en) | 2002-09-16 | 2006-07-25 | Bellsouth Intellectual Property Corporation | Systems and methods for collaborative searching |
US7099860B1 (en) | 2000-10-30 | 2006-08-29 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US7146399B2 (en) | 2001-05-25 | 2006-12-05 | 2006 Trident Company | Run-time architecture for enterprise integration with transformation generation |
US7171352B2 (en) | 2004-04-23 | 2007-01-30 | Microsoft Corporation | Linguistic object model |
US7181459B2 (en) | 1999-05-04 | 2007-02-20 | Iconfind, Inc. | Method of coding, categorizing, and retrieving network pages and sites |
US7194455B2 (en) | 2002-09-19 | 2007-03-20 | Microsoft Corporation | Method and system for retrieving confirming sentences |
US7194485B2 (en) | 2003-11-21 | 2007-03-20 | International Business Machines Corporation | Mapping XML schema components to qualified java components |
US7231395B2 (en) | 2002-05-24 | 2007-06-12 | Overture Services, Inc. | Method and apparatus for categorizing and presenting documents of a distributed database |
US7293014B2 (en) | 2001-06-18 | 2007-11-06 | Siebel Systems, Inc. | System and method to enable searching across multiple databases and files using a single search |
US7305129B2 (en) | 2003-01-29 | 2007-12-04 | Microsoft Corporation | Methods and apparatus for populating electronic forms from scanned documents |
US7318049B2 (en) | 2000-11-17 | 2008-01-08 | Gregory Fx Iannacci | System and method for an automated benefit recognition, acquisition, value exchange, and transaction settlement system using multivariable linear and nonlinear modeling |
US7412708B1 (en) | 2004-03-31 | 2008-08-12 | Google Inc. | Methods and systems for capturing information |
US7421645B2 (en) | 2000-06-06 | 2008-09-02 | Microsoft Corporation | Method and system for providing electronic commerce actions based on semantically labeled strings |
US7437353B2 (en) | 2003-12-31 | 2008-10-14 | Google Inc. | Systems and methods for unification of search results |
US7451136B2 (en) | 2000-10-11 | 2008-11-11 | Microsoft Corporation | System and method for searching multiple disparate search engines |
US7478089B2 (en) | 2003-10-29 | 2009-01-13 | Kontera Technologies, Inc. | System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content |
Family Cites Families (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6492118B1 (en) * | 1999-08-27 | 2002-12-10 | Matrix Technologies Corporation | Methods of immobilizing ligands on solid supports |
JP2003223412A (en) * | 2002-01-30 | 2003-08-08 | Oki Electric Ind Co Ltd | Semiconductor integrated circuit |
-
2004
- 2004-03-31 US US10/813,875 patent/US7693825B2/en active Active
Patent Citations (147)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20050262073A1 (en) | 1989-10-26 | 2005-11-24 | Michael Reed | Multimedia search system |
USRE36727E (en) * | 1989-12-26 | 2000-06-06 | Kageneck; Karl-Erbo G. | Method of indexing and retrieval of electronically-stored documents |
US5418948A (en) | 1991-10-08 | 1995-05-23 | West Publishing Company | Concept matching of natural language queries with a database of document concepts |
US5696962A (en) | 1993-06-24 | 1997-12-09 | Xerox Corporation | Method for computerized information retrieval using shallow linguistic analysis |
US5678038A (en) | 1994-06-21 | 1997-10-14 | International Business Machines Corporation | Storing and retrieving heterogeneous classification systems utilizing globally unique identifiers |
US6460036B1 (en) | 1994-11-29 | 2002-10-01 | Pinpoint Incorporated | System and method for providing customized electronic newspapers and target advertisements |
US5754938A (en) | 1994-11-29 | 1998-05-19 | Herz; Frederick S. M. | Pseudonymous server for system for customized electronic identification of desirable objects |
US5717913A (en) | 1995-01-03 | 1998-02-10 | University Of Central Florida | Method for detecting and extracting text data using database schemas |
US5701469A (en) | 1995-06-07 | 1997-12-23 | Microsoft Corporation | Method and system for generating accurate search results using a content-index |
US5911139A (en) * | 1996-03-29 | 1999-06-08 | Virage, Inc. | Visual image database search engine which allows for different schema |
US5964839A (en) | 1996-03-29 | 1999-10-12 | At&T Corp | System and method for monitoring information flow and performing data collection |
US5826261A (en) * | 1996-05-10 | 1998-10-20 | Spencer; Graham | System and method for querying multiple, distributed databases by selective sharing of local relative significance information for terms related to the query |
US6581056B1 (en) * | 1996-06-27 | 2003-06-17 | Xerox Corporation | Information retrieval system providing secondary content analysis on collections of information objects |
US6820093B2 (en) | 1996-07-30 | 2004-11-16 | Hyperphrase Technologies, Llc | Method for verifying record code prior to an action based on the code |
US6070158A (en) | 1996-08-14 | 2000-05-30 | Infoseek Corporation | Real-time document collection search engine with phrase indexing |
US5890152A (en) | 1996-09-09 | 1999-03-30 | Seymour Alvin Rapaport | Personal feedback browser for obtaining media files |
US5933827A (en) | 1996-09-25 | 1999-08-03 | International Business Machines Corporation | System for identifying new web pages of interest to a user |
US5987446A (en) * | 1996-11-12 | 1999-11-16 | U.S. West, Inc. | Searching large collections of text using multiple search engines concurrently |
US6272507B1 (en) * | 1997-04-09 | 2001-08-07 | Xerox Corporation | System for ranking search results from a collection of documents using spreading activation techniques |
US6006222A (en) | 1997-04-25 | 1999-12-21 | Culliss; Gary | Method for organizing information |
US5940821A (en) | 1997-05-21 | 1999-08-17 | Oracle Corporation | Information presentation in a knowledge base search and retrieval system |
US6182068B1 (en) | 1997-08-01 | 2001-01-30 | Ask Jeeves, Inc. | Personalized search methods |
US6014665A (en) * | 1997-08-01 | 2000-01-11 | Culliss; Gary | Method for organizing information |
US6078916A (en) | 1997-08-01 | 2000-06-20 | Culliss; Gary | Method for organizing information |
US6961954B1 (en) | 1997-10-27 | 2005-11-01 | The Mitre Corporation | Automated segmentation, information extraction, summarization, and presentation of broadcast news |
US6473752B1 (en) * | 1997-12-04 | 2002-10-29 | Micron Technology, Inc. | Method and system for locating documents based on previously accessed documents |
US6602300B2 (en) | 1998-02-03 | 2003-08-05 | Fujitsu Limited | Apparatus and method for retrieving data from a document database |
US6012067A (en) | 1998-03-02 | 2000-01-04 | Sarkar; Shyam Sundar | Method and apparatus for storing and manipulating objects in a plurality of relational data managers on the web |
US20030055831A1 (en) * | 1998-03-16 | 2003-03-20 | S.L.I. Systems, Inc. | Search engine |
US6421675B1 (en) * | 1998-03-16 | 2002-07-16 | S. L. I. Systems, Inc. | Search engine |
US6112203A (en) | 1998-04-09 | 2000-08-29 | Altavista Company | Method for ranking documents in a hyperlinked environment using connectivity and selective content analysis |
US6199059B1 (en) | 1998-04-22 | 2001-03-06 | International Computex, Inc. | System and method for classifying and retrieving information with virtual object hierarchy |
US6122647A (en) * | 1998-05-19 | 2000-09-19 | Perspecta, Inc. | Dynamic generation of contextual links in hypertext documents |
US6167434A (en) | 1998-07-15 | 2000-12-26 | Pang; Stephen Y. | Computer code for removing junk e-mail messages |
US6505191B1 (en) | 1998-07-24 | 2003-01-07 | Jarg Corporation | Distributed computer database system and method employing hypertext linkage analysis |
US6397221B1 (en) | 1998-09-12 | 2002-05-28 | International Business Machines Corp. | Method for creating and maintaining a frame-based hierarchically organized databases with tabularly organized data |
US20030079185A1 (en) | 1998-10-09 | 2003-04-24 | Sanjeev Katariya | Method and system for generating a document summary |
US6587856B1 (en) | 1998-12-07 | 2003-07-01 | Oracle International Corporation | Method and system for representing and accessing object-oriented data in a relational database system |
US20030123443A1 (en) * | 1999-04-01 | 2003-07-03 | Anwar Mohammed S. | Search engine with user activity memory |
US7181459B2 (en) | 1999-05-04 | 2007-02-20 | Iconfind, Inc. | Method of coding, categorizing, and retrieving network pages and sites |
US7031961B2 (en) | 1999-05-05 | 2006-04-18 | Google, Inc. | System and method for searching and recommending objects from a categorically organized information repository |
US20020016786A1 (en) | 1999-05-05 | 2002-02-07 | Pitkow James B. | System and method for searching and recommending objects from a categorically organized information repository |
US6571234B1 (en) * | 1999-05-11 | 2003-05-27 | Prophet Financial Systems, Inc. | System and method for managing online message board |
US20060010150A1 (en) | 1999-05-18 | 2006-01-12 | Kom, Inc. | Method and System for Electronic File Lifecycle Management |
US6963830B1 (en) | 1999-07-19 | 2005-11-08 | Fujitsu Limited | Apparatus and method for generating a summary according to hierarchical structure of topic |
US6321228B1 (en) * | 1999-08-31 | 2001-11-20 | Powercast Media, Inc. | Internet search system for retrieving selected results from a previous search |
US6697799B1 (en) | 1999-09-10 | 2004-02-24 | Requisite Technology, Inc. | Automated classification of items using cascade searches |
US6976053B1 (en) | 1999-10-14 | 2005-12-13 | Arcessa, Inc. | Method for using agents to create a computer index corresponding to the contents of networked computers |
US7022905B1 (en) | 1999-10-18 | 2006-04-04 | Microsoft Corporation | Classification of information and use of classifications in searching and retrieval of information |
US6665666B1 (en) | 1999-10-26 | 2003-12-16 | International Business Machines Corporation | System, method and program product for answering questions using a search engine |
US6490575B1 (en) | 1999-12-06 | 2002-12-03 | International Business Machines Corporation | Distributed network search engine |
US6785671B1 (en) | 1999-12-08 | 2004-08-31 | Amazon.Com, Inc. | System and method for locating web-based product offerings |
US20020099700A1 (en) * | 1999-12-14 | 2002-07-25 | Wen-Syan Li | Focused search engine and method |
US6546388B1 (en) * | 2000-01-14 | 2003-04-08 | International Business Machines Corporation | Metadata search results ranking system |
US6820237B1 (en) * | 2000-01-21 | 2004-11-16 | Amikanow! Corporation | Apparatus and method for context-based highlighting of an electronic document |
US20030033296A1 (en) | 2000-01-31 | 2003-02-13 | Kenneth Rothmuller | Digital media management apparatus and methods |
US6961910B2 (en) | 2000-02-17 | 2005-11-01 | International Business Machines Corporation | System for interacting with participants at a web site through an interactive visual proxy |
US6697840B1 (en) | 2000-02-29 | 2004-02-24 | Lucent Technologies Inc. | Presence awareness in collaborative systems |
US20010037328A1 (en) | 2000-03-23 | 2001-11-01 | Pustejovsky James D. | Method and system for interfacing to a knowledge acquisition system |
US20030093790A1 (en) * | 2000-03-28 | 2003-05-15 | Logan James D. | Audio and video program recording, editing and playback systems using metadata |
US20020059272A1 (en) * | 2000-04-20 | 2002-05-16 | Porter Edward W. | Apparatuses, methods, programming, and propagated signals for creating, editing, organizing and viewing collaborative databases |
US6976090B2 (en) | 2000-04-20 | 2005-12-13 | Actona Technologies Ltd. | Differentiated content and application delivery via internet |
US20010037377A1 (en) * | 2000-04-27 | 2001-11-01 | Yumiko Nakano | Information searching apparatus and method |
US6745178B1 (en) * | 2000-04-28 | 2004-06-01 | International Business Machines Corporation | Internet based method for facilitating networking among persons with similar interests and for facilitating collaborative searching for information |
US7054860B2 (en) | 2000-06-02 | 2006-05-30 | Hitachi, Ltd. | Method and system for retrieving a document and computer readable storage medium |
US7421645B2 (en) | 2000-06-06 | 2008-09-02 | Microsoft Corporation | Method and system for providing electronic commerce actions based on semantically labeled strings |
US6803906B1 (en) | 2000-07-05 | 2004-10-12 | Smart Technologies, Inc. | Passive touch system and method of detecting user input |
US20020174101A1 (en) * | 2000-07-12 | 2002-11-21 | Fernley Helen Elaine Penelope | Document retrieval system |
US6772188B1 (en) | 2000-07-14 | 2004-08-03 | America Online, Incorporated | Method and apparatus for communicating with an entity automatically identified in an electronic communication |
US6583798B1 (en) | 2000-07-21 | 2003-06-24 | Microsoft Corporation | On-object user interface |
US6948134B2 (en) | 2000-07-21 | 2005-09-20 | Microsoft Corporation | Integrated method for creating a refreshable Web Query |
US6633868B1 (en) | 2000-07-28 | 2003-10-14 | Shermann Loyall Min | System and method for context-based document retrieval |
US7027975B1 (en) | 2000-08-08 | 2006-04-11 | Object Services And Consulting, Inc. | Guided natural language interface system and method |
US6778951B1 (en) | 2000-08-09 | 2004-08-17 | Concerto Software, Inc. | Information retrieval method with natural language interface |
US6766320B1 (en) | 2000-08-24 | 2004-07-20 | Microsoft Corporation | Search engine with natural language-based robust parsing for user query and relevance feedback learning |
US6687704B1 (en) | 2000-08-31 | 2004-02-03 | Hewlett-Packard Development Company, L.P. | Database model system and method |
US20020103737A1 (en) | 2000-09-07 | 2002-08-01 | Briere Daniel D. | Marketing collateral repository and supporting data management and communication environment |
US6795825B2 (en) | 2000-09-12 | 2004-09-21 | Naphtali David Rishe | Database querying system and method |
US6950791B1 (en) | 2000-09-13 | 2005-09-27 | Antartica Systems, Inc. | Method for describing objects in a virtual space |
US20020040311A1 (en) * | 2000-10-04 | 2002-04-04 | John Douglass | Web browser page rating system |
US7451136B2 (en) | 2000-10-11 | 2008-11-11 | Microsoft Corporation | System and method for searching multiple disparate search engines |
US20020103806A1 (en) | 2000-10-18 | 2002-08-01 | Masafumi Yamanoue | Data supply controlling device, data supplying method, storage medium for data supplying program, and data supplying system |
US7099860B1 (en) | 2000-10-30 | 2006-08-29 | Microsoft Corporation | Image retrieval systems and methods with semantic and feature based relevance feedback |
US20020103698A1 (en) | 2000-10-31 | 2002-08-01 | Christian Cantrell | System and method for enabling user control of online advertising campaigns |
US20020095427A1 (en) | 2000-11-09 | 2002-07-18 | Kaplan Ari David | System, method and apparatus for the wireless monitoring and management of computer systems |
US7054870B2 (en) | 2000-11-15 | 2006-05-30 | Kooltorch, Llc | Apparatus and methods for organizing and/or presenting data |
US7318049B2 (en) | 2000-11-17 | 2008-01-08 | Gregory Fx Iannacci | System and method for an automated benefit recognition, acquisition, value exchange, and transaction settlement system using multivariable linear and nonlinear modeling |
US20020065800A1 (en) | 2000-11-30 | 2002-05-30 | Morlitz David M. | HTTP archive file |
US20020116291A1 (en) | 2000-12-22 | 2002-08-22 | Xerox Corporation | Recommender system and method |
US20020129059A1 (en) | 2000-12-29 | 2002-09-12 | Eck Jeffery R. | XML auto map generator |
US20030167266A1 (en) | 2001-01-08 | 2003-09-04 | Alexander Saldanha | Creation of structured data from plain text |
US6853998B2 (en) | 2001-02-07 | 2005-02-08 | International Business Machines Corporation | Customer self service subsystem for classifying user contexts |
US7062442B2 (en) | 2001-02-23 | 2006-06-13 | Popcatcher Ab | Method and arrangement for search and recording of media signals |
US6834287B1 (en) | 2001-03-14 | 2004-12-21 | Trilogy Development Group, Inc. | Classification engine for managing attribute-based data |
US20040122656A1 (en) | 2001-03-16 | 2004-06-24 | Eli Abir | Knowledge system method and appparatus |
US6850934B2 (en) | 2001-03-26 | 2005-02-01 | International Business Machines Corporation | Adaptive search engine query |
US7032174B2 (en) | 2001-03-27 | 2006-04-18 | Microsoft Corporation | Automatically adding proper names to a database |
US20040030741A1 (en) | 2001-04-02 | 2004-02-12 | Wolton Richard Ernest | Method and apparatus for search, visual navigation, analysis and retrieval of information from networks with remote notification and content delivery |
US7146399B2 (en) | 2001-05-25 | 2006-12-05 | 2006 Trident Company | Run-time architecture for enterprise integration with transformation generation |
US7293014B2 (en) | 2001-06-18 | 2007-11-06 | Siebel Systems, Inc. | System and method to enable searching across multiple databases and files using a single search |
US20030046311A1 (en) * | 2001-06-19 | 2003-03-06 | Ryan Baidya | Dynamic search engine and database |
US20030014398A1 (en) * | 2001-06-29 | 2003-01-16 | Hitachi, Ltd. | Query modification system for information retrieval |
US7043492B1 (en) | 2001-07-05 | 2006-05-09 | Requisite Technology, Inc. | Automated classification of items using classification mappings |
US20030020749A1 (en) * | 2001-07-10 | 2003-01-30 | Suhayya Abu-Hakima | Concept-based message/document viewer for electronic communications and internet searching |
US20030069877A1 (en) | 2001-08-13 | 2003-04-10 | Xerox Corporation | System for automatically generating queries |
US7007085B1 (en) | 2001-09-28 | 2006-02-28 | Bellsouth Intellectual Property Corporation | Message log for wireline, voice mail, email, fax, pager, instant messages and chat |
US20030093276A1 (en) * | 2001-11-13 | 2003-05-15 | Miller Michael J. | System and method for automated answering of natural language questions and queries |
US20030115552A1 (en) * | 2001-11-27 | 2003-06-19 | Jorg Jahnke | Method and system for automatic creation of multilingual immutable image files |
US6874126B1 (en) | 2001-11-30 | 2005-03-29 | View Space Technologies | Method and apparatus for controlling content display by the cursor motion |
US20030130982A1 (en) | 2002-01-09 | 2003-07-10 | Stephane Kasriel | Web-site analysis system |
US20030135499A1 (en) | 2002-01-14 | 2003-07-17 | Schirmer Andrew Lewis | System and method for mining a user's electronic mail messages to determine the user's affinities |
US20030135490A1 (en) | 2002-01-15 | 2003-07-17 | Barrett Michael E. | Enhanced popularity ranking |
US20030154071A1 (en) | 2002-02-11 | 2003-08-14 | Shreve Gregory M. | Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents |
US20030158855A1 (en) | 2002-02-20 | 2003-08-21 | Farnham Shelly D. | Computer system architecture for automatic context associations |
US20040003097A1 (en) | 2002-05-17 | 2004-01-01 | Brian Willis | Content delivery system |
US20030220913A1 (en) * | 2002-05-24 | 2003-11-27 | International Business Machines Corporation | Techniques for personalized and adaptive search services |
US7231395B2 (en) | 2002-05-24 | 2007-06-12 | Overture Services, Inc. | Method and apparatus for categorizing and presenting documents of a distributed database |
US20040036716A1 (en) | 2002-06-12 | 2004-02-26 | Jordahl Jena J. | Data storage, retrieval, manipulation and display tools enabling multiple hierarchical points of view |
US20040001104A1 (en) | 2002-06-28 | 2004-01-01 | Microsoft Corporation | Resource browser sessions search |
US20040143569A1 (en) * | 2002-09-03 | 2004-07-22 | William Gross | Apparatus and methods for locating data |
US7082428B1 (en) | 2002-09-16 | 2006-07-25 | Bellsouth Intellectual Property Corporation | Systems and methods for collaborative searching |
US20040059730A1 (en) * | 2002-09-19 | 2004-03-25 | Ming Zhou | Method and system for detecting user intentions in retrieval of hint sentences |
US7194455B2 (en) | 2002-09-19 | 2007-03-20 | Microsoft Corporation | Method and system for retrieving confirming sentences |
US20040059564A1 (en) | 2002-09-19 | 2004-03-25 | Ming Zhou | Method and system for retrieving hint sentences using expanded queries |
US20040064447A1 (en) | 2002-09-27 | 2004-04-01 | Simske Steven J. | System and method for management of synonymic searching |
US20040068486A1 (en) | 2002-10-02 | 2004-04-08 | Xerox Corporation | System and method for improving answer relevance in meta-search engines |
US20040073534A1 (en) | 2002-10-11 | 2004-04-15 | International Business Machines Corporation | Method and apparatus for data mining to discover associations and covariances associated with data |
US20040139106A1 (en) * | 2002-12-31 | 2004-07-15 | International Business Machines Corporation | Search engine facility with automated knowledge retrieval generation and maintenance |
US20040133560A1 (en) * | 2003-01-07 | 2004-07-08 | Simske Steven J. | Methods and systems for organizing electronic documents |
US20060136405A1 (en) * | 2003-01-24 | 2006-06-22 | Ducatel Gary M | Searching apparatus and methods |
US7305129B2 (en) | 2003-01-29 | 2007-12-04 | Microsoft Corporation | Methods and apparatus for populating electronic forms from scanned documents |
US20040225667A1 (en) | 2003-03-12 | 2004-11-11 | Canon Kabushiki Kaisha | Apparatus for and method of summarising text |
US20040267700A1 (en) | 2003-06-26 | 2004-12-30 | Dumais Susan T. | Systems and methods for personal ubiquitous information retrieval and reuse |
US7162473B2 (en) * | 2003-06-26 | 2007-01-09 | Microsoft Corporation | Method and system for usage analyzer that determines user accessed sources, indexes data subsets, and associated metadata, processing implicit queries based on potential interest to users |
US20040267730A1 (en) * | 2003-06-26 | 2004-12-30 | Microsoft Corporation | Systems and methods for performing background queries from content and activity |
US20040267813A1 (en) | 2003-06-30 | 2004-12-30 | Rivers-Moore Jonathan E. | Declarative solution definition |
US20050065909A1 (en) | 2003-08-05 | 2005-03-24 | Musgrove Timothy A. | Product placement engine and method |
US7478089B2 (en) | 2003-10-29 | 2009-01-13 | Kontera Technologies, Inc. | System and method for real-time web page context analysis for the real-time insertion of textual markup objects and dynamic content |
US20050114306A1 (en) | 2003-11-20 | 2005-05-26 | International Business Machines Corporation | Integrated searching of multiple search sources |
US7194485B2 (en) | 2003-11-21 | 2007-03-20 | International Business Machines Corporation | Mapping XML schema components to qualified java components |
US20050125390A1 (en) * | 2003-12-03 | 2005-06-09 | Oliver Hurst-Hiller | Automated satisfaction measurement for web search |
US20050125382A1 (en) | 2003-12-03 | 2005-06-09 | Microsoft Corporation | Search system using user behavior data |
US7437353B2 (en) | 2003-12-31 | 2008-10-14 | Google Inc. | Systems and methods for unification of search results |
US20050198026A1 (en) | 2004-02-03 | 2005-09-08 | Dehlinger Peter J. | Code, system, and method for generating concepts |
US7412708B1 (en) | 2004-03-31 | 2008-08-12 | Google Inc. | Methods and systems for capturing information |
US20050222987A1 (en) | 2004-04-02 | 2005-10-06 | Vadon Eric R | Automated detection of associations between search criteria and item categories based on collective analysis of user activity data |
US7171352B2 (en) | 2004-04-23 | 2007-01-30 | Microsoft Corporation | Linguistic object model |
Non-Patent Citations (58)
Title |
---|
"askSam(TM) Making Information Useful," askSam,-Organize your Information with askSam, http://www.asksam.com/brochure.asp, printed Mar. 15, 2004. |
"askSam™ Making Information Useful," askSam,—Organize your Information with askSam, http://www.asksam.com/brochure.asp, printed Mar. 15, 2004. |
"Overview," Stuff I've Seen-Home Page, http://research.Microsoft.com/adapt/sis/index.htm, pp. 1-2, printed May 26, 2004. |
"Searching for the next Google-New trends are helping nimble startups elbow in to the plundered market," Red Herring-The Business of Technology, Mar. 9, 2004, http://redherring.com/PrintArticle.aspx?a=4782§or=Capital, p. 1-5, printed Mar. 30, 2004. |
"Selecting Task-Relevant Sources for Just-In-Time Retrieval," pp. 1-3, no date. |
"Standardization Priorities for the Directory-Directory Interoperability Forum White Paper," The Open Group, Dec. 2001, pp. 1-21. |
"WhenU Just-In-Time Marketing," http://www.whenu.com, printed Mar. 19, 2004. |
80-20 Software-Products-80-20 One Search, http://www.80-20.com/products/one-search/retriever.asp, printed Mar. 16, 2004. |
Alexa® Web Search-Toolbar Quick Tour, http://pages.alexa.com/prod-serv/quicktour.html, pp. 1-5, printed Mar. 16, 2004. |
Barrett, R. et al., "How to Personalize the Web," IBM Research, http://www.almaden.ibm.com/cs/wbi/papers/chi97/wbipaper.html, pp. 1-13, printed Mar. 16, 2004. |
Battelle, J., CNN.com "When geeks go camping, ideas hatch," http://www.cnn.com/2004/TECH/ptech/01/09/bus2.feat.geek.camp/index.html, pp. 1-3, printed Jan. 13, 2004. |
Berlin, J., et al., "Database Schema Matching Using Machine Learning with Feature Selection," CAISE 2002, LNCS 2348, pp. 452-466, http://www.springerlink.com/contant/73u6cpt0qek8rgh0/. |
Boyan, J., et al., "A Machine Learning Architecture for Optimizing Web Search Engines," School of Computer Science, Camegie Mellon University, May 10, 1996, pp. 1-8. |
Bradenbaugh, F., "Chapter 1 The Client-Side Search Engine," JavaScript Cookbook, 1st Ed., Oct. 1999, O'Reilly(TM) Online Catalog, http://www.oreilly.com/catalog/jscook/chapter/ch01.html, pp. 1-30, printed Dec. 29, 2003. |
Bradenbaugh, F., "Chapter 1 The Client-Side Search Engine," JavaScript Cookbook, 1st Ed., Oct. 1999, O'Reilly™ Online Catalog, http://www.oreilly.com/catalog/jscook/chapter/ch01.html, pp. 1-30, printed Dec. 29, 2003. |
Brill, E., "A Simple Rule-Based Part of Speech Tagger," Department of Computer Science, University of Pennsylvania, 1992, pp. 1-5. |
Brin, S., et al, "The Anatomy of a Large-Scale Hypertextual Web Search Engine," http://www7.scu.edu.au/programme/fullpapers/1921/com1921.htm, pp. 1-18, 1998. |
Budzik, J., et al., User Interactions with Everyday Applications as Context for Just-in-time Information Access, Intelligent Information Laboratory, Northwestern University, pp. 1-8, no date. |
Chen, H., et al., "Bringing Order to the Web: Automatically Categorizing Search Results," Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, Apr. 2000, p. 145-152. |
Claypool, M., et al., "Inferring User Interest," IEEE Internet Computing, 2001, pp. 1-17, vol. 5, No. 6, located at http://web.cs.wpi.edu/~claypool/papers/iui/iui.pdf. |
Claypool, M., et al., "Inferring User Interest," IEEE Internet Computing, 2001, pp. 1-17, vol. 5, No. 6, located at http://web.cs.wpi.edu/˜claypool/papers/iui/iui.pdf. |
Czerwinski, M., et al., "Visualizing Implicit Queries for Information Management and Retrieval," ACM CHI '99, May 15-20, 1999, pp. 560-567. |
DEVONthink, http://www.devon-techonologies.com/products/devonthink.php, printed Mar. 16, 2004. |
dtSearch®-http://www.dtsearch.com/, printed Mar. 15, 2004. |
Dumais, S., et al, "Stuff I've Seen: A System for Personal Information Retrieval and Re-Use," Microsoft Research, SIGIR'03, Jul. 28-Aug. 1, 2003, pp. 1-8. |
Enfish, http://www.enfish.com, printed Mar. 16, 2004. |
Fast Search & Transfer-Home-Enterprise Search, http://solutions.altavista.com/en/news/pr-020402-desktop.shtmu, printed Mar. 16, 2004. |
Fertig, S., et al., "Lifestreams: An Alternative to the Desktop Metaphor," http://www.acm.org/sigchi/chi96/proceedings/videos/Fertig/etf.htm, pp. 1-3, printed Mar. 16, 2004. |
Garofalakis, M., et al., "XTRACT: A System for Extracting Document Type Descriptors from XML Documents," SIGMOD, ACM, Jun. 2000, p. 165-176, vol. 29, No. 2. |
Geisler, G., "Enriched Links: A Framework for Improving Web Navigation Using Pop-Up Views," pp. 1-14, 2000. |
Horvitz, E., et al., "The Lumiere project: Bayesian user modeling for inferring the goals and needs of software users", Proceedings of the Fourteenth Conference on Uncertainty, 1998, pp. 256-265, Morgan Kaufmann: San Francisco. |
International Search Report and Written Opinion, PCT/US2004/038562, Apr. 6, 2005, 12 pages. |
ISYS Search Software-ISYS: desktop, http://www.isysusa.com/products/desktop/index.html, printed Mar. 16, 2004. |
Joachims, T., et al., "WebWatcher: A Tour Guide for the World Wide Web," 1996. |
Joho, H., et al., "A Study of User Interaction with a Concept-Based Interactive Query Expansion Support Tool," Advances in Information Retrieval, A Study of User Interaction, Lecture Notes in Computer Science, Mar. 2, 2004, pp. 42-56, vol. 2997. |
Jones, G., et al., "Context-Aware Retrieval for Ubiquitous Computing Environments," Mobile and Ubiquitous Information Access, Lecture Notes in Computer Science, Jan. 27, 2004, pp. 227-243, vol. 2954. |
Knezevic, P. et al., "The Architecture Of The Obelix-An Improved Internet Search Engine," Proceedings of the 33rd Annual Hawaii International Conference on System Sciences (HICSS) Jan. 4-7, 2000, Maui, HI, USA, pp. 2145-2155. |
Li, W., et al., "Semantic Integration in Heterogeneous Databases Using Neural Networks," Proceedings of the 20th International Conference on Very Large Data Bases, Sep. 12-15, 1994, pp. 1-12, Morgan Kaufmann Publishers, San Francisco, CA. |
Li, W., et al., "SEMINT: A Tool for Identifying Attribute Correspondences in Heterogeneous Databases Using Neural Networks," Data Knowl. Eng., Apr. 2000, pp. 484, vol. 33, No. 1, http://dx.doi.org/10.1016/S0169-023X(99)00044-0. |
Markoff, J., "Google Moves Toward Clash with Microsoft," The New York Times, May 19, 2004, http://www.nytimes.com/2004/5/19/technology/19google.html?ex=1085964389&ei=1&e..., pp. 1-4, printed May 19, 2004. |
Morita, M. et al., "Information Filtering Based on User Behavior Analysis and Best Match Text Retrieval," Proceedings of the Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, Dublin, Jul. 3-6, 1994, pp. 272-281. |
Naraine, R., "Future of Search Will Make You Dizzy," Enterprise, May 20, 2004, http://www.internetnews.com/ent-news/article.php/3356831, pp. 1-4, printed May 21, 2004. |
Pasca, M., "Acquisition of Categorized Named Entities for Web Search," Proceedings of the 13th ACM International Conference on Information and Knowledge Management, Nov. 2004, pp. 137-145. |
Phelps, A., "All You Can Seek," Special Services, Jul. 1999, vol. 7, Iss. 7, [online] [Retrieved on Oct. 16, 2006] Retrieved from the Internet: http://www.smartcomputing.com/editorial/article.asp?article=articles/archive/g0707/26g07/26g07.asp. |
Rhodes, B., "Margin Notes Building a Contextually Aware Associative Memory," The Proceedings of the International Conference on Intelligent User Interfaces (IUI'00), Jan. 9-12, 2000. |
Rhodes, B., et al., "Just-in-time information retrieval agents," Systems Journal, vol. 39, Nos. 3&4, 2000, pp. 685-704. |
Rhodes, B., et al., "Remembrance Agent-A continuously running automated information retrieval system," The Proceedings of the First International Conference on the Practical Application of Intelligent Agents and Multi Agent Technology (PAAM '98), pp. 487-495. |
Rizzo, T., "WinFS 101: Introducing the New Windows File System," Longhorn Developer Center Home: Headline Archive: WinFS 101: Introducing the New . . . , http://msdn.Microsoft.com/Longhorn/archive/default.aspx?pull+/library/en-us/dnwinfs/htm..., pp. 1-5, printed Apr. 21, 2004. |
Scha, R., et al., "An Augmented Context Free Grammar for Discourse," Proceedings of the 12th Conference on Computational Linguistics-vol. 2, Computational Linguistics, Aug. 22-27, 1988, pp. 573-577, Morristown, NJ, http://dx.doi.org/10.3115/991719.991756. |
Shedherd, M., et al., "Browsing and Keyword-Based Profiles: A Cautionary Tale," Proceedings of the 34th Hawaii International Conference on System Sciences, Jan. 3-6, 2001, pp. 1365-1373. |
Sherman, C., "HotBot's New Desktop Search Toolbar," www.searchenginewatch.com, http://searchenginewatch.com/searchday/print.php/34711-339921, pp. 1-3, printed Apr. 14, 2004. |
Sullivan, D., "Alta Vista Releases Search Software," The Search Engine Report, Aug. 4, 1998, pp. 1-2. |
U.S. Appl. No. 10/749,440, filed Dec. 31, 2003, Badros et al. |
WebWatcher Home Page, "Welcome to the WebWatcher Project," http://www-2.cs.cmu.edu/~webwatcher/, printed Oct. 15, 2003. |
WebWatcher Home Page, "Welcome to the WebWatcher Project," http://www-2.cs.cmu.edu/˜webwatcher/, printed Oct. 15, 2003. |
White, R., et al., "The Use of Implicit Evidence for Relevance Feedback in Web Retrieval," Lecture Notes in Computer Science, Jan. 1, 2002, pp. 93-109, vol. 2291. |
X1 instantly searches files & email. For outlook, Outlook, http://www.x1.com/, printed Mar. 15, 2004. |
Zellweger, P., et al., "Fluid Links for Informed and Incremental Link Transitions," Proceedings of Hypertext'98, Pittsburgh, PA, Jun. 20-24, 1998, pp. 50-57. |
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US11809432B2 (en) | 2002-01-14 | 2023-11-07 | Awemane Ltd. | Knowledge gathering system based on user's affinity |
US20130173609A1 (en) * | 2004-11-22 | 2013-07-04 | Facebook, Inc. | Systems and methods for sorting search results |
US20130080424A1 (en) * | 2004-11-22 | 2013-03-28 | Facebook, Inc. | Systems and methods for sorting search results |
US8788488B2 (en) * | 2004-11-22 | 2014-07-22 | Facebook, Inc. | Ranking search results based on recency |
US20060271520A1 (en) * | 2005-05-27 | 2006-11-30 | Ragan Gene Z | Content-based implicit search query |
US8401841B2 (en) * | 2006-08-31 | 2013-03-19 | Orcatec Llc | Retrieval of documents using language models |
US20080059187A1 (en) * | 2006-08-31 | 2008-03-06 | Roitblat Herbert L | Retrieval of Documents Using Language Models |
US8036926B2 (en) * | 2007-03-12 | 2011-10-11 | International Business Machines Corporation | Techniques for selecting calendar events by examining content of user's recent e-mail activity |
US20080228548A1 (en) * | 2007-03-12 | 2008-09-18 | Mcbrearty Gerald F | System and method for selecting calendar events by examining content of user's recent e-mail activity |
US20080294619A1 (en) * | 2007-05-23 | 2008-11-27 | Hamilton Ii Rick Allen | System and method for automatic generation of search suggestions based on recent operator behavior |
US20090055426A1 (en) * | 2007-08-20 | 2009-02-26 | Samsung Electronics Co., Ltd. | Method and system for generating playlists for content items |
US8156118B2 (en) * | 2007-08-20 | 2012-04-10 | Samsung Electronics Co., Ltd. | Method and system for generating playlists for content items |
US8370351B2 (en) | 2007-08-20 | 2013-02-05 | Samsung Electronics Co., Ltd. | Method and system for generating playlists for content items |
US20100094831A1 (en) * | 2008-10-14 | 2010-04-15 | Microsoft Corporation | Named entity resolution using multiple text sources |
US20100145939A1 (en) * | 2008-12-05 | 2010-06-10 | Yahoo! Inc. | Determining related keywords based on lifestream feeds |
US8515908B2 (en) | 2008-12-05 | 2013-08-20 | Yahoo! Inc. | Determining related keywords based on lifestream feeds |
US8112393B2 (en) * | 2008-12-05 | 2012-02-07 | Yahoo! Inc. | Determining related keywords based on lifestream feeds |
US8849806B2 (en) * | 2010-03-23 | 2014-09-30 | Blackberry Limited | Method, system and apparatus for efficiently determining priority of data in a database |
US20110238671A1 (en) * | 2010-03-23 | 2011-09-29 | Research In Motion Limited | Method, system and apparatus for efficiently determining priority of data in a database |
US9111289B2 (en) * | 2011-08-25 | 2015-08-18 | Ebay Inc. | System and method for providing automatic high-value listing feeds for online computer users |
US10311488B2 (en) | 2011-08-25 | 2019-06-04 | Ebay Inc. | System and method for providing automatic high-value listing feeds for online computer users |
US9098543B2 (en) | 2013-03-14 | 2015-08-04 | Wal-Mart Stores, Inc. | Attribute detection |
US9436744B2 (en) | 2014-05-08 | 2016-09-06 | Accenture Global Services Limited | Combining internal and external search results |
US9838348B2 (en) | 2014-12-31 | 2017-12-05 | Yahoo Holdings, Inc. | Electronic message search system and method |
US20230281257A1 (en) * | 2022-01-31 | 2023-09-07 | Walmart Apollo, Llc | Systems and methods for determining and utilizing search token importance using machine learning architectures |
Also Published As
Publication number | Publication date |
---|---|
US20070276829A1 (en) | 2007-11-29 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US7693825B2 (en) | Systems and methods for ranking implicit search results | |
US9009153B2 (en) | Systems and methods for identifying a named entity | |
US8631001B2 (en) | Systems and methods for weighting a search query result | |
US8543572B2 (en) | Systems and methods for analyzing boilerplate | |
US7664734B2 (en) | Systems and methods for generating multiple implicit search queries | |
US7788274B1 (en) | Systems and methods for category-based search | |
US11860921B2 (en) | Category-based search | |
US7873632B2 (en) | Systems and methods for associating a keyword with a user interface area | |
US20070276801A1 (en) | Systems and methods for constructing and using a user profile | |
US20070282797A1 (en) | Systems and methods for refreshing a content display | |
US7437353B2 (en) | Systems and methods for unification of search results | |
US7725508B2 (en) | Methods and systems for information capture and retrieval | |
KR100932999B1 (en) | Browsing documents by links automatically generated based on user information and content | |
US9672232B1 (en) | Systems and methods for selectively storing event data | |
US20090276408A1 (en) | Systems And Methods For Generating A User Interface | |
US20080059419A1 (en) | Systems and methods for providing search results | |
US7580568B1 (en) | Methods and systems for identifying an image as a representative image for an article | |
US6804704B1 (en) | System for collecting and storing email addresses with associated descriptors in a bookmark list in association with network addresses of electronic documents using a browser program | |
US20050114324A1 (en) | System and method for improved searching on the internet or similar networks and especially improved MetaNews and/or improved automatically generated newspapers | |
US20070043761A1 (en) | Semantic discovery engine | |
US20050149498A1 (en) | Methods and systems for improving a search ranking using article information | |
US8667013B1 (en) | Systems and methods for determining an article association measure | |
US7761439B1 (en) | Systems and methods for performing a directory search | |
WO2002093418A1 (en) | Personal document system and method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AS | Assignment |
Owner name: GOOGLE INC., CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WANG, NINIANE;LAWRENCE, STEPHEN R.;REEL/FRAME:015880/0220 Effective date: 20040716 Owner name: GOOGLE INC.,CALIFORNIA Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:WANG, NINIANE;LAWRENCE, STEPHEN R.;REEL/FRAME:015880/0220 Effective date: 20040716 |
|
STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
FPAY | Fee payment |
Year of fee payment: 4 |
|
AS | Assignment |
Owner name: GOOGLE LLC, CALIFORNIA Free format text: CHANGE OF NAME;ASSIGNOR:GOOGLE INC.;REEL/FRAME:044101/0610 Effective date: 20170929 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 8TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1552) Year of fee payment: 8 |
|
MAFP | Maintenance fee payment |
Free format text: PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY Year of fee payment: 12 |