A system and method are provided for searching for desired items from a network of information resources. In particular, the system and method have advantageous applicability to searching for World Wide Web pages having desired content. An initial set of pages are selected, preferably by running a conventional keyword-based query, and then further selecting pages pointing to, or pointed to from, the pages found by the keyword-based query. Alternatively, the invention may be applied to a single page, where the initial set includes pages pointed to by the single page and pages which point to the single page. Then, iteratively, authoritativeness values are computed for the pages of the initial set, based on the number of links to and from the pages. One or more communities, or "neighborhoods", of related pages are defined based on the authoritativeness values thus produced. Such communities of pages are likely to be of particular interest and value to the user who is interested in the... |
Citations|
| US5257185 | May 21, 1990 | Oct 26, 1993 | Ann W. Farley | Interactive, cross-referenced knowledge system | | US5446891 | Nov 2, 1994 | Aug 29, 1995 | International Business Machines Corporation | System for adjusting hypertext links with weighed user goals and activities | | US5778363 | Dec 30, 1996 | Jul 7, 1998 | Intel Corporation | Method for measuring thresholded relevance of a document to a specified topic | | US5826031 | Jun 10, 1996 | Oct 20, 1998 | Sun Microsystems, Inc. | Method and system for prioritized downloading of embedded web objects | | US5835905 | Apr 9, 1997 | Nov 10, 1998 | Xerox Corporation | System for predicting documents relevant to focus documents by spreading activation through network representations of a linked collection of documents |
Referenced by|
| US6311177 | Feb 16, 1999 | Oct 30, 2001 | International Business Machines Corporation | Accessing databases when viewing text on the web | | US6336112 | Mar 16, 2001 | Jan 1, 2002 | International Business Machines Corporation | Method for interactively creating an information database including preferred information elements, such as, preferred-authority, world wide web pages | | US6397210 | Aug 4, 1999 | May 28, 2002 | International Business Machines Corporation | Network interactive tree search method and system | | US6397218 | Aug 4, 1999 | May 28, 2002 | International Business Machines Corporation | Network interactive search engine server and method | | US6401118 | Aug 13, 1998 | Jun 4, 2002 | Online Monitoring Services | Method and computer program product for an online monitoring search engine | | US6408316 | Dec 17, 1998 | Jun 18, 2002 | International Business Machines Corporation | Bookmark set creation according to user selection of selected pages satisfying a search condition | | US6665837 | Aug 10, 1998 | Dec 16, 2003 | Overture Services, Inc. | Method for identifying related pages in a hyperlinked database | | US6671714 | Nov 23, 1999 | Dec 30, 2003 | | Method, apparatus and business system for online communications with online and offline recipients | | US6675170 | Jun 29, 2000 | Jan 6, 2004 | NEC Laboratories America, Inc. | Method to efficiently partition large hyperlinked databases by hyperlink structure | | US6684254 | May 31, 2000 | Jan 27, 2004 | International Business Machines Corporation | Hyperlink filter for "pirated" and "disputed" copyright material on the internet in a method, system and program | | US6711569 | Jul 24, 2001 | Mar 23, 2004 | Bright Planet Corporation | Method for automatic selection of databases for searching | | US6745178 | Apr 28, 2000 | Jun 1, 2004 | International Business Machines Corporation | Internet based method for facilitating networking among persons with similar interests and for facilitating collaborative searching for information | | US6799176 | Jul 6, 2001 | Sep 28, 2004 | The Board of Trustees of the Leland Stanford Junior University | Method for scoring documents in a linked database | | US6839702 | Dec 13, 2000 | Jan 4, 2005 | Google Inc. | Systems and methods for highlighting search results | | US6938206 | Jan 19, 2001 | Aug 30, 2005 | Transolutions, Inc. | System and method for creating a clinical resume | | US7000199 | May 9, 2001 | Feb 14, 2006 | FairIsaac and Company Inc. | Methodology for viewing large strategies via a computer workstation | | US7010527 | Aug 13, 2001 | Mar 7, 2006 | Oracle International Corp. | Linguistically aware link analysis method and system | | US7028029 | Aug 23, 2004 | Apr 11, 2006 | Google Inc. | Adaptive computation of ranking | | US7058628 | Jul 2, 2001 | Jun 6, 2006 | The Board of Trustees of the Leland Stanford Junior University | Method for node ranking in a linked database | | US7076483 | Aug 27, 2001 | Jul 11, 2006 | Xyleme SA | Ranking nodes in a graph | | US7096214 | Dec 13, 2000 | Aug 22, 2006 | Google Inc. | System and method for supporting editorial opinion in the ranking of search results | | US7111232 | Mar 6, 2002 | Sep 19, 2006 | Thomas Layne Bascom | Method and system for making document objects available to users of a network | | US7139974 | Mar 6, 2002 | Nov 21, 2006 | Thomas Layne Bascom | Framework for managing document objects stored on a network | | US7158971 | Apr 10, 2002 | Jan 2, 2007 | Thomas Layne Bascom | Method for searching document objects on a network | | US7159023 | Dec 16, 2003 | Jan 2, 2007 | Alexa Internet | Use of web usage trail data to identify relationships between browsable items | | US7165069 | Jun 28, 1999 | Jan 16, 2007 | Alexa Internet | Analysis of search activities of users to identify related network sites | | US7167871 | Sep 3, 2002 | Jan 23, 2007 | Xerox Corporation | Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections | | US7188117 | Sep 3, 2002 | Mar 6, 2007 | Xerox Corporation | Systems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections | | US7194466 | May 1, 2003 | Mar 20, 2007 | Microsoft Corporation | Object clustering using inter-layer links | | US7216123 | Aug 22, 2003 | May 8, 2007 | Board of Trustees of the LeLand Stanford Junior University | Methods for ranking nodes in large directed graphs | | US7249126 | Dec 29, 2004 | Jul 24, 2007 | Shopping.com | Systems and methods for dynamically updating relevance of a selected item | | US7260568 | Apr 15, 2004 | Aug 21, 2007 | Microsoft Corporation | Verifying relevance between keywords and web site contents | | US7266553 | Jul 1, 2002 | Sep 4, 2007 | Microsoft Corporation | Content data indexing | | US7269587 | Dec 1, 2004 | Sep 11, 2007 | The Board of Trustees of the Leland Stanford Junior University | Scoring documents in a linked database | | US7281005 | Oct 20, 2003 | Oct 9, 2007 | Telenor ASA | Backward and forward non-normalized link weight analysis method, system, and computer program product | | US7289985 | Apr 15, 2004 | Oct 30, 2007 | Microsoft Corporation | Enhanced document retrieval | | US7293007 | Apr 29, 2004 | Nov 6, 2007 | Microsoft Corporation | Method and system for identifying image relatedness using link and page layout analysis | | US7299270 | Jul 10, 2001 | Nov 20, 2007 | Lycos, Inc. | Inferring relations between internet objects that are not connected directly | | US7305389 | Apr 15, 2004 | Dec 4, 2007 | Microsoft Corporation | Content propagation for enhanced document retrieval | | US7366705 | Aug 16, 2004 | Apr 29, 2008 | Microsoft Corporation | Clustering based text classification | | US7372903 | Sep 22, 1999 | May 13, 2008 | MediaTek, Inc. | Apparatus and method for object based rate control in a coding system | | US7386543 | Jun 30, 2006 | Jun 10, 2008 | Google Inc. | System and method for supporting editorial opinion in the ranking of search results | | US7386792 | Jan 18, 2002 | Jun 10, 2008 | Thomas Layne Bascom | System and method for collecting, storing, managing and providing categorized information related to a document object | | US7389241 | Apr 9, 2002 | Jun 17, 2008 | Thomas Layne Bascom | Method for users of a network to provide other users with access to link relationships between documents | | US7421432 | Dec 13, 2000 | Sep 2, 2008 | Google Inc. | Hypertext browser assistant | | US7493320 | Aug 16, 2004 | Feb 17, 2009 | Telenor ASA | Method, system, and computer program product for ranking of documents using link analysis, with remedies for sinks | | US7499934 | May 23, 2006 | Mar 3, 2009 | International Business Machines Corporation | Method for linking documents | | US7512587 | Jul 1, 2004 | Mar 31, 2009 | Microsoft Corporation | Efficient computation of web page rankings | | US7529756 | Dec 22, 2000 | May 5, 2009 | West Services, Inc. | System and method for processing formatted text documents in a database | | US7565630 | Jun 15, 2004 | Jul 21, 2009 | Google Inc. | Customization of search results for search queries received from third party sites | | US7584183 | Feb 1, 2006 | Sep 1, 2009 | Yahoo! Inc. | Method for node classification and scoring by combining parallel iterative scoring calculation | | US7593981 | Nov 3, 2006 | Sep 22, 2009 | Alexa Internet | Detection of search behavior based associations between web sites | | US7630973 | Nov 3, 2003 | Dec 8, 2009 | Yahoo! Inc. | Method for identifying related pages in a hyperlinked database | | US7668822 | Sep 18, 2006 | Feb 23, 2010 | Become, Inc. | Method for assigning quality scores to documents in a linked database | | US7676464 | Mar 17, 2006 | Mar 9, 2010 | International Business Machines Corporation | Page-ranking via user expertise and content relevance | | US7676555 | Dec 4, 2006 | Mar 9, 2010 | BrightPlanet Corporation | System and method for efficient control and capture of dynamic database content | | US7680812 | Sep 16, 2005 | Mar 16, 2010 | Telenor ASA | Method, system, and computer program product for searching for, navigating among, and ranking of documents in a personal web | | US7689585 | Apr 15, 2004 | Mar 30, 2010 | Microsoft Corporation | Reinforced clustering of multi-type data objects for search term suggestion | | US7689666 | Aug 28, 2007 | Mar 30, 2010 | | System and method for restricting internet access of a computer | | US7716223 | Dec 1, 2004 | May 11, 2010 | Google Inc. | Variable personalization of search results in a search engine | | US7752186 | Dec 20, 2004 | Jul 6, 2010 | AOL Inc. | Grouping multimedia and streaming media search results | | US7778954 | Mar 6, 2006 | Aug 17, 2010 | West Publishing Corporation | Systems, methods, and software for presenting legal case histories | | US7797344 | Dec 23, 2005 | Sep 14, 2010 | Become, Inc. | Method for assigning relative quality scores to a collection of linked documents | | US7809705 | Feb 13, 2007 | Oct 5, 2010 | Yahoo! Inc. | System and method for determining web page quality using collective inference based on local and global information | | US7831526 | Aug 27, 2007 | Nov 9, 2010 | Fair Isaac Corporation | Article and method for finding a compact representation to visualize complex decision trees | | US7853586 | Oct 18, 2004 | Dec 14, 2010 | Google Inc. | Highlighting occurrences of terms in documents or search results | | US7873652 | Sep 2, 2005 | Jan 18, 2011 | Charles E. Hill & Associates, Inc. | Electronic presentation generation system and method | | US7882105 | Jul 24, 2006 | Feb 1, 2011 | France Telecom | Method of ranking a set of electronic documents of the type possibly containing hypertext links to other electronic documents | | US7908260 | Dec 31, 2007 | Mar 15, 2011 | BrightPlanet Corporation II, Inc. | Source editing, internationalization, advanced configuration wizard, and summary page selection for information automation systems | | US7908277 | Feb 5, 2007 | Mar 15, 2011 | The Board of Trustees of the Leland Stanford Junior University | Annotating links in a document based on the ranks of documents pointed to by the links | | US7912831 | Oct 3, 2006 | Mar 22, 2011 | Yahoo! Inc. | System and method for characterizing a web page using multiple anchor sets of web pages | | US7925967 | Jun 8, 2001 | Apr 12, 2011 | AOL Inc. | Metadata quality improvement | | US7970768 | Aug 20, 2007 | Jun 28, 2011 | Microsoft Corporation | Content data indexing with content associations | | US7987115 | Dec 27, 2007 | Jul 26, 2011 | Institute for Information Industry | Advertisement selection systems and methods for internet articles | | US7987189 | Aug 20, 2007 | Jul 26, 2011 | Microsoft Corporation | Content data indexing and result ranking | | US7991755 | Dec 17, 2004 | Aug 2, 2011 | International Business Machines Corporation | Dynamically ranking nodes and labels in a hyperlinked database | | US8055663 | Dec 20, 2006 | Nov 8, 2011 | Institute for Information Industry | Systems and methods for measuring behavior characteristics | | US8065301 | Jul 11, 2007 | Nov 22, 2011 | Shopping.com | Systems and methods for dynamically updating relevance of a selected item | | US8095529 | Jan 4, 2005 | Jan 10, 2012 | AOL Inc. | Full-text relevancy ranking | | US8126884 | Jan 28, 2010 | Feb 28, 2012 | The Board of Trustees of the Leland Stanford Junior University | Scoring documents in a linked database | | US8131715 | Jan 19, 2010 | Mar 6, 2012 | The Board of Trustees of the Leland Stanford Junior University | Scoring documents in a database | | US8131717 | Jan 19, 2010 | Mar 6, 2012 | The Board of Trustees of the Leland Stanford Junior University | Scoring documents in a database | | US8135731 | Dec 2, 2004 | Mar 13, 2012 | International Business Machines Corporation | Administration of search results | | US8161040 | Apr 30, 2008 | Apr 17, 2012 | Piffany, Inc. | Criteria-specific authority ranking | | US8161050 | Nov 20, 2008 | Apr 17, 2012 | Microsoft Corporation | Visualizing hyperlinks in a search results list | | US8176033 | Apr 21, 2009 | May 8, 2012 | NTT DoCoMo, Inc. | Document processing device and document processing method | | US8180776 | Mar 9, 2010 | May 15, 2012 | Google Inc. | Variable personalization of search results in a search engine | | US8195651 | Feb 2, 2010 | Jun 5, 2012 | The Board of Trustees of the Leland Stanford Junior University | Scoring documents in a linked database | | US8200609 | Aug 29, 2008 | Jun 12, 2012 | Fair Isaac Corporation | Construction of decision logic with graphs | | US8209311 | Jul 1, 2010 | Jun 26, 2012 | AOL Inc. | Methods and systems for grouping uniform resource locators based on masks | | US8229782 | Dec 17, 2009 | Jul 24, 2012 | Amazon.com, Inc. | Methods and systems for processing distributed feedback | | US8237716 | Sep 8, 2008 | Aug 7, 2012 | Fair Isaac Corporation | Algorithm for drawing directed acyclic graphs |
Claims1. A computer program product, for use with a computer system, for directing the computer system to execute a search of information resources, the resources having content-based links between each other, to identify a desired subset of the information resources which satisfy a desired criterion, the computer program product comprising: - a computer-readable medium;
- means, provided on the recording medium, for directing the computer system to identify an initial set of information resources;
- means, provided on the recording medium, for directing the computer system to define initial authoritativeness information for the initial set;
- means, provided on the recording medium, for directing the computer system to use the initial authoritativeness information as input authoritativeness information, to execute the steps of:
- (i) producing first authoritativeness information about a set of information resources pointed to by links in resources of the input set, and
- (ii) producing second authoritativeness information about a set of information resources having links that point to resources of the input set; and
- means, provided on the recording medium, for directing the computer system to produce a final set of information resources based on the first and second authoritativeness information.
2. A computer program product as recited in claim 1, wherein the information resources include World Wide Web pages, and the content-based links include hyperlinks. 3. A computer program product as recited in claim 1, wherein the means for directing to identify an initial set of information resources includes means, provided on the recording medium, for directing the computer system to obtain, as an input, an information resource containing subject matter of interest. 4. A computer program product as recited in claim 3, wherein the means for directing to identify an initial set of information resources includes means, provided on the recording medium, for directing the computer system to identify a further set of information resources linked to the input information resource. 5. A computer program product as recited in claim 1, wherein: - the means for directing to execute the steps of producing first and second authoritativeness information is operative in a series of iterations;
- the initial authoritativeness information is used as input authoritativeness information for a first iteration; and the produced first and second authoritativeness information is a result of the iteration, the first and second authoritativeness information produced in a given iteration to be used as the input authoritativeness information for the next iteration.
6. A computer program product as recited in claim 1 further comprising means, provided on the recording medium, for directing the computer system to execute the steps of producing first authoritativeness information and producing second authoritativeness information in a series of iterations until a predetermined condition is met. 7. A computer program product as recited in claim 6, wherein the predetermined condition includes the execution of a specified number of iterations. 8. A computer program product as recited in claim 6, wherein the predetermined condition includes a steady state in which further iterations result in substantially the same results. 9. A computer program product as recited in claim 6, wherein the means for directing to identify an initial set of information resources includes means, provided on the recording medium, for directing the computer system to execute a keyword-based query search, results of the search including information resources to be included in the initial set. 10. A computer program product as recited in claim 9, wherein the means for directing to identify an initial set of information resources further includes means, provided on the recording medium, for directing the computer system to identify information resources linked to or from the information resources which are the results of the search, the former information resources also to be included in the initial set. 11. A computer program product as recited in claim 10, wherein the means for directing to define initial authoritativeness information includes means, provided on the recording medium, for directing the computer system to select an initial numerical authoritativeness value for each of the information resources of the initial set. 12. A computer program product as recited in claim 11, wherein the means for directing to define initial authoritativeness information further includes means, provided on the recording medium, for directing the computer system to define an authority value and a hub value for each of the information resources of the initial set. 13. A computer program product as recited in claim 12, wherein the defined authority values and hub values are processed as vectors, each vector containing a respective term corresponding with each respective one of the information resources of the initial set, and having stored therein the value defined for that respective one of the information resources of the initial set. 14. A computer program product as recited in claim 12, wherein: - an initial hub value is defined as 1 if the information resource was found by the keyword-based query search, and 0 if the information resource is linked to or from the information resources which are the results of the search; and
- an initial authority value is defined as 0 for all information resources.
15. A computer program product as recited in claim 12, wherein, for each iteration: - the hub value for an information resource is updated as the sum of the authority values for authority information resources which point to the hub information resource; and
- the authority value for an information resource is updated as the sum of the hub values for hub information resources which are pointed to by the information resource.
16. A computer program product as recited in claim 15, wherein each iteration further includes normalizing the hub and authority values for the information resources. 17. A computer program product as recited in claim 1, wherein the means for directing to produce a final set of information resources includes means, provided on the recording medium, for directing the computer system to select information resources from the set based on their hub and authority values. 18. A computer program product as recited in claim 17, wherein the means for directing to select includes means, provided on the recording medium, for directing the computer system to select information resources whose hub values or authority values have greatest magnitudes. 19. A computer program product as recited in claim 17, wherein the means for directing to select includes means, provided on the recording medium, for directing the computer system to select a plurality of successive communities, selecting each successive community including selecting information resources whose hub values or authority values have greatest magnitudes of those information resources not already selected for a prior community. 20. A method for executing a search of information resources, the resources having content-based links between each other, to identify a desired subset of the information resources which satisfy a desired criterion, the method comprising the steps of: - identifying an initial set of information resources;
- defining initial authoritativeness information for the initial set;
- using the initial authoritativeness information as input authoritativeness information, executing the steps of:
- (i) producing first authoritativeness information about a set of information resources pointed to by links in resources of the input set, and
- (ii) producing second authoritativeness information about a set of information resources having links that point to resources of the input set; and
- producing a final set of information resources based on the first and second authoritativeness information.
21. A method as recited in claim 20, wherein the information resources include World Wide Web pages, and the content-based links include hyperlinks. 22. A method as recited in claim 20, wherein the step of identifying an initial set of information resources includes obtaining, as an input, an information resource containing subject matter of interest. 23. A method as recited in claim 22, wherein the step of identifying an initial set of information resources includes identifying a further set of information resources linked to the input information resource. 24. A method as recited in claim 20, wherein: - the step of executing the steps of producing first and second authoritativeness information is executed in a series of iterations;
- the initial authoritativeness information is used as input authoritativeness information for a first iteration; and
- the produced first and second authoritativeness information is a result of the iteration, the first and second authoritativeness information produced in a given iteration to be used as the input authoritativeness information for the next iteration.
25. A method as recited in claim 20, wherein the steps of producing first authoritativeness information and producing second authoritativeness information are executed in a series of iterations until a predetermined condition is met. 26. A method as recited in claim 25, wherein the predetermined condition includes the execution of a specified number of iterations. 27. A method as recited in claim 25, wherein the predetermined condition includes a steady state in which further iterations result in substantially the same results. 28. A method as recited in claim 25, wherein the step of identifying an initial set of information resources includes executing a keyword-based query search, results of the search including information resources to be included in the initial set. 29. A method as recited in claim 28, wherein the step of identifying an initial set of information resources further includes identifying information resources linked to or from the information resources which are the results of the search, the former information resources also to be included in the initial set. 30. A method as recited in claim 29, wherein the step of defining initial authoritativeness information includes selecting an initial numerical authoritativeness value for each of the information resources of the initial set. 31. A method as recited in claim 30, wherein the step of defining initial authoritativeness information further includes defining an authority value and a hub value for each of the information resources of the initial set. 32. A method as recited in claim 31, wherein the defined authority values and hub values are processed as vectors, each vector containing a respective term corresponding with each respective one of the information resources of the initial set, and having stored therein the value defined for that respective one of the information resources of the initial set. 33. A method as recited in claim 31, wherein: - an initial hub value is defined as 1 if the information resource was found by the keyword-based query search, and 0 if the information resource is linked to or from the information resources which are the results of the search; and
- an initial authority value is defined as 0 for all information resources.
34. A method as recited in claim 31, wherein, for each iteration: - the hub value for an information resource is updated as the sum of the authority values for authority information resources which point to the hub information resource; and
- the authority value for an information resource is updated as the sum of the hub values for hub information resources which are pointed to by the information resource.
35. A method as recited in claim 34, wherein each iteration further includes normalizing the hub and authority values for the information resources. 36. A method as recited in claim 20, wherein: - each information resource is associated with an authority value and a hub value; and
- the step of producing a final set of information resources includes selecting information resources from the set based on the hub and authority values.
37. A method as recited in claim 36, wherein the step of selecting includes selecting information resources whose hub values or authority values have greatest magnitudes. 38. A method as recited in claim 36, wherein the step of selecting includes selecting a plurality of successive communities, selecting each successive community including selecting information resources whose hub values or authority values have greatest magnitudes of those information resources not already selected for a prior community. 39. A system for executing a search of information resources, the resources having content-based links between each other, to identify a desired subset of the information resources which satisfy a desired criterion, the system comprising: - means for identifying an initial set of information resources;
- means for defining initial authoritativeness information for the initial set;
- means for using the initial authoritativeness information as input authoritativeness information, to execute the steps of:
- (i) producing first authoritativeness information about a set of information resources pointed to by links in resources of the input set, and
- (ii) producing second authoritativeness information about a set of information resources having links that point to resources of the input set; and
- means for producing a final set of information resources based on the first and second authoritativeness information.
40. A system as recited in claim 39, wherein the information resources include World Wide Web pages, and the content-based links include hyperlinks. 41. A system as recited in claim 39, wherein the means for identifying an initial set of information resources includes means for obtaining, as an input, an information resource containing subject matter of interest. 42. A system as recited in claim 41, wherein the means for identifying an initial set of information resources includes means for identifying a further set of information resources linked to the input information resource. 43. A system as recited in claim 39, wherein: - the means for executing the steps of producing first and second authoritativeness information is operative in a series of iterations;
- the initial authoritativeness information is used as input authoritativeness information for a first iteration; and
- the produced first and second authoritativeness information is a result of the iteration, the first and second authoritativeness information produced in a given iteration to be used as the input authoritativeness information for the next iteration.
44. A system as recited in claim 39 further comprising means for executing the steps of producing first authoritativeness information and producing second authoritativeness information in a series of iterations until a predetermined condition is met. 45. A system as recited in claim 44, wherein the predetermined condition includes the execution of a specified number of iterations. 46. A system as recited in claim 44, wherein the predetermined condition includes a steady state in which further iterations result in substantially the same results. 47. A system as recited in claim 44, wherein the means for identifying an initial set of information resources includes means for executing a keyword-based query search, results of the search including information resources to be included in the initial set. 48. A system as recited in claim 47, wherein the means for identifying an initial set of information resources further includes means for identifying information resources linked to or from the information resources which are the results of the search, the former information resources also to be included in the initial set. 49. A system as recited in claim 48, wherein the means for defining initial authoritativeness information includes means for selecting an initial numerical authoritativeness value for each of the information resources of the initial set. 50. A system as recited in claim 49, wherein the means for defining initial authoritativeness information further includes means for defining an authority value and a hub value for each of the information resources of the initial set. 51. A system as recited in claim 50, wherein the defined authority values and hub values are processed as vectors, each vector containing a respective term corresponding with each respective one of the information resources of the initial set, and having stored therein the value defined for that respective one of the information resources of the initial set. 52. A system as recited in claim 50, wherein: - an initial hub value is defined as 1 if the information resource was found by the keyword-based query search, and 0 if the information resource is linked to or from the information resources which are the results of the search; and
- an initial authority value is defined as 0 for all information resources.
53. A system as recited in claim 50, wherein, for each iteration: - the hub value for an information resource is updated as the sum of the authority values for authority information resources which point to the hub information resource; and
- the authority value for an information resource is updated as the sum of the hub values for hub information resources which are pointed to by the information resource.
54. A system as recited in claim 53, wherein each iteration further includes normalizing the hub and authority values for the information resources. 55. A system as recited in claim 39, wherein the means for producing a final set of information resources includes means for selecting information resources from the set based on their hub and authority values. 56. A system as recited in claim 55, wherein the means for selecting includes means for selecting information resources whose hub values or authority values have greatest magnitudes. 57. A system as recited in claim 55, wherein the means for selecting includes means for selecting a plurality of successive communities, selecting each successive community including selecting information resources whose hub values or authority values have greatest magnitudes of those information resources not already selected for a prior community. |