Search Images Maps Play YouTube News Gmail Drive More »
Sign in
Screen reader users: click this link for accessible mode. Accessible mode has the same essential features but works better with your reader.

Patents

  1. Advanced Patent Search
Publication numberUS6185614 B1
Publication typeGrant
Application numberUS 09/084,452
Publication dateFeb 6, 2001
Filing dateMay 26, 1998
Priority dateMay 26, 1998
Fee statusPaid
Publication number084452, 09084452, US 6185614 B1, US 6185614B1, US-B1-6185614, US6185614 B1, US6185614B1
InventorsGennaro A. Cuomo, Binh Q. Nguyen, Sandeep K. Singhal
Original AssigneeInternational Business Machines Corp.
Export CitationBiBTeX, EndNote, RefMan
External Links: USPTO, USPTO Assignment, Espacenet
Method and system for collecting user profile information over the world-wide web in the presence of dynamic content using document comparators
US 6185614 B1
Abstract
Disclosed is a method and system for collecting profile information about users accessing dynamically generated content from one or more servers. In a specific embodiment, a server dynamically generates a web page in response to a user request. The server customizes the web page content based on the requested universal resource identifier (URI) and one or more of: the user's identity, access permissions, demographic information, and previous behavior at the site. The web server then passes the URI, user identity, and dynamically generated web page to an access information collector. The access information collector generates document comparators from the current web page content and compares them to document comparators associated with previously retrieved web pages. If the current web page is sufficiently similar to some previously retrieved web page, the access information collector logs the URI, user identity, and a document key associated with the matching previously retrieved page. Otherwise, the access information collector generates a new key; stores the new key and the document comparators in a database; and logs the URI, user identity, and the newly generated document key.
Images(5)
Previous page
Next page
Claims(27)
What we claim is:
1. A method of collecting information about document retrievals over the World-Wide Web, comprising the steps of:
receiving a requesting user identity, requested Universal Resource Identifier (URI), and a content of a retrieved document;
selecting a Candidate Document from a Retrieved Document Database, said Candidate Document associated with a Candidate Document Key;
comparing said retrieved document to said Candidate Document to determine a sufficiency of said Candidate Document;
associating said retrieved document with a newly generated Retrieved Document Key if said Candidate Document is not deemed to be sufficient;
adding said retrieved document to said Received Document Database; and
adding a Log File Entry including said requesting user identity, said requested URI, and said Retrieved Document Key.
2. The method of claim 1, wherein each of a plurality of documents in said Retrieved Document Database is associated with a Document Comparator and wherein a first Document Comparator may be compared to a second Document Comparator using a Document Comparator Function.
3. The method of claim 2, wherein said step of comparing to determine a sufficiency of said Candidate Document further comprises the steps of:
computing said first Document Comparator for said retrieved document;
retrieving said second Document Comparator for said Candidate Document;
computing with said Document Comparator Function a numeric measure of a difference between said first Document Comparator and said second Document Comparator; and
comparing said numeric measure against a predefined Document Difference Threshold.
4. The method of claim 2, wherein each said Document Comparator comprises content of said each of a plurality of documents associated therewith.
5. The method of claim 4, wherein a URI for said Candidate Document is equal to a URI for said retrieved document.
6. The method of claim 2, wherein each said Document Comparator is computed by associating predefined portions of said each of a plurality of documents to a binary token.
7. The method of claim 2, wherein each said Document Comparator comprises a list of significant words or phrases in said each of a plurality of documents.
8. The method of claim 2, wherein each said Document Comparator comprises a Comparator for each of a plurality of predefined sections of said each of a plurality of documents.
9. The method of claim 2, wherein said step of selecting a Candidate Document comprises selecting from a Document Comparator Database.
10. A system for collecting information about document retrievals over the World-Wide Web, comprising:
means for receiving a requesting user identity, requested Universal Resource Identifier (URI), and a content of a retrieved document;
means for selecting a Candidate Document from a Retrieved Document Database, said Candidate Document associated with a Candidate Document Key;
means for comparing said retrieved document to said Candidate Document to determine a sufficiency of said Candidate Document;
means for associating said retrieved document with a newly generated Retrieved Document Key if said Candidate Document is not deemed to be sufficient;
means for adding said retrieved document to said Received Document Database; and
means for adding a Log File Entry including said requesting user identity, said requested URI, and said Retrieved Document Key.
11. The system of claim 10, wherein each of a plurality of documents in said Retrieved Document Database is associated with a Document Comparator and wherein a first Document Comparator may be compared to a second Document Comparator using a Document Comparator Function.
12. The system of claim 11, wherein said means for comparing to determine a sufficiency of said Candidate Document further comprises:
means for computing said first Document Comparator for said retrieved document;
means for retrieving said second Document Comparator for said Candidate Document;
means for computing with said Document Comparator Function a numeric measure of a difference between said first Document Comparator and said second Document Comparator; and
means for comparing said numeric measure against a predefined Document Difference Threshold.
13. The system of claim 11, wherein each said Document Comparator comprises content of said each of a plurality of documents associated therewith.
14. The system of claim 13, wherein a URI for said Candidate Document is equal to a URI for said retrieved document.
15. The system of claim 11, wherein each said Document Comparator is computed by associating predefined portions of said each of a plurality of documents to a binary token.
16. The system of claim 11, wherein each said Document Comparator comprises a list of significant words or phrases in said each of a plurality of documents.
17. The system of claim 11, wherein each said Document Comparator comprises a Comparator for each of a plurality of predefined sections of said each of a plurality of documents.
18. The system of claim 11, wherein said means for selecting a Candidate Document comprises selecting from a Document Comparator Database.
19. A computer program product recorded on computer readable medium for collecting information about document retrievals over the World-Wide Web, comprising:
computer readable means for receiving a requesting user identity, requested Universal Resource Identifier (URI), and a content of a retrieved document;
computer readable means for selecting a Candidate Document from a Retrieved Document Database, said Candidate Document associated with a Candidate Document Key;
computer readable means for comparing said retrieved document to said Candidate Document to determine a sufficiency of said Candidate Document;
computer readable means for associating said retrieved document with a newly generated Retrieved Document Key if said Candidate Document is not deemed to be sufficient;
computer readable means for adding said retrieved document to said Received Document Database; and
computer readable means for adding a Log File Entry including said requesting user identity, said requested URI, and said Retrieved Document Key.
20. The program product of claim 19, wherein each of a plurality of documents in said Retrieved Document Database is associated with a Document Comparator and wherein a first Document Comparator may be compared to a second Document Comparator according to a predefined distance metric.
21. The program product of claim 20, wherein said computer readable means for comparing to determine a sufficiency of said Candidate Document further comprises:
computer readable means for computing said first Document Comparator for said retrieved document;
computer readable means for retrieving said second Document Comparator for said Candidate Document;
computer readable means for computing with said Document Comparator Function a numeric measure of a difference between said first Document Comparator and said second Document Comparator; and
computer readable means for comparing said numeric measure against a predefined Document Difference Threshold.
22. The program product of claim 20, wherein each said Document Comparator comprises content of said each of a plurality of documents associated therewith.
23. The program product of claim 22, wherein a URI for said Candidate Document is equal to a URI for said retrieved document.
24. The program product of claim 20, wherein each said Document Comparator is computed by associating predefined portions of said each of a plurality of documents to a binary token.
25. The program product of claim 20, wherein each said Document Comparator comprises a list of significant words or phrases in said each of a plurality of documents.
26. The program product of claim 20, wherein each said Document Comparator comprises a Comparator for each of a plurality of predefined sections of said each of a plurality of documents.
27. The program product of claim 20, wherein said computer readable means for selecting a Candidate Document comprises selecting from a Document Comparator Database.
Description
FIELD OF THE INVENTION

This invention relates in general to computer software, and in particular to a method and system for collecting profile information about users accessing Web pages from a plurality of Web servers. More particularly, the present invention relates to a method and system by which user profile information can be collected when the Web content is generated dynamically for each request at the Web server.

BACKGROUND OF THE INVENTION

In the World-Wide Web, a content provider deploys a plurality of Web servers that deliver Web pages to clients. When requesting a Web page, the client supplies a Uniform Resource Locator (URL) or Universal Resource Identifier (URI) to the server. The server associates this URI with a particular page of content and delivers that information to the requesting client.

As the World-Wide Web is being used increasingly to support commerce and targeted advertising, content providers desire to collect information about which users are accessing the site and what site content those users are accessing. This information can be used to establish “profiles” for each site visitor and enable tuning of the Web site content to meet the visitors' interests. Traditionally, this visitor information is collected by the Web server or a proxy server in the form of a log file. This log file contains, among other things, the requesting host address, the requested URI, and the time at which the request was received. Because each URI represents a particular piece of static content at the Web site, the URI is sufficient for a user profile analyzer to evaluate which content was received by each user and to detect similarities among the behavior of different users.

Recent Web servers are providing support for server-side scripting, whereby the URI is associated with a program or script that is executed at the Web server. This script is responsible for receiving the URI and the user identity and using this information to dynamically generate the content that should be returned to the requesting user. This generated content may account for the user's previous behavior at the site, his access permissions, his demographic information, or any number of other factors. Dynamic server content is supported by most Web servers today, including Microsoft's Active Server Pages, Sun's Dynamic Server Pages, industry-standard servlets, Common Gateway Interface (CGI) executables, and other mechanisms.

As a result of this direction, a particular URI can no longer be associated with particular content at the Web site. On different requests, the URI may return wholly different content depending on the requesting user and the context in which the request was issued. Consequently, existing methods for capturing user information are insufficient for producing meaningful user profiles. More specifically, the reliance on URIs alone prevents the accurate characterization of which users are exhibiting similar access behavior. Therefore, a method is needed for efficiently collecting user access information in the presence of dynamically-generated content at a Web server, in order to support the accurate generation of user profiles.

SUMMARY OF THE INVENTION

One object of the present invention is to provide, within a networked environment, a method of associating each user's request for World-Wide Web information to the content of the retrieved document when that document was generated dynamically.

Another object of the present invention is to group together user requests that retrieve the same document content. Yet another object of the present invention is to ignore minor variations in document content as might occur when the documents differ only in the presence of the requesting user's name. Still yet another object of the present invention is to enable the use of a range of metrics for comparing two documents for similarity.

To achieve the foregoing objects and in accordance with the purpose of the invention as broadly described herein, a method and system are disclosed for collecting information about user accesses by analyzing the content of retrieved documents and associating Document Comparators with each document. These and other features, aspects, and advantages of the present invention will become better understood with reference to the following description, appended claims, and accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

For a more complete understanding of the present invention and for further advantages thereof, reference is now made to the following Detailed Description taken in conjunction with the accompanying Drawings, in which:

FIG. 1 is a pictorial representation of a data processing system which may be utilized to implement a method and system of the present invention;

FIG. 2 shows a block diagram of a World-Wide Web environment in which user access information may be generated in accordance with the present invention;

FIG. 3 shows a sample data structure for representing the information collected by the Access Information Collector in accordance with the present invention; and

FIG. 4 is a flowchart showing how an Access Information Collector analyzes a document retrieved from a Web server and updates its data structures.

DETAILED DESCRIPTION OF THE INVENTION

Referring to FIG. 1, there is depicted a graphical representation of a data processing system 8, which may be utilized to implement the present invention. As may be seen, data processing system 8 may include a plurality of networks, such as Local Area Networks (LAN) 10 and 32, each of which preferably includes a plurality of individual computers 12 and 30, respectively. Of course, those skilled in the art will appreciate that a plurality of Intelligent Work Stations (IWS) coupled to a host processor may be utilized for each such network. Each said network may also consist of a plurality of processors coupled via a communications medium, such as shared memory, shared storage, or an interconnection network. As is common in such data processing systems, each individual computer may be coupled to a storage device 14 and/or a printer/output device 16 and may be provided with a pointing device such as a mouse 17.

The data processing system 8 may also include multiple mainframe computers, such as mainframe computer 18, which may be preferably coupled to LAN 10 by means of communications link 22. The mainframe computer 18 may also be coupled to a storage device 20 which may serve as remote storage for LAN 10. Similarly, LAN 10 may be coupled via communications link 24 through a sub-system control unit/communications controller 26 and communications link 34 to a gateway server 28. The gateway server 28 is preferably an IWS which serves to link LAN 32 to LAN 10.

With respect to LAN 32 and LAN 10, a plurality of documents or resource objects may be stored within storage device 20 and controlled by mainframe computer 18, as resource manager or library service for the resource objects thus stored. Of course, those skilled in the art will appreciate that mainframe computer 18 may be located a great geographic distance from LAN 10 and similarly, LAN 10 may be located a substantial distance from LAN 32. For example, LAN 32 may be located in California while LAN 10 may be located within North Carolina and mainframe computer 18 may be located in New York.

Software program code which employs the present invention is typically stored in the memory of a storage device 14 of a stand alone workstation or LAN server from which a developer may access the code for distribution purposes, the software program code may be embodied on any of a variety of known media for use with a data processing system such as a diskette or CD-ROM or may be distributed to users from a memory of one computer system over a network of some type to other computer systems for use by users of such other systems. Such techniques and methods for embodying software code on media and/or distributing software code are well-known and will not be further discussed herein.

Referring now to FIG. 2, components of a World-Wide Web system are shown in which user information may be gathered in accordance with the present invention. A plurality of clients (generally indicated by reference numerals 200, 201, and 202) access information over a network 205 using World-Wide Web browsers such as NETSCAPE NAVIGATOR, a trademark of Netscape, Inc. or MICROSOFT INTERNET EXPLORER, a trademark of Microsoft, Inc. These clients access a plurality of Web servers (generally indicated by reference numerals 210, 211, and 212) such as LOTUS GO, a trademark of Lotus, Inc., MICROSOFT INTERNET INFORMATION SERVICE (IIS), a trademark of Microsoft, Inc. or NETSCAPE FASTTRACK, a trademark of Netscape, Inc.

In accessing these Web servers, the clients 200, 201 and 202 specify a URI. Each of these Web servers 210, 211, and 212 accesses a Static Content Database (generally indicated by reference numerals 220, 221, and 222) and a Dynamic Content Generator (generally indicated by reference numerals 230, 231, and 232) that receives a URI and other information about the user and generates Web content suitable for display by the browsers at the clients 200, 201, and 202. These Dynamic Content Generators 230, 231, and 232 may take many forms, including Active Server Pages, servlets, Common Gateway Interface (CGI) binaries, or Dynamic Server Pages.

Upon receiving a URI request from a client, the Web server 210, 211, or 212 either retrieves the content from the Static Content Database 220, 221, or 222 or from the Dynamic Content Generator 230, 231, or 232. An Access Information Collector 240 receives client requests and content returned from the Static Content Database 220, 221, or 222 or from the Dynamic Content Generator 230, 231, or 232 and collects log information that can be used to analyze the access patterns of various users. It should be understood that the physical location of the components shown in FIG. 2 may vary. In particular, the Access Information Collector 240 may be embedded in the Web servers 210, 211, and 212. Moreover, the Dynamic Content Generators 200, 201, and 202 and Static Content Databases 220, 221, and 222 may be co-located with the Web servers 210, 211, and 212.

FIG. 3 illustrates the information collected by the Access Information Collector in accordance with the present invention. A Log File 300 contains a sequence of Access Records.

Each Access Record includes at least a time stamp 301, a requested URI 313, and a Document Key 312.

A Retrieved Document Database 310 contains a repository of Document Records corresponding to documents retrieved by users. Each Document Record 311 is indexed by a Document Key 312 and contains an associated URI 313, document text 314, and a Document Comparator 315. The Document Key 312, when combined with the URI 313, serves to uniquely identity the Document Record 311. Document Keys may be assigned sequentially or by any other appropriate method.

The Document Comparator 315 is a representation of the document's contents and is used by a Document Comparator Function to determine whether there are substantial predefined similarities, as will be subsequently described in greater detail, between the current document and other previously retrieved documents. The Document Comparator Function receives the Document Comparators for two documents and determines whether the two documents are substantially similar. To make this determination, the Function may employ a Document Difference Threshold, a numeric value that indicates how much two documents may differ before they are no longer deemed to be substantially similar. The use of the Document Difference Threshold depends on the particular Document Comparator Function being used. The use of a Document Difference Threshold allows the Document Comparator Function to ignore minor differences between two documents. Such minor differences include timestamps, client name, or client-specific data.

In the present embodiment of this invention, the Document Comparator 315 is the actual content of the document itself, and the Document Comparator Function for any two documents is defined to be the number of character insertions, deletions, or modifications required to convert one document to the other. This computation is well understood in the prior art (see, for example, the use of tries, as described in Chapter 11 of Alan Tharp, File Organization and Processing, Wiley, 1988) and will not be discussed further. Alternative embodiments of this invention may compute a Document Comparator 315 by mapping each word, paragraph, or section of the document to a binary token. In this case, the Document Comparator Function might count the number of matching binary tokens, and the Document Difference Threshold would designate what percentage of the tokens must match (see, for example, “Copy Detection Mechanisms for Digital Documents,” by Sergey Brin, James Davis, and Hector Garcia-Molina, in Proceedings of the 1995 SIGMOD International Conference on Management of Data, pages 398-409, May 1995). Yet another embodiment of this invention may define a Document Comparator 315 as a list of the most significant (as predefined) words or phrases in the document; the Document Comparator Function may simply count how many words or phrases occur in both documents, and the Document Difference Threshold would designate what percentage of words in each document must appear in the other. Other comparison methods are well established in the prior art. The essential element of a Document Comparator 315 is that a metric (i.e. the Document Comparator Function) must exist for comparing two different Document Comparators to determine by how much their respective documents differ. Indeed, a Document Comparator 315 may actually comprise multiple Comparators, one per each predefined section of the document, each having an associated Document Comparator Function.

Finally, a Document Comparator Index 320 associates each Document Comparator 315 with the corresponding Document Key 312. The Index 320 is used to improve the performance of the Document Comparator 315 evaluations and the selection of Candidate Documents (see FIG. 4). However, it is a performance optimization that may be omitted by alternative embodiments of this invention.

Though the data structures have been illustrated in FIG. 3 with a particular embodiment, alternative representations of this information are possible. The essential attributes of these implementations is the association of each Document Comparator 315 to a Document Key 312, the association of each user URI 313 retrieval with a particular Document Key 312, and the association of each Document Key 312 with particular document content. It should be noted that various optimizations are also possible. For example, instead of storing each document's full content, the Retrieved Document Database 310 may store only a list of most significant words or phrases.

When a document is accessed from the Web server (with a particular URI), the Access Information Collector 240 analyzes the retrieved document (using the Document Comparator Function) to determine whether it is substantially similar to another document that has been previously retrieved from that Web server using the same URI. If a substantially similar document has already been generated by the Web server, then the user's access is associated with that previous document; however, if a substantially similar document has not been previously generated by the Web server, then the user's access is associated with this new document. In this way, the Access Information Collector 240 distinguishes between different dynamically-generated documents retrieved using the same URI while also merging access information about documents that are nearly identical.

Referring now to FIG. 4, a flowchart depicts the steps taken by the Access Information Collector 240 to analyze a document retrieved from a Web server and to update the Log File 300, Retrieved Document Database 310, and Document Comparator Index 320 (as shown in FIG. 3). At block 400, the Access Information Collector 240 receives the requested URI, the time of the request, the identity of the requesting client, and the content of the retrieved document. At block 402, a Document Comparator 315 is computed for the retrieved document. At block 404, a Candidate Document and Candidate Document Comparator are selected from the Retrieved Document Database 310. The Candidate Document is a document in the Retrieved Document Database 310 whose URI matches that of the retrieved document. (It should be understood that alternative embodiments of this invention may remove the restriction that the URI of the retrieved document and the URI of the Candidate Document match. Alternative embodiments of this invention may also introduce additional restrictions on what constitutes a Candidate Document.) At decision block 406, it is determined whether or not a Candidate Document has been found. If the answer to decision block 406 is yes, then at decision block 408, the Document Comparator Function is invoked with the Document Comparators of the retrieved document and of the Candidate Document to determine whether or not the retrieved document and the Candidate Document are substantially similar.

Continuing with FIG. 4, if the answer to decision block 408 is yes, then it is determined that the retrieved document is sufficiently similar to the Candidate Document and no new entry is required to either the Retrieved Document Database 310 or to the Document Comparator Index 320. At block 410, the Document Key is retrieved for the Candidate Document. At block 415, a new entry is added to the Log File, including the time stamp, requested URI, and candidate document's Document Key. The process then terminates at block 490. If the answer to decision block 408 is no, then control returns to block 404, where another Candidate Document is selected for evaluation.

If the answer to decision block 406 is no, then it is determined that the retrieved document is new. At block 420, a new Document Key is generated for the retrieved document. At block 425, a new entry is added to the Retrieved Document Database 310 to associate the retrieved document's Document Key with a new Document Record containing the retrieved URI, retrieved document, and retrieved document's Document Comparator. At block 430, a new entry is added to the Document Comparator Index 320 database to associate the retrieved document's Document Comparator with the retrieved document's Document Key. At block 435, a new entry is added to the Log File, including the time stamp, requested URI, and retrieved document's Document Key. The process then terminates at block 490.

Thus, each user access is associated with a Document Key representing a document in the Retrieved Document Database with a sufficiently close Document Comparator. Each URI is, therefore, potentially linked with multiple documents, each having different content. At the same time, the analysis ignores minor differences between documents, as might arise when page content is customized in minor ways to reflect the identity of the requesting user.

Although the present invention has been described with respect to a specific preferred embodiment thereof, various changes and modifications may be suggested to one skilled in the art and it is intended that the present invention encompass such changes and modifications as fall within the scope of the appended claims.

Patent Citations
Cited PatentFiling datePublication dateApplicantTitle
US5649186 *Aug 7, 1995Jul 15, 1997Silicon Graphics IncorporatedSystem and method for a computer-based dynamic information clipping service
US5732218Jan 2, 1997Mar 24, 1998Lucent Technologies Inc.Management-data-gathering system for gathering on clients and servers data regarding interactions between the servers, the clients, and users of the clients during real use of a network of clients and servers
US5740430 *Nov 6, 1995Apr 14, 1998C/Net, Inc.Method and apparatus for server-independent caching of dynamically-generated customized pages
US5745900 *Aug 9, 1996Apr 28, 1998Digital Equipment CorporationMethod for indexing duplicate database records using a full-record fingerprint
US5813007 *Jun 20, 1996Sep 22, 1998Sun Microsystems, Inc.Automatic updates of bookmarks in a client computer
US5890164 *Jun 24, 1996Mar 30, 1999Sun Microsystems, Inc.Estimating the degree of change of web pages
US5892917Sep 27, 1995Apr 6, 1999Microsoft CorporationSystem for log record and log expansion with inserted log records representing object request for specified object corresponding to cached object copies
US5893908 *Nov 21, 1996Apr 13, 1999Ricoh Company LimitedDocument management system
US5895470 *Apr 9, 1997Apr 20, 1999Xerox CorporationSystem for categorizing documents in a linked collection of documents
US5898836 *Jan 14, 1997Apr 27, 1999Netmind Services, Inc.Change-detection tool indicating degree and location of change of internet documents by comparison of cyclic-redundancy-check(CRC) signatures
US5909677 *Jun 18, 1996Jun 1, 1999Digital Equipment CorporationMethod for determining the resemblance of documents
US5913208 *Jul 9, 1996Jun 15, 1999International Business Machines CorporationIdentifying duplicate documents from search results without comparing document content
US5941944 *Mar 3, 1997Aug 24, 1999Microsoft CorporationIn a computer system
US5978842 *Jul 18, 1997Nov 2, 1999Netmind Technologies, Inc.Distributed-client change-detection tool with change-detection augmented by multiple clients
US5983268 *Mar 25, 1997Nov 9, 1999Netmind Technologies, Inc.Spreadsheet user-interface for an internet-document change-detection tool
US5987480 *Jul 25, 1996Nov 16, 1999Donohue; MichaelMethod and system for delivering documents customized for a particular user over the internet using imbedded dynamic content
US5999929 *Sep 29, 1997Dec 7, 1999Continuum Software, IncWorld wide web link referral system and method for generating and providing related links for links identified in web pages
US6012087 *May 20, 1998Jan 4, 2000Netmind Technologies, Inc.Unique-change detection of dynamic web pages using history tables of signatures
WO1998031155A2Jan 9, 1998Jul 16, 1998Relevantknowledge IncMonitoring of remote file access on a public computer network
Non-Patent Citations
Reference
1 *Brin, S., et al., "Copy Detection Mechansims for Digital Documents," Proc. Of the 1995 ACM SIGMOD Int'l. Conf. on Management of Data, ACM, pp. 398-409, May, 1995.
2 *Garcia-Molina, H., et al, "dSCAM: Finding Document Copies Across Multiple Databases," Proc. of the 4th Int'l. Conf.on Parallel and Distributed Information Systems, IEEE, pp. 68-79, May 1995.
Referenced by
Citing PatentFiling datePublication dateApplicantTitle
US6393421 *Aug 6, 1999May 21, 2002Neriel PaglinCommunication method and system utilizing a specific communication code uniquely assigned to the data record
US6446035 *May 5, 1999Sep 3, 2002Xerox CorporationFinding groups of people based on linguistically analyzable content of resources accessed
US6615247 *Jul 1, 1999Sep 2, 2003Micron Technology, Inc.System and method for customizing requested web page based on information such as previous location visited by customer and search term used by customer
US6662215Jul 10, 2000Dec 9, 2003I Novation Inc.System and method for content optimization
US6687734 *Mar 21, 2000Feb 3, 2004America Online, IncorporatedSystem and method for determining if one web site has the same information as another web site
US6728721 *Jun 23, 2000Apr 27, 2004Microsoft CorporationSystem and method for maintaining a user's state within a database table
US6748386 *May 8, 2001Jun 8, 2004Nec CorporationSystem and method for automated construction of URL, cookie, and database query mapping
US6757691 *Nov 9, 1999Jun 29, 2004America Online, Inc.Predicting content choices by searching a profile database
US6804659 *Jan 14, 2000Oct 12, 2004Ricoh Company Ltd.Content based web advertising
US6816849 *May 26, 1999Nov 9, 2004Gerald B. Halt, Jr.Advanced internet interface
US6978419 *Nov 15, 2000Dec 20, 2005Justsystem CorporationMethod and apparatus for efficient identification of duplicate and near-duplicate documents and text spans using high-discriminability text fragments
US6983478Feb 1, 2000Jan 3, 2006Bellsouth Intellectual Property CorporationMethod and system for tracking network use
US6996612 *Dec 29, 2000Feb 7, 2006Vignette CorporationMethod of providing information related to activity of a user and a data processing system program product
US7010540 *Mar 15, 2004Mar 7, 2006Microsoft CorporationSystem and method for maintaining a user's state within a database table
US7017183Jun 29, 2001Mar 21, 2006Plumtree Software, Inc.System and method for administering security in a corporate portal
US7020652Dec 21, 2001Mar 28, 2006Bellsouth Intellectual Property Corp.System and method for customizing content-access lists
US7031977Feb 28, 2002Apr 18, 2006Plumtree Software, Inc.Efficiently storing indented threads in a threaded discussion application
US7039704 *Jan 24, 2003May 2, 2006The Cobalt Group, Inc.Business platform with networked, association-based business entity access management and active content website configuration
US7085755Nov 7, 2002Aug 1, 2006Thomson Global Resources AgElectronic document repository management and access system
US7086075Dec 21, 2001Aug 1, 2006Bellsouth Intellectual Property CorporationMethod and system for managing timed responses to A/V events in television programming
US7092942May 31, 2002Aug 15, 2006Bea Systems, Inc.Managing secure resources in web resources that are accessed by multiple portals
US7111232Mar 6, 2002Sep 19, 2006Thomas Layne BascomMethod and system for making document objects available to users of a network
US7124093Sep 7, 2004Oct 17, 2006Ricoh Company, Ltd.Method, system and computer code for content based web advertising
US7127609Mar 26, 2001Oct 24, 2006Siemens Medical Solutions Health Services CorporationSystem and user interface for adaptively processing and communicating URL data between applications
US7131062 *Dec 9, 1998Oct 31, 2006International Business Machines CorporationSystems, methods and computer program products for associating dynamically generated web page content with web site visitors
US7139972Jul 26, 2001Nov 21, 2006International Business Machines CorporationPreemptive downloading of web pages with terms associated with user interest keywords
US7139974Mar 6, 2002Nov 21, 2006Thomas Layne BascomFramework for managing document objects stored on a network
US7149704 *Jan 25, 2002Dec 12, 2006Claria CorporationSystem, method and computer program product for collecting information about a network user
US7158971Apr 10, 2002Jan 2, 2007Thomas Layne BascomMethod for searching document objects on a network
US7171429May 7, 2004Jan 30, 2007Bea Systems, Inc.Efficiently storing indented threads in a threaded discussion application
US7174330May 7, 2004Feb 6, 2007Bea Systems, Inc.Efficiently storing indented threads in a threaded discussion application
US7200862 *Sep 30, 2002Apr 3, 2007Microsoft CorporationSecuring uniform resource identifier namespaces
US7212979Dec 14, 2001May 1, 2007Bellsouth Intellectuall Property CorporationSystem and method for identifying desirable subscribers
US7246230Jan 29, 2002Jul 17, 2007Bea Systems, Inc.Single sign-on over the internet using public-key cryptography
US7254585May 7, 2004Aug 7, 2007Bea Systems, Inc.Efficiently storing indented threads in a threaded discussion application
US7257589Jul 12, 2001Aug 14, 2007Ricoh Company, Ltd.Techniques for targeting information to users
US7260773Mar 17, 2003Aug 21, 2007Uri ZernikDevice system and method for determining document similarities and differences
US7269835Jul 18, 2005Sep 11, 2007At&T Bls Intellectual Property, Inc.Method and system for managing timed responses to A/V events in television programming
US7363266 *Apr 27, 2000Apr 22, 2008General Electric Capital CorporationSystems and methods for asset valuation
US7366724Aug 10, 2006Apr 29, 2008Kurt FriedenManaging secure resources in web resources that are accessed by multiple portals
US7386792Jan 18, 2002Jun 10, 2008Thomas Layne BascomSystem and method for collecting, storing, managing and providing categorized information related to a document object
US7389241Apr 9, 2002Jun 17, 2008Thomas Layne BascomMethod for users of a network to provide other users with access to link relationships between documents
US7398533 *Sep 27, 2000Jul 8, 2008Sun Microsystems, Inc.Remote function invocation with messaging in a distributed computing environment
US7412720Nov 2, 2001Aug 12, 2008Bea Systems, Inc.Delegated authentication using a generic application-layer network protocol
US7424471Jan 8, 2007Sep 9, 2008Lsr TechnologiesSystem for searching network accessible data sets
US7444658Dec 14, 2001Oct 28, 2008At&T Intellectual Property I, L.P.Method and system to perform content targeting
US7468729Dec 21, 2004Dec 23, 2008Aol Llc, A Delaware Limited Liability CompanyUsing an avatar to generate user profile information
US7475346 *Dec 22, 1999Jan 6, 2009Unisys CorporationSystem and method for providing a customizable web portal service
US7484176Dec 30, 2003Jan 27, 2009Aol Llc, A Delaware Limited Liability CompanyReactive avatars
US7587323Dec 14, 2001Sep 8, 2009At&T Intellectual Property I, L.P.System and method for developing tailored content
US7593858Dec 8, 2006Sep 22, 2009At&T Intellectual Property I, L.P.System and method for identifying desirable subscribers
US7603382Nov 5, 2004Oct 13, 2009Halt Jr Gerald BAdvanced internet interface providing user display access of customized webpages
US7636755Dec 30, 2003Dec 22, 2009Aol LlcMultiple avatar personalities
US7653660Dec 29, 2006Jan 26, 2010Bea Systems, Inc.Efficiently storing indented threads in a threaded discussion application
US7660827Dec 29, 2006Feb 9, 2010Bea Systems, Inc.Efficiently storing indented threads in a threaded discussion application
US7702541 *Dec 15, 2000Apr 20, 2010Yahoo! Inc.Targeted e-commerce system
US7725472 *May 30, 2001May 25, 2010Hottolink, Inc.Distributed monitoring system providing knowledge services
US7725487 *Dec 1, 2003May 25, 2010National Institute Of Information And Communications TechnologyContent synchronization system and method of similar web pages
US7779147Sep 29, 2006Aug 17, 2010Amazon Technologies, Inc.Method and system for advertisement placement based on network trail proximity
US7809801Sep 29, 2006Oct 5, 2010Amazon Technologies, Inc.Method and system for keyword selection based on proximity in network trails
US7831706May 25, 2005Nov 9, 2010Vignette Software LlcMethod of gathering information related to activity of a user and a data processing system program product
US7849498May 15, 2007Dec 7, 2010Siemens Medical Solutions Usa, Inc.System and user interface supporting context sharing between concurrently operating applications
US7860861 *Aug 10, 2005Dec 28, 2010Sony CorporationInformation processing apparatus, information processing method, and program for the same
US7861174Feb 15, 2002Dec 28, 2010Oracle International CorporationMethod and system for assembling concurrently-generated content
US7861309May 1, 2006Dec 28, 2010The Cobalt Group, Inc.Business platform with networked, association-based business entity access management and active content website configuration
US7908554Dec 29, 2004Mar 15, 2011Aol Inc.Modifying avatar behavior based on user action or mood
US7912752Oct 31, 2001Mar 22, 2011Context Web, Inc.Internet contextual communication system
US7913176Dec 21, 2004Mar 22, 2011Aol Inc.Applying access controls to communications with avatars
US7941431May 12, 2006May 10, 2011Thomson Reuters Global ResourcesElectronic document repository management and access system
US7945476Oct 31, 2001May 17, 2011Context Web, Inc.Internet contextual advertisement delivery system
US7949671Feb 22, 2010May 24, 2011Aol Inc.System and method for the transformation and canonicalization of semantically structured data
US8122057May 2, 2011Feb 21, 2012Aol Inc.System and method for the transformation and canonicalization of semantically structured data
US8161064Nov 24, 2009Apr 17, 2012Lsr TechnologiesSystem for searching network accessible data sets
US8250144Dec 22, 2009Aug 21, 2012Blattner Patrick DMultiple avatar personalities
US8281404Mar 20, 2006Oct 2, 2012Oracle International Corp.System and method for administering security in a corporate portal
US8307291Aug 11, 2005Nov 6, 2012American Express Travel Related Services Company, Inc.Web page security system and method
US8341126Dec 1, 2005Dec 25, 2012Oracle International CorporationEfficiently storing indented threads in a threaded discussion application
US8381091Sep 16, 2002Feb 19, 2013International Business Machines CorporationReal-time method, system and program product for collecting web form data
US8386488Apr 27, 2004Feb 26, 2013International Business Machines CorporationMethod and system for matching appropriate content with users by matching content tags and profiles
US8402378Nov 7, 2008Mar 19, 2013Microsoft CorporationReactive avatars
US8412740Feb 9, 2012Apr 2, 2013Microsoft CorporationSystem and method for the transformation and canonicalization of semantically structured data
US8438164Jul 5, 2007May 7, 2013Ricoh Co., Ltd.Techniques for targeting information to users
US8515998Nov 20, 2006Aug 20, 2013Bascom Research LLPFramework for managing document objects stored on a network
US8566439 *Oct 1, 2007Oct 22, 2013Ebay IncMethod and system for intelligent request refusal in response to a network deficiency detection
US8620747Jan 6, 2006Dec 31, 2013Ricoh Company, Ltd.Method, system and computer code for content based web advertising
US8627215Feb 25, 2011Jan 7, 2014Microsoft CorporationApplying access controls to communications with avatars
US8751302Aug 11, 2006Jun 10, 2014Pulsepoint, Inc.Method and system for placement and pricing of internet-based advertisements or services
US20090089419 *Oct 1, 2007Apr 2, 2009Ebay Inc.Method and system for intelligent request refusal in response to a network deficiency detection
WO2001098919A1 *Jun 21, 2001Dec 27, 2001Keller AshleyLocating information in a network based on user's evaluation
WO2002005136A1 *Jul 5, 2001Jan 17, 2002Novation Inc IA system and method for content optimization
WO2002080014A1 *Feb 15, 2002Oct 10, 2002Plumtree Software IncAssembling concurrently-generated personalized web pages
WO2002091193A1 *Dec 28, 2001Nov 14, 2002IbmWeb page annotation systems
WO2003056805A2 *Dec 16, 2002Jul 10, 2003Bellsouth Intellect Pty CorpSystem and method for storing and distributing television viewing patterns from a clearinghouse
WO2004044676A2 *Oct 27, 2003May 27, 2004Thomson Legal And Regulatory GElectronic document repository management and access system
Classifications
U.S. Classification709/224, 709/203, 714/E11.202, 707/E17.109, 707/999.107, 707/999.104
International ClassificationG06F11/34, G06F17/30
Cooperative ClassificationY10S707/99948, Y10S707/99945, G06F11/3495, G06F17/30867
European ClassificationG06F11/34T12, G06F17/30W1F
Legal Events
DateCodeEventDescription
Feb 11, 2014ASAssignment
Effective date: 20140131
Owner name: BANK OF AMERICA, N.A., TEXAS
Free format text: SECURITY AGREEMENT;ASSIGNOR:ACTIVISION PUBLISHING, INC.;REEL/FRAME:032240/0257
Feb 28, 2013ASAssignment
Effective date: 20121231
Owner name: ACTIVISION PUBLISHING, INC., CALIFORNIA
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:INTERNATIONAL BUSINESS MACHINES CORPORATION;REEL/FRAME:029900/0285
Sep 21, 2012SULPSurcharge for late payment
Year of fee payment: 11
Sep 21, 2012FPAYFee payment
Year of fee payment: 12
Sep 17, 2012REMIMaintenance fee reminder mailed
Jul 30, 2008FPAYFee payment
Year of fee payment: 8
Jul 12, 2004FPAYFee payment
Year of fee payment: 4
May 26, 1998ASAssignment
Owner name: IBM CORPORATION, NEW YORK
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:CUOMO, GENNARO A.;NGUYEN, BINH Q.;SINGHAL, SANDEEP K.;REEL/FRAME:009210/0362
Effective date: 19980515