Search Images Maps Play YouTube News Gmail Drive More »
Sign in
Screen reader users: click this link for accessible mode. Accessible mode has the same essential features but works better with your reader.

Patents

  1. Advanced Patent Search
Publication numberUS6112231 A
Publication typeGrant
Application numberUS 09/179,735
Publication dateAug 29, 2000
Filing dateOct 27, 1998
Priority dateOct 18, 1996
Fee statusPaid
Also published asCA2216795A1, CA2216795C, DE69719902D1, DE69719902T2, EP0837407A1, EP0837407B1, US6138141
Publication number09179735, 179735, US 6112231 A, US 6112231A, US-A-6112231, US6112231 A, US6112231A
InventorsAntonio DeSimone, Sandeep Sibal
Original AssigneeAt&T Corp.
Export CitationBiBTeX, EndNote, RefMan
External Links: USPTO, USPTO Assignment, Espacenet
Server to cache protocol for improved web performance
US 6112231 A
Abstract
On the Internet (106), rather than retrieving a frequently requested Web object from its originating server (105) in response to a request from a client terminal (101, 102), the object rather can be retrieved from a cache (103) within the Internet Access Service Provider (IASP) (104), which connects the client terminal to the Internet. What is stored in the cache may, however, not be the most recent version of the object. Distinct from providing the Web object itself, information about changes to the object is provided by the server in response to a cache request that is asynchronous to a request from a client for the object. Such information about changes to an object includes the date and time when the object was last modified, the byte size of the modified object, and information on the type of content of the object. After receiving this information about changes to an object, the cache may then request that a copy of the object be downloaded to it.
Images(4)
Previous page
Next page
Claims(9)
What is claimed is:
1. A method of communicating on a data network between a cache unit and server unit comprising the steps of:
the cache unit making a request to the server, independent of any request by a client processor coupled to the cache over said network, for information about a change to an object in the server;
providing information about the change to the object whenever such change occurs, until directed by said cache unit to stop;
deciding based on the information about the change to the object whether to request a copy of the object; and
if, based on the information about the change to the object, the decision is to request a copy of the object, requesting that a copy of the object be provided to the cache unit.
2. The method of claim 1 further comprising the step of providing a copy of the object to the cache.
3. The method of claim 1 wherein the request to the server is made asynchronous to a request for a copy of the object by a client terminal connected to the cache.
4. The method of claim 1 wherein information about a change to the object is provided on a periodic basis.
5. The method of claim 1 wherein information about a change to the object is provided when the object changes in the server.
6. The method of claim 1 wherein the step of making a request to the server for information about a change to the object comprises the step of registering a request with the server for information about a change to the object, and the step of providing information about a change to the object occurs in response to a change taking place in the server.
7. The method of claim 1 wherein the step of making a request to the server for information about a change to the object comprises the step of making a request to determine whether the object has been modified since a specified previous time.
8. The method of claim 7 wherein the information about the change to the object comprises information relating to when the object changed.
9. The method of claim 7 wherein the information about the change to the object further comprises the size of the object.
Description

This application is a divisional of U.S. patent application Ser. No. 08/733,486, filed Oct. 18, 1996.

CROSS REFERENCE TO RELATED APPLICATIONS

This application relates to subject matter described in co-pending U.S. patent application Ser. No. 08/733,485, now U.S. Pat. No. 5,787,470, filed simultaneously herewith, for Antonio DeSimone, David H. Shur, and Sandeep Sibal, the first and third named inventors therein being co-inventors herein, and assigned to the assignee hereof.

TECHNICAL FIELD

This invention relates to data communications and computer networking, and more particularly, to the transfer of digital information on packet data networks such as the Internet, between caches and servers.

BACKGROUND OF THE INVENTION

In a transaction on the World Wide Web between a client terminal and a Web server in which the client terminal retrieves a Web object from a server connected on the Internet, the client terminal normally accesses the Internet through an Internet Access Service Provider (IASP). Such an object may be one or more pages of textual information, a picture, a sound clip, a video clip, a JAVA applet or other software, any combination of the former, or anything that is capable of being transmitted digitally over the Internet to a client terminal. The term "object" will be used hereinafter to include all of the foregoing. A cache, located within the IASP network, functions as an intermediary in transactions involving the retrieval of such Web objects from servers by a client terminal. In particular, in its simplest form, a cache within the IASP saves a copy of a retrieved object for itself when the object is moved from the server to the requesting client terminal. This caching operation is transparent to the user and, under normal circumstances, does not incur any significant delay due to the copying operation which is performed simultaneously as the object is retrieved from the server and delivered to the client terminal.

Advantageously, the cache within the IASP network can satisfy subsequent requests for those objects that are stored therein, thereby obviating the necessity of retrieving the object from the originating server on the Internet. This reduces the delay as perceived by the user to access the object and further, saves bandwidth on links that connect the IASP network to the Internet. FIG. 1 is a block diagram of a prior art network in which plural client terminals, such as 101 and 102, are connected to a cache 103 within IASP 104. Cache 103, in turn, is connected to a server 105 connected to the Internet 106. By storing a copy of object from server 105 in cache 103 when it is first retrieved by client 101, subsequent requests for that same object by client 101, or any other client connected to IASP 104, such as client 102, can be satisfied directly from cache 103.

The problem of satisfying subsequent requests for an object from the cache, however, is that the copy of the object stored in the cache may differ from the object in the server if the latter has been modified since the initial request for the object was made and the object was copied and stored in the cache. Thus, the copy of the object provided to the requesting client from the cache may not be current and may be a stale or outdated version of the object as it currently exists in the server from which it originated.

A prior art attempt at tackling the problem of cache staleness uses a "conditional GET" method. Unlike a standard "GET" method in which an object is retrieved from the server upon each and every request by a client, in the "conditional GET" method, a decision is made to retrieve or not to retrieve an object based on the staleness of the object in the cache, i.e., the length of time that the copy of the object has been in the cache since the object was last verified. Unlike the standard "GET" method, the "conditional GET" method includes an "If-Modified-Since" field in the request, which is typically added to the client terminal's request by the cache, before the request is issued from the cache to the server. The "If-Modified-Since" field carries the date of the modification time of the copy of the object in the cache (the modification time being sent to the cache by the server as a field in the header along with the copy of the object itself, on the first response), and the server responds with either a "Not Modified" message if the object the cache carries is current; or the entire new version of the object if the object the cache carries is stale. Advantageously, if the copy of the object in the cache is not stale, the "conditional GET" method prevents wasteful consumption of resources. However, while the number of bytes transferred in that case is small, it still involves a non-negligible delay to retrieve the "Not Modified" message. Also, these requests load the network, cache and servers.

Many caches today use the concept of "gracing" to determine whether an object should be retrieved from its own cache or directly from the server. A gracing period is associated with Web objects that defines a period of staleness that is assumed that accessing clients are willing to tolerate. Thus, if an object is received by a cache at time t1, a subsequent request for that object at up to time t1 +Δt will be retrieved from the cache rather than from the server. The gracing time Δt will vary depending upon the type of object being retrieved. Inasmuch as certain objects are likely to be updated relatively frequently, such as the New York Times, and caches have no idea when the pages are updated, the gracing period needs to be very small if it is assumed that accessing clients want to read only that which is most current. As in the other prior art method, delays in filling requests and wasteful use of resources are inherent.

SUMMARY OF THE INVENTION

In accordance with the present invention, information about changes to a Web object in a server are provided to a cache, distinct from providing the Web object itself. Such information about an object is provided by a server in response to a cache request, asynchronous to a request from any client for the object. Further, such information about changes is meant to include information about an object that has been newly created in a server as well as an object that has been modified. In addition to the date and time when the object was last modified, such information about an object typically may also include, for example, the type of content of an object, and the byte size of an object, the latter being useful to the cache for planning disk space usage.

The mechanism of providing information to the cache about changes to an object in a server may be initiated in a first mode by the cache requesting the server to inform it about all or a subset of objects that have changed since a given date and time, or in a second mode by a server informing the cache on a periodic basis about changes to objects that the cache has previously indicated an interest in. The latter requires a registration phase during which the cache provides to the server the range of objects that interest it as well as the mode in which it wants to be updated. After the registration phase, therefore, information is provided to the cache in either the first mode, i.e., periodically sending information updates on all changed objects specified in the registration phase, or in the second mode, i.e., event driven wherein an update is sent whenever the object changes. During this second phase, both modes can co-exist.

Sending information notification messages rather than the contents of the object itself has several advantages over the prior art methodologies. Firstly, it enables cache coherency even if the cache may not have space left on its system to copy the Web object itself. Secondly, a logical separation is provided between information regarding the modification time of an object and the content of the object itself. The cache can therefore choose what objects it wants to refresh or cache, before it actually downloads them. Specifically, if the cache is notified of changes at intervals that are less than half the cache's gracing period for those objects, it does not have to access the server again to determine if the object is fresh if it has the object cached. Precious access time from the user's perspective is thus saved, which can be significant over links with large transfer delays.

Following the exchange of information between the server and the cache, the new or modified Web objects are be retrieved individually or as a batch using multipart messages, Keep-Alive connections, compression techniques, or any other scheme.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram of a prior art network showing a cache connected within an Internet Access Service Provider, interconnecting a client terminal and a server within the Internet;

FIG. 2 shows the interactions between a cache and a server along a timeline in accordance with a cache-initiated notification mechanism of the present invention;

FIG. 3 shows the interactions between a cache and a server along a timeline in accordance with a server-initiated notification mechanism of the present invention; and

FIG. 4 shows the message interaction between a cache and a server for a specific implementation of the present invention.

DETAILED DESCRIPTION

With reference to FIG. 2, a mechanism for a cache-initiated notification is shown. This mode is a Request-Response style mechanism, where information on contents is sent by the server to the cache in response to a request made by the cache. Thus, as shown in FIG. 2, the cache 201 initiates a request 202 to the server 203, which request may include an address list of URLs for which it desires information about changes. This list may include objects with individual URLs, or a plurality of objects within a range of URLs using a wildcard symbol to represent all those objects whose URLs share address commonality. As previously noted, "changes" is intended to mean both modifications to existing objects stored in a server as well as creation of new objects in the server. As can be noted in FIG. 2, the request 202 includes an "if modified since" (IMS) date, thereby indicating that information about changes to such listed URLs is requested only if the URL has been changed since that IMS date.

In response to that request 202, the server 203 checks those listed URLs to determine whether they in fact have been changed since the IMS date. The response 204 thereto is a list of those URLs that in fact have been changed since the specified IMS date, together with the times at which those URLs were changed. This information may also include the size of the requested URL object, as well as other information about the object, such as the type of content of the object. The cache 201 then parses the response and decides, using the retrieved information about those requested URLs, specifically which URLs it desires to then download. Based on its own set of rules, the cache decides to download a copy of an object based on the time at which it was last changed, the size of the object relative to the amount of free disk storage the cache has available, the contents of an object, and/or a combination of any of these pieces of information provided by the server, in addition to any other information that the cache may have available, such as a history of requests for the object by client terminals. Thus, for example, if the cache decides to retrieve the object with URL url, it makes a request 205 to GET the object with URL url using the HyperText Transfer Protocol (HTTP), which is the predominant World Wide Web Internet protocol. The server 203, in reply to that GET request, sends a response back to cache 201 that includes a copy of the body of the object with URL url. The cache 202 sequentially then makes a request to the server to GET a copy of each changed object it wants downloaded to it. As can be noted, this mechanism it totally independent of requests by a client terminal for the object.

The mechanism for a server-initiated notification scheme is shown in FIG. 3. This second mode is useful when the cache is interested in changes to specific objects, and wants the server to automatically send information on changes to those objects, either whenever they change or on a periodic basis. In this mode the cache 301 first registers a request 302 with server 303 for update events for a range of URLs. The server 303 then registers those requests and transmits an acknowledgment 304 back to cache 301. After registration, server 303 transmits an update message back to cache 301 whenever one of those registered URLs is changed. Thus, when URL url1 is changed, a message 305 containing information about the object is transmitted to the cache 301. In response thereto, the cache may decide to make a request 306 to GET url1 using the HTTP/1.0 protocol. Alternatively, in response to an information message that a particular URL has been changed, the cache may decide not to download it. Thus, as noted in FIG. 3, when URL url2 is changed and an update message 307 on URL url2 is transmitted to cache 301, the cache decides not to make a request to download the modified url2. In a similar manner, when URL url3 is changed and a message transmitted to the cache, the cache makes a request to download the modified object.

After the registration period in which the cache specifies to the server those objects for which it desires information, such information about those objects can be transmitted to the cache upon a change to the object, as described above, or on a periodic basis, or a combination of both. The above-described server-initiated mechanism will reduce the traffic due to queries being made to caches about content changes in the server. For example, if the New York Times alters its content five times a day at instants generally not known in advance to the cache, it will send five update messages to those caches that have registered with it. Assuming caches want to maintain a coherency lag of no greater than six minutes, absence of a server-initiated mechanism could mean that 240 queries are made to the server by each cache. Furthermore, if notifications are initiated by the server, they can be multicast in a loose synchronous fashion at the network or application layers as well.

The cache-initiated and the server-initiated mechanisms described above can be implemented in a real network various ways. One possible implementation of these mechanisms extends the HTTP protocol by defining a new request method called CONTENTS. Alternate designs may use a separate protocol suite, outside of HTTP.

FIG. 4 illustrates this protocol mechanism for a cache-initiated notification mode between a cache 401 and a server 402, which is illustratively shown as being the New York Times having a Web address of www.nytimes.com. In the Request made to the server for information about specific URLs, the HTTP/1.0 and HTTP/1.1 syntax for the Request-Line is:

Request-Line=Method SP (space) Request-URL SP HTTP-Version CRLF (Carriage Retum Line Feed)

and the syntax for the Request-URL is:

Request-URL="*" |absoluteURL| abs-path

This does not permit expressing complete set of URLs. "*" is therefore chosen as the Request-URL, which fortuitously means that by default the request pertains to all of the contents of the server or serving cache. The Request-Line is thus:

Request-Line="CONTENTS" SP "*" SP HTTP-Version CRLF The If-Modified-Since field in the request header is used to specify that only those content changes that took place after the date specified by the If-Modified-Since field are of interest. The Range field is used to specify the URLs that are of interest. This is a field whose syntax has yet to be specified in the version 1.1 of the HTTP protocol, and it would be desirable if all regular expressions can be expressed by this field. If this should be insufficient, a new field may need to be created for this purpose. What is desired here is that if a cache is only interested in some select HTML pages, for example all the HTML pages of the New York Times except those from the Sports Section, and the JPEG images of the Louvre, it should be able to specify that union using the Range field. Finally, the Unless field is used to specify any other restriction the cache many want to apply on the URLs that interest it. A new application type termed application/www-contents is also defined to support the response that the server or serving cache returns.

In FIG. 4, in request 403, the request line is:

CONTENTS * HTTP/1.0

wherein CONTENTS is the method requesting is a list of URLs, the "*" means that the method is controlled by protocol, and that HTTP/1.0 is that protocol. This line is followed by a CRLF. The next line is:

Accept: application/www-contents

which means that if the server sends the cache a response in accordance with that defined application, the cache will be able to understand it. The next line is:

If-Modified-Since: Sat, Oct. 29, 1996 19:43:31 GMT

which means that only URLs that have changed since Saturday, Oct. 29, 1996 at 19:43:31 GMT are of interest. The last line is:

Range: http://www.nytimes.com/*

This defines the range of URLs which are of interest, with "*" indicating that all "www.nytimes.com" objects are of interest.

The response of the server to the cache also needs to follow a specific format. Instead of defining this within the protocol, it is left to the server to specify the format, although the format definition itself needs to have a specific syntax. The format of the file sent in response to the request contains a sequence of lines containing ASCII characters terminated by either the sequence LF (line feed) or CRLF. Each line may contain either a directive or a part of a entry. Entries consist of a sequence of fields relating to a single HTTP object. If a field is unused in a particular entry, "-" marks the omitted field. Directives provide information about the version, as well as header fields of the objects that follow. Lines beginning with the # character contain directives. The following directives are defined:

Version: <integer>.<integer>

The version of the extended log file format used.

Syntax: [<specifier> . . . ]

Specifies the fields recorded in the log. The strings SP and CRLF have special meaning.

Remark: <text>

Comment information. Data recorded in this field should be ignored by analysis tools.

The directives Version and Syntax are required. The Syntax directive may appear multiple times, with the understanding that all entries obey the Syntax directive that is above and closest to them. The Syntax directive specifies the data recorded in the fields of each entry.

In the response message 404, the line "201 O.K." indicates that the request was understood and that a valid response follows. Content-Type on the next line indicates that the a special document with a certain syntax follows that is not just textual in nature. The directive #Version defines the type of syntax, specifically version 1.0. The directive #Syntax: Last-Modified CRLF URL SP Content-Length indicates that what follows will have the format of the last modified date, on a next line, the URL that has been modified, followed by a space and the size of the object in bytes. Thus the response 404 indicates that two objects matched the request 403. The first object has URL http://www.nytimes.com/index.html, having been last modified on Saturday, Oct. 29, 1996 at 19:54:02 GMT, and having a length of 575 bytes. The second object has a URL of http://www.nytimes.com/info/textpath.html having been last modified on Saturday, Oct. 29, 1996 at 19:56:34 GMT, and having a length of 4096 bytes.

Cache 401, receives response 404 and chooses which objects to download from server 402 by means of a GET request. Thus, as noted in FIG. 4, cache 401 issues a request 405 to GET http://www.nytimes.com/info/textpath.html HTTP/1.0, from server 402. Server 402 subsequently fills that request by forwarding the body of that object back to cache 401, where it replaces the stale version of the object in the cache.

The above-described embodiments are illustrative of the principles of the present invention. Other embodiments may be devised by those skilled in the art without departing from the spirit and scope of the present invention.

Patent Citations
Cited PatentFiling datePublication dateApplicantTitle
US5151989 *Feb 13, 1987Sep 29, 1992International Business Machines CorporationDirectory cache management in a distributed data processing system
US5581704 *Dec 6, 1993Dec 3, 1996Panasonic Technologies, Inc.System for maintaining data coherency in cache memory by periodically broadcasting invalidation reports from server to client
US5734898 *Jun 26, 1995Mar 31, 1998International Business Machines CorporationClient-server computer system and method for updating the client, server, and objects
US5787470 *Oct 18, 1996Jul 28, 1998At&T CorpInter-cache protocol for improved WEB performance
US5793965 *Mar 22, 1995Aug 11, 1998Sun Microsystems, Inc.Method and apparatus for determining the type of an object in a distributed object system
US5826253 *Apr 19, 1996Oct 20, 1998Borland International, Inc.Database system with methodology for notifying clients of any additions, deletions, or modifications occurring at the database server which affect validity of a range of data records cached in local memory buffers of clients
Referenced by
Citing PatentFiling datePublication dateApplicantTitle
US6199107 *Jul 22, 1998Mar 6, 2001Microsoft CorporationPartial file caching and read range resume system and method
US6233606Dec 1, 1998May 15, 2001Microsoft CorporationAutomatic cache synchronization
US6272593Apr 10, 1998Aug 7, 2001Microsoft CorporationDynamic network cache directories
US6442654 *Dec 10, 1999Aug 27, 2002International Business Machines CorporationOperating system support for in-server caching of documents
US6553409Jul 9, 1999Apr 22, 2003Microsoft CorporationBackground cache synchronization
US6854018 *Apr 8, 2000Feb 8, 2005Nec CorporationSystem and method for intelligent web content fetch and delivery of any whole and partial undelivered objects in ascending order of object size
US7054912 *Mar 8, 2002May 30, 2006Kabushiki Kaisha ToshibaData transfer scheme using caching technique for reducing network load
US7185011Oct 31, 2000Feb 27, 2007Microsoft CorporationMethod and system for directory balancing
US7243136 *Jan 17, 2001Jul 10, 2007Inktomi CorporationApproach for managing and providing content to users
US7636765Feb 15, 2006Dec 22, 2009Kabushiki Kaisha ToshibaData transfer scheme using caching technique for reducing network load
US7680875 *Apr 1, 2002Mar 16, 2010Novell, Inc.Markers for cached objects
US7779068Feb 7, 2005Aug 17, 2010Nec CorporationSystem and method for intelligent web content fetch and delivery of any whole and partial undelivered objects in ascending order of object size
US8756130 *Mar 26, 2009Jun 17, 2014Scottrade, Inc.System and method for the automated brokerage of financial instruments
US8805950 *Feb 6, 2008Aug 12, 2014Aol Inc.Client web cache
US20090187502 *Mar 26, 2009Jul 23, 2009Scottrade, Inc.System and Method for the Automated Brokerage of Financial Instruments
US20130212636 *Dec 10, 2012Aug 15, 2013Wistron CorporationElectronic device and a method of synchronous image display
US20140019576 *Jul 13, 2012Jan 16, 2014International Business Machines CorporationIntelligent edge caching
Classifications
U.S. Classification709/213, 711/124, 709/203, 711/147, 711/130, 707/E17.12, 709/223, 711/146, 709/216, 707/999.01, 707/999.202
International ClassificationG06F13/00, G06F17/30, G06F12/00
Cooperative ClassificationY10S707/99953, G06F17/30902
European ClassificationG06F17/30W9C
Legal Events
DateCodeEventDescription
Jan 27, 2012FPAYFee payment
Year of fee payment: 12
Jan 17, 2008FPAYFee payment
Year of fee payment: 8
Jan 29, 2004FPAYFee payment
Year of fee payment: 4