US6049874A - System and method for backing up computer files over a wide area computer network - Google Patents

System and method for backing up computer files over a wide area computer network Download PDF

Info

Publication number
US6049874A
US6049874A US09/041,149 US4114998A US6049874A US 6049874 A US6049874 A US 6049874A US 4114998 A US4114998 A US 4114998A US 6049874 A US6049874 A US 6049874A
Authority
US
United States
Prior art keywords
block
computer
file
data
data center
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
US09/041,149
Inventor
Fred W. McClain
Thomas B. Bolt
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
CDD TECHNOLOGIES LLC
DOMINION VENTURE FINANCE LLC
Original Assignee
Fairbanks Systems Group
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Family has litigation
US case filed in Texas Eastern District Court litigation Critical https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00702 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00700 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00699 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00470 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00339 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00338 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
US case filed in Texas Eastern District Court litigation https://portal.unifiedpatents.com/litigation/Texas%20Eastern%20District%20Court/case/6%3A13-cv-00335 Source: District Court Jurisdiction: Texas Eastern District Court "Unified Patents Litigation Data" by Unified Patents is licensed under a Creative Commons Attribution 4.0 International License.
First worldwide family litigation filed litigation https://patents.darts-ip.com/?family=25046505&utm_source=google_patent&utm_medium=platform_link&utm_campaign=public_patent_search&patent=US6049874(A) "Global patent litigation dataset” by Darts-ip is licensed under a Creative Commons Attribution 4.0 International License.
Assigned to FAIRBANKS SYSTEMS GROUP reassignment FAIRBANKS SYSTEMS GROUP ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BOLT, THOMAS B., MCCLAIN, FRED W.
Priority to US09/041,149 priority Critical patent/US6049874A/en
Application filed by Fairbanks Systems Group filed Critical Fairbanks Systems Group
Priority to US09/100,914 priority patent/US6038665A/en
Priority to AU30764/99A priority patent/AU3076499A/en
Priority to CA002323522A priority patent/CA2323522A1/en
Priority to PCT/US1999/005248 priority patent/WO1999046660A2/en
Priority to EP99912379A priority patent/EP1062558A2/en
Priority to JP2000535980A priority patent/JP2002507016A/en
Publication of US6049874A publication Critical patent/US6049874A/en
Application granted granted Critical
Assigned to SKYDESK, INC. reassignment SKYDESK, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: FAIRBANKS SYSTEMS GROUP
Assigned to IMPERIAL BANK reassignment IMPERIAL BANK SECURITY INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SKYDESK, INC.
Assigned to DOMINION VENTURE FINANCE L.L.C. reassignment DOMINION VENTURE FINANCE L.L.C. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SKYDESK, INC.
Assigned to SKYDESK, INC. reassignment SKYDESK, INC. REASSIGNMENT AND RELEASE OF SECURITY INTEREST Assignors: COMERICA BANK-CALIFORNIA AS SUCCESSOR IN INTEREST TO IMPERIAL BANK
Assigned to SWAPDRIVE, INC. reassignment SWAPDRIVE, INC. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SKYDESK, INC.
Assigned to SILICON VALLEY BANK reassignment SILICON VALLEY BANK SECURITY AGREEMENT Assignors: SWAPDRIVE, INC.
Assigned to SWAPDRIVE INC reassignment SWAPDRIVE INC RELEASE Assignors: SILICON VALLEY BANK
Assigned to SYMANTEC CORPORATION reassignment SYMANTEC CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SWAPDRIVE, INC.
Assigned to DOMINION VENTURA FINANCE L.L.C. reassignment DOMINION VENTURA FINANCE L.L.C. CORRECTIVE ASSIGNMENT TO CORRECT THE NATURE OF CONVEYANCE: FROM ASSIGNMENT TO SECURITY AGREEMENT PREVIOUSLY RECORDED ON REEL 012252 FRAME 0524. ASSIGNOR(S) HEREBY CONFIRMS THE DOMINION VENTURA FINANCE L.L.C.. Assignors: SKYDESK, INC.
Assigned to SKY DESK, INC. reassignment SKY DESK, INC. RELEASE BY SECURED PARTY (SEE DOCUMENT FOR DETAILS). Assignors: DOMINION VENTURE FINANCE L.L.C.
Assigned to DOMINION VENTURE FINANCE L.L.C. reassignment DOMINION VENTURE FINANCE L.L.C. CORRECTIVE ASSIGNMENT TO CORRECT THE REEL/FRAME TO 012252/0523; AND CORRECT NAME TO DOMINION VENTURE FINANCE L.L.C. PREVIOUSLY RECORDED ON REEL 029929 FRAME 0864. ASSIGNOR(S) HEREBY CONFIRMS THE DOMINION VENTURE FINANCE L.L.C.. Assignors: SKYDESK, INC.
Assigned to CDD TECHNOLOGIES, LLC reassignment CDD TECHNOLOGIES, LLC ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: SYMANTEC CORPORATION
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q40/00Finance; Insurance; Tax strategies; Processing of corporate or income taxes
    • G06Q40/08Insurance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1448Management of the data involved in backup or backup restore
    • G06F11/1451Management of the data involved in backup or backup restore by selection of backup contents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/14Error detection or correction of the data by redundancy in operation
    • G06F11/1402Saving, restoring, recovering or retrying
    • G06F11/1446Point-in-time backing up or restoration of persistent data
    • G06F11/1458Management of the backup or restore process
    • G06F11/1461Backup scheduling policy
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99951File or database maintenance
    • Y10S707/99952Coherency, e.g. same view to multiple users
    • Y10S707/99953Recoverability
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10TECHNICAL SUBJECTS COVERED BY FORMER USPC
    • Y10STECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y10S707/00Data processing: database and file management or data structures
    • Y10S707/99951File or database maintenance
    • Y10S707/99952Coherency, e.g. same view to multiple users
    • Y10S707/99955Archiving or backup

Definitions

  • the present invention relates generally to computer file back up systems, and more particularly to automated computer file back up systems.
  • Extraordinarily large amounts of information are stored on electronic media, such as personal computer hard disk drives and other well-known data storage media. Increasingly, the information is exclusively stored on such media; no "hard” (i.e., paper) copies exist for much information. It can readily be appreciated that such so-called “paperless” information storage systems consume less space and virtually no natural resources, in contrast to systems that require information to be stored on paper. Unfortunately, it is not uncommon for electronic data storage media to "crash", i.e., to become damaged or otherwise lose its information. Such information loss can be devastating to the information owner and highly costly.
  • back up systems suffer from several drawbacks. First, they usually require user interaction to initiate the back up process, and consequently occupy the time of a person who must undertake the tedious chore. Further, systems which back up computer files and store the files in the same premises as the system being backed up are of little value in the event of a catastrophe such as a fire or flood. This is because the backed up file copies can be expected to perish along with the files that are resident in the system, thereby rendering the back up system a failure.
  • a second, more rigorous check is made to determine whether the segment has been changed since the last back up.
  • the second check consists of comparing a cyclic redundancy check (CRC) product of the segment against a previously recorded CRC value. If the CRCs do not match, a file change is indicated, and the changed portion of the file is sent via modem, LAN, WAN, or other network device to the off-site facility. Otherwise, the segment is assumed to have not been changed since the last back up.
  • CRC cyclic redundancy check
  • the present invention recognizes that some files need not initially be backed up at all. And, the present invention recognizes that file restoration can be provided for in more than one way to reduce the time required to restore lost files to a computer system.
  • the present invention recognizes that data backup undertaken at predetermined intervals might risk failing to backup data that is generated and lost between the intervals. Fortunately, the present invention recognizes that data backup via wide area networks can be undertaken as a user is actively using a computer, thereby providing real time and, hence, almost completely up to date data backup.
  • the data is preferably first encrypted for security reasons.
  • the present invention recognizes that the longer a user employs a single key for encrypting data, the greater the risk that the key will be broken. For this reason, it is preferably to use more than one key to encrypt data.
  • the present invention understands that the use of more than one key over a period of time is rendered somewhat difficult because it is difficult during subsequent decryption to associate the correct encryption key with the blocks that were originally encrypted with the key.
  • the present invention still further recognizes that in addition, it is desirable and possible to provide a session-unique key for each transmission session that cannot be broken unless every bit of data to be transmitted is known a priori.
  • an object of the present invention to provide a system and method for backing up computer files at predetermined intervals or continuously, as he computer is being used.
  • Another object of the present invention is to provide a system and method for backing up computer files to a remote facility via modem, LAN, WAN, or other network device.
  • Still another object of the present invention is to provide a system and method for backing up computer files by automatically sending only changed portions of the files to an off-site facility.
  • Yet another object of the present invention is to provide a system and method for backing up computer files that is easy to use and cost-effective.
  • Another object of the present invention is to provide easy to use and effective encryption processes for backing up data via a wide area network.
  • Yet another object of the present invention is to provide a data backup system which automatically updates a centrally stored common software library.
  • a computer program product includes a computer program storage device that is readable by a digital processing system and a program means on the program storage device.
  • the program device is realized in a critical machine component that causes the digital processing system to perform method steps to back up at least some blocks in at least one local computer file from at least one user computer in a computer network.
  • the method steps include transmitting the local file to a data center that is remote from the user computer to render a remote version, with the data center being accessible via the computer network. For each of the blocks (i.e., each sequence of two thousand (2000) or so bytes) in the local file, two respective characters thereof are copied, with the characters defining a respective first comparison value. Additionally, the method steps include generating respective digital signature codes defining second comparison values.
  • the method steps include, for each block, periodically determining a test digital signature code and comparing it to the respective second comparison value.
  • a block of the local file is designated as a transmission block to be transmitted to the data center to thereby update the remote version when the test digital signature code of the block does not equal the second comparison value of the block.
  • the method steps may further include gathering transmission blocks in a transfer chunk.
  • the transfer chunk is transmitted to the data center.
  • transfer chunks are transmitted to the data center only during a predetermined period.
  • the method steps may additionally include, for each local file block, determining whether a duplicate of the local file block is stored in a common file library at the data center. The file block is copied to the data center only when a duplicate of the local file block is not stored in a common file library.
  • the method steps yet further include periodically copying remote versions of file blocks at the data center onto a portable data storage medium.
  • Local files are restored by copying remote versions of file blocks from the portable data storage medium when the remote versions that are stored on the portable data storage medium are at least as current as the remote versions stored at the data center, and otherwise they are restored by copying remote versions of the file blocks from the data center.
  • the computer program product is also disclosed in combination with the digital processing apparatus, and in further combination with the computer network.
  • a computer-implemented method using the above-described steps is also disclosed.
  • a system for backing up files in user computers.
  • the system includes a data center including a library of common computer file blocks, and a plurality of user computers remote from the data center and in communication with the data center.
  • Common library logic means determine, for each user computer, which if any of the file blocks of the user computer are common file blocks by virtue of being contained in the common library.
  • initialization logic means are associated with each user computer for causing file blocks except common file blocks to be transmitted to the data center.
  • subfile incremental back up logic means are associated with each user computer for determining changed blocks that have been changed since a predetermined back up time. Changed block signals are generated in response thereto.
  • Chunk transmission logic means are responsive to the subfile incremental logic means for sending only changed blocks to the data center in chunks having a predetermined size.
  • synchronizing logic means are responsive to the changed block signals for synchronizing the subfile incremental logic means.
  • a computer program product in another aspect, includes a computer program storage device readable by a digital processing system and logic means on the program storage device.
  • the logic means include instructions that are executable by the digital processing system for performing method steps for backing up at least some blocks in at least one local computer file from at least one user computer in a computer network.
  • the method steps performed by the computer program product include transmitting the local file to a data center remote from the user computer to render a remote version.
  • the data center is accessible via the computer network and has an electronic storage capacity of at least one hundred terabytes.
  • the local file is transmitted only when the data center does not have an exact copy of the local file.
  • a system for backing up files in user computers includes a data center including a common library of computer data and a plurality of user computers remote from the data center and in communication with the data center. Further, common library logic means determine which if any of data blocks of the user computers are common computer file blocks by virtue of being contained in the common library, it being understood that computer file blocks that are determined not to be common computer file blocks are non-common file blocks. Common library addition means are provided for adding, to the common library, a non-common file block when the non-common file block appears at least a predetermined number of times in the user computers.
  • a computer-implemented method for backing up at least one local computer file from at least one user computer in a computer network includes transmitting the local file to a data center remote from the user computer to render a remote version. Changes to a first block of the local file are received, it being understood that the changes are established by a user of the user computer generating input events. Immediately upon receiving the changes, at least the first block is designated as a transmission block to be transmitted to the data center for updating the remote version. The method then includes transmitting at least the first block to the data center substantially while the user is generating input events.
  • a computer-implemented method for backing up computer file blocks includes, at a user computer, encrypting at least one block using one or more encryption keys. The method further includes storing the one or more encryption keys in a key file, and transmitting the block to a remote data center. Next, the method contemplates, at a restore time, transmitting the block from the data center to the user computer, and selecting one of the keys from the key file. The block is decrypted with the key, and the method then determines whether the decryption was successful. When the decryption is not successful, another key is selected from the key list and the data is decrypted, and the above steps repeated until the decryption is successful.
  • a method for providing a session key for a transmission session for transmitting data between a transmitting computer and a receiving computer includes generating a digital signature unique to the data, and using the digital signature to encrypt the data.
  • a method for backing up data in a system including user computers and at least one data storage facility includes, at a user computer, determining whether data associated with the user computer is resident in a common software library associated with the data storage facility. The data is not backed up when the data is resident in the common software library. Otherwise, the data is automatically added to the common software library when the data is present in at least a predetermined number of user computers in the system.
  • a method for backing up computer data in a computer system including user computers and a data storage facility online with a wide area network.
  • the method includes, for at least one user computer online with the wide area network, as data is entered into the user computer during input events, transmitting at least part of the data to the data storage facility via the wide area network between input events.
  • FIG. 1 is a schematic diagram of the computer file back up system of the present invention
  • FIG. 2 is a flow chart of the initialization process
  • FIG. 3 is a flow chart of the subfile incremental back up process
  • FIG. 4 is a flow chart of the restore process
  • FIG. 5 is a flow chart of the process for automatically updating the common software library
  • FIG. 6A is a flow chart of the process for encrypting data to be backed up
  • FIG. 6B is a flow chart of the process for decrypting backed up data received from the data center
  • FIG. 7A is a flow chart of the process for transferring data from a user computer to a data center using the message digest (MD)-5 comparison value of the data being transferred as an encryption random number;
  • MD message digest
  • FIG. 7B is a flow chart of the process for validating the data transfer shown in FIG. 7A.
  • FIG. 8 is a flow chart of the process for continuously backing up the computer files of a user computer while a user is using the computer and the computer is online with a remote data center via a wide area network.
  • a system is shown, generally designated 10, which includes a server computer, referred to herein as a data center 12, which is part of a computer network.
  • the data center 12 is part of the computer network 14 referred to as the Internet, and the data center 12 can communicate with other computers on the network 14 via a modem, LAN, WAN, or other network device 16.
  • the data center 12 includes an electronic data storage repository 18 with an associated common library 20.
  • the data repository 18 has a multi-terabyte or indeed a petabyte capacity.
  • the capacity of the repository 18 is over one hundred terabytes, and more preferably is five hundred terabytes or more.
  • a control module 22 controls the flow of data into and out of the repository 18 in consonance with the novel logic described below.
  • the common library 20 stores the blocks of computer files that are widely distributed and used by many users and, hence, that are common to many users.
  • the common library 20 can store copies of well-known off-the-shelf personal computer operating systems, network browsers, and so on.
  • the common library 20 can be automatically updated as discussed further below in reference to FIG. 5.
  • the data storage repository 18 with common library 20 can be physically implemented in a suitable medium, such as magnetic tape, hard disk arrays, compact disk read-only memory (CD-ROM), DASD arrays, or even floppy diskettes.
  • each user computer 24 includes a respective software-implemented back up module 28 that undertakes the inventive steps of the present invention. It is to be understood that in another embodiment, the back up modules 28 can be remote from, but accessible to, the user computers 24. Additionally, each user computer 24 preferably includes a respective CD-ROM drive 30 for purposes to be made clear shortly. As those skilled in the art will recognize, the user computers 24 can also be associated with respective video monitors, printers, and other output devices, and keyboards, keypads, mice, and other input devices.
  • FIGS. 2-8 illustrate the structure of the back up module 30 of the present invention as embodied in computer program software.
  • FIGS. 2-8 illustrate the structure of the back up module 30 of the present invention as embodied in computer program software.
  • FIGS. 2-8 illustrate the structures of logic elements, such as computer program code elements or electronic logic circuits, that function according to this invention.
  • the invention is practiced in its essential embodiment by a machine component that renders the logic elements in a form that instructs a digital processing apparatus (that is, a computer) to perform a sequence of function steps corresponding to those shown in the Figures.
  • a digital processing apparatus that is, a computer
  • These instructions may reside on a program storage device including a data storage medium to establish a computer program product, such as a programmed computer diskette.
  • a computer program product such as a programmed computer diskette.
  • such media can also be found in semiconductor devices, on magnetic tape, on optical disks, on a DASD array, on a conventional hard disk drive, on electronic read-only memory or on electronic random access memory, or other appropriate data storage device.
  • the logic means are computer-executable instructions that are written in C ++ language code.
  • the process begins at start state 32 for each user computer 24 and moves to block 34, wherein a file index counter "j" is set equal to unity.
  • decision diamond 36 it is determined whether a copy of the j th file, using the so-called “message digest five" ("MD5") code discussed further below, is already stored in the common library 20. This can be done on a file-by-file basis by comparing the MD5 of each j th file to the MD5s of the files in the common library 20.
  • MD5 mesage digest five
  • the process moves to block 38 to flag the file as being in the common library and, hence, not a candidate for wholesale transfer to the data center 12.
  • greater granularity can be achieved by comparing the MD5 of each block of each j th file to a list of common block MD5s, and flagging only those blocks having MD5s that match an MD5 on the common library 20 list of MD5s as being in the common library and, hence, not candidates for wholesale transfer to the data center 12.
  • the modified portions will be detected and transferred to the data center 12 during the subfile incremental procedure discussed below in reference to FIG. 3.
  • the use of the common library 20 avoids the necessity of transmitting to the data center 12 very large, off-the-shelf computer files that are resident on many if not most personal computer systems and that would otherwise require backing up. In other words, the common library 20 reduces the time and cost of making initial copies of the files in the user computers 24.
  • a block-by-block comparison value listing is recorded that has two entries for every block i (wherein "i" is a block index counter) of every files j flagged as being in the common library 20.
  • the first entry is the first two characters ("XX 1 ") of the first byte of each block i .
  • the second entry is a digital signature ("MD5 i "), the value of which is uniquely defined by the contents of the block i .
  • the first entry establishes a first comparison value
  • the second entry establishes a second comparison value.
  • the digital signature used is the 128 bit-long "message digest five" ("MD5") code known in the cryptology art.
  • MD5 code of a block changes a great deal with even very small changes of the block.
  • the digital signature of the present invention advantageously is more sensitive to changes in a block than are check sums, cyclic redundancy codes, and so-called "exclusive or" codes.
  • decision diamond 40 determines whether the last file in the user computer 24 has been tested. If so, the initialization process terminates at state 42. Otherwise, the process moves to block 44 to increment by one the file index counter "j", and then to loop back to decision diamond 36.
  • the logic proceeds to block 46 to trickle copy each non-resident block i of the file j to the storage repository 18 of the data center 12.
  • trickle copy is meant that the non-resident blocks of the file are transmitted to the data center 12 during a predetermined transmission period, also referred to herein as a "window", until the entire non-resident portions file has been transmitted, or until the transmission period elapses.
  • the data to be backed up can be transmitted real time as the user inputs data to the user computer 24, provided the user computer 24 is online with the network 14.
  • the respective back up module 28 completes the transmission of the chunk, and then terminates. If the transmission is interrupted during the transmission period, the module 28 notes the value of the block index counter "i", such that transmission can be resumed from there during the next transmission period. Then, at the start of the next transmission period, the back up module 28 increments the block index counter "i" by unity and commences transmitting the file j where it previously had left off. As indicated in FIG. 2, the block-by-block listing of XX i and MD5 i is generated at block 46 as described previously in reference to block 38. From block 46, the logic proceeds to decision diamond 40 to function as previously described.
  • FIG. 3 the subfile incremental back up logic can be seen. It is to be understood that as intended by the present invention, periodically (e.g., on a daily basis) the back up module 28 of each user computer 24 will undertake the process discussed below.
  • decision diamond 48 it is determined whether a user-selected predetermined back up period has commenced. When it has, the process moves to block 50 to enter a do loop for each file j that has been designated by the user to be backed up. Then, the process moves to block 52 to enter a do loop for each block i of the file j that was selected at block 50.
  • the back up module 28 computes the digital signature MD5 i new of the block i under test.
  • the digital signature MD5 i new of the block i under test is compared, at decision diamond 56, with the digital signature MD5 i old that is recorded in the listing that was generated in FIG. 2, as described above.
  • the comparison of the MD5 i+ 1 new of the i th +1 block to be tested is first compared to MD5 l+ 1 old , i.e., to the l th +1 element in the ordered list of old digital signatures. If no match is found, the MD5 i+ 1 new of the i th +1 block is then serially tested against the digital signatures in the list after the l th +1 element in the ordered list of old digital signatures.
  • the process moves to decision diamond 58.
  • decision diamond 58 the logic determines whether the last block of the file j has been tested, and if not, the process loops back to retrieve the next block i and compute its digital signature as shown and described. In other words, if further blocks exist for the file j under test, the index counter "i" is incremented by one, and the process loops back to block 54.
  • the logic determines, at decision diamond 60, whether the last file in the user computer 24 has been tested. If not, the process loops back to retrieve the next file and proceed as before to test the blocks in the file. Otherwise, the process moves to decision diamond 62.
  • decision diamond 62 it is determined whether the back up period has not elapsed, i.e., whether the allocated back up window is still open. If the period has elapsed, the logic loops back to decision diamond 48 to await the commencement of the next allocated back up period. In contrast, if the back up window is still open, the logic attempts to establish communication with the data center 12 via the modems 26, 16 (or other network devices, such as LANs, WANs, etc.).
  • decision diamond 64 it is determined whether the communication connection was successful. If it wasn't, the logic moves to block 66 to wait a predetermined time-out period (preferably five minutes) before looping back to decision diamond 62 to retry the connection if the window is still open. In any case, when it is determined that communication has been established between the user computer 24 and the data center 12 subsequent to testing all files j in the user computer 12, the logic encrypts and compresses any remaining updated blocks (described further below) and transmits them to the data center 12, at block 68. From block 68, the process loops back to decision diamond 48.
  • a predetermined time-out period preferably five minutes
  • the logic moves to block 70, wherein a do loop is entered for the bytes k of the block 1 .
  • "k" is an index counter initially equal to unity.
  • the logic determines whether the first two characters ("XX k new ") of the block starting with the byte k under test equal the first comparison value ("XX old ”) that corresponds to the block i in the comparison value listing. It is to be understood that the comparison value listing parallels the ordered list of digital signatures MD5 old .
  • the two characters XX k new are compared to the next two characters XX old in the ordered list until a match is found or until the end of the list is reached.
  • the logic moves to decision diamond 73 to determine whether the current byte k is the last byte of the candidate block under test. Stated differently, at decision diamond 73 the logic determines whether the byte index counter "k" equals about two thousand forty eight (2048). If it does, the logic moves to block 74 to increment the block index counter "i" by unity, and then the logic loops proceeds to decision diamond 58. In this instance, the entire block under test is flagged for copying. If it is determined that the byte index counter "k” does not equal one thousand at decision diamond 73, the logic loops back to increment "k” by unity and test the next byte in the block i at decision diamond 72.
  • decision diamond 75 the digital signature MD5 of the block having as its first byte the byte k under test is determined and compared to the second comparison value, i.e., the digital signature MD5 old that corresponds to the matched characters XX old in the comparison value listing.
  • the digital signature MD5 being tested is tested against subsequent digital signatures MD5 old in the ordered list, if necessary, to find a match. If no match is found, the logic moves to decision diamond 73.
  • the logic returns "resynchronized" and moves to block 76.
  • a positive test at decision diamond 75 indicates that the logic has found an old, unchanged block that previously has been backed up, and, hence, that the logic is resynchronized with the comparison value listing.
  • the changed block(s) (also referred to herein as “transmission blocks”) are moved to a "next chunk" file.
  • the comparison value listing is updated to include the first two characters and digital signatures of the changed block(s), for use as the first and second comparison values, respectively, during the test of the blocks during the next back up cycle.
  • decision diamond 78 it is determined whether the chunk file is full. In the presently preferred embodiment, the chunk file is full when its size is five megabytes (5 MB).
  • the logic returns to decision diamond 58. In contrast, if the chunk file is full, the process moves to decision diamond 80 to determine whether the back up period has not elapsed, i.e., whether the allocated back up window is still open. If the period has elapsed, the logic loops back to decision diamond 48 to await the commencement of the next allocated back up period. In contrast, if the back up window is still open, the logic transmits the chunk when a successful connection has been established with the data center 12, using the procedure of steps 64 and 66 discussed above. From block 82, the logic returns to decision diamond 58.
  • FIG. 4 shows the logic by which lost files may be restored to a user computer 24.
  • backed up file blocks at the data center 12 are periodically (e.g., weekly, monthly, quarterly, etc.) copied from the storage repository 18 to CD-ROM disks, and the disks then transported to the same physical location as the associated user computer 24.
  • CD-ROM disks are referred to as "local back up disks”.
  • a directory of blocks in the data center 12 that correspond to the user computer 24 is downloaded from the data center 12.
  • a do loop is entered for each block i of the lost file(s) as follows. It is determined at decision diamond 88 whether the requested version of the block of the file (ordinarily the latest version) is on the local back up disks. If so, the block is preferentially restored from the local back up disks at block 90. Otherwise, it is restored via the modems 26, 16 from the data center 12 at block 92. From block 90 or block 92, the logic loops back to retrieve the next block i , and continues this process until the entire requested files have been restored.
  • the common software library 20 shown in FIG. 1 may be automatically updated by commencing at block 100 in FIG. 5.
  • the server i.e., at the data center 12 shown in FIG. 1 undertakes, for each digital signature MD5 code with its attendant block received for storage, a loop by moving to decision diamond 102, wherein it is determined whether the digital signature MD5 code under test has been recorded as a candidate for the library 20.
  • decision diamond 102 The first time the particular digital signature MD5 code is received from any user computer, the test at decision diamond 102 is negative, and the process consequently moves to block 104 to record the digital signature MD5 code on a candidate list.
  • a counter J MD5 for the particular digital signature MD5 code under test is set equal to unity at block 104. Moving from block 104 to block 106, the next received digital signature MD5 code is retrieved and the above process repeated as indicated by the dashed loop back line in FIG. 5.
  • the test at decision diamond 102 is positive, and the process moves to block 108 to increment the counter J MD5 that is associated with the digital signature MD5 code under test by one. It is to be understood that the counter J MD5 that is associated with the digital signature MD5 code under test can be incremented whenever the digital signature MD5 code under test is received from any user computer, or only when it received from a user computer that has not previously sent the digital signature MD5 code under test.
  • decision diamond 110 it is determined whether the value of the counter J MD5 that is associated with the digital signature MD5 code under test equals a predetermined value "k". If not, the process loops back to block 106, but otherwise the process moves to block 112 to add the digital signature MD5 code under test, along with its associated block(s), to the common software library 20. The process then loops back to block 106 to retrieve the next digital signature MD5 code.
  • FIGS. 6A and 6B show the present inventive steps for allowing a user to use multiple encryption keys to transmit data for back up, thereby increasing security.
  • the digital signature MD5 code for each block for which backup has been selected by the steps disclosed above in FIGS. 2-4 is determined and attached to its respective data block, and then the digital signature MD5 code with data block is compressed using compression principles known in the art.
  • the data block, but not the MD5 code is encrypted with a user-selected key on a user key list. From block 116, the process ends at state 120, it being understood that further transmission steps as previously disclosed can be undertaken.
  • the user computer invokes the steps shown in FIG. 6B.
  • the user computer receives the blocks to be restored from the data center and then undertakes the following steps for each block.
  • the computer selects the first encryption key in the list, and then at block 128 the computer decrypts and decompresses the block.
  • decision diamond 130 the user computer determines whether decompression failed as indicated by, e.g., data overflow. If decompression failed, the process moves to block 132 to select the next key in the list, and then loops back to block 128 to again attempt decompression.
  • the process moves to block 136 to further validate the key under test by stripping the digital signature MD5 code from the packet and then calculating anew the digital signature MD5 code of the data that is associated with the decrypted block.
  • the digital signature MD5 code is a 128 bit-long string that is appended to the data with which it is associated in a known location, and that consequently the digital signature MD5 code easily can be stripped.
  • the calculated digital signature MD5 code of the decrypted data is compared to the decrypted digital signature MD5 code that had been stripped from the data at block 136. If the two digital signature MD5 codes match each other at decision diamond 138, the key under test has been validated and the process returns "correct key" at block 140. Then the process retrieves the next data portion at block 124. Otherwise, the process loops back to block 132 to retrieve the next key in the user computer's key list.
  • FIG. 7A shows a method for providing a unique, session-specific session key for encryption purposes.
  • the digital signature MD5 code for the entire set of data to be transmitted is calculated by the computer that is to transmit the data.
  • the digital signature MD5 code is used as the encryption key for the data transmission session.
  • the two 64 bit halves of the 128 bit-long digital signature MD5 code are combined using an exclusive-OR (XOR) operation and then 8 bits, e.g., the last 8 bits, of the resulting 64 bit string are discarded.
  • XOR exclusive-OR
  • the present logic follows two branches, denoted in the Figures as "A" and "B". More specifically, at block 146A the data is encrypted using the digital signature MD5 code (or portion thereof) as a random number, and then at block 148A the data is transmitted to the receiving computer. On the other hand, at block 146B the portion of the digital signature MD5 code that is to serve as the session key (i.e., the portion of the digital signature MD5 code that is used at block 146A to encrypt the data) is transferred from the transmitting computer to the receiving computer using conventional private key/public key encryption principles, or more preferably using Diffie-Helman encryption principles.
  • the digital signature code can be encrypted by the transmitting computer using the receiving computer's public key, and then at block 148B the receiving computer can decrypt the digital signature code using its private key. Then, as shown in FIG. 7A, block 150 receives the results of blocks 148A and 148B to decrypt the data using the digital signature code session key.
  • FIG. 7B additionally shows that the present logic can validate the data received at block 152 by undertaking the validation steps shown at blocks 152-158. More particularly, at block 152 the receiving computer calculates the digital signature code of the data that is associated with the decrypted data from block 150. Next, at decision diamond 154 the calculated digital signature code of the decrypted data is compared to the digital signature code that had been received and decrypted at block 148B. It is to be understood that the in the event that only 56 bits of an XOR'd digital signature MD5 code are used as the session key, the comparison at decision diamond 154 can be between the session key and an XOR'd version of the digital signature MD5 code calculated at block 152.
  • a method is shown for continuous data backup via a wide area computer network while a user computer is online with the network and while a user is using the user computer.
  • the process moves to decision diamond 162.
  • the process determines whether the user computer is online with the network, e.g., the network 14 shown in FIG. 1. If not, the process moves to block 164 to store the data for transmission thereof when the computer is online.
  • the process determines that the user computer is online with the network
  • the present logic moves to decision diamond 166 to determine whether the central processing unit (CPU) of the user computer is busy. Specifically, the determination at decision diamond 166 is whether the CPU is busy with a task that has a higher priority assigned to it than the priority of the backup task shown in FIG. 8. If the CPU is not occupied with another task of higher priority than the present data backup task, the process moves to block 168 to transmit the block until the entire block has been transmitted, or until the computer becomes reoccupied with a task of higher priority than backup.
  • CPU central processing unit
  • the loop between block 170 and decision diamond 166, and the "busy" loop at block 168, illustrate the principles of the above discussion.
  • the logic can proceed to block 170 to wait a predetermined period .increment.t, and then retry the determination at decision diamond 166.
  • the length of .increment.t is very short, on the order of milliseconds or microseconds.
  • the logic can essentially define ".increment.t" to be "whenever the CPU is idle", and when the CPU is idle, proceed to block 168 to transmit data.
  • the CPU might receive a data input via a key stroke by the user of the computer, and if so, the process idles at block 170 until the key stroke is completed, before transmitting data at block 168.
  • the present process transmits backup data between CPU tasks that support a user's needs, e.g., between key strokes, with the data that is changed by the key strokes being determined as backup data and transmitted offsite via the Internet for storage virtually real time as the data is being created by the user of the computer.

Abstract

A system backs up computer files to a remote site via modem, LAN, WAN, or other network device 16. Files of a user computer that are found in a common library at the remote site initially are not copied to the remote site, whereas files not in the library are copied to the remote site and potentially automatically added to the library when a sufficient number of computers contain the files. Then, periodically or continuously if the user is online via the modem, the user computer determines which blocks have been changed, and the user computer transmits only changed blocks to the remote site. The blocks are gathered in "chunk" files, and when a chunk file reaches a predetermined size, it is transmitted to the remote site for updating the back up version of the respective file. The process then resumes identifying changed blocks. In addition to flagging the changed block for transfer, the process resynchronizes the local data file with the backed up version using a two-step comparison, first comparing the first two characters in the block with a pre-stored character set, and then, if the first comparison results in a match, comparing a digital signature of the changed block with a pre-stored signature. If either comparison results in a mismatch, the test is repeated using, as the first byte of the test block, the next byte in the sequence. Novel encryption procedures are also disclosed.

Description

RELATED APPLICATIONS
The present application is a continuation-in-part of U.S. patent Ser. No. 08/757,134, filed Dec. 3, 1996, U.S. Pat. No. 5,794,254 granted Aug. 11, 1998, for an invention entitled "SYSTEM AND METHOD FOR BACKING UP COMPUTER FILES OVER A WIDE AREA COMPUTER NETWORK", from which priority is claimed.
FIELD OF THE INVENTION
The present invention relates generally to computer file back up systems, and more particularly to automated computer file back up systems.
BACKGROUND
Extraordinarily large amounts of information are stored on electronic media, such as personal computer hard disk drives and other well-known data storage media. Increasingly, the information is exclusively stored on such media; no "hard" (i.e., paper) copies exist for much information. It can readily be appreciated that such so-called "paperless" information storage systems consume less space and virtually no natural resources, in contrast to systems that require information to be stored on paper. Unfortunately, it is not uncommon for electronic data storage media to "crash", i.e., to become damaged or otherwise lose its information. Such information loss can be devastating to the information owner and highly costly.
Accordingly, many computer system managers routinely make back up copies of computer files. Typically, once each day (or at some other specified periodicity) a system manager will cause the computer system to copy files that are resident on the hard disk drives of the system onto storage media such as other hard disk drives or magnetic tape. Should a file or entire hard disk drive in the system be damaged, lost, or otherwise rendered inaccessible, the back up copy of the file that is stored on, e.g., the storage tape can be copied back into the system.
Such back up systems suffer from several drawbacks. First, they usually require user interaction to initiate the back up process, and consequently occupy the time of a person who must undertake the tedious chore. Further, systems which back up computer files and store the files in the same premises as the system being backed up are of little value in the event of a catastrophe such as a fire or flood. This is because the backed up file copies can be expected to perish along with the files that are resident in the system, thereby rendering the back up system a failure.
An example of an alternate back up system is disclosed in U.S. Pat. No. 5,479,654, which teaches sending changed portions of computer files via modem to an off-site electronic storage facility. As taught in the 5,479,654 patent, segments of computer files are checked for changes that have been made since the previous back up. This check first consists of comparing a so-called "exclusive-OR" ("XOR") product of the file against an XOR value that was calculated previously for the stored back up version of the file. If the XORs do not agree, a file change is indicated, and the changed portion of the file is sent via modem, LAN, WAN, or other network device to the off-site facility.
On the other hand, if the XORs do agree, a second, more rigorous check is made to determine whether the segment has been changed since the last back up. The second check consists of comparing a cyclic redundancy check (CRC) product of the segment against a previously recorded CRC value. If the CRCs do not match, a file change is indicated, and the changed portion of the file is sent via modem, LAN, WAN, or other network device to the off-site facility. Otherwise, the segment is assumed to have not been changed since the last back up.
Unfortunately, the system disclosed in the 5,479,654 patent can require two calculations per check. Each calculation consumes computing time, and as recognized by the present invention, it is consequently desirable to minimize the number of calculations undertaken to determine whether a change has been made to a computer file. As further recognized herein, it is nevertheless desirable to undertake an initial "quick and dirty" check to reduce the number of computationally rigorous checks that must be made.
Moreover, as recognized by the present invention it is necessary to manage the transmission of data to the off-site storage facility, to avoid one user interfering with the back up operations of another user. Still further, the present invention recognizes that some files need not initially be backed up at all. And, the present invention recognizes that file restoration can be provided for in more than one way to reduce the time required to restore lost files to a computer system.
In addition to the above considerations, the present invention recognizes that data backup undertaken at predetermined intervals might risk failing to backup data that is generated and lost between the intervals. Fortunately, the present invention recognizes that data backup via wide area networks can be undertaken as a user is actively using a computer, thereby providing real time and, hence, almost completely up to date data backup.
Moreover, when data is to be transmitted via wide area networks, the data is preferably first encrypted for security reasons. The present invention recognizes that the longer a user employs a single key for encrypting data, the greater the risk that the key will be broken. For this reason, it is preferably to use more than one key to encrypt data. Unfortunately, in the context of data backup over time, the present invention understands that the use of more than one key over a period of time is rendered somewhat difficult because it is difficult during subsequent decryption to associate the correct encryption key with the blocks that were originally encrypted with the key. As recognized herein, however, it is possible to facilitate the use of more than one encryption key to thereby reduce the risk of encryption system compromise. The present invention still further recognizes that in addition, it is desirable and possible to provide a session-unique key for each transmission session that cannot be broken unless every bit of data to be transmitted is known a priori.
Accordingly, it is an object of the present invention to provide a system and method for backing up computer files at predetermined intervals or continuously, as he computer is being used. Another object of the present invention is to provide a system and method for backing up computer files to a remote facility via modem, LAN, WAN, or other network device. Still another object of the present invention is to provide a system and method for backing up computer files by automatically sending only changed portions of the files to an off-site facility. Yet another object of the present invention is to provide a system and method for backing up computer files that is easy to use and cost-effective. Another object of the present invention is to provide easy to use and effective encryption processes for backing up data via a wide area network. Yet another object of the present invention is to provide a data backup system which automatically updates a centrally stored common software library.
SUMMARY OF THE INVENTION
A computer program product includes a computer program storage device that is readable by a digital processing system and a program means on the program storage device. As intended by the present invention, the program device is realized in a critical machine component that causes the digital processing system to perform method steps to back up at least some blocks in at least one local computer file from at least one user computer in a computer network. In accordance with the present invention, the method steps include transmitting the local file to a data center that is remote from the user computer to render a remote version, with the data center being accessible via the computer network. For each of the blocks (i.e., each sequence of two thousand (2000) or so bytes) in the local file, two respective characters thereof are copied, with the characters defining a respective first comparison value. Additionally, the method steps include generating respective digital signature codes defining second comparison values.
Moreover, the method steps include, for each block, periodically determining a test digital signature code and comparing it to the respective second comparison value. A block of the local file is designated as a transmission block to be transmitted to the data center to thereby update the remote version when the test digital signature code of the block does not equal the second comparison value of the block.
In a preferred embodiment, the method steps further include determining whether the first two characters of the block, starting at a test bytei, equal one of the first comparison values, wherein i=a natural number, when the test digital signature code of a block does not equal the respective second comparison value. Furthermore, the method steps include determining a block digital signature code using the test bytei as the first byte of a test block when the first two characters of the block equal one of the first comparison values, and determining whether the block digital signature code equals one of the second comparison values. When the block digital signature code equals one of the second comparison values, the method returns "resynchronized". Otherwise, when either the first two characters of the block do not equal one of the first comparison values, or when the block digital signature code does not equal one of the second comparison values, the method sets i=i+1 and repeats the steps of this paragraph.
The method steps may further include gathering transmission blocks in a transfer chunk. When the size of the transfer chunk equals a predetermined size, the transfer chunk is transmitted to the data center. Preferably, transfer chunks are transmitted to the data center only during a predetermined period. Still further, the method steps may additionally include, for each local file block, determining whether a duplicate of the local file block is stored in a common file library at the data center. The file block is copied to the data center only when a duplicate of the local file block is not stored in a common file library.
In preferred embodiments, the method steps yet further include periodically copying remote versions of file blocks at the data center onto a portable data storage medium. Local files are restored by copying remote versions of file blocks from the portable data storage medium when the remote versions that are stored on the portable data storage medium are at least as current as the remote versions stored at the data center, and otherwise they are restored by copying remote versions of the file blocks from the data center.
The computer program product is also disclosed in combination with the digital processing apparatus, and in further combination with the computer network. A computer-implemented method using the above-described steps is also disclosed.
In another aspect, a system is disclosed for backing up files in user computers. The system includes a data center including a library of common computer file blocks, and a plurality of user computers remote from the data center and in communication with the data center. Common library logic means determine, for each user computer, which if any of the file blocks of the user computer are common file blocks by virtue of being contained in the common library. Also, initialization logic means are associated with each user computer for causing file blocks except common file blocks to be transmitted to the data center. And, subfile incremental back up logic means are associated with each user computer for determining changed blocks that have been changed since a predetermined back up time. Changed block signals are generated in response thereto. Chunk transmission logic means are responsive to the subfile incremental logic means for sending only changed blocks to the data center in chunks having a predetermined size. As disclosed in further detail below, synchronizing logic means are responsive to the changed block signals for synchronizing the subfile incremental logic means.
In another aspect, a computer program product includes a computer program storage device readable by a digital processing system and logic means on the program storage device. The logic means include instructions that are executable by the digital processing system for performing method steps for backing up at least some blocks in at least one local computer file from at least one user computer in a computer network. The method steps performed by the computer program product include transmitting the local file to a data center remote from the user computer to render a remote version. The data center is accessible via the computer network and has an electronic storage capacity of at least one hundred terabytes. The local file is transmitted only when the data center does not have an exact copy of the local file.
In another aspect, a system for backing up files in user computers includes a data center including a common library of computer data and a plurality of user computers remote from the data center and in communication with the data center. Further, common library logic means determine which if any of data blocks of the user computers are common computer file blocks by virtue of being contained in the common library, it being understood that computer file blocks that are determined not to be common computer file blocks are non-common file blocks. Common library addition means are provided for adding, to the common library, a non-common file block when the non-common file block appears at least a predetermined number of times in the user computers.
In another aspect, a computer-implemented method for backing up at least one local computer file from at least one user computer in a computer network includes transmitting the local file to a data center remote from the user computer to render a remote version. Changes to a first block of the local file are received, it being understood that the changes are established by a user of the user computer generating input events. Immediately upon receiving the changes, at least the first block is designated as a transmission block to be transmitted to the data center for updating the remote version. The method then includes transmitting at least the first block to the data center substantially while the user is generating input events.
In another aspect, a computer-implemented method for backing up computer file blocks includes, at a user computer, encrypting at least one block using one or more encryption keys. The method further includes storing the one or more encryption keys in a key file, and transmitting the block to a remote data center. Next, the method contemplates, at a restore time, transmitting the block from the data center to the user computer, and selecting one of the keys from the key file. The block is decrypted with the key, and the method then determines whether the decryption was successful. When the decryption is not successful, another key is selected from the key list and the data is decrypted, and the above steps repeated until the decryption is successful.
In another aspect, a method for providing a session key for a transmission session for transmitting data between a transmitting computer and a receiving computer includes generating a digital signature unique to the data, and using the digital signature to encrypt the data.
In another aspect, a method for backing up data in a system including user computers and at least one data storage facility includes, at a user computer, determining whether data associated with the user computer is resident in a common software library associated with the data storage facility. The data is not backed up when the data is resident in the common software library. Otherwise, the data is automatically added to the common software library when the data is present in at least a predetermined number of user computers in the system.
In another aspect, a method is disclosed for backing up computer data in a computer system including user computers and a data storage facility online with a wide area network. The method includes, for at least one user computer online with the wide area network, as data is entered into the user computer during input events, transmitting at least part of the data to the data storage facility via the wide area network between input events.
The details of the present invention, both as to its structure and operation, can best be understood in reference to the accompanying drawings, in which like reference numerals refer to like parts, and in which:
BRIEF DESCRIPTION OF THE DRAWINGS
FIG. 1 is a schematic diagram of the computer file back up system of the present invention;
FIG. 2 is a flow chart of the initialization process;
FIG. 3 is a flow chart of the subfile incremental back up process;
FIG. 4 is a flow chart of the restore process;
FIG. 5 is a flow chart of the process for automatically updating the common software library;
FIG. 6A is a flow chart of the process for encrypting data to be backed up;
FIG. 6B is a flow chart of the process for decrypting backed up data received from the data center;
FIG. 7A is a flow chart of the process for transferring data from a user computer to a data center using the message digest (MD)-5 comparison value of the data being transferred as an encryption random number;
FIG. 7B is a flow chart of the process for validating the data transfer shown in FIG. 7A; and
FIG. 8 is a flow chart of the process for continuously backing up the computer files of a user computer while a user is using the computer and the computer is online with a remote data center via a wide area network.
DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT
Referring initially to FIG. 1, a system is shown, generally designated 10, which includes a server computer, referred to herein as a data center 12, which is part of a computer network. In the preferred embodiment, the data center 12 is part of the computer network 14 referred to as the Internet, and the data center 12 can communicate with other computers on the network 14 via a modem, LAN, WAN, or other network device 16.
As shown in FIG. 1, the data center 12 includes an electronic data storage repository 18 with an associated common library 20. Preferably, to permit scaling of the system 10 to support a large number of users, the data repository 18 has a multi-terabyte or indeed a petabyte capacity. Preferably, the capacity of the repository 18 is over one hundred terabytes, and more preferably is five hundred terabytes or more. A control module 22 controls the flow of data into and out of the repository 18 in consonance with the novel logic described below.
Per the present invention, the common library 20 stores the blocks of computer files that are widely distributed and used by many users and, hence, that are common to many users. For example, the common library 20 can store copies of well-known off-the-shelf personal computer operating systems, network browsers, and so on. The common library 20 can be automatically updated as discussed further below in reference to FIG. 5. As intended by the present invention, the data storage repository 18 with common library 20 can be physically implemented in a suitable medium, such as magnetic tape, hard disk arrays, compact disk read-only memory (CD-ROM), DASD arrays, or even floppy diskettes.
Plural user computers 24, e.g., personal computers or laptop computers, communicate with the network 14 via respective user modems 26 or other appropriate network connection in accordance with well-known principles. In the preferred embodiment, each user computer 24 includes a respective software-implemented back up module 28 that undertakes the inventive steps of the present invention. It is to be understood that in another embodiment, the back up modules 28 can be remote from, but accessible to, the user computers 24. Additionally, each user computer 24 preferably includes a respective CD-ROM drive 30 for purposes to be made clear shortly. As those skilled in the art will recognize, the user computers 24 can also be associated with respective video monitors, printers, and other output devices, and keyboards, keypads, mice, and other input devices.
FIGS. 2-8 illustrate the structure of the back up module 30 of the present invention as embodied in computer program software. Those skilled in the art will appreciate that the Figures illustrate the structures of logic elements, such as computer program code elements or electronic logic circuits, that function according to this invention. Manifestly, the invention is practiced in its essential embodiment by a machine component that renders the logic elements in a form that instructs a digital processing apparatus (that is, a computer) to perform a sequence of function steps corresponding to those shown in the Figures.
These instructions may reside on a program storage device including a data storage medium to establish a computer program product, such as a programmed computer diskette. Alternatively, such media can also be found in semiconductor devices, on magnetic tape, on optical disks, on a DASD array, on a conventional hard disk drive, on electronic read-only memory or on electronic random access memory, or other appropriate data storage device. In an illustrative embodiment of the invention, the logic means are computer-executable instructions that are written in C++ language code.
Referring now to FIG. 2, the initialization process can be understood. The process begins at start state 32 for each user computer 24 and moves to block 34, wherein a file index counter "j" is set equal to unity. Moving to decision diamond 36, it is determined whether a copy of the jth file, using the so-called "message digest five" ("MD5") code discussed further below, is already stored in the common library 20. This can be done on a file-by-file basis by comparing the MD5 of each jth file to the MD5s of the files in the common library 20. For each file that is stored in the common library 20, as indicated by a match of a file's MD5 with one of the MD5s of the files in the library 20, the process moves to block 38 to flag the file as being in the common library and, hence, not a candidate for wholesale transfer to the data center 12.
Alternatively, greater granularity can be achieved by comparing the MD5 of each block of each jth file to a list of common block MD5s, and flagging only those blocks having MD5s that match an MD5 on the common library 20 list of MD5s as being in the common library and, hence, not candidates for wholesale transfer to the data center 12. Thus, it is to be understood that in the event that the jth file as it exists in the user computer 24 has been modified from the common, off-the-shelf version in the common library 20, the modified portions will be detected and transferred to the data center 12 during the subfile incremental procedure discussed below in reference to FIG. 3.
In any case, it can now be appreciated that the use of the common library 20 avoids the necessity of transmitting to the data center 12 very large, off-the-shelf computer files that are resident on many if not most personal computer systems and that would otherwise require backing up. In other words, the common library 20 reduces the time and cost of making initial copies of the files in the user computers 24.
Additionally, at block 38 a block-by-block comparison value listing is recorded that has two entries for every blocki (wherein "i" is a block index counter) of every filesj flagged as being in the common library 20. The first entry is the first two characters ("XX1 ") of the first byte of each blocki. In contrast, the second entry is a digital signature ("MD5i "), the value of which is uniquely defined by the contents of the blocki. The first entry establishes a first comparison value, and the second entry establishes a second comparison value.
As mentioned above, in the presently preferred embodiment, the digital signature used is the 128 bit-long "message digest five" ("MD5") code known in the cryptology art. As recognized by the present invention, the MD5 code of a block changes a great deal with even very small changes of the block. Thus, the digital signature of the present invention advantageously is more sensitive to changes in a block than are check sums, cyclic redundancy codes, and so-called "exclusive or" codes.
From block 38, the logic proceeds to decision diamond 40 to determine whether the last file in the user computer 24 has been tested. If so, the initialization process terminates at state 42. Otherwise, the process moves to block 44 to increment by one the file index counter "j", and then to loop back to decision diamond 36.
In contrast, if it is determined at decision diamond 36 that a copy of the ith block of the jth file is not in the common library 20, the logic proceeds to block 46 to trickle copy each non-resident blocki of the filej to the storage repository 18 of the data center 12. By "trickle copy" is meant that the non-resident blocks of the file are transmitted to the data center 12 during a predetermined transmission period, also referred to herein as a "window", until the entire non-resident portions file has been transmitted, or until the transmission period elapses. Alternatively, as discussed below in reference to FIG. 8, the data to be backed up can be transmitted real time as the user inputs data to the user computer 24, provided the user computer 24 is online with the network 14.
If the transmission period elapses before the non-resident portion of the file has been transmitted in its entirety, the respective back up module 28 completes the transmission of the chunk, and then terminates. If the transmission is interrupted during the transmission period, the module 28 notes the value of the block index counter "i", such that transmission can be resumed from there during the next transmission period. Then, at the start of the next transmission period, the back up module 28 increments the block index counter "i" by unity and commences transmitting the filej where it previously had left off. As indicated in FIG. 2, the block-by-block listing of XXi and MD5i is generated at block 46 as described previously in reference to block 38. From block 46, the logic proceeds to decision diamond 40 to function as previously described.
Turning now to FIG. 3, the subfile incremental back up logic can be seen. It is to be understood that as intended by the present invention, periodically (e.g., on a daily basis) the back up module 28 of each user computer 24 will undertake the process discussed below.
Commencing at decision diamond 48, it is determined whether a user-selected predetermined back up period has commenced. When it has, the process moves to block 50 to enter a do loop for each filej that has been designated by the user to be backed up. Then, the process moves to block 52 to enter a do loop for each blocki of the filej that was selected at block 50.
Moving to block 54, the back up module 28 computes the digital signature MD5i new of the blocki under test. Next, the digital signature MD5i new of the blocki under test is compared, at decision diamond 56, with the digital signature MD5i old that is recorded in the listing that was generated in FIG. 2, as described above.
It is to be understood that the listing of digital signatures MD5old is an ordered list from i=0 to n, wherein n=number of digital signatures in the list. It is to be further understood that the digital signature MD5i new of the blocki under test is first compared to the ith MD5old in the list, and if it doesn't match, the digital signature MD5i new of the blocki under test is next compared to the next MD5old in the ordered list (i.e., the ith +1 MD5old), and so on, until a match is found, or until the end of the ordered list has been reached. When a match is found anywhere in the list, the test at decision diamond 56 is positive, and the process moves to decision diamond 58. Also, when a match is found at the lth position in the list, wherein l>i, the comparison of the MD5 i+ 1new of the ith +1 block to be tested is first compared to MD5 l+ 1old, i.e., to the lth +1 element in the ordered list of old digital signatures. If no match is found, the MD5 i+ 1new of the ith +1 block is then serially tested against the digital signatures in the list after the lth +1 element in the ordered list of old digital signatures.
Returning to the discussion of the ith block, when a digital signature match is found, indicating that no change was made to the blocki under test since the last back up, the process moves to decision diamond 58. At decision diamond 58, the logic determines whether the last block of the filej has been tested, and if not, the process loops back to retrieve the next blocki and compute its digital signature as shown and described. In other words, if further blocks exist for the filej under test, the index counter "i" is incremented by one, and the process loops back to block 54.
If, on the other hand, the last block of the filej has been tested, the logic determines, at decision diamond 60, whether the last file in the user computer 24 has been tested. If not, the process loops back to retrieve the next file and proceed as before to test the blocks in the file. Otherwise, the process moves to decision diamond 62.
At decision diamond 62, it is determined whether the back up period has not elapsed, i.e., whether the allocated back up window is still open. If the period has elapsed, the logic loops back to decision diamond 48 to await the commencement of the next allocated back up period. In contrast, if the back up window is still open, the logic attempts to establish communication with the data center 12 via the modems 26, 16 (or other network devices, such as LANs, WANs, etc.).
At decision diamond 64, it is determined whether the communication connection was successful. If it wasn't, the logic moves to block 66 to wait a predetermined time-out period (preferably five minutes) before looping back to decision diamond 62 to retry the connection if the window is still open. In any case, when it is determined that communication has been established between the user computer 24 and the data center 12 subsequent to testing all filesj in the user computer 12, the logic encrypts and compresses any remaining updated blocks (described further below) and transmits them to the data center 12, at block 68. From block 68, the process loops back to decision diamond 48.
Returning to the negative loop originating at decision diamond 56, when the digital signature of the blocki does not match one of the signatures stored in the listing for the block, a change to the blocki is indicated, and the blocki therefore becomes a candidate for back up. Additionally, the process resynchronizes itself with the block-by-block comparison value listing discussed above.
Accordingly, after a negative test at decision diamond 56 (i.e., when the logic essentially returns a "changed block" signal), the logic moves to block 70, wherein a do loop is entered for the bytesk of the block1. "k" is an index counter initially equal to unity. Moving to decision diamond 72, the logic determines whether the first two characters ("XXk new ") of the block starting with the bytek under test equal the first comparison value ("XXold ") that corresponds to the blocki in the comparison value listing. It is to be understood that the comparison value listing parallels the ordered list of digital signatures MD5old. Consequently, if the first comparison does not yield a match, similar to the testing at decision diamond 56 the two characters XXk new are compared to the next two characters XXold in the ordered list until a match is found or until the end of the list is reached.
When the end of the list of comparison values is reached without a match, the logic moves to decision diamond 73 to determine whether the current bytek is the last byte of the candidate block under test. Stated differently, at decision diamond 73 the logic determines whether the byte index counter "k" equals about two thousand forty eight (2048). If it does, the logic moves to block 74 to increment the block index counter "i" by unity, and then the logic loops proceeds to decision diamond 58. In this instance, the entire block under test is flagged for copying. If it is determined that the byte index counter "k" does not equal one thousand at decision diamond 73, the logic loops back to increment "k" by unity and test the next byte in the blocki at decision diamond 72.
If, however, it is determined at decision diamond 72 that "XXk new " equals an "XXold ", the logic proceeds to decision diamond 75. At decision diamond 75, the digital signature MD5 of the block having as its first byte the bytek under test is determined and compared to the second comparison value, i.e., the digital signature MD5old that corresponds to the matched characters XXold in the comparison value listing. As was the case with the test at decision diamond 56, the digital signature MD5 being tested is tested against subsequent digital signatures MD5old in the ordered list, if necessary, to find a match. If no match is found, the logic moves to decision diamond 73.
If, however, the digital signature MD5 of the block having as its first byte the bytek under test is determined to be equal to one of the digital signatures MD5old in the ordered list at decision diamond 75, the logic returns "resynchronized" and moves to block 76. In other words, a positive test at decision diamond 75 indicates that the logic has found an old, unchanged block that previously has been backed up, and, hence, that the logic is resynchronized with the comparison value listing.
At block 76, the changed block(s) (also referred to herein as "transmission blocks") are moved to a "next chunk" file. Additionally, at block 76 the comparison value listing is updated to include the first two characters and digital signatures of the changed block(s), for use as the first and second comparison values, respectively, during the test of the blocks during the next back up cycle. Moving to decision diamond 78, it is determined whether the chunk file is full. In the presently preferred embodiment, the chunk file is full when its size is five megabytes (5 MB).
If the chunk file is not full, the logic returns to decision diamond 58. In contrast, if the chunk file is full, the process moves to decision diamond 80 to determine whether the back up period has not elapsed, i.e., whether the allocated back up window is still open. If the period has elapsed, the logic loops back to decision diamond 48 to await the commencement of the next allocated back up period. In contrast, if the back up window is still open, the logic transmits the chunk when a successful connection has been established with the data center 12, using the procedure of steps 64 and 66 discussed above. From block 82, the logic returns to decision diamond 58.
FIG. 4 shows the logic by which lost files may be restored to a user computer 24. It is to be understood that as envisioned herein, backed up file blocks at the data center 12 are periodically (e.g., weekly, monthly, quarterly, etc.) copied from the storage repository 18 to CD-ROM disks, and the disks then transported to the same physical location as the associated user computer 24. Hereinafter, these CD-ROM disks are referred to as "local back up disks".
At block 84, a directory of blocks in the data center 12 that correspond to the user computer 24 is downloaded from the data center 12. Next, at block 86 a do loop is entered for each blocki of the lost file(s) as follows. It is determined at decision diamond 88 whether the requested version of the block of the file (ordinarily the latest version) is on the local back up disks. If so, the block is preferentially restored from the local back up disks at block 90. Otherwise, it is restored via the modems 26, 16 from the data center 12 at block 92. From block 90 or block 92, the logic loops back to retrieve the next blocki, and continues this process until the entire requested files have been restored.
Further inventive features of the present invention can be appreciated in reference to FIGS. 5-8. With particular regard to FIG. 5, the common software library 20 shown in FIG. 1 may be automatically updated by commencing at block 100 in FIG. 5. At block 100, the server (i.e., at the data center 12 shown in FIG. 1) undertakes, for each digital signature MD5 code with its attendant block received for storage, a loop by moving to decision diamond 102, wherein it is determined whether the digital signature MD5 code under test has been recorded as a candidate for the library 20. The first time the particular digital signature MD5 code is received from any user computer, the test at decision diamond 102 is negative, and the process consequently moves to block 104 to record the digital signature MD5 code on a candidate list. Also, a counter JMD5 for the particular digital signature MD5 code under test is set equal to unity at block 104. Moving from block 104 to block 106, the next received digital signature MD5 code is retrieved and the above process repeated as indicated by the dashed loop back line in FIG. 5.
On the other hand, when the digital signature MD5 code previously has been received from a user computer, the test at decision diamond 102 is positive, and the process moves to block 108 to increment the counter JMD5 that is associated with the digital signature MD5 code under test by one. It is to be understood that the counter JMD5 that is associated with the digital signature MD5 code under test can be incremented whenever the digital signature MD5 code under test is received from any user computer, or only when it received from a user computer that has not previously sent the digital signature MD5 code under test.
Then, at decision diamond 110 it is determined whether the value of the counter JMD5 that is associated with the digital signature MD5 code under test equals a predetermined value "k". If not, the process loops back to block 106, but otherwise the process moves to block 112 to add the digital signature MD5 code under test, along with its associated block(s), to the common software library 20. The process then loops back to block 106 to retrieve the next digital signature MD5 code.
FIGS. 6A and 6B show the present inventive steps for allowing a user to use multiple encryption keys to transmit data for back up, thereby increasing security. Commencing at block 114, at a user computer, the digital signature MD5 code for each block for which backup has been selected by the steps disclosed above in FIGS. 2-4 is determined and attached to its respective data block, and then the digital signature MD5 code with data block is compressed using compression principles known in the art. Moving to block 116, the data block, but not the MD5 code, is encrypted with a user-selected key on a user key list. From block 116, the process ends at state 120, it being understood that further transmission steps as previously disclosed can be undertaken.
Subsequently, when the user computer, for example, loses the data packet processed by the steps shown in FIG. 6A and accordingly requests and receives a copy of the packet from the data center 12, the user computer invokes the steps shown in FIG. 6B. Commencing at block 124, the user computer receives the blocks to be restored from the data center and then undertakes the following steps for each block. At block 126, the computer selects the first encryption key in the list, and then at block 128 the computer decrypts and decompresses the block. Moving to decision diamond 130, the user computer determines whether decompression failed as indicated by, e.g., data overflow. If decompression failed, the process moves to block 132 to select the next key in the list, and then loops back to block 128 to again attempt decompression.
If, on the other hand, decompression is determined at decision diamond 130 to have been executed satisfactorily, the process moves to block 136 to further validate the key under test by stripping the digital signature MD5 code from the packet and then calculating anew the digital signature MD5 code of the data that is associated with the decrypted block. In undertaking the process at block 136, the present invention understands that the digital signature MD5 code is a 128 bit-long string that is appended to the data with which it is associated in a known location, and that consequently the digital signature MD5 code easily can be stripped.
At decision diamond 138 the calculated digital signature MD5 code of the decrypted data is compared to the decrypted digital signature MD5 code that had been stripped from the data at block 136. If the two digital signature MD5 codes match each other at decision diamond 138, the key under test has been validated and the process returns "correct key" at block 140. Then the process retrieves the next data portion at block 124. Otherwise, the process loops back to block 132 to retrieve the next key in the user computer's key list.
FIG. 7A shows a method for providing a unique, session-specific session key for encryption purposes. Commencing at block 142, the digital signature MD5 code for the entire set of data to be transmitted is calculated by the computer that is to transmit the data. Moving to block 144, the digital signature MD5 code is used as the encryption key for the data transmission session. In the event that only 56 bits are required for the key, the two 64 bit halves of the 128 bit-long digital signature MD5 code are combined using an exclusive-OR (XOR) operation and then 8 bits, e.g., the last 8 bits, of the resulting 64 bit string are discarded.
From block 144, the present logic follows two branches, denoted in the Figures as "A" and "B". More specifically, at block 146A the data is encrypted using the digital signature MD5 code (or portion thereof) as a random number, and then at block 148A the data is transmitted to the receiving computer. On the other hand, at block 146B the portion of the digital signature MD5 code that is to serve as the session key (i.e., the portion of the digital signature MD5 code that is used at block 146A to encrypt the data) is transferred from the transmitting computer to the receiving computer using conventional private key/public key encryption principles, or more preferably using Diffie-Helman encryption principles. That is, at block 146B the digital signature code can be encrypted by the transmitting computer using the receiving computer's public key, and then at block 148B the receiving computer can decrypt the digital signature code using its private key. Then, as shown in FIG. 7A, block 150 receives the results of blocks 148A and 148B to decrypt the data using the digital signature code session key.
FIG. 7B additionally shows that the present logic can validate the data received at block 152 by undertaking the validation steps shown at blocks 152-158. More particularly, at block 152 the receiving computer calculates the digital signature code of the data that is associated with the decrypted data from block 150. Next, at decision diamond 154 the calculated digital signature code of the decrypted data is compared to the digital signature code that had been received and decrypted at block 148B. It is to be understood that the in the event that only 56 bits of an XOR'd digital signature MD5 code are used as the session key, the comparison at decision diamond 154 can be between the session key and an XOR'd version of the digital signature MD5 code calculated at block 152.
In any case, if the two digital signature codes match each other at decision diamond 154, the session key under test has been validated and the process returns "transmission validated" at block 156. Otherwise, the process returns "transmission error" at block 158.
Now referring to FIG. 8, a method is shown for continuous data backup via a wide area computer network while a user computer is online with the network and while a user is using the user computer. Commencing at block 160, upon determination by a user computer that a changed data block is to be backed up in accordance with the principles discussed above (with the exception that a backup start time is not waited for but rather that the process perpetually determines, real-time, data blocks to be backed up), the process moves to decision diamond 162. At decision diamond 162, the process determines whether the user computer is online with the network, e.g., the network 14 shown in FIG. 1. If not, the process moves to block 164 to store the data for transmission thereof when the computer is online.
On the other hand, if, at decision diamond 162, the process determines that the user computer is online with the network, the present logic moves to decision diamond 166 to determine whether the central processing unit (CPU) of the user computer is busy. Specifically, the determination at decision diamond 166 is whether the CPU is busy with a task that has a higher priority assigned to it than the priority of the backup task shown in FIG. 8. If the CPU is not occupied with another task of higher priority than the present data backup task, the process moves to block 168 to transmit the block until the entire block has been transmitted, or until the computer becomes reoccupied with a task of higher priority than backup. It is to be understood that when transmission of a block is interrupted by a higher priority task, the user computer undertakes the higher priority task, and when finished returns to the present data backup task to complete transmission of the data block. The logic then moves to block 169 to determine further blocks for transmission, and to transmit the blocks virtually real-time as the user uses the computer for other tasks.
The loop between block 170 and decision diamond 166, and the "busy" loop at block 168, illustrate the principles of the above discussion. When the CPU is busy at decision diamond 166 with, e.g., the requirement to execute an input event such as a key stroke, the logic can proceed to block 170 to wait a predetermined period .increment.t, and then retry the determination at decision diamond 166. It will be appreciated that the length of .increment.t is very short, on the order of milliseconds or microseconds. Alternatively, the logic can essentially define ".increment.t" to be "whenever the CPU is idle", and when the CPU is idle, proceed to block 168 to transmit data.
For example, during the test at decision diamond 166, the CPU might receive a data input via a key stroke by the user of the computer, and if so, the process idles at block 170 until the key stroke is completed, before transmitting data at block 168. Thus, in one embodiment the present process transmits backup data between CPU tasks that support a user's needs, e.g., between key strokes, with the data that is changed by the key strokes being determined as backup data and transmitted offsite via the Internet for storage virtually real time as the data is being created by the user of the computer.
While the particular SYSTEM AND METHOD FOR BACKING UP COMPUTER FILES OVER A WIDE AREA COMPUTER NETWORK as herein shown and described in detail is fully capable of attaining the above-described objects of the invention, it is to be understood that it is the presently preferred embodiment of the present invention and is thus representative of the subject matter which is broadly contemplated by the present invention, that the scope of the present invention fully encompasses other embodiments which may become obvious to those skilled in the art, and that the scope of the present invention is accordingly to be limited by nothing other than the appended claims, in which reference to an element in the singular is not intended to mean "one and only one" unless explicitly so stated, but rather "one or more".

Claims (10)

What is claimed is:
1. A system for backing up files in user computers, comprising:
a data center including a common library of computer data;
a plurality of user computers remote from the data center and in communication with the data center;
common library logic means for determining which if any of data blocks of the user computers are common computer file blocks by virtue of being contained in the common library, computer file blocks that are determined not to be common computer file blocks being non-common file blocks; and
common library addition means for adding to the common library a non-common file block when the non-common file block appears at least a predetermined number of times in the user computers.
2. The system of claim 1, further comprising:
initialization logic means associated with each user computer for causing file blocks except common file blocks to be transmitted to the data center;
subfile incremental back up logic means associated with each user computer for determining changed blocks that have been changed since a predetermined back up time and generating changed block signals in response thereto;
chunk transmission logic means responsive to the subfile incremental logic means for sending only changed blocks to the data center in chunks having a predetermined size;
synchronizing logic means responsive to the changed block signals for synchronizing the subfile incremental logic means; and
logic means for generating a listing of the blocks of the files stored at the data center and, associated with each block, the first two characters of the block and a digital signature of the block, the subfile incremental logic means using the listing to undertake the determining function.
3. The system of claim 1, wherein a non-common file block is added to the common library when the non-common file block appears in at least a predetermined number of user computers.
4. The system of claim 2, further comprising:
logic means for periodically copying remote versions at the data center onto a portable data storage medium; and
logic means for restoring local files by copying remote versions from the portable data storage medium when the remote versions stored on the portable data storage medium are at least as current as the remote versions stored at the data center, and otherwise restoring local files by copying remote versions from the data center.
5. A computer-implemented method for backing up at least one local computer file from at least one user computer in a computer network, comprising:
(a) transmitting the local file to a data center remote from the user computer to render a remote version, the data center being accessible via the computer network; (b) receiving changes to at least a first block of the local file, the changes being established by a user of the user computer generating input events;
(c) immediately upon receiving the changes, designating at least the first block as a transmission block to be transmitted to the data center for updating the remote version;
(d) transmitting at least the first block to the data center substantially while the user is generating input events;
(e) for each of the blocks in the local file copying two respective characters thereof defining respective first comparison values and generating respective digital signature codes defining second comparison values;
(f) for each block, periodically determining a test digital signature code and comparing it to the respective second comparison value: and
(g) when the test digital signature code of a block does not equal the respective second comparison value, determining whether the first two characters of the block equal one of the first comparison values.
6. The computer-implemented method of claim 5, wherein the transmitting step is undertaken between input events.
7. The computer-implemented method of claim 5, further comprising the steps of:
(h) when the first two characters equal one of the first comparison values, determining a block digital signature code using a test bytei as the first byte of a test block;
(i) determining whether the block digital signature code equals one of the second comparison values;
(j) when the block digital signature code equals one of the second comparison values, returning "resynchronized"; otherwise
(k) when either the first two characters of the test bytei do not equal one of the first comparison values, or when the block digital signature code does not equal one of the second comparison values, setting i=i+1 and repeating select steps.
8. The computer-implemented method of claim 7, further comprising the steps of:
for at least some of the local file blocks, determining whether a duplicate of the local file block is stored in a common file block library at the data center; and
undertaking step (a) only when a duplicate of the local file block is not stored in a common file block library.
9. The computer-implemented method of claim 8, further comprising the steps of:
periodically copying remote versions at the data center onto a portable data storage medium; and
restoring local file blocks by copying remote versions from the portable data storage medium when the remote versions stored on the portable data storage medium are at least as current as the remote versions stored at the data center, and otherwise restoring local file blocks by copying remote versions from the data center.
10. A method for backing up data in a system including user computers and at least one data storage facility, comprising the steps of:
at a user computer, determining whether data associated with the user computer is resident in a common software library associated with the data storage facility;
not backing up the data when the data is resident in the common software library; and
automatically adding the data to the common software library when the data is present in at least a predetermined number of user computers in the system.
US09/041,149 1996-12-03 1998-03-12 System and method for backing up computer files over a wide area computer network Expired - Lifetime US6049874A (en)

Priority Applications (7)

Application Number Priority Date Filing Date Title
US09/041,149 US6049874A (en) 1996-12-03 1998-03-12 System and method for backing up computer files over a wide area computer network
US09/100,914 US6038665A (en) 1996-12-03 1998-06-19 System and method for backing up computer files over a wide area computer network
CA002323522A CA2323522A1 (en) 1998-03-12 1999-03-11 System and method for backing up computer files over a wide area computer network
PCT/US1999/005248 WO1999046660A2 (en) 1998-03-12 1999-03-11 System and method for backing up computer files over a wide area computer network
EP99912379A EP1062558A2 (en) 1998-03-12 1999-03-11 System and method for backing up computer files over a wide area computer network
JP2000535980A JP2002507016A (en) 1998-03-12 1999-03-11 System and method for backing up computer files in a wide area computer network
AU30764/99A AU3076499A (en) 1998-03-12 1999-03-11 System and method for backing up computer files over a wide area computer network

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/757,134 US5794254A (en) 1996-12-03 1996-12-03 Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets
US09/041,149 US6049874A (en) 1996-12-03 1998-03-12 System and method for backing up computer files over a wide area computer network

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US08/757,134 Continuation-In-Part US5794254A (en) 1996-12-03 1996-12-03 Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets

Related Child Applications (1)

Application Number Title Priority Date Filing Date
US09/100,914 Continuation-In-Part US6038665A (en) 1996-12-03 1998-06-19 System and method for backing up computer files over a wide area computer network

Publications (1)

Publication Number Publication Date
US6049874A true US6049874A (en) 2000-04-11

Family

ID=25046505

Family Applications (3)

Application Number Title Priority Date Filing Date
US08/757,134 Expired - Lifetime US5794254A (en) 1996-12-03 1996-12-03 Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets
US09/041,149 Expired - Lifetime US6049874A (en) 1996-12-03 1998-03-12 System and method for backing up computer files over a wide area computer network
US09/080,171 Expired - Lifetime US6014676A (en) 1996-12-03 1998-05-16 System and method for backing up computer files over a wide area computer network

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US08/757,134 Expired - Lifetime US5794254A (en) 1996-12-03 1996-12-03 Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets

Family Applications After (1)

Application Number Title Priority Date Filing Date
US09/080,171 Expired - Lifetime US6014676A (en) 1996-12-03 1998-05-16 System and method for backing up computer files over a wide area computer network

Country Status (5)

Country Link
US (3) US5794254A (en)
EP (1) EP1012701A4 (en)
JP (1) JP2001508894A (en)
CA (1) CA2273920A1 (en)
WO (1) WO1998027483A1 (en)

Cited By (51)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20020016913A1 (en) * 2000-08-04 2002-02-07 Wheeler Lynn Henry Modifying message data and generating random number digital signature within computer chip
US20020069376A1 (en) * 2000-12-01 2002-06-06 Gregg Leon Edward Method, article of manufacture and apparatus for copying information to a storage medium
US20020073106A1 (en) * 1996-04-12 2002-06-13 Computer Associates Think, Inc. A Dela Corporation Intelligent data inventory and asset management system method and apparatus
US20020174137A1 (en) * 2001-05-15 2002-11-21 Wolff Daniel Joseph Repairing alterations to computer files
US6496949B1 (en) * 1999-08-06 2002-12-17 International Business Machines Corp. Emergency backup system, method and program product therefor
US20030037247A1 (en) * 2000-05-23 2003-02-20 Kiyohiro Obara Computing system and data decryption method and computer system with remote copy facility
US6594677B2 (en) 2000-12-22 2003-07-15 Simdesk Technologies, Inc. Virtual tape storage system and method
WO2003090098A1 (en) * 2001-02-15 2003-10-30 Avica Technology Programming content distribution
US20040078602A1 (en) * 2002-10-10 2004-04-22 Pb&J Software, Llc Method and system for sharing storage space on a computer
US6742028B1 (en) 2000-09-15 2004-05-25 Frank Wang Content management and sharing
US20040186861A1 (en) * 2003-01-17 2004-09-23 Phatak Shirish Hemant Method and system for use of storage caching with a distributed file system
US20040215628A1 (en) * 2003-04-23 2004-10-28 International Business Machines Corporation Apparatus and method to copy computer files
US20040215962A1 (en) * 2000-05-05 2004-10-28 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US20050021950A1 (en) * 2002-10-10 2005-01-27 Pb&J Software, Llc Method and system for sharing storage space on a computer
US20050071380A1 (en) * 2003-09-29 2005-03-31 Micka William F. Apparatus and method to coordinate multiple data storage and retrieval systems
US20050235043A1 (en) * 2004-04-15 2005-10-20 Microsoft Corporation Efficient algorithm and protocol for remote differential compression
US20050256974A1 (en) * 2004-05-13 2005-11-17 Microsoft Corporation Efficient algorithm and protocol for remote differential compression on a remote device
US6968478B1 (en) * 2003-12-18 2005-11-22 Xilinx, Inc. Method and apparatus for data transfer validation
US20050262167A1 (en) * 2004-05-13 2005-11-24 Microsoft Corporation Efficient algorithm and protocol for remote differential compression on a local device
US20060047855A1 (en) * 2004-05-13 2006-03-02 Microsoft Corporation Efficient chunking algorithm
US20060080737A1 (en) * 2004-10-13 2006-04-13 International Business Machines Corporation System and method for reducing virus scan time
US20060085561A1 (en) * 2004-09-24 2006-04-20 Microsoft Corporation Efficient algorithm for finding candidate objects for remote differential compression
US20060143713A1 (en) * 2004-12-28 2006-06-29 International Business Machines Corporation Rapid virus scan using file signature created during file write
US20060155735A1 (en) * 2005-01-07 2006-07-13 Microsoft Corporation Image server
US20060155674A1 (en) * 2005-01-07 2006-07-13 Microsoft Corporation Image server
US20060185017A1 (en) * 2004-12-28 2006-08-17 Lenovo (Singapore) Pte. Ltd. Execution validation using header containing validation data
US20070094348A1 (en) * 2005-01-07 2007-04-26 Microsoft Corporation BITS/RDC integration and BITS enhancements
US20070106714A1 (en) * 2002-10-10 2007-05-10 Rothbarth James N Method and system using an external hard drive to implement back-up files
US20070198659A1 (en) * 2006-01-25 2007-08-23 Lam Wai T Method and system for storing data
US20080022650A1 (en) * 2006-07-28 2008-01-31 Pascoe William M Composite yarn and process for producing the same
US20080215667A1 (en) * 2003-10-09 2008-09-04 Pb&J Software, Llc Method and system for sharing storage space on a computer
US20090006640A1 (en) * 2007-06-28 2009-01-01 Michael Lambertus Hubertus Brouwer Incremental secure backup and restore of user settings and data
US7559011B1 (en) 2006-02-10 2009-07-07 Xilinx, Inc. Circuit having a programmable circuit and method of validating a bitstream loaded into a programmable device
US7734733B1 (en) 2007-06-15 2010-06-08 Packeteer, Inc. WAFS disconnected-mode read-write access
US20100160544A1 (en) * 2005-09-09 2010-06-24 Charles Smith Method of applying silane coating to metal composition
US20100185855A1 (en) * 2000-02-18 2010-07-22 Margolus Norman H Data Repository and Method for Promoting Network Storage of Data
US20100257142A1 (en) * 2009-04-03 2010-10-07 Microsoft Corporation Differential file and system restores from peers and the cloud
US20100257403A1 (en) * 2009-04-03 2010-10-07 Microsoft Corporation Restoration of a system from a set of full and partial delta system snapshots across a distributed system
US20110004513A1 (en) * 2003-02-05 2011-01-06 Hoffberg Steven M System and method
US8041641B1 (en) * 2006-12-19 2011-10-18 Symantec Operating Corporation Backup service and appliance with single-instance storage of encrypted data
US20120123945A1 (en) * 2010-11-17 2012-05-17 Inside Secure Nfc transaction method and system
US20120136832A1 (en) * 2010-11-30 2012-05-31 Network Appliance, Inc. Incremental restore of data between storage systems having dissimilar storage operating systems associated therewith
US20120294200A1 (en) * 2011-05-19 2012-11-22 Wistron Neweb Corporation Wireless communication method
US8346966B1 (en) 2007-07-19 2013-01-01 Blue Coat Systems, Inc. Transparent file system access for wide area network file system acceleration
US8468387B2 (en) 2009-04-03 2013-06-18 Microsoft Corporation Bare metal machine recovery
US8904181B1 (en) 2001-03-23 2014-12-02 David P. Felsher System and method for secure three-party communications
US9135284B1 (en) 2008-03-13 2015-09-15 Blue Coat Systems, Inc. Composite execution of rename operations in wide area file systems
US9442850B1 (en) 2008-03-25 2016-09-13 Blue Coat Systems, Inc. Efficient directory refresh operations in wide area file systems
US9794797B2 (en) 2005-10-04 2017-10-17 Steven M. Hoffberg Multifactorial optimization system and method
US10361802B1 (en) 1999-02-01 2019-07-23 Blanding Hovenweep, Llc Adaptive pattern recognition based control system and method
US11044195B1 (en) 2008-08-21 2021-06-22 United Services Automobile Association (Usaa) Preferential loading in data centers

Families Citing this family (95)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3540511B2 (en) * 1996-06-18 2004-07-07 株式会社東芝 Electronic signature verification device
US6038665A (en) * 1996-12-03 2000-03-14 Fairbanks Systems Group System and method for backing up computer files over a wide area computer network
US5794254A (en) * 1996-12-03 1998-08-11 Fairbanks Systems Group Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets
US6374250B2 (en) * 1997-02-03 2002-04-16 International Business Machines Corporation System and method for differential compression of data from a plurality of binary sources
AU6151598A (en) * 1997-02-11 1998-08-26 Connected Corporation File comparison for data backup and file synchronization
KR100195152B1 (en) * 1997-03-31 1999-06-15 윤종용 Method for initializing asymmetrical data transmitter
JP4077907B2 (en) * 1997-08-04 2008-04-23 富士通株式会社 Computer data backup device, data backup method, and computer-readable recording medium recording data backup program
US6199074B1 (en) * 1997-10-09 2001-03-06 International Business Machines Corporation Database backup system ensuring consistency between primary and mirrored backup database copies despite backup interruption
US6101585A (en) * 1997-11-04 2000-08-08 Adaptec, Inc. Mechanism for incremental backup of on-line files
US6018747A (en) * 1997-11-26 2000-01-25 International Business Machines Corporation Method for generating and reconstructing in-place delta files
US6799206B1 (en) 1998-03-31 2004-09-28 Qualcomm, Incorporated System and method for the intelligent management of archival data in a computer network
US6236993B1 (en) * 1998-06-24 2001-05-22 Victor V. Fanberg Computer file comparison method
KR100293937B1 (en) * 1998-07-27 2001-07-12 윤종용 Atomatic database backup method in private exchange
US6243717B1 (en) * 1998-09-01 2001-06-05 Camstar Systems, Inc. System and method for implementing revision management of linked data entities and user dependent terminology
US6571245B2 (en) * 1998-12-07 2003-05-27 Magically, Inc. Virtual desktop in a computer network
CA2262316A1 (en) * 1999-02-22 2000-08-22 Ibm Canada Limited-Ibm Canada Limitee System and method for detecting release-to-release binary compatibility in compiled object code
US20020026321A1 (en) * 1999-02-26 2002-02-28 Sadeg M. Faris Internet-based system and method for fairly and securely enabling timed-constrained competition using globally time-sychronized client subsystems and information servers having microsecond client-event resolution
DE60037119T3 (en) 1999-03-29 2012-10-04 Gotuit Media Corp., ELECTRONIC STORAGE OF MUSIC DATA AND PROGRAMS, WITH THE DETECTION OF PROGRAM SEGMENTS, SUCH AS MUSIC LECTURES RECORDED, AND SYSTEM FOR THE MANAGEMENT AND PLAYING OF SUCH PROGRAM SEGMENTS
US7239226B2 (en) * 2001-07-10 2007-07-03 American Express Travel Related Services Company, Inc. System and method for payment using radio frequency identification in contact and contactless transactions
EP1242890A4 (en) * 1999-10-12 2007-06-20 Cms Peripherals Inc Automatic backup system
US6490596B1 (en) 1999-11-09 2002-12-03 International Business Machines Corporation Method of transmitting streamlined data updates by selectively omitting unchanged data parts
JP4158297B2 (en) * 1999-11-12 2008-10-01 株式会社日立製作所 Method for matching attribute data in distributed system and distributed system
US6341341B1 (en) 1999-12-16 2002-01-22 Adaptec, Inc. System and method for disk control with snapshot feature including read-write snapshot half
US6460054B1 (en) 1999-12-16 2002-10-01 Adaptec, Inc. System and method for data storage archive bit update after snapshot backup
US6510491B1 (en) 1999-12-16 2003-01-21 Adaptec, Inc. System and method for accomplishing data storage migration between raid levels
US7062648B2 (en) * 2000-02-18 2006-06-13 Avamar Technologies, Inc. System and method for redundant array network storage
US6826711B2 (en) 2000-02-18 2004-11-30 Avamar Technologies, Inc. System and method for data protection with multidimensional parity
US7194504B2 (en) * 2000-02-18 2007-03-20 Avamar Technologies, Inc. System and method for representing and maintaining redundant data sets utilizing DNA transmission and transcription techniques
US20010044879A1 (en) * 2000-02-18 2001-11-22 Moulton Gregory Hagan System and method for distributed management of data storage
US7509420B2 (en) 2000-02-18 2009-03-24 Emc Corporation System and method for intelligent, globally distributed network storage
US6704730B2 (en) 2000-02-18 2004-03-09 Avamar Technologies, Inc. Hash file system and method for use in a commonality factoring system
KR20010089012A (en) * 2000-03-20 2001-09-29 전성영 An Internet Safe Service System And Its Method
AU6533501A (en) * 2000-06-05 2001-12-24 Miralink Corp Flexible remote data mirroring
WO2001098952A2 (en) * 2000-06-20 2001-12-27 Orbidex System and method of storing data to a recording medium
US6553388B1 (en) * 2000-07-20 2003-04-22 International Business Machines Corporation Database deltas using Cyclic Redundancy Checks
AU2001293290A1 (en) 2000-09-21 2002-04-02 Integrity Pc Innovations, Inc. An automatic real-time file management method and apparatus
WO2002025438A1 (en) * 2000-09-22 2002-03-28 Patchlink.Com Corporation Non-invasive automatic offsite patch fingerprinting and updating system and method
US20040003266A1 (en) * 2000-09-22 2004-01-01 Patchlink Corporation Non-invasive automatic offsite patch fingerprinting and updating system and method
US6810398B2 (en) * 2000-11-06 2004-10-26 Avamar Technologies, Inc. System and method for unorchestrated determination of data sequences using sticky byte factoring to determine breakpoints in digital sequences
US8479189B2 (en) 2000-11-17 2013-07-02 Hewlett-Packard Development Company, L.P. Pattern detection preprocessor in an electronic device update generation system
US7409685B2 (en) 2002-04-12 2008-08-05 Hewlett-Packard Development Company, L.P. Initialization and update of software and/or firmware in electronic devices
US6606690B2 (en) 2001-02-20 2003-08-12 Hewlett-Packard Development Company, L.P. System and method for accessing a storage area network as network attached storage
US20020156921A1 (en) * 2001-04-19 2002-10-24 International Business Machines Corporation Automatic backup of wireless mobile device data onto gateway server while device is idle
US20030023621A1 (en) * 2001-07-25 2003-01-30 Jay Muse Remote activated internet file transfer and storage device
US6662198B2 (en) 2001-08-30 2003-12-09 Zoteca Inc. Method and system for asynchronous transmission, backup, distribution of data and file sharing
US6952758B2 (en) * 2002-07-31 2005-10-04 International Business Machines Corporation Method and system for providing consistent data modification information to clients in a storage system
US7389313B1 (en) * 2002-08-07 2008-06-17 Symantec Operating Corporation System and method for creating a snapshot copy of a database
US7461372B2 (en) * 2002-10-11 2008-12-02 Hewlett-Packard Development Company, L.P. System for optimizing distribution of information employing a universal dictionary
SG136793A1 (en) * 2002-10-22 2007-11-29 Sql View Pte Ltd E-archiving system (eas), methodology and process
US7055008B2 (en) * 2003-01-22 2006-05-30 Falconstor Software, Inc. System and method for backing up data
US7107419B1 (en) * 2003-02-14 2006-09-12 Google Inc. Systems and methods for performing record append operations
US8555273B1 (en) 2003-09-17 2013-10-08 Palm. Inc. Network for updating electronic devices
US7761569B2 (en) 2004-01-23 2010-07-20 Tiversa, Inc. Method for monitoring and providing information over a peer to peer network
US8156175B2 (en) 2004-01-23 2012-04-10 Tiversa Inc. System and method for searching for specific types of people or information on a peer-to-peer network
US7685384B2 (en) * 2004-02-06 2010-03-23 Globalscape, Inc. System and method for replicating files in a computer network
US7698159B2 (en) * 2004-02-13 2010-04-13 Genworth Financial Inc. Systems and methods for performing data collection
US20050182666A1 (en) * 2004-02-13 2005-08-18 Perry Timothy P.J. Method and system for electronically routing and processing information
US7320003B2 (en) * 2004-02-13 2008-01-15 Genworth Financial, Inc. Method and system for storing and retrieving document data using a markup language string and a serialized string
US7904895B1 (en) 2004-04-21 2011-03-08 Hewlett-Packard Develpment Company, L.P. Firmware update in electronic devices employing update agent in a flash memory card
US20060004890A1 (en) * 2004-06-10 2006-01-05 International Business Machines Corporation Methods and systems for providing directory services for file systems
US7484051B2 (en) * 2004-06-14 2009-01-27 International Business Machines Corporation Apparatus, system and method for reliably updating a data group in a read-before-write data replication environment using a comparison file
US7580959B2 (en) * 2004-06-14 2009-08-25 International Business Machines Corporation Apparatus, system, and method for providing efficient disaster recovery storage of data using differencing
US7243201B2 (en) * 2004-07-22 2007-07-10 International Business Machines Corporation Application-based commit for local storage subsystems and remote storage subsystems
US8526940B1 (en) 2004-08-17 2013-09-03 Palm, Inc. Centralized rules repository for smart phone customer care
GB2417800A (en) * 2004-09-07 2006-03-08 Siemens Ag Data updating management apparatus and method
US20060069890A1 (en) * 2004-09-30 2006-03-30 Emc Corporation Triangular asynchronous replication with minimal synchronous storage
US8078813B2 (en) * 2004-09-30 2011-12-13 Emc Corporation Triangular asynchronous replication
JP4741835B2 (en) * 2004-12-17 2011-08-10 Necエンジニアリング株式会社 Memory dump system
FR2882448B1 (en) * 2005-01-21 2007-05-04 Meiosys Soc Par Actions Simpli METHOD OF MANAGING, JOURNALIZING OR REJECTING THE PROGRESS OF AN APPLICATION PROCESS
US20070038884A1 (en) * 2005-08-10 2007-02-15 Spare Backup, Inc. System and method of remote storage of data using client software
US7962585B2 (en) * 2005-08-15 2011-06-14 Microsoft Corporation Partial item change tracking and synchronization
WO2007131190A2 (en) 2006-05-05 2007-11-15 Hybir Inc. Group based complete and incremental computer file backup system, process and apparatus
EP2025095A2 (en) 2006-06-08 2009-02-18 Hewlett-Packard Development Company, L.P. Device management in a network
US8752044B2 (en) 2006-07-27 2014-06-10 Qualcomm Incorporated User experience and dependency management in a mobile device
US9141627B2 (en) * 2006-09-26 2015-09-22 Sony Corporation Providing a user access to data files distributed in a plurality of different types of user devices
US9922330B2 (en) 2007-04-12 2018-03-20 Kroll Information Assurance, Llc System and method for advertising on a peer-to-peer network
JP5390513B2 (en) * 2007-06-05 2014-01-15 アストリウム・リミテッド Remote inspection system and method
US8438558B1 (en) 2009-03-27 2013-05-07 Google Inc. System and method of updating programs and data
US8311964B1 (en) 2009-11-12 2012-11-13 Symantec Corporation Progressive sampling for deduplication indexing
US8473463B1 (en) 2010-03-02 2013-06-25 Symantec Corporation Method of avoiding duplicate backups in a computing system
US8370315B1 (en) 2010-05-28 2013-02-05 Symantec Corporation System and method for high performance deduplication indexing
US8983952B1 (en) 2010-07-29 2015-03-17 Symantec Corporation System and method for partitioning backup data streams in a deduplication based storage system
US8756197B1 (en) 2010-08-13 2014-06-17 Symantec Corporation Generating data set views for backup restoration
US8291170B1 (en) 2010-08-19 2012-10-16 Symantec Corporation System and method for event driven backup data storage
US8392376B2 (en) 2010-09-03 2013-03-05 Symantec Corporation System and method for scalable reference management in a deduplication based storage system
US8396841B1 (en) 2010-11-30 2013-03-12 Symantec Corporation Method and system of multi-level and multi-mode cloud-based deduplication
US8392384B1 (en) 2010-12-10 2013-03-05 Symantec Corporation Method and system of deduplication-based fingerprint index caching
US9600513B2 (en) 2011-06-09 2017-03-21 International Business Machines Corporation Database table comparison
US8589640B2 (en) 2011-10-14 2013-11-19 Pure Storage, Inc. Method for maintaining multiple fingerprint tables in a deduplicating storage system
US10275397B2 (en) 2013-02-22 2019-04-30 Veritas Technologies Llc Deduplication storage system with efficient reference updating and space reclamation
US9575680B1 (en) 2014-08-22 2017-02-21 Veritas Technologies Llc Deduplication rehydration
US10423495B1 (en) 2014-09-08 2019-09-24 Veritas Technologies Llc Deduplication grouping
US10146752B2 (en) 2014-12-31 2018-12-04 Quantum Metric, LLC Accurate and efficient recording of user experience, GUI changes and user interaction events on a remote web document
US10015229B2 (en) * 2015-02-24 2018-07-03 International Business Machines Corporation Metadata sharing to decrease file transfer time
WO2017011829A1 (en) * 2015-07-16 2017-01-19 Quantum Metric, LLC Document capture using client-based delta encoding with server

Citations (94)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3711863A (en) * 1972-01-21 1973-01-16 Honeywell Inf Systems Source code comparator computer program
US3715734A (en) * 1970-11-12 1973-02-06 J Fajans Memory storage device and method of making the same
US4361832A (en) * 1977-01-28 1982-11-30 Cole Martin T Automatic centralized monitoring system
US4377000A (en) * 1980-05-05 1983-03-15 Westinghouse Electric Corp. Automatic fault detection and recovery system which provides stability and continuity of operation in an industrial multiprocessor control
US4491934A (en) * 1982-05-12 1985-01-01 Heinz Karl E Data compression process
US4558302A (en) * 1983-06-20 1985-12-10 Sperry Corporation High speed data compression and decompression apparatus and method
US4616315A (en) * 1985-01-11 1986-10-07 Burroughs Corporation System memory for a reduction processor evaluating programs stored as binary directed graphs employing variable-free applicative language codes
US4641274A (en) * 1982-12-03 1987-02-03 International Business Machines Corporation Method for communicating changes made to text form a text processor to a remote host
US4654819A (en) * 1982-12-09 1987-03-31 Sequoia Systems, Inc. Memory back-up system
US4686620A (en) * 1984-07-26 1987-08-11 American Telephone And Telegraph Company, At&T Bell Laboratories Database backup method
US4727509A (en) * 1984-06-28 1988-02-23 Information Exchange Systems, Inc. Master/slave system for replicating/formatting flexible magnetic diskettes
US4792896A (en) * 1983-12-07 1988-12-20 516277 Ontario Limited Storage controller emulator providing transparent resource sharing in a computer system
US4807182A (en) * 1986-03-12 1989-02-21 Advanced Software, Inc. Apparatus and method for comparing data groups
US4819154A (en) * 1982-12-09 1989-04-04 Sequoia Systems, Inc. Memory back up system with one cache memory and two physically separated main memories
US4881075A (en) * 1987-10-15 1989-11-14 Digital Equipment Corporation Method and apparatus for adaptive data compression
US4893307A (en) * 1988-02-29 1990-01-09 International Business Machines Corporation Method and apparatus for linking SNA terminals to an SNA host over a packet switched communications network
US4910666A (en) * 1986-12-18 1990-03-20 Bull Hn Information Systems Inc. Apparatus for loading and verifying a control store memory of a central subsystem
US4914576A (en) * 1986-12-18 1990-04-03 Bull Hn Information Systems Inc. Apparatus and method of loading a control store memory of a central subsystem
US5008936A (en) * 1988-12-09 1991-04-16 The Exchange System Limited Partnership Backup/restore technique in a microcomputer-based encryption system
US5043871A (en) * 1986-03-26 1991-08-27 Hitachi, Ltd. Method and apparatus for database update/recovery
US5051947A (en) * 1985-12-10 1991-09-24 Trw Inc. High-speed single-pass textual search processor for locating exact and inexact matches of a search pattern in a textual stream
US5060185A (en) * 1988-03-25 1991-10-22 Ncr Corporation File backup system
US5133065A (en) * 1989-07-27 1992-07-21 Personal Computer Peripherals Corporation Backup computer program for networks
US5155845A (en) * 1990-06-15 1992-10-13 Storage Technology Corporation Data storage system for providing redundant copies of data on different disk drives
US5163148A (en) * 1989-08-11 1992-11-10 Digital Equipment Corporation File backup system for producing a backup copy of a file which may be updated during backup
US5193154A (en) * 1987-07-10 1993-03-09 Hitachi, Ltd. Buffered peripheral system and method for backing up and retrieving data to and from backup memory device
US5202982A (en) * 1990-03-27 1993-04-13 Sun Microsystems, Inc. Method and apparatus for the naming of database component files to avoid duplication of files
US5210866A (en) * 1990-09-12 1993-05-11 Storage Technology Corporation Incremental disk backup system for a dynamically mapped data storage subsystem
EP0541281A2 (en) * 1991-11-04 1993-05-12 AT&T Corp. Incremental-computer-file backup using signatures
US5212784A (en) * 1990-10-22 1993-05-18 Delphi Data, A Division Of Sparks Industries, Inc. Automated concurrent data backup system
US5212772A (en) * 1991-02-11 1993-05-18 Gigatrend Incorporated System for storing data in backup tape device
US5235601A (en) * 1990-12-21 1993-08-10 Array Technology Corporation On-line restoration of redundancy information in a redundant array system
US5239637A (en) * 1989-06-30 1993-08-24 Digital Equipment Corporation Digital data management system for maintaining consistency of data in a shadow set
US5239647A (en) * 1990-09-07 1993-08-24 International Business Machines Corporation Data storage hierarchy with shared storage level
US5239659A (en) * 1991-06-19 1993-08-24 Storage Technology Corporation Phantom duplex copy group apparatus for a disk drive array data storge subsystem
US5241668A (en) * 1992-04-20 1993-08-31 International Business Machines Corporation Method and system for automated termination and resumption in a time zero backup copy process
US5263154A (en) * 1992-04-20 1993-11-16 International Business Machines Corporation Method and system for incremental time zero backup copying of data
US5274807A (en) * 1990-11-01 1993-12-28 At&T Bell Laboratories Method for reducing magnetic storage volume for computer disk image backup
US5274802A (en) * 1991-02-22 1993-12-28 Gte Mobilnet Incorporated Method for restoring lost databases by comparing existing database and generic database, and generating cellular switch commands to update the generic database
US5276860A (en) * 1989-12-19 1994-01-04 Epoch Systems, Inc. Digital data processor with improved backup storage
US5276865A (en) * 1992-05-06 1994-01-04 Thorpe Tracy J Automatic computer back-up system
US5278838A (en) * 1991-06-18 1994-01-11 Ibm Corp. Recovery from errors in a redundant array of disk drives
US5293613A (en) * 1991-08-29 1994-03-08 International Business Machines Corporation Recovery control register
US5295258A (en) * 1989-12-22 1994-03-15 Tandem Computers Incorporated Fault-tolerant computer system with online recovery and reintegration of redundant components
US5321832A (en) * 1989-05-26 1994-06-14 Hitachi, Ltd. System of database copy operations using a virtual page control table to map log data into physical store order
WO1994017474A1 (en) * 1993-01-21 1994-08-04 Apple Computer, Inc. Apparatus and method for backing up data from networked computer storage devices
US5347653A (en) * 1991-06-28 1994-09-13 Digital Equipment Corporation System for reconstructing prior versions of indexes using records indicating changes between successive versions of the indexes
US5357607A (en) * 1991-04-01 1994-10-18 Xerox Corporation File storage process for electronic printing systems having multiple disks
US5367698A (en) * 1991-10-31 1994-11-22 Epoch Systems, Inc. Network file migration system
US5375232A (en) * 1992-09-23 1994-12-20 International Business Machines Corporation Method and system for asynchronous pre-staging of backup copies in a data processing storage subsystem
US5404508A (en) * 1992-12-03 1995-04-04 Unisys Corporation Data base backup and recovery system and method
US5416840A (en) * 1993-07-06 1995-05-16 Phoenix Technologies, Ltd. Software catalog encoding method and system
US5435004A (en) * 1994-07-21 1995-07-18 International Business Machines Corporation Computerized system and method for data backup
US5438671A (en) * 1991-07-19 1995-08-01 Dell U.S.A., L.P. Method and system for transferring compressed bytes of information between separate hard disk drive units
US5446871A (en) * 1993-03-23 1995-08-29 International Business Machines Corporation Method and arrangement for multi-system remote data duplexing and recovery
US5448718A (en) * 1992-04-20 1995-09-05 International Business Machines Corporation Method and system for time zero backup session security
US5454099A (en) * 1989-07-25 1995-09-26 International Business Machines Corporation CPU implemented method for backing up modified data sets in non-volatile store for recovery in the event of CPU failure
US5475834A (en) * 1992-10-26 1995-12-12 International Business Machines Corporation Integration of migration level two and backup tape processing using multiple inventory entries
US5479654A (en) * 1990-04-26 1995-12-26 Squibb Data Systems, Inc. Apparatus and method for reconstructing a file from a difference signature and an original file
US5495533A (en) * 1994-04-29 1996-02-27 International Business Machines Corporation Personal key archive
US5497483A (en) * 1992-09-23 1996-03-05 International Business Machines Corporation Method and system for track transfer control during concurrent copy operations in a data processing storage subsystem
US5506986A (en) * 1992-07-14 1996-04-09 Electronic Data Systems Corporation Media management system using historical data to access data sets from a plurality of data storage devices
US5513314A (en) * 1995-01-27 1996-04-30 Auspex Systems, Inc. Fault tolerant NFS server system and mirroring protocol
US5513351A (en) * 1994-07-28 1996-04-30 International Business Machines Corporation Protecting a system during system maintenance by usage of temporary filenames in an alias table
US5515502A (en) * 1993-09-30 1996-05-07 Sybase, Inc. Data backup system with methods for stripe affinity backup to multiple archive devices
US5546534A (en) * 1993-07-19 1996-08-13 Intelligence Quotient International Ltd. Method of operating a computer system
US5548750A (en) * 1992-12-08 1996-08-20 Telefonaktiebolaget Lm Ericsson System for taking backup in a data base
US5564037A (en) * 1995-03-29 1996-10-08 Cheyenne Software International Sales Corp. Real time data migration system and method employing sparse files
US5566331A (en) * 1994-01-24 1996-10-15 University Corporation For Atmospheric Research Mass storage system for file-systems
US5574906A (en) * 1994-10-24 1996-11-12 International Business Machines Corporation System and method for reducing storage requirement in backup subsystems utilizing segmented compression and differencing
US5584023A (en) * 1993-12-27 1996-12-10 Hsu; Mike S. C. Computer system including a transparent and secure file transform mechanism
US5586322A (en) * 1992-06-11 1996-12-17 Beck; Robert E. Workgroup organized network manager with workstation comparison system
US5594900A (en) * 1992-12-02 1997-01-14 International Business Machines Corporation System and method for providing a backup copy of a database
US5596706A (en) * 1990-02-28 1997-01-21 Hitachi, Ltd. Highly reliable online system
US5604862A (en) * 1995-03-14 1997-02-18 Network Integrity, Inc. Continuously-snapshotted protection of computer files
US5606693A (en) * 1991-10-02 1997-02-25 International Business Machines Corporation Distributed database management over a network
US5615364A (en) * 1992-10-16 1997-03-25 Siemens Energy & Automation, Inc. Apparatus and method for performing the backup of a primary database using a back up database
US5623662A (en) * 1993-12-15 1997-04-22 Supercomm, Inc. Revenue sharing system with data filtering using history, periodic, and exclusion databases
US5640561A (en) * 1992-10-13 1997-06-17 International Business Machines Corporation Computerized method and system for replicating a database using log records
US5649089A (en) * 1994-11-30 1997-07-15 Motorola, Inc. Method and apparatus for maintaining a redundant database system
US5649196A (en) * 1993-07-01 1997-07-15 Legent Corporation System and method for distributed storage management on networked computer systems using binary object identifiers
US5659743A (en) * 1994-12-05 1997-08-19 Legent Corporation Method and apparatus for a pattern based spaced management system
US5659614A (en) * 1994-11-28 1997-08-19 Bailey, Iii; John E. Method and system for creating and storing a backup copy of file data stored on a computer
US5664186A (en) * 1992-05-21 1997-09-02 International Business Machines Corporation Computer file management and backup system
US5668991A (en) * 1994-03-31 1997-09-16 International Computers Limited Database management system
US5673381A (en) * 1994-05-27 1997-09-30 Cheyenne Software International Sales Corp. System and parallel streaming and data stripping to back-up a network
US5675725A (en) * 1993-07-19 1997-10-07 Cheyenne Advanced Technology Limited Computer backup system operable with open files
US5708820A (en) * 1994-10-25 1998-01-13 Samsung Electronics Co., Ltd. Network hibernation system for suspending and resuming operation of computer system operable in network environment in event of power failure or period of inactivity
US5713024A (en) * 1994-06-10 1998-01-27 Exabyte Corporation Cold boot data backup system
US5765173A (en) * 1996-01-11 1998-06-09 Connected Corporation High performance backup via selective file saving which can perform incremental backups and exclude files and uses a changed block signature list
US5778395A (en) * 1995-10-23 1998-07-07 Stac, Inc. System for backing up files from disk volumes on multiple nodes of a computer network
US5794254A (en) * 1996-12-03 1998-08-11 Fairbanks Systems Group Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets
US5799141A (en) * 1995-06-09 1998-08-25 Qualix Group, Inc. Real-time data protection system and method
US5864853A (en) * 1994-09-14 1999-01-26 Kabushiki Kaisha Toshiba Portable file system operable under various computer environments

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5623822A (en) * 1995-05-23 1997-04-29 Montenay International Corp. Method of operating a waste-to-energy plant having a waste boiler and gas turbine cycle

Patent Citations (98)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3715734A (en) * 1970-11-12 1973-02-06 J Fajans Memory storage device and method of making the same
US3711863A (en) * 1972-01-21 1973-01-16 Honeywell Inf Systems Source code comparator computer program
US4361832A (en) * 1977-01-28 1982-11-30 Cole Martin T Automatic centralized monitoring system
US4377000A (en) * 1980-05-05 1983-03-15 Westinghouse Electric Corp. Automatic fault detection and recovery system which provides stability and continuity of operation in an industrial multiprocessor control
US4491934A (en) * 1982-05-12 1985-01-01 Heinz Karl E Data compression process
US4641274A (en) * 1982-12-03 1987-02-03 International Business Machines Corporation Method for communicating changes made to text form a text processor to a remote host
US4654819A (en) * 1982-12-09 1987-03-31 Sequoia Systems, Inc. Memory back-up system
US4819154A (en) * 1982-12-09 1989-04-04 Sequoia Systems, Inc. Memory back up system with one cache memory and two physically separated main memories
US4558302B1 (en) * 1983-06-20 1994-01-04 Unisys Corp
US4558302A (en) * 1983-06-20 1985-12-10 Sperry Corporation High speed data compression and decompression apparatus and method
US4792896A (en) * 1983-12-07 1988-12-20 516277 Ontario Limited Storage controller emulator providing transparent resource sharing in a computer system
US4727509A (en) * 1984-06-28 1988-02-23 Information Exchange Systems, Inc. Master/slave system for replicating/formatting flexible magnetic diskettes
US4686620A (en) * 1984-07-26 1987-08-11 American Telephone And Telegraph Company, At&T Bell Laboratories Database backup method
US4616315A (en) * 1985-01-11 1986-10-07 Burroughs Corporation System memory for a reduction processor evaluating programs stored as binary directed graphs employing variable-free applicative language codes
US5051947A (en) * 1985-12-10 1991-09-24 Trw Inc. High-speed single-pass textual search processor for locating exact and inexact matches of a search pattern in a textual stream
US4807182A (en) * 1986-03-12 1989-02-21 Advanced Software, Inc. Apparatus and method for comparing data groups
US5043871A (en) * 1986-03-26 1991-08-27 Hitachi, Ltd. Method and apparatus for database update/recovery
US4910666A (en) * 1986-12-18 1990-03-20 Bull Hn Information Systems Inc. Apparatus for loading and verifying a control store memory of a central subsystem
US4914576A (en) * 1986-12-18 1990-04-03 Bull Hn Information Systems Inc. Apparatus and method of loading a control store memory of a central subsystem
US5193154A (en) * 1987-07-10 1993-03-09 Hitachi, Ltd. Buffered peripheral system and method for backing up and retrieving data to and from backup memory device
US4881075A (en) * 1987-10-15 1989-11-14 Digital Equipment Corporation Method and apparatus for adaptive data compression
US4893307A (en) * 1988-02-29 1990-01-09 International Business Machines Corporation Method and apparatus for linking SNA terminals to an SNA host over a packet switched communications network
US5060185A (en) * 1988-03-25 1991-10-22 Ncr Corporation File backup system
US5008936A (en) * 1988-12-09 1991-04-16 The Exchange System Limited Partnership Backup/restore technique in a microcomputer-based encryption system
US5321832A (en) * 1989-05-26 1994-06-14 Hitachi, Ltd. System of database copy operations using a virtual page control table to map log data into physical store order
US5239637A (en) * 1989-06-30 1993-08-24 Digital Equipment Corporation Digital data management system for maintaining consistency of data in a shadow set
US5454099A (en) * 1989-07-25 1995-09-26 International Business Machines Corporation CPU implemented method for backing up modified data sets in non-volatile store for recovery in the event of CPU failure
US5133065A (en) * 1989-07-27 1992-07-21 Personal Computer Peripherals Corporation Backup computer program for networks
US5163148A (en) * 1989-08-11 1992-11-10 Digital Equipment Corporation File backup system for producing a backup copy of a file which may be updated during backup
US5276860A (en) * 1989-12-19 1994-01-04 Epoch Systems, Inc. Digital data processor with improved backup storage
US5295258A (en) * 1989-12-22 1994-03-15 Tandem Computers Incorporated Fault-tolerant computer system with online recovery and reintegration of redundant components
US5596706A (en) * 1990-02-28 1997-01-21 Hitachi, Ltd. Highly reliable online system
US5202982A (en) * 1990-03-27 1993-04-13 Sun Microsystems, Inc. Method and apparatus for the naming of database component files to avoid duplication of files
US5479654A (en) * 1990-04-26 1995-12-26 Squibb Data Systems, Inc. Apparatus and method for reconstructing a file from a difference signature and an original file
US5155845A (en) * 1990-06-15 1992-10-13 Storage Technology Corporation Data storage system for providing redundant copies of data on different disk drives
US5239647A (en) * 1990-09-07 1993-08-24 International Business Machines Corporation Data storage hierarchy with shared storage level
US5210866A (en) * 1990-09-12 1993-05-11 Storage Technology Corporation Incremental disk backup system for a dynamically mapped data storage subsystem
US5212784A (en) * 1990-10-22 1993-05-18 Delphi Data, A Division Of Sparks Industries, Inc. Automated concurrent data backup system
US5274807A (en) * 1990-11-01 1993-12-28 At&T Bell Laboratories Method for reducing magnetic storage volume for computer disk image backup
US5235601A (en) * 1990-12-21 1993-08-10 Array Technology Corporation On-line restoration of redundancy information in a redundant array system
US5212772A (en) * 1991-02-11 1993-05-18 Gigatrend Incorporated System for storing data in backup tape device
US5274802A (en) * 1991-02-22 1993-12-28 Gte Mobilnet Incorporated Method for restoring lost databases by comparing existing database and generic database, and generating cellular switch commands to update the generic database
US5357607A (en) * 1991-04-01 1994-10-18 Xerox Corporation File storage process for electronic printing systems having multiple disks
US5278838A (en) * 1991-06-18 1994-01-11 Ibm Corp. Recovery from errors in a redundant array of disk drives
US5239659A (en) * 1991-06-19 1993-08-24 Storage Technology Corporation Phantom duplex copy group apparatus for a disk drive array data storge subsystem
US5347653A (en) * 1991-06-28 1994-09-13 Digital Equipment Corporation System for reconstructing prior versions of indexes using records indicating changes between successive versions of the indexes
US5438671A (en) * 1991-07-19 1995-08-01 Dell U.S.A., L.P. Method and system for transferring compressed bytes of information between separate hard disk drive units
US5293613A (en) * 1991-08-29 1994-03-08 International Business Machines Corporation Recovery control register
US5606693A (en) * 1991-10-02 1997-02-25 International Business Machines Corporation Distributed database management over a network
US5668986A (en) * 1991-10-02 1997-09-16 International Business Machines Corporation Method and apparatus for handling data storage requests in a distributed data base environment
US5367698A (en) * 1991-10-31 1994-11-22 Epoch Systems, Inc. Network file migration system
EP0541281A2 (en) * 1991-11-04 1993-05-12 AT&T Corp. Incremental-computer-file backup using signatures
US5559991A (en) * 1991-11-04 1996-09-24 Lucent Technologies Inc. Incremental computer file backup using check words
US5263154A (en) * 1992-04-20 1993-11-16 International Business Machines Corporation Method and system for incremental time zero backup copying of data
US5241668A (en) * 1992-04-20 1993-08-31 International Business Machines Corporation Method and system for automated termination and resumption in a time zero backup copy process
US5448718A (en) * 1992-04-20 1995-09-05 International Business Machines Corporation Method and system for time zero backup session security
US5276865A (en) * 1992-05-06 1994-01-04 Thorpe Tracy J Automatic computer back-up system
US5664186A (en) * 1992-05-21 1997-09-02 International Business Machines Corporation Computer file management and backup system
US5586322A (en) * 1992-06-11 1996-12-17 Beck; Robert E. Workgroup organized network manager with workstation comparison system
US5506986A (en) * 1992-07-14 1996-04-09 Electronic Data Systems Corporation Media management system using historical data to access data sets from a plurality of data storage devices
US5375232A (en) * 1992-09-23 1994-12-20 International Business Machines Corporation Method and system for asynchronous pre-staging of backup copies in a data processing storage subsystem
US5497483A (en) * 1992-09-23 1996-03-05 International Business Machines Corporation Method and system for track transfer control during concurrent copy operations in a data processing storage subsystem
US5640561A (en) * 1992-10-13 1997-06-17 International Business Machines Corporation Computerized method and system for replicating a database using log records
US5615364A (en) * 1992-10-16 1997-03-25 Siemens Energy & Automation, Inc. Apparatus and method for performing the backup of a primary database using a back up database
US5475834A (en) * 1992-10-26 1995-12-12 International Business Machines Corporation Integration of migration level two and backup tape processing using multiple inventory entries
US5594900A (en) * 1992-12-02 1997-01-14 International Business Machines Corporation System and method for providing a backup copy of a database
US5404508A (en) * 1992-12-03 1995-04-04 Unisys Corporation Data base backup and recovery system and method
US5548750A (en) * 1992-12-08 1996-08-20 Telefonaktiebolaget Lm Ericsson System for taking backup in a data base
WO1994017474A1 (en) * 1993-01-21 1994-08-04 Apple Computer, Inc. Apparatus and method for backing up data from networked computer storage devices
US5446871A (en) * 1993-03-23 1995-08-29 International Business Machines Corporation Method and arrangement for multi-system remote data duplexing and recovery
US5649196A (en) * 1993-07-01 1997-07-15 Legent Corporation System and method for distributed storage management on networked computer systems using binary object identifiers
US5416840A (en) * 1993-07-06 1995-05-16 Phoenix Technologies, Ltd. Software catalog encoding method and system
US5675725A (en) * 1993-07-19 1997-10-07 Cheyenne Advanced Technology Limited Computer backup system operable with open files
US5546534A (en) * 1993-07-19 1996-08-13 Intelligence Quotient International Ltd. Method of operating a computer system
US5671350A (en) * 1993-09-30 1997-09-23 Sybase, Inc. Data backup system with methods for stripe affinity backup to multiple archive devices
US5515502A (en) * 1993-09-30 1996-05-07 Sybase, Inc. Data backup system with methods for stripe affinity backup to multiple archive devices
US5623662A (en) * 1993-12-15 1997-04-22 Supercomm, Inc. Revenue sharing system with data filtering using history, periodic, and exclusion databases
US5584023A (en) * 1993-12-27 1996-12-10 Hsu; Mike S. C. Computer system including a transparent and secure file transform mechanism
US5566331A (en) * 1994-01-24 1996-10-15 University Corporation For Atmospheric Research Mass storage system for file-systems
US5668991A (en) * 1994-03-31 1997-09-16 International Computers Limited Database management system
US5495533A (en) * 1994-04-29 1996-02-27 International Business Machines Corporation Personal key archive
US5673381A (en) * 1994-05-27 1997-09-30 Cheyenne Software International Sales Corp. System and parallel streaming and data stripping to back-up a network
US5713024A (en) * 1994-06-10 1998-01-27 Exabyte Corporation Cold boot data backup system
US5435004A (en) * 1994-07-21 1995-07-18 International Business Machines Corporation Computerized system and method for data backup
US5513351A (en) * 1994-07-28 1996-04-30 International Business Machines Corporation Protecting a system during system maintenance by usage of temporary filenames in an alias table
US5864853A (en) * 1994-09-14 1999-01-26 Kabushiki Kaisha Toshiba Portable file system operable under various computer environments
US5574906A (en) * 1994-10-24 1996-11-12 International Business Machines Corporation System and method for reducing storage requirement in backup subsystems utilizing segmented compression and differencing
US5708820A (en) * 1994-10-25 1998-01-13 Samsung Electronics Co., Ltd. Network hibernation system for suspending and resuming operation of computer system operable in network environment in event of power failure or period of inactivity
US5659614A (en) * 1994-11-28 1997-08-19 Bailey, Iii; John E. Method and system for creating and storing a backup copy of file data stored on a computer
US5649089A (en) * 1994-11-30 1997-07-15 Motorola, Inc. Method and apparatus for maintaining a redundant database system
US5659743A (en) * 1994-12-05 1997-08-19 Legent Corporation Method and apparatus for a pattern based spaced management system
US5513314A (en) * 1995-01-27 1996-04-30 Auspex Systems, Inc. Fault tolerant NFS server system and mirroring protocol
US5604862A (en) * 1995-03-14 1997-02-18 Network Integrity, Inc. Continuously-snapshotted protection of computer files
US5564037A (en) * 1995-03-29 1996-10-08 Cheyenne Software International Sales Corp. Real time data migration system and method employing sparse files
US5799141A (en) * 1995-06-09 1998-08-25 Qualix Group, Inc. Real-time data protection system and method
US5778395A (en) * 1995-10-23 1998-07-07 Stac, Inc. System for backing up files from disk volumes on multiple nodes of a computer network
US5765173A (en) * 1996-01-11 1998-06-09 Connected Corporation High performance backup via selective file saving which can perform incremental backups and exclude files and uses a changed block signature list
US5794254A (en) * 1996-12-03 1998-08-11 Fairbanks Systems Group Incremental computer file backup using a two-step comparison of first two characters in the block and a signature with pre-stored character and signature sets

Cited By (102)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6847982B2 (en) * 1996-04-12 2005-01-25 Computer Associates Think, Inc. Intelligent data inventory and asset management system method and apparatus
US20020073106A1 (en) * 1996-04-12 2002-06-13 Computer Associates Think, Inc. A Dela Corporation Intelligent data inventory and asset management system method and apparatus
US10361802B1 (en) 1999-02-01 2019-07-23 Blanding Hovenweep, Llc Adaptive pattern recognition based control system and method
US6496949B1 (en) * 1999-08-06 2002-12-17 International Business Machines Corp. Emergency backup system, method and program product therefor
US20100185855A1 (en) * 2000-02-18 2010-07-22 Margolus Norman H Data Repository and Method for Promoting Network Storage of Data
US9177175B2 (en) * 2000-02-18 2015-11-03 Permabit Technology Corporation Data repository and method for promoting network storage of data
US7574598B2 (en) 2000-05-05 2009-08-11 Microsoft Corporation Identifying and coalescing identical objects encrypted with different keys
US7266689B2 (en) 2000-05-05 2007-09-04 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US6983365B1 (en) * 2000-05-05 2006-01-03 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US20050235146A1 (en) * 2000-05-05 2005-10-20 Microsoft Corporation Identifying and coalescing identical objects encrypted with different keys
US7437555B2 (en) 2000-05-05 2008-10-14 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US20040215962A1 (en) * 2000-05-05 2004-10-28 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US20040221159A1 (en) * 2000-05-05 2004-11-04 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US20040221160A1 (en) * 2000-05-05 2004-11-04 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US7159110B2 (en) * 2000-05-05 2007-01-02 Microsoft Corporation Identifying and coalescing identical objects encrypted with different keys
US20050229012A1 (en) * 2000-05-05 2005-10-13 Microsoft Corporation Identifying and coalescing identical objects encrypted with different keys
US7779253B2 (en) * 2000-05-05 2010-08-17 Microsoft Corporation Encryption systems and methods for identifying and coalescing identical objects encrypted with different keys
US6966001B2 (en) * 2000-05-23 2005-11-15 Hitachi, Ltd. Computing system and data decryption method and computer system with remote copy facility
US20030037247A1 (en) * 2000-05-23 2003-02-20 Kiyohiro Obara Computing system and data decryption method and computer system with remote copy facility
US20020016913A1 (en) * 2000-08-04 2002-02-07 Wheeler Lynn Henry Modifying message data and generating random number digital signature within computer chip
US7784106B2 (en) 2000-08-04 2010-08-24 First Data Corporation Manufacturing unique devices that generate digital signatures
US6742028B1 (en) 2000-09-15 2004-05-25 Frank Wang Content management and sharing
US20020069376A1 (en) * 2000-12-01 2002-06-06 Gregg Leon Edward Method, article of manufacture and apparatus for copying information to a storage medium
US6779129B2 (en) * 2000-12-01 2004-08-17 International Business Machines Corporation Method, article of manufacture and apparatus for copying information to a storage medium
US6594677B2 (en) 2000-12-22 2003-07-15 Simdesk Technologies, Inc. Virtual tape storage system and method
WO2003090098A1 (en) * 2001-02-15 2003-10-30 Avica Technology Programming content distribution
US9419951B1 (en) 2001-03-23 2016-08-16 St. Luke Technologies, Llc System and method for secure three-party communications
US8904181B1 (en) 2001-03-23 2014-12-02 David P. Felsher System and method for secure three-party communications
US20020174137A1 (en) * 2001-05-15 2002-11-21 Wolff Daniel Joseph Repairing alterations to computer files
US7356535B2 (en) * 2002-10-10 2008-04-08 Pb & J Software, Llc Method and system for sharing storage space on a computer
US7310736B2 (en) 2002-10-10 2007-12-18 Pb&J Software, Llc Method and system for sharing storage space on a computer
US20040078602A1 (en) * 2002-10-10 2004-04-22 Pb&J Software, Llc Method and system for sharing storage space on a computer
US20050021950A1 (en) * 2002-10-10 2005-01-27 Pb&J Software, Llc Method and system for sharing storage space on a computer
US20070106714A1 (en) * 2002-10-10 2007-05-10 Rothbarth James N Method and system using an external hard drive to implement back-up files
US20070198685A1 (en) * 2003-01-17 2007-08-23 Phatak Shirish H Method and system for use of storage caching with a distributed file system
US7103617B2 (en) * 2003-01-17 2006-09-05 Tacit Networks, Inc. Method and system for use of storage caching with a distributed file system
US20040186861A1 (en) * 2003-01-17 2004-09-23 Phatak Shirish Hemant Method and system for use of storage caching with a distributed file system
WO2004068469A3 (en) * 2003-01-17 2005-03-03 Tacit Networks Inc Method and system for use of storage caching with a distributed file system
US9818136B1 (en) 2003-02-05 2017-11-14 Steven M. Hoffberg System and method for determining contingent relevance
US8600830B2 (en) 2003-02-05 2013-12-03 Steven M. Hoffberg System and method for providing a payment to a non-winning auction participant
US10163137B2 (en) 2003-02-05 2018-12-25 Steven M. Hoffberg System and method for incentivizing participation in a market transaction
US10943273B2 (en) 2003-02-05 2021-03-09 The Hoffberg Family Trust 2004-1 System and method for determining contingent relevance
US11790413B2 (en) 2003-02-05 2023-10-17 Hoffberg Family Trust 2 System and method for communication
US20110004513A1 (en) * 2003-02-05 2011-01-06 Hoffberg Steven M System and method
US7747586B2 (en) 2003-04-23 2010-06-29 International Business Machines Corporation Apparatus and method to map and copy computer files
US20040215628A1 (en) * 2003-04-23 2004-10-28 International Business Machines Corporation Apparatus and method to copy computer files
US20050071380A1 (en) * 2003-09-29 2005-03-31 Micka William F. Apparatus and method to coordinate multiple data storage and retrieval systems
US20080215667A1 (en) * 2003-10-09 2008-09-04 Pb&J Software, Llc Method and system for sharing storage space on a computer
US6968478B1 (en) * 2003-12-18 2005-11-22 Xilinx, Inc. Method and apparatus for data transfer validation
US20050235043A1 (en) * 2004-04-15 2005-10-20 Microsoft Corporation Efficient algorithm and protocol for remote differential compression
US7555531B2 (en) 2004-04-15 2009-06-30 Microsoft Corporation Efficient algorithm and protocol for remote differential compression
US8117173B2 (en) 2004-04-15 2012-02-14 Microsoft Corporation Efficient chunking algorithm
US20090271528A1 (en) * 2004-04-15 2009-10-29 Microsoft Corporation Efficient chunking algorithm
US20050256974A1 (en) * 2004-05-13 2005-11-17 Microsoft Corporation Efficient algorithm and protocol for remote differential compression on a remote device
US20050262167A1 (en) * 2004-05-13 2005-11-24 Microsoft Corporation Efficient algorithm and protocol for remote differential compression on a local device
US20060047855A1 (en) * 2004-05-13 2006-03-02 Microsoft Corporation Efficient chunking algorithm
US20100064141A1 (en) * 2004-09-24 2010-03-11 Microsoft Corporation Efficient algorithm for finding candidate objects for remote differential compression
US7613787B2 (en) 2004-09-24 2009-11-03 Microsoft Corporation Efficient algorithm for finding candidate objects for remote differential compression
US20060085561A1 (en) * 2004-09-24 2006-04-20 Microsoft Corporation Efficient algorithm for finding candidate objects for remote differential compression
US8112496B2 (en) 2004-09-24 2012-02-07 Microsoft Corporation Efficient algorithm for finding candidate objects for remote differential compression
US20060080737A1 (en) * 2004-10-13 2006-04-13 International Business Machines Corporation System and method for reducing virus scan time
US7401361B2 (en) 2004-10-13 2008-07-15 Lenovo (Singapore) Pte. Ltd. System and method for reducing virus scan time
US7752667B2 (en) 2004-12-28 2010-07-06 Lenovo (Singapore) Pte Ltd. Rapid virus scan using file signature created during file write
US20060143713A1 (en) * 2004-12-28 2006-06-29 International Business Machines Corporation Rapid virus scan using file signature created during file write
US7805765B2 (en) 2004-12-28 2010-09-28 Lenovo (Singapore) Pte Ltd. Execution validation using header containing validation data
US20060185017A1 (en) * 2004-12-28 2006-08-17 Lenovo (Singapore) Pte. Ltd. Execution validation using header containing validation data
US7849462B2 (en) 2005-01-07 2010-12-07 Microsoft Corporation Image server
US20060155674A1 (en) * 2005-01-07 2006-07-13 Microsoft Corporation Image server
US20060155735A1 (en) * 2005-01-07 2006-07-13 Microsoft Corporation Image server
US8073926B2 (en) 2005-01-07 2011-12-06 Microsoft Corporation Virtual machine image server
US20070094348A1 (en) * 2005-01-07 2007-04-26 Microsoft Corporation BITS/RDC integration and BITS enhancements
US20100160544A1 (en) * 2005-09-09 2010-06-24 Charles Smith Method of applying silane coating to metal composition
US9794797B2 (en) 2005-10-04 2017-10-17 Steven M. Hoffberg Multifactorial optimization system and method
USRE49334E1 (en) 2005-10-04 2022-12-13 Hoffberg Family Trust 2 Multifactorial optimization system and method
US10567975B2 (en) 2005-10-04 2020-02-18 Hoffberg Family Trust 2 Multifactorial optimization system and method
US20070198659A1 (en) * 2006-01-25 2007-08-23 Lam Wai T Method and system for storing data
US7559011B1 (en) 2006-02-10 2009-07-07 Xilinx, Inc. Circuit having a programmable circuit and method of validating a bitstream loaded into a programmable device
US20080022650A1 (en) * 2006-07-28 2008-01-31 Pascoe William M Composite yarn and process for producing the same
US8812442B1 (en) * 2006-12-19 2014-08-19 Symantec Operating Corporation Backup service and appliance with single-instance storage of encrypted data
US8041641B1 (en) * 2006-12-19 2011-10-18 Symantec Operating Corporation Backup service and appliance with single-instance storage of encrypted data
US20100146083A1 (en) * 2007-06-15 2010-06-10 Packeteer, Inc. WAFS Disconnected-Mode Read-Write Access
US7734733B1 (en) 2007-06-15 2010-06-08 Packeteer, Inc. WAFS disconnected-mode read-write access
US7962600B2 (en) 2007-06-15 2011-06-14 Packeteer, Inc. WAFS disconnected-mode read-write access
US8209540B2 (en) * 2007-06-28 2012-06-26 Apple Inc. Incremental secure backup and restore of user settings and data
US8671279B2 (en) 2007-06-28 2014-03-11 Apple Inc. Incremental secure backup and restore of user settings and data
US20090006640A1 (en) * 2007-06-28 2009-01-01 Michael Lambertus Hubertus Brouwer Incremental secure backup and restore of user settings and data
US8346966B1 (en) 2007-07-19 2013-01-01 Blue Coat Systems, Inc. Transparent file system access for wide area network file system acceleration
US9135284B1 (en) 2008-03-13 2015-09-15 Blue Coat Systems, Inc. Composite execution of rename operations in wide area file systems
US10133744B2 (en) 2008-03-13 2018-11-20 Symantec Corporation Composite execution of rename operations in wide area file systems
US9442850B1 (en) 2008-03-25 2016-09-13 Blue Coat Systems, Inc. Efficient directory refresh operations in wide area file systems
US11683263B1 (en) 2008-08-21 2023-06-20 United Services Automobile Association (Usaa) Preferential loading in data centers
US11044195B1 (en) 2008-08-21 2021-06-22 United Services Automobile Association (Usaa) Preferential loading in data centers
US8468387B2 (en) 2009-04-03 2013-06-18 Microsoft Corporation Bare metal machine recovery
US20100257142A1 (en) * 2009-04-03 2010-10-07 Microsoft Corporation Differential file and system restores from peers and the cloud
US8805953B2 (en) 2009-04-03 2014-08-12 Microsoft Corporation Differential file and system restores from peers and the cloud
US20100257403A1 (en) * 2009-04-03 2010-10-07 Microsoft Corporation Restoration of a system from a set of full and partial delta system snapshots across a distributed system
US10169754B2 (en) 2010-11-17 2019-01-01 Inside Secure Method and system for NFC transaction
US10185950B2 (en) 2010-11-17 2019-01-22 Inside Secure NFC transaction server
US20120123945A1 (en) * 2010-11-17 2012-05-17 Inside Secure Nfc transaction method and system
US20120136832A1 (en) * 2010-11-30 2012-05-31 Network Appliance, Inc. Incremental restore of data between storage systems having dissimilar storage operating systems associated therewith
US8688645B2 (en) * 2010-11-30 2014-04-01 Netapp, Inc. Incremental restore of data between storage systems having dissimilar storage operating systems associated therewith
US20120294200A1 (en) * 2011-05-19 2012-11-22 Wistron Neweb Corporation Wireless communication method

Also Published As

Publication number Publication date
US6014676A (en) 2000-01-11
JP2001508894A (en) 2001-07-03
EP1012701A1 (en) 2000-06-28
US5794254A (en) 1998-08-11
CA2273920A1 (en) 1998-06-25
WO1998027483A1 (en) 1998-06-25
EP1012701A4 (en) 2001-10-31

Similar Documents

Publication Publication Date Title
US6049874A (en) System and method for backing up computer files over a wide area computer network
US6038665A (en) System and method for backing up computer files over a wide area computer network
US20200293693A1 (en) Group based complete and incremental computer file backup system, process and apparatus
US7134041B2 (en) Systems and methods for data backup over a network
US5778395A (en) System for backing up files from disk volumes on multiple nodes of a computer network
US9578097B2 (en) Block based access to a dispersed data storage network
US6754827B1 (en) Secure File Archive through encryption key management
JP4446738B2 (en) System and method for efficiently backing up computer files
US6615225B1 (en) System and method for relating files in a distributed data storage environment
US7769971B2 (en) Replication and restoration of single-instance storage pools
US10534919B1 (en) Backup service and appliance with single-instance storage of encrypted data
US8341117B2 (en) Method, system, and program for personal data management using content-based replication
US5765173A (en) High performance backup via selective file saving which can perform incremental backups and exclude files and uses a changed block signature list
US8176338B1 (en) Hash-based data block processing with intermittently-connected systems
KR100878861B1 (en) System for identifying common digital sequences
US20120197853A1 (en) System and method for sampling based elimination of duplicate data
US8285997B2 (en) Backup apparatus with higher security and lower network bandwidth consumption
US6668262B1 (en) Methods and apparatus for modifying a database
US7634657B1 (en) Reducing the probability of undetected collisions in hash-based data block processing
JP2005538467A (en) Method and apparatus for managing data health of backup data and disaster recovery data
WO2000050999A1 (en) Method and system for mirroring and archiving mass storage
WO1999046660A2 (en) System and method for backing up computer files over a wide area computer network
US8195612B1 (en) Method and apparatus for providing a catalog to optimize stream-based data restoration
Xu Hydra: A platform for survivable and secure data storage systems
KR20230104877A (en) How to ensure confidentiality and integrity of data and metadata stored in untrusted environments

Legal Events

Date Code Title Description
AS Assignment

Owner name: FAIRBANKS SYSTEMS GROUP, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:MCCLAIN, FRED W.;BOLT, THOMAS B.;REEL/FRAME:009033/0826

Effective date: 19980311

STCF Information on status: patent grant

Free format text: PATENTED CASE

AS Assignment

Owner name: SKYDESK, INC., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:FAIRBANKS SYSTEMS GROUP;REEL/FRAME:010901/0520

Effective date: 20000603

AS Assignment

Owner name: IMPERIAL BANK, CALIFORNIA

Free format text: SECURITY INTEREST;ASSIGNOR:SKYDESK, INC.;REEL/FRAME:011190/0549

Effective date: 20000929

AS Assignment

Owner name: DOMINION VENTURE FINANCE L.L.C., CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SKYDESK, INC.;REEL/FRAME:012252/0523

Effective date: 20010706

AS Assignment

Owner name: SKYDESK, INC., CALIFORNIA

Free format text: REASSIGNMENT AND RELEASE OF SECURITY INTEREST;ASSIGNOR:COMERICA BANK-CALIFORNIA AS SUCCESSOR IN INTEREST TO IMPERIAL BANK;REEL/FRAME:012958/0791

Effective date: 20020603

AS Assignment

Owner name: SWAPDRIVE, INC., DISTRICT OF COLUMBIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SKYDESK, INC.;REEL/FRAME:013211/0690

Effective date: 20020603

FPAY Fee payment

Year of fee payment: 4

SULP Surcharge for late payment
AS Assignment

Owner name: SILICON VALLEY BANK, CALIFORNIA

Free format text: SECURITY AGREEMENT;ASSIGNOR:SWAPDRIVE, INC.;REEL/FRAME:018590/0215

Effective date: 20061130

AS Assignment

Owner name: SWAPDRIVE INC, DISTRICT OF COLUMBIA

Free format text: RELEASE;ASSIGNOR:SILICON VALLEY BANK;REEL/FRAME:019541/0970

Effective date: 20070705

FPAY Fee payment

Year of fee payment: 8

AS Assignment

Owner name: SYMANTEC CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SWAPDRIVE, INC.;REEL/FRAME:023163/0218

Effective date: 20080606

FPAY Fee payment

Year of fee payment: 12

AS Assignment

Owner name: SKY DESK, INC., CALIFORNIA

Free format text: RELEASE BY SECURED PARTY;ASSIGNOR:DOMINION VENTURE FINANCE L.L.C.;REEL/FRAME:029930/0053

Effective date: 20020531

Owner name: DOMINION VENTURA FINANCE L.L.C., CALIFORNIA

Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE NATURE OF CONVEYANCE: FROM ASSIGNMENT TO SECURITY AGREEMENT PREVIOUSLY RECORDED ON REEL 012252 FRAME 0524. ASSIGNOR(S) HEREBY CONFIRMS THE DOMINION VENTURA FINANCE L.L.C.;ASSIGNOR:SKYDESK, INC.;REEL/FRAME:029929/0864

Effective date: 20010706

AS Assignment

Owner name: DOMINION VENTURE FINANCE L.L.C., CALIFORNIA

Free format text: CORRECTIVE ASSIGNMENT TO CORRECT THE REEL/FRAME TO 012252/0523; AND CORRECT NAME TO DOMINION VENTURE FINANCE L.L.C. PREVIOUSLY RECORDED ON REEL 029929 FRAME 0864. ASSIGNOR(S) HEREBY CONFIRMS THE DOMINION VENTURE FINANCE L.L.C.;ASSIGNOR:SKYDESK, INC.;REEL/FRAME:029992/0229

Effective date: 20010706

AS Assignment

Owner name: CDD TECHNOLOGIES, LLC, TEXAS

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:SYMANTEC CORPORATION;REEL/FRAME:030088/0009

Effective date: 20121105