WO2004074996A2 - Broadcast bridge apparatus for transferring data to subsystems in a storage controller - Google Patents

Broadcast bridge apparatus for transferring data to subsystems in a storage controller Download PDF

Info

Publication number
WO2004074996A2
WO2004074996A2 PCT/US2004/004098 US2004004098W WO2004074996A2 WO 2004074996 A2 WO2004074996 A2 WO 2004074996A2 US 2004004098 W US2004004098 W US 2004004098W WO 2004074996 A2 WO2004074996 A2 WO 2004074996A2
Authority
WO
WIPO (PCT)
Prior art keywords
pci
data
bus
bridge
broadcast
Prior art date
Application number
PCT/US2004/004098
Other languages
French (fr)
Other versions
WO2004074996A3 (en
Inventor
Gene Maine
Original Assignee
Chapparal Network Storage, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Chapparal Network Storage, Inc. filed Critical Chapparal Network Storage, Inc.
Publication of WO2004074996A2 publication Critical patent/WO2004074996A2/en
Publication of WO2004074996A3 publication Critical patent/WO2004074996A3/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F11/00Error detection; Error correction; Monitoring
    • G06F11/07Responding to the occurrence of a fault, e.g. fault tolerance
    • G06F11/16Error detection or correction of the data by redundancy in hardware
    • G06F11/20Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements
    • G06F11/2053Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant
    • G06F11/2056Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring
    • G06F11/2087Error detection or correction of the data by redundancy in hardware using active fault-masking, e.g. by switching out faulty elements or by switching in spare elements where persistent mass storage functionality or persistent mass storage control functionality is redundant by mirroring with a common controller
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • H04L67/1097Protocols in which an application is distributed across nodes in the network for distributed storage of data in networks, e.g. transport arrangements for network file system [NFS], storage area networks [SAN] or network attached storage [NAS]
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/06Digital input from, or digital output to, record carriers, e.g. RAID, emulated record carriers or networked record carriers
    • G06F3/0601Interfaces specially adapted for storage systems
    • G06F3/0668Interfaces specially adapted for storage systems adopting a particular infrastructure
    • G06F3/0671In-line storage system
    • G06F3/0673Single storage device

Definitions

  • This invention relates in general to the field of mass storage systems and particularly to data transfers to redundant controllers in a mass storage system.
  • Today's computer networks include vast amounts of storage, require high data throughput, and demand high data availability. Many networks support hundreds or even thousands of users connected to them. Many networks store extremely valuable data, such as bank account information, personal medical information, databases whose unavailability equates to huge sums of lost revenue due to inability to sell a product or provide a service, and scientific data gathered over large amounts of time and at great expense.
  • a typical computer network includes one or more computers connected to one or more storage devices, such as disk drives or tape drives, by one or more storage controllers.
  • One technique for providing higher data availability in computer networks is to include redundant components in the network.
  • Providing redundant components means providing two or more of the components such that if one of the components fails, one of the other redundant components continues to perform the function of the failed component. In many cases, the failed component can be quickly replaced to restore the system to its original data availability level.
  • RAID redundant array of inexpensive disks
  • High performance storage controllers typically include relatively large memories for buffering data transferred between the host computers and the storage devices.
  • the storage controller receives the data from the host computer, writes the data into the storage controller memory, and informs the host computer that the data has been successfully transferred. Subsequently, the storage controller writes the data from its memory into the storage device. Buffering the data in this manner provides at least two advantages.
  • the buffering serves to alleviate bottlenecks that might arise from transfer speed mismatches between the host/storage controller interface and the storage controller/storage device interface.
  • the buffered data may be cached, such mat when a host subsequently reads the data, the storage controller can simply provide the data from its cache memory rather than having to first read the data from the storage device.
  • a potential problem with the buffered data approach described above is that if the storage controller memory fails, the data is lost forever.
  • a conventional approach is to provide two or more redundant memory subsystems.
  • the data is received from the host and written to the first redundant memory subsystem, and subsequently copied by the first memory subsystem to the second memory subsystem.
  • the second memory subsystem continues operation and writes the data to the storage device.
  • the redundant write is lower performing than a non-redundant write in at least two ways.
  • the initial write of the data and the copy of the data are serialized, which means the redundant write takes approximately twice as long to perform as a non-redundant write.
  • the redundant write consumes considerably more resource bandwidth than a non-redundant write.
  • the fact that the first memory is both written and read by a redundant write consumes twice the memory bandwidth of a non-redundant write.
  • the copy of the data from the first to the second memory subsystem consumes additional bandwidth on the bus connecting the two memory subsystems.
  • the present invention provides a bus bridge apparatus for improving redundant write performance in storage controllers by writing the data directly and concurrently to the memory subsystems rather than having the primary memory subsystem copy the data to the other memory subsystem. Accordingly, in attainment of the aforementioned object, it is a feature of the present invention to provide a broadcast bridge apparatus.
  • the apparatus includes a first port that receives data transmitted on a first local bus and a second port, coupled to the first port, which receives the data from the first port and provides the data for retransmission on a second local bus to a first memory subsystem.
  • the apparatus also includes a third port, coupled to the first port, which receives a copy of the data from the first port and selectively provides the copy of the data for retransmission on a third local bus to a second memory subsystem.
  • a bus bridge apparatus for broadcasting data from a first local bus on one side of the bridge to a plurality of redundant storage controllers coupled to second and third local buses on an opposite side of the bridge to relieve the redundant controllers from copying the data to one another.
  • the apparatus includes a first FIFO memory, coupled to receive data from the first local bus. The data is associated with a first write transaction on the first local bus.
  • the apparatus also includes first master logic, coupled to the first FIFO memory, which causes a second write transaction on the second local bus to transfer the data from the first FIFO memory to a first of the plurality of redundant storage controllers.
  • the apparatus also includes a second FIFO memory, coupled to receive the data from the first local bus.
  • the apparatus also includes second master logic, coupled to the second FIFO memory, which causes a third write transaction on the third local bus to transfer the data from the second FIFO memory to a second of the plurality of redundant storage controllers.
  • a PCI-X bus bridge for bridging a first PCI-X bus to second and third PCI-X buses.
  • the bus bridge includes first, second, and third PCI-X interfaces, coupled to the first, second, and third PCI-X buses, respectively.
  • the first PCI-X interface is configured to receive a plurality of write transactions from the first PCI-X bus.
  • the bus bridge also includes a plurality of broadcast bridge circuits, coupling the first PCI-X interface to the second and third PCI-X interfaces, each for causing both of the second and third PCI-X interfaces to retransmit a respective one of the plurality of write transactions on the second and third PCI-X buses, respectively.
  • the bus bridge includes a PCI-X target circuit that receives a PCI-X write command from a first PCI-X bus coupled to one side of the bus bridge.
  • the PCI-X write command specifies an address of data to be written.
  • the bus bridge also includes a control input to the PCI-X target circuit that indicates whether the address is within an address range of an address space of the first PCI-X bus.
  • the bus bridge also includes a write FIFO, coupled to the PCI-X target circuit, which receives the data for retransmission on a second PCI-X bus coupled to a side of the bus bridge opposite the first PCI-X bus.
  • the bus bridge also includes a broadcast FIFO, coupled to the PCI-X target circuit, which receives a copy of the data for retransmission on a third PCI-X bus coupled to the opposite side of the bus bridge.
  • the broadcast FIFO receives the copy of the data only if the address is within the address range.
  • the present invention provides a method for selectively performing a data transfer across a bus bridge to a plurality of memory subsystems in a storage controller.
  • the method includes receiving data on a first bus on one side of the bus bridge, writing the data to a first of the plurality of memory subsystems on a second bus on an opposite side of the bus bridge from the first bus, and determining whether the bus bridge is enabled to perform broadcast data transfers.
  • the method also includes writing a copy of the data to a second of the plurality of memory subsystems on a third bus on the opposite side of the bus bridge only if the bus bridge is enabled to perform broadcast data transfers.
  • the bus bridge writes the copy of the data to the second of the plurality of memory subsystems on the third bus substantially concurrently with the writing of the data to the first of the plurality of memory subsystems on the second bus.
  • the controller includes at least one I/O interface circuit that receives data from a host computer and writes the data to one or more storage devices.
  • the controller also includes a primary memory subsystem that buffers the data before being written to the storage devices.
  • the controller also includes a secondary memory subsystem that stores a redundant copy of the data.
  • the controller also includes a plurality of bus bridges that bridge a bus coupled to the at least one I/O interface circuit with a plurality of buses coupled to the primary and secondary memory subsystems. The plurality of bus bridges write the data received on the bus concurrently to the primary and secondary memory subsystems on first and second of the plurality of buses, respectively.
  • An advantage of the present invention is that it avoids the need for a memory subsystem to copy received data to its mirrored memory subsystem, which has several benefits.
  • Third, the fact that the memory subsystem processor does not have to issue a copy of the data reduces the command overhead time associated with a mirrored write, thereby freeing up memory subsystem processor bandwidth.
  • software development time may be reduced since the write of the data copy is performed in hardware rather than in software.
  • FIGURE 1 is a block diagram of a computer network, including a redundant network storage controller according to the present invention.
  • FIGURE 2 is a block diagram of the redundant network storage controller of Figure 1 according to the present invention.
  • FIGURE 3 is a block diagram of a PCI-X bridge controller of Figure 2 according to the present invention.
  • FIGURE 4 is a block diagram of a broadcast bridge apparatus of Figure 3 according to the present invention.
  • FIGURE 5 is a block diagram illustrating selective broadcast, or mirrored, writes to the redundant memory subsystems of Figure 2 based on a broadcast memory range according to the present invention.
  • FIGURE 6 is a flowchart illustrating operation of the redundant network storage controller of Figure 2 according to the present invention.
  • FIGURE 7 is a block diagram of the redundant network storage controller of Figure 2 illustrating data flow in a broadcast write example according to the present invention.
  • FIGURE 8 is a block diagram of the PCI-X bridge of Figure 3 illustrating data flow in the broadcast write example of Figure 7 according to the present invention.
  • FIGURE 9 is a block diagram of a related art conventional redundant network storage controller for illustrating data flow of a conventional mirrored write.
  • the computer network 100 comprises a plurality of host computers 104 coupled to a plurality of storage devices 106 through networks 114 and a redundant network storage controller 102.
  • the hosts 104 may be any type of computer, such as a file server, print server, enterprise server, or workstation.
  • the storage devices 106 may be any type of storage devices, such as disk drives, tape drives, or writeable CDROM drives.
  • computer network 100 comprises a storage area network (SAN), such as a Fibre Channel (FC) or Infrniband (IB) storage area network (SAN).
  • SAN storage area network
  • computer network 100 comprises a computer network with network attached storage (NAS).
  • the networks 114 comprise any of various network types, including a FC network, an finiband network, an Ethernet network, a Token Ring network, an Arcnet network, an FDDI network, an LocalTalk network, an ATM network, etc.
  • the interfaces 108 between hosts 104 and networks 114 and the interfaces 116 between networks 114 and redundant network storage controller 102 may be any one of various interfaces, such as a Fibre Channel, Ethernet, Infiniband (IB), Token Ring, Arcnet, FDDI, LocalTalk, or ATM.
  • the interfaces 112 between redundant network storage controller 102 and storage devices 106 may be any one of various interfaces, such as a Fibre Channel, Ethernet, advanced technology attachment (ATA), serial ATA (SATA), small computer systems interface (SCSI), or Iiifiniband interface.
  • hosts 104 are coupled directly to redundant network storage controller 102 rather than indirectly through networks 114.
  • the redundant network storage controller 102 includes redundant components for increased data availability.
  • the redundant network storage controller 102 writes the data into a plurality of redundant memory subsystems so that if one of the memory subsystems fails, the other memory subsystem can write the data to the destination storage devices 106.
  • the redundant network storage controller 102 writes the data to the plurality of redundant memory subsystems simultaneously, rather than writing the data to a first of the memory subsystems and then having the first memory subsystem copy the data to the second memory subsystem, and to other redundant memory subsystems, if present.
  • redundant network storage controller 102 includes four interface modules 206 coupled to two redundant memory subsystems 222 by four PCI-X buses 212.
  • the PCI-X buses and devices described herein conform substantially to the PCI-X 1.0 specification.
  • the four interface modules 206 are referred to individually as interface module-A 206A, interface module-B 206B, interface module-C 206C, and interface module-D 206D.
  • the two memory subsystems 222 are referred to individually as memory subsystem-A 222A and memory subsystem-B 222B.
  • the PCI-X buses 212 are referred to individually as PCI-X bus-A 212A, PCI- X bus-B 212B, PCI-X bus-C 212C, and PCI-X bus-D 212D.
  • Memory subsystem-A 222A is coupled to PCI-X bus-D 212D and PCI-X bus-C 212C.
  • Memory subsystem-B 222B is coupled to PCI-X bus-B 212B and PCI-X bus-A 212A.
  • the memory subsystems 222 provide arbitration for PCI-X buses 212.
  • Each of the interface modules 206 includes two interface controllers 204 and a four-ported PCI-X bus bridge 202.
  • One of the interface controllers 204 also referred to as I O interface circuits 204, is coupled to one of the PCI-X bridge 202 ports via a first PCI-X bus 208A and the other interface controller 204 is coupled to another one , of the PCI-X bridge 202 ports via a second PCI-X bus 208B.
  • the interface controllers 204 are dual-ported Fibre Channel (FC) to PCI-X controllers. That is, each interface controller 204 includes two FC ports for coupling the redundant network storage controller 102 to hosts 104 and/or storage devices 106 via interfaces 108 and/or 112 of Figure 1.
  • FC Fibre Channel
  • PCI-X bridge 202 provides arbitration for PCI-X buses 208.
  • Interface controllers 204 perform FC protocol to PCI-X protocol conversion.
  • Interface controllers 204 receive FC frames on their FC ports and responsively initiate PCI-X commands via their PCI-X ports on PCI-X buses 208 targeted at the memory subsystems 222.
  • interface controllers 204 are the target of PCI-X commands from PCI-X bus 208 initiated by the memory controllers 222 and retransmitted by PCI-X bridge 202; the interface controllers 204 transmit FC frames on interfaces 108 and/or 112 to hosts 104 and/or storage devices 106 in response to the PCI-X commands.
  • interface controllers 204 receive FC frames with write data destined for the memory subsystems 222 and initiate PCI-X burst write commands, such as memory write 'or memory write block commands, on their PCI-X ports to the PCI-X bridge 202 for retransmission to each of the memory subsystems 222 in a broadcasted, or mirrored, fashion, as discussed below.
  • Each PCI-X bridge 202 includes four PCI-X ports for coupling to four PCI-X buses. Two of the PCI-X bridge 202 ports are coupled to the PCI-X port of the two interface controllers 204 via PCI-X buses 208A and 208B.
  • the other two PCI-X bridge 202 ports are coupled to two of the four PCI-X buses 212.
  • the PCI-X bridge 202 of interface module-A 206A is coupled to PCI-X bus-A 212A and PCI-X bus-C 212C;
  • the PCI-X bridge 202 of interface module-B 206B is coupled to PCI- X bus-D 212D and PCI-X bus-B 212B;
  • the PCI-X bridge 202 of interface module-C 206C is coupled to PCI-X bus-C 212C and PCI-X bus-A 212A;
  • the PCI-X bridge 202 of interface module-D 206D is coupled to PCI-X bus-B 212B and PCI-X bus-D 212D.
  • each of the PCI-X bridges 202 is coupled to each of the memory subsystems 222.
  • the PCI-X bridges 202 act as the target of PCI-X commands on PCI- X buses 208 and retransmit the commands as initiators on PCI-X buses 212; conversely, the PCI-X bridges 202 act as the target of PCI-X commands on PCI-X buses 212 and retransmit the commands as initiators on PCI-X buses 208, thereby enabling communication between the interface controllers 204 and the memory subsystems 222.
  • a PCI-X bridge 202 receives a PCI-X burst write command on one of its PCI-X buses 208 on one side of the PCI-X bridge 202 and selectively broadcasts the write data on both of its PCI-X buses 212 on the other side of the PCI-X bridge 202 to each of the redundant memory subsystems 222, as discussed below.
  • the memory subsystems 222 each include a memory controller 224 coupled to a memory 226 and a microprocessor 228.
  • the memory controller 224 includes two PCI-X bus ports for interfacing with two of the PCI-X buses 212.
  • the memory controller 224 enables the transfer of data between the two PCI-X buses 212 and the memory 226.
  • the memory controller 224 also provides the processor 228 access to the memory 226.
  • the memory 226 is double-data-rate (DDR) SDRAM memory.
  • the microprocessor 228 performs storage device control functions, such as RAID functionality. For example, a memory subsystem 222 may receive a logical disk write command from a host 104 and receive the write command data into its memory 226.
  • the microprocessor 228 determines which sectors of which of the disk drives 106 are implicated by the disk write command and writes the data from the memory 226 to the appropriate disk 106 sectors by initiating PCI-X burst write commands to one or more interface modules 206.
  • memory subsystems 222 also provide out-of-band configuration and management interfaces for redundant network storage controller 102.
  • PCI-X bridge controller 202 includes four PCI-X interfaces 332, or ports 332, (referred to individually as 332A, 332B, 332C, and 332D), four broadcast bridges 302 (referred to individually as 302A, 302B, 302C, and 302D), and four sets of control logic 334 (referred to individually as 334A, 334B, 334C, and 334D).
  • Control logic 334 includes arbitration and multiplexing logic for selectively coupling the broadcast bridges 302 to the PCI-X interfaces 332 in predetermined combinations, as shown, and as described below.
  • control logic 334 performs arbitration and steers control and data signals of PCI-X buses 208/212 of Figure 2 between the broadcast bridges 302 and the PCI-X interfaces 332, as shown, depending upon which of the broadcast bridges 302 is currently performing a PCI-X transaction on the associated PCI-X bus 208/212, as described below.
  • control logic 334A selectively couples PCI-X signals 322A from broadcast bridge 302A or PCI-X signals 322B from broadcast bridge 302B to PCI-X interface 332A; control logic 334B selectively couples PCI-X signals 322C from broadcast bridge 302C or PCI-X signals 322D from broadcast bridge 302D to PCI-X interface 332B; control logic 334C selectively couples PCI-X signals 324A from broadcast bridge 302A or PCI-X signals 324B from broadcast bridge 302B or PCI-X signals 326C from broadcast bridge 302C or PCI-X signals 326D from broadcast bridge 302D to PCI-X interface 332C; control logic 334D selectively couples PCI-X signals 326A from broadcast bridge 302A or PCI-X signals 326B from broadcast bridge 302B or PCI-X signals 324C from broadcast bridge 302C or PCI-X signals 324D from broadcast bridge 302D to PCI-X interface 332D.
  • PCI-X signals 322A, 322B, 322C, or 322D are referred to generically as PCI-X signals 322;
  • PCI-X signals 324A, 324B, 324C, or 324D are referred to generically as PCI-X signals 324;
  • PCI-X signals 326A, 326B, 326C, or 326D are referred to generically as PCI-X signals 326.
  • PCI-X interfaces 332A and 332B are on the interface module 206 side of PCI-X bridge 202 and are coupled to PCI-X buses 208A and 208B, respectively.
  • PCI-X interfaces 332C and 332D are on the memory subsystem 222 side of PCI-X bridge 202 and are coupled to two of the PCI-X buses 212.
  • Figure 3 illustrates the PCI-X bridge 202 of interface module-A 206A, which is representative of the other PCI-X bridges 202 of Figure 2. Because Figure 3 illustrates the PCI-X bridge 202 of interface module-A 206A, PCI-X interfaces 332C and 332D are shown coupled to PCI-X buses 212A and 212C, respectively, to correspond with Figure 2.
  • Each of the broadcast bridges 302 includes send/receive first-in-first-out (FIFO) memories 304, a broadcast FIFO memory 306, and three PCI-X ports, denoted portl 312, portO 314, and port2 316, all of which are described in more detail with respect to Figure 4.
  • the FIFO memories are also referred to as buffers.
  • Port2 316 is capable of being a PCI-X bus master, and portl 312 and portO 314 are capable of being a PCI-X master and a PCI-X target.
  • Portl 312, the inputs of receive FIFOs 304, the outputs of send FIFOs 304, and the input of broadcast FIFO 306 are all coupled to PCI-X signals 322.
  • PortO 314, the outputs of receive FIFOs 304, and the inputs of send FIFOs 304 are all coupled to PCI-X signals 324.
  • Port2 316 and the output of broadcast FIFO 306 are all coupled to PCI-X signals 326.
  • portl 312 is coupled to portO 314 and port2316, as described below with respect to Figure 4.
  • PCI-X bridge 202 also includes a CPU bridge 352 coupled to control logic 334C and 334D.
  • CPU bridge 352 is a communication path between the two memory subsystems 222 of Figure 2, and in particular for facilitating communication between the memory subsystem 222 processors 228 of Figure 2 via PCI-X memory commands.
  • CPU bridge 352 provides address translation between the two memory subsystems 222, so that a desired PCI-X address range in one memory subsystem 222 may be mapped to a different PCI-X address range in the other memory subsystem 222.
  • the primary memory subsystem 222 of a mirrored write operation employs CPU bridge 352 to update tables in the secondary memory subsystem 222 that specify the presence and location of mirrored data in the secondary memory subsystem 222. Because the mirrored data is used by the secondary memory subsystem 222 only if the primary memory subsystem 222 fails, updating ,the tables via the CPU bridge 352 advantageously alleviates the need for the primary memory subsystem 222 to interrupt the secondary memory subsystem 222 on every transaction.
  • the PCI-X bridge 202 also includes a broadcast enable bit 354 contained in a control register coupled to CPU bridge 352.
  • PCI-X bridge 202 also includes a software-programmable broadcast address range register 356 coupled to CPU bridge 352.
  • the broadcast address range is predetermined, i.e., hardwired, into the PCI-X bridge 202 rather than being programmable in address range register 356.
  • the broadcast enable bit 354 and broadcast address range are used to selectively enable/disable broadcast writes described herein.
  • broadcast, or mirrored, writes may be selectively enabled/disabled.
  • Some application environments, such as video streaming, may prefer the higher performance afforded by two independent memory subsystems over the higher data availability of redundant memory subsystems and choose to disable broadcasted writes.
  • the memory subsystems 222 program the broadcast enable bit 354 based on whether host software configures the memory subsystems 222 as redundant or non-redundant.
  • the broadcast bridge 302 is includes portl 312, portO 314, port2 316, broadcast FIFO 306, and send/receive FIFOs 304 coupled to PCI-X signals 322, 324, and 326 of Figure 3.
  • the FIFOs 304 and 306 are used to buffer PCI-X transaction data as it is transferred through PCI-X bridge 202 between the PCI-X buses 208 and 212.
  • each of the send/receive FIFOs 304 and the broadcast FIFO 306 are capable of storing and forwarding up to 4KB of data, which is the maximum allowable PCI-X transfer size.
  • the FIFOs 304/306 include outputs that indicate how full or empty the FIFOs 304/306 are to enable the PCI-X interfaces 332 know when one or more blocks of data are present in the FIFOs 304/306 and to disconnect on block boundaries if necessary.
  • the send/receive FIFOs 304 include a write receive FIFO 492, a read receive FIFO 494, a write send FIFO 496, and a read send FIFO 498.
  • the inputs to write receive FIFO 492 and read receive FIFO 494 are coupled to PCI-X signals 322 and their outputs are coupled to PCI-X signals 324.
  • the inputs to write send FIFO 496 and read send FIFO 498 are coupled to PCI-X signals 324 and their outputs are coupled to PCI-X signals 322.
  • the input to broadcast FIFO 306 is coupled to PCI-X signals 322 and its output is coupled to PCI-X signals 326.
  • the send FIFOs 496/498 are used to transfer data in the direction from portO 314 to portl 312, i.e., from a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202 to a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202.
  • the receive FIFOs 492/494 are used to transfer data in the direction from portl 312 to portO 314, i.e., from a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202 to a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202.
  • the broadcast FIFO 306 is used to transfer data in the direction from portl 312 to port2 316, i.e., from a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202 to a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202.
  • broadcast FIFO 306 is selectively used to broadcast a mirrored copy of the write data to a secondary memory subsystem 222, as described below.
  • Portl 312 includes a PCI-X target circuit 414 coupled to an address/size register 412.
  • PCI-X target 414 is configured to function as a target of PCI-X commands initiated by the interface controller 204 of Figure 2, which is selectively coupled to portl 312 via control logic 334 and PCI-X interface 332 on PCI-X bus 208.
  • PCI-X target 414 receives from PCI-X signals 322 a PCI-X command 424, such as a PCI- X burst write command, initiated by one of the interface controllers 204 on a PCI-X bus 208.
  • a start byte address and byte transfer count 422 of the burst write command are stored in address/size register 412.
  • PCI-X target 414 generates a write_pending signal 432 provided to a PCI-X master circuit 444 in portO 314 to inform portO 314 that portl 312 has received a PCI-X burst write command and that associated write data is being stored into write receive FIFO 492.
  • portO 314 PCI-X master 444 loads the PCI-X burst write command address and count into an address/size register 442 of portO 314 from address/size register 412 of portl 312, and initiates a PCI-X burst write command on PCI-X signals 324 to perform a transfer of the write data from the write receive FIFO 492 to the primary memory subsystem 222 of Figure 2, which is selectively coupled to portO 314 via control logic 334 and PCI-X interface 332 on PCI-X bus 212.
  • PortO 314 PCI-X master 444 generates a write_done signal 434 to notify PCI-X target 414 that the PCI-X burst write command transferring the write data to the primary memory subsystem 222 has completed.
  • PCI-X target 414 also generates a broadcast_pending signal 436 provided to a PCI-X master circuit 474 in port2 316 to inform port2 316 that portl 312 has received a PCI-X burst write command and that a copy of associated write data is being stored into broadcast FIFO 306.
  • PCI-X target 414 only asserts the broadcast_pending signal 436 if portl 312 determines that a broadcast, or mirrored, write is desired.
  • Portl 312 determines whether a broadcast write is desired by examining a broadcast_enable input 408 and an in_broadcast_range input 406.
  • the broadcast_enable input 408 indicates the value of the broadcast enable bit 354 of Figure 3.
  • the in_broadcast_range input 406 is the output of range checking logic 402 comprised in portl 312.
  • the range checking logic 402 receives the PCI-X burst write command start byte address in address/size register 412 and a broadcast_address_range signal 404.
  • the broadcast_address_range signal 404 indicates the value stored in broadcast address range register 356 of Figure 3.
  • the value of the broadcast_address_range signal 404 is hardwired.
  • the range checking logic 402 compares the start byte address from address/size register 412 with broadcast_address_range signal 404 and generates a true value on in_broadcast_range signal 406 if the start byte address from address/size register 412 is within the address range specified by broadcast_addressjrange signal 404; otherwise, range checking logic 402 generates a false value on in_broadcast_range signal 406.
  • port2 316 PCI-X master 474 loads the PCI- X write block command address and count into an address/size register 472 of port2 316 from address/size register 412 of portl 312, and initiates a PCI-X burst write command on PCI-X signals 326 to perform a transfer of the write data from the broadcast FIFO 306 to the secondary memory subsystem 222 of Figure 2, which is selectively coupled to port2 316 via control logic 334 and PCI-X interface 332 on PCI-X bus 212.
  • Port2 316 PCI-X master 474 generates a broadcast_done signal 438 to notify PCI-X target 414 that the PCI-X burst write command transferring the write data to the secondary memory subsystem 222 has completed.
  • PCI-X target 414 and PCI-X master 444 operate in a similar manner just described to perform PCI-X read commands initiated by an interface controller 204 coupled to portl 312 to transfer data across PCI-X bridge 202 from a memory subsystem 222 to an interface controller 204 via read send FIFO 498.
  • the read command data transfer is not a broadcasted read.
  • Portl 312 also includes a PCI-X master circuit 416 coupled to PCI-X target 414 and PCI-X signals 322.
  • PortO 314 also includes a PCI-X target circuit 446 coupled to PCI-X master 444 and PCI-X signals 324.
  • PCI-X target 446 and PCI-X master 416 operate in a manner similar to target 414 and master 444 described above to perform PCI-X commands initiated by a memory subsystem 222 coupled to portO 314 to transfer data across PCI-X bridge 202 between a memory subsystem 222 and an interface controller 204 via write send FIFO 496 and read receive FIFO 494 in a non-broadcasted manner.
  • PCI-X target 414 also generates a busy output signal 426 to indicate whether it is currently busy servicing a PCI-X command initiated by an interface controller 204 on PCI-X bus 208.
  • PCI-X target 414 also receives a busy input signal 428 to indicate whether the other paired broadcast bridge 302 is currently busy servicing a PCI-X command initiated by an interface controller 204 on PCI-X bus 208.
  • the busy output 426 of portl 312 of broadcast bridge-A 302A is provided as the busy input 428 to portl 312 of broadcast bridge-B 302B.
  • the PCI-X bridge 202 includes two broadcast bridges 302 for each PCI-X bus pair 208/212, as shown, which accommodates overlapping PCI-X commands for increased performance over a single broadcast bridge 302 per PCI-X bus pair 208/212 configuration.
  • more than two broadcast bridges 302 per PCI-X bus pair 208/212 may be employed in PCI-X bridge 202 depending upon the demands of the application employing the PCI-X bridge 202.
  • each of the broadcast bridges 302 coupled to a PCI-X interface 332 sees the beginning of a PCI-X command initiated on PCI-X bus 208/212 via control logic 334, and the non-busy broadcast bridges 302 (i.e., those not asserting their busy output 426) respond to the PCI-X command in a round-robin fashion.
  • the control logic 334 couples to the PCI-X interface 332 the PCI-X signals 322 coupled to the non-busy broadcast bridge 302 that is selected to service the PCI-X command.
  • portO 314, portl 312, and port2 316 operate according to different clock sources since each of the four PCI-X interfaces 332 of Figure 3 operate based on independent clock sources. Therefore, synchronization logic is provided to synchronize the signals that communicate between portO 314, portl 312, and ⁇ ort2 316, such as writejpending signal 432, write_done signal 434, broadcast_pending signal 436, and broadcast _done signal 438.
  • FIG. 5 a block diagram illustrating selective broadcast, or mirrored, writes to the redundant memory subsystems 222 of Figure 2 based on a broadcast memory range according to the present invention is shown.
  • Figure 5 shows an address space of one of the PCI-X buses 208 of Figure 2 to which one of the interface controllers 204 of Figure 2 is attached.
  • the address space is divided into three ranges: a first non-broadcast address range 502, a broadcast address range 504, and a second non-broadcast range 506.
  • the broadcast address range 504 is stored in, i.e., known to, the PCI-X bridge 202 of Figure 2 and indicated on broadcast_address_range signal 404 of Figure 4.
  • the broadcast address range 504 is stored in the PCI-X bridge 202 because the broadcast address range 504 is hard-coded into the PCI-X bridge 202. In another embodiment, the broadcast address range 504 is stored in the PCI-X bridge 202 because the PCI-X bridge 202 includes broadcast address range register 356 of Figure 3, which specifies the broadcast address range 504. In one embodiment, broadcast address range register 356 is programmable by software. The first and second non-broadcast address ranges 502 and 506, respectively, represent the remainder of the address space of the PCI-X bus 208.
  • Figure 5 also shows an address space of the PCI-X bus 212 of Figure 2 to which one of the memory subsystems 222 of Figure 2 is attached, which will be referred to as the primary memory subsystem 222.
  • the primary memory subsystem 222 address space is also divided into three ranges: a first non-broadcast address range 512 corresponding to first non-broadcast address range 502, a broadcast address range 514 corresponding to broadcast address range 504, and a second non-broadcast range 516 corresponding to second non-broadcast range 506.
  • Figure 5 also shows an address space of the PCI-X bus 212 of Figure 2 to which the other of the memory subsystems 222 of Figure 2 is attached, which will be referred to as the secondary memory subsystem 222.
  • the secondary memory subsystem 222 address space is also divided into three ranges: a first non-broadcast address range 522 co ⁇ esponding to first non-broadcast address range 502, a broadcast address range 524 corresponding to broadcast address range 504, and a second non-broadcast range 526 corresponding to second non-broadcast range 506.
  • write requests on the interface controller 204 side PCI-X bus 208 having an address in broadcast address range 504 are retransmitted by PCI-X bridge 202 to the corresponding address in both the primary broadcast address range 514 and the secondary broadcast address range 524 - assuming the broadcast enable bit 354 is set.
  • write requests on the interface controller 204 side PCI-X bus 208 having an address in one of the non-broadcast address ranges 502/506 are retransmitted by PCI-X bridge 202 only to the corresponding address in the primary broadcast address range 512/516.
  • multiple broadcast address ranges may be used, rather than a single broadcast address range. If the PCI-X burst write command start byte address falls within any of the multiple broadcast address ranges, the PCI-X bridge 202 performs a broadcast write.
  • FIG. 6 a flowchart illustrating operation of the redundant network storage controller 102 of Figure 2 according to the present invention is shown. Flow begins at block 602.
  • one of the interface modules 206 of Figure 2 receives write data from a host 104 of Figure 1.
  • the interface module 206 may receive a Fibre Channel frame containing the write data.
  • the interface controller 204 of Figure 2 receiving the write data generates a PCI-X burst write command, such as a PCI-X memory write block command, on the PCI-X bus 208 of Figure 2 coupled to the interface controller 204.
  • Flow proceeds to block 604.
  • the PCI-X write command generated at block 602 is conveyed to portl 312 of each of the two broadcast bridges 302 of Figure 3, which are coupled to the PCI-X bus 208 via PCI-X interface 332 and control logic 334 of Figure 3.
  • One of the broadcast bridges 302 that is not busy (as determined by busy inputs 428) according to the round-robin scheme responds to the PCI-X write command and asserts its busy output 426 of Figure 4. Consequently, control logic 334 uncouples the non-responding broadcast bridge 302 from PCI-X signals 332 and continues coupling the responding broadcast bridge 202 to PCI-X signals 322. Flow proceeds to decision block 606.
  • portl 312 determines whether the address of the PCI-X write command is in the broadcast address range stored in the PCI-X bridge 202 (such as in broadcast address range register 356 in one embodiment), and whether the broadcast enable bit 354 is set. That is, portl 312 determines whether both of the broadcast_enable 408 and in_broadcast_range 406 signals are true. If not, flow proceeds to block 612 to perform a non-broadcast, or non-mirrored, write. Otherwise, flow proceeds in parallel to one flow beginning at block 622 and another flow beginning at block 632 to perform a broadcast, or mirrored, write.
  • portl 312 notifies portO 314 of Figure 4 of the pending PCI-X write command via write_pending signal 432. That is, portl 312 notifies portO 314 that write data is being written into write receive FIFO 492 and provides the write command address and count to address/size register 442. Flow proceeds to block 614.
  • portO 314 retransmits the PCI-X write command to the primary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 324, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332.
  • Retransmitting the PCI-X write command includes providing the write data from write receive FIFO 492 to the primary memory subsystem 222 via PCI-X bus 212.
  • Flow proceeds to block 616.
  • the memory controller 224 of the primary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the write data from the write receive FIFO 492, and writes the data into its memory 226. Flow proceeds to block 618.
  • portO 314 notifies portl 312 that the write data has been transmitted to the primary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 619.
  • portl 312 deasserts its busy output signal 426 in response to the write command completion. Flow ends at block 619.
  • Blocks 622 through 628 are essentially the same as blocks 612 through 618. However, blocks 622 through 628 are part of a mirrored write due to the operations performed in blocks 632 through 638, as described below.
  • portl 312 notifies portO 314 of the pending PCI-X write command via write_pending signal 432. That is, portl 312 notifies portO 314 that write data is being written into write receive FIFO 492 and provides the write command address and count to address/size register 442. Flow proceeds to block 624.
  • portO 314 retransmits the PCI-X write command to the primary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 324, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332.
  • Retransmitting the PCI-X write command includes providing the write data from write receive FIFO 492 to the primary memory subsystem 222 via PCI-X bus 212.
  • Flow proceeds to block 626.
  • the memory controller 224 of the primary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the write data from the write receive FIFO 492, and writes the data into its memory 226. Flow proceeds to block 628.
  • portO 314 notifies portl 312 that the write data has been transmitted to the primary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 629.
  • portl 312 notifies port2 316 of Figure 4 of the pending PCI-X write command via broadcastjending signal 436. That is, portl 312 notifies port2 316 that write data is being written into broadcast FIFO 306 and provides the write command address and count to address/size register 472. Flow proceeds to block 634.
  • port2 316 retransmits the PCI-X write command to the secondary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 326, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332.
  • Retransmitting the PCI-X write command includes providing a copy of the write data from broadcast FIFO 306 to the secondary memory subsystem 222 via PCI-X bus 212.
  • Flow proceeds to block 636.
  • the memory controller 224 of the secondary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the copy of the write data from the broadcast FIFO 306, and writes the data into its memory 226. Flow proceeds to block 638.
  • port2 316 notifies portl 312 that the write data has been transmitted to the secondary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 629.
  • portl 312 has been notified by each of portO 314 and port2 316 that their respective PCI-X write commands to the primary and secondary memory subsystem 222, respectively, have completed, and responsively deasserts its busy output 426. Now the broadcast bridge 302 is ready to receive data associated with another PCI-X write command into its write receive FIFO 492 and its broadcast FIFO 306. Flow ends at block 629.
  • FIG. 7 a block diagram of the redundant network storage controller 102 of Figure 2 illustrating data flow in a broadcast write example according to the present invention is shown.
  • memory subsystem-A 222A is the primary memory subsystem and memory subsystem-B 222B is the secondary memory subsystem of a broadcast write.
  • the flow of data through redundant network storage controller 102 is indicated in Figure 7 by thick shaded arrows with numbers contained therein. The direction of the arrows shows the direction of data flow. The sequence of the numbers inside the arrows specifies the sequence of data flowing through the redundant network storage controller 102.
  • a host 104 of Figure 1 transmits write data to redundant network storage controller 102 of Figure 7.
  • the write data is received by the interface controller 204 coupled to interface 108, according to block 602 of Figure 6, as shown by arrow 1.
  • the interface controller 204 generates a PCI-X burst write command on PCI-X bus 208A in response to reception of the host write data, according to block 602, as shown by arrow 2.
  • PCI-X bridge 202 receives the PCI-X write command, according to block 604 of Figur ⁇ e
  • PCI-X bridge 202 determines that the write command address is within the broadcast address range and that broadcast enable bit is set, according to block 606 of Figure 6. Consequently, PCI-X bridge 202 transmits the write data to the primary memory subsystem-A 222A on PCI-X bus 212C, according to blocks 622 and 624 of Figure 6, as shown by arrow 3 A. Concurrently, PCI-X bridge 202 transmits a copy of the write data to the secondary memory subsystem-B 222B on PCI-X bus 212A, according to blocks 632 and 634 of Figure 6, as shown by arrow 3B.
  • the primary memory subsystem-A 222A receives the data and writes the data into its cache memory 226, according to block 626 of Figure 6, as shown by arrow 4A.
  • secondary memory subsystem-B 222B receives the copy of the data and writes the data into its cache memory 226, accordmg to block 636 of Figure 6, as shown by arrow 4B.
  • FIG. 8 a block diagram of the PCI-X bridge 202 of Figure 3 illustrating data flow in the broadcast write example of Figure 7 according to the present invention is shown.
  • the flow of data through PCI-X bridge 202 is indicated by thick shaded arrows denoted 2, 3A, and 3B, corresponding to the data flow example arrows of Figure 7.
  • PCI-X interface 332A of Figure 8 receives the PCI-X write command on PCI-X bus 208A from interface controller 204 of Figure 7 and conveys the command to control logic 334A, as shown by arrow 2.
  • broadcast bridge 302B is the next non-busy broadcast bridge 302 in the round-robin scheme; therefore, control logic 334A selectively couples PCI-X signals 322B to PCI-X interface 332A since portl 312 of broadcast bridge 302B responds to the PCI-X write command and asserts its busy output 426, according to block 604 of Figure 6, as shown by arrow 2.
  • Write receive FIFO 492 of Figure 4 included in send/receive FIFOs 304 of Figure 8, provides the write data on PCI-X signals 324B to control logic 334C; control logic 334C selectively provides the write data to PCI-X interface 332C, which in turn provides the data on PCI-X bus 212A, according to blocks 622 and 624 of Figure 6, as shown by arrow 3A.
  • broadcast FIFO 306 of Figure 8 provides a copy of the write data on PCI-X signals 326B to control logic 334D; control logic 334D selectively provides the write data to PCI-X interface 332D, which in turn provides the data on PCI-X bus 212C, according to blocks 632 and 634 of Figure 6, as shown by arrow 3B.
  • control logic 334D selectively provides the write data to PCI-X interface 332D, which in turn provides the data on PCI-X bus 212C, according to blocks 632 and 634 of Figure 6, as shown by arrow 3B.
  • Figure 9 may be compared to Figure 7 in order to more fully appreciate the advantages of the present invention over the conventional method.
  • the conventional storage controller 900 includes only two interface modules 906, denoted interface module-C 906C and interface module-D 906D, rather than the four interface modules 206 of Figure 7.
  • PCI-X ports of the interface controllers 204 of Figure 9 are coupled directly to the memory subsystems 222 via the PCI-X buses 212. That is, one of the interface controllers 204 of interface module 906C is coupled to PCI-X bus 212C and the other interface controller 204 is coupled to PCI-X bus 212A. Similarly, one of the interface controllers 204 of interface module 906D is coupled to PCI-X bus 212B and the other interface controller 204 is coupled to PCI-X bus 212D.
  • each of the interface modules 906 of the conventional storage controller 900 includes a two-ported PCI-X bridge 902 rather than the four-ported PCI-X bridge 202 of Figure 7.
  • PCI-X bridge 902 of interface module 906C couples PCI-X bus 212A and 212C.
  • PCI-X bridge 902 of interface module 906D couples PCI-X bus 212B and 212D.
  • the memory controller 924 of Figure 9 includes a DMA controller for performing data transfers between its memory 226 and its PCI-X buses 212.
  • a host transmits write data to conventional storage controller 900 of Figure 9, as shown by arrow 1.
  • the write data is received by the interface controller 204 of interface module 906C, which generates a PCI-X write command on PCI-X bus 212C to the primary memory subsystem-A 222 A in response to reception of the host write data, as shown by arrow 2.
  • the primary memory subsystem-A 222A receives the data and writes the data into its cache memory 226, as shown by arrow 3.
  • the processor 228 of the primary memory subsystem-A 222A receives notification of the transfer of the write data into its memory 226 and instructs the memory controller 924 to read the just-written write data from its memory 226, as shown by arrow 4, and to copy the write data out on PCI-X bus 212D to interface module-D 906D, as shown by arrow 5.
  • PCI-X bridge 902 of interface module-D 206D receives the copy of the write data from the primary memory subsystem-A 222A and retransmits the PCI-X write command, including the copy of the write data, on PCI-X bus 212B to the secondary memory subsystem-B 222B, as shown by arrow 6.
  • the secondary memory subsystem-B 222B receives the copy of the data and writes it into its cache memory 226, as shown by arrow 7.
  • the conventional storage controller 900 takes a substantially longer time to perform a mirrored write than the redundant network storage controller 102 of Figure 7 since the conventional storage controller 900 performs its copy of the data to the secondary memory subsystem 222 in series with the write of the data to the primary memory subsystem 222; whereas, the redundant network storage controller 102 broadcasts the data to the secondary memory subsystem 222 simultaneously with the write of the data to the primary memory subsystem 222.
  • the redundant network storage controller 102 performs the broadcast write directly to the secondary memory subsystem 222, there is no need for the primary memory subsystem 222 to copy the data to the secondary memory subsystem 222, which makes more efficient use of the bandwidth of the PCI-X buses 212 and the bus between the memory 226 and memory controller 224. As may be observed, twice as much memory 226 bus bandwidth is consumed and 50% more PCI-X bus bandwidth 212 is consumed by the conventional storage controller 900 to perform a mirrored write than the redundant network storage controller 102 of the present invention.
  • PCI-X bus In addition, although embodiments have been described with respect to the PCI-X bus, the invention is adaptable to work with other buses, such as the PCI bus, PCI Express bus, PCI-X2 bus, EISA bus, VESA bus, Futurebus, VME bus, MultiBus, RapidlO bus, AGP bus, ISA bus, 3GIO bus, Hypertransport bus, Fibre Channel, Ethernet, ATA, SATA, SCSI, Infiniband, etc.
  • the bus bridge has been described with a particular number of broadcast bridges, the number of broadcast bridges per bus bridge may be varied to meet the demands of the particular application in which the bus bridge is employed.

Abstract

A bus bridge apparatus for performing broadcasted writes to redundant memory subsystems in a network storage controller is disclosed. The bus bridge includes a PCI-X target that receives a write command on a first PCI-X bus on one side of the bridge. The target is coupled to two PCI-X masters coupled to primary and secondary memory subsystems by respective PCI-X buses on the other side of the bridge. A first FIFO buffers the write command data between the target and the first master, and a second FIFO buffers a copy of the data between the target and the second master. The first and second masters concurrently retransmit the write command on their respective PCI-X buses to the primary and secondary memory subsystems. However, the second master only retransmits if broadcasting is enabled and the write command address is in a broadcast address range known by the bus bridge.

Description

BROADCAST BRIDGE APPARATUS FOR TRANSFERRING DATA TO SUBSYSTEMS IN A
STORAGE CONTROLLER by
Gene Maine
FIELD OF THE INVENTION
[0001] This invention relates in general to the field of mass storage systems and particularly to data transfers to redundant controllers in a mass storage system.
BACKGROUND OF THE INVENTION
[0002] Today's computer networks include vast amounts of storage, require high data throughput, and demand high data availability. Many networks support hundreds or even thousands of users connected to them. Many networks store extremely valuable data, such as bank account information, personal medical information, databases whose unavailability equates to huge sums of lost revenue due to inability to sell a product or provide a service, and scientific data gathered over large amounts of time and at great expense.
[0003] A typical computer network includes one or more computers connected to one or more storage devices, such as disk drives or tape drives, by one or more storage controllers. One technique for providing higher data availability in computer networks is to include redundant components in the network. Providing redundant components means providing two or more of the components such that if one of the components fails, one of the other redundant components continues to perform the function of the failed component. In many cases, the failed component can be quickly replaced to restore the system to its original data availability level.
[0004] A popular example of providing redundant components in a system is the notion of a redundant array of inexpensive disks (RAID). With a RAID, data is written to the plurality of disk drives in such a manner that if one of the disk drives fails, the data may be recovered from the remaining disk drives. In the simplest RAID configuration, commonly referred to as RAID level 1, all data is written to two disk drives which are maintained as a mirrored pair. If one of the mirrored drives fails, the desired data may be read from the remaining disk in the mirrored pair.
[0005] Another example of providing redundancy is within a storage controller in a computer network. High performance storage controllers typically include relatively large memories for buffering data transferred between the host computers and the storage devices. In particular, when a host computer writes data to a storage device via the storage controller, the storage controller receives the data from the host computer, writes the data into the storage controller memory, and informs the host computer that the data has been successfully transferred. Subsequently, the storage controller writes the data from its memory into the storage device. Buffering the data in this manner provides at least two advantages. First, the buffering serves to alleviate bottlenecks that might arise from transfer speed mismatches between the host/storage controller interface and the storage controller/storage device interface. Second, the buffered data may be cached, such mat when a host subsequently reads the data, the storage controller can simply provide the data from its cache memory rather than having to first read the data from the storage device.
[0006] A potential problem with the buffered data approach described above is that if the storage controller memory fails, the data is lost forever. To alleviate this problem, a conventional approach is to provide two or more redundant memory subsystems. In the conventional redundant storage controller memory approach, the data is received from the host and written to the first redundant memory subsystem, and subsequently copied by the first memory subsystem to the second memory subsystem. By this approach, if the first memory subsystem fails, the second memory subsystem continues operation and writes the data to the storage device.
[0007] Unfortunately, there appears to be a paradigm in mass storage design such that performance and data availability are two opposing goals. Redundancy seems to imply lower performance. In the redundant storage controller example above, the redundant write is lower performing than a non-redundant write in at least two ways. First, the initial write of the data and the copy of the data are serialized, which means the redundant write takes approximately twice as long to perform as a non-redundant write. Second, the redundant write consumes considerably more resource bandwidth than a non-redundant write. In particular, the fact that the first memory is both written and read by a redundant write consumes twice the memory bandwidth of a non-redundant write. Additionally, the copy of the data from the first to the second memory subsystem consumes additional bandwidth on the bus connecting the two memory subsystems.
[0008] Therefore what is needed is an apparatus and method for providing higher performance redundant writes to redundant memory subsystems in storage controllers.
SUMMARY
[0009] The present invention provides a bus bridge apparatus for improving redundant write performance in storage controllers by writing the data directly and concurrently to the memory subsystems rather than having the primary memory subsystem copy the data to the other memory subsystem. Accordingly, in attainment of the aforementioned object, it is a feature of the present invention to provide a broadcast bridge apparatus. The apparatus includes a first port that receives data transmitted on a first local bus and a second port, coupled to the first port, which receives the data from the first port and provides the data for retransmission on a second local bus to a first memory subsystem. The apparatus also includes a third port, coupled to the first port, which receives a copy of the data from the first port and selectively provides the copy of the data for retransmission on a third local bus to a second memory subsystem. [0010] In another aspect, it is a feature of the present invention to provide a bus bridge apparatus for broadcasting data from a first local bus on one side of the bridge to a plurality of redundant storage controllers coupled to second and third local buses on an opposite side of the bridge to relieve the redundant controllers from copying the data to one another. The apparatus includes a first FIFO memory, coupled to receive data from the first local bus. The data is associated with a first write transaction on the first local bus. The apparatus also includes first master logic, coupled to the first FIFO memory, which causes a second write transaction on the second local bus to transfer the data from the first FIFO memory to a first of the plurality of redundant storage controllers. The apparatus also includes a second FIFO memory, coupled to receive the data from the first local bus. The apparatus also includes second master logic, coupled to the second FIFO memory, which causes a third write transaction on the third local bus to transfer the data from the second FIFO memory to a second of the plurality of redundant storage controllers.
[0011] In another aspect, it is a feature of the present invention to provide a PCI-X bus bridge for bridging a first PCI-X bus to second and third PCI-X buses. The bus bridge includes first, second, and third PCI-X interfaces, coupled to the first, second, and third PCI-X buses, respectively. The first PCI-X interface is configured to receive a plurality of write transactions from the first PCI-X bus. The bus bridge also includes a plurality of broadcast bridge circuits, coupling the first PCI-X interface to the second and third PCI-X interfaces, each for causing both of the second and third PCI-X interfaces to retransmit a respective one of the plurality of write transactions on the second and third PCI-X buses, respectively.
[0012] In another aspect, it is a feature of the present invention to provide a PCI-X bus bridge. The bus bridge includes a PCI-X target circuit that receives a PCI-X write command from a first PCI-X bus coupled to one side of the bus bridge. The PCI-X write command specifies an address of data to be written. The bus bridge also includes a control input to the PCI-X target circuit that indicates whether the address is within an address range of an address space of the first PCI-X bus. The bus bridge also includes a write FIFO, coupled to the PCI-X target circuit, which receives the data for retransmission on a second PCI-X bus coupled to a side of the bus bridge opposite the first PCI-X bus. The bus bridge also includes a broadcast FIFO, coupled to the PCI-X target circuit, which receives a copy of the data for retransmission on a third PCI-X bus coupled to the opposite side of the bus bridge. The broadcast FIFO receives the copy of the data only if the address is within the address range.
[0013] In another aspect, it is a feature of the present invention to provide a method for selectively performing a data transfer across a bus bridge to a plurality of memory subsystems in a storage controller. The method includes receiving data on a first bus on one side of the bus bridge, writing the data to a first of the plurality of memory subsystems on a second bus on an opposite side of the bus bridge from the first bus, and determining whether the bus bridge is enabled to perform broadcast data transfers. The method also includes writing a copy of the data to a second of the plurality of memory subsystems on a third bus on the opposite side of the bus bridge only if the bus bridge is enabled to perform broadcast data transfers. The bus bridge writes the copy of the data to the second of the plurality of memory subsystems on the third bus substantially concurrently with the writing of the data to the first of the plurality of memory subsystems on the second bus.
[0014] In another aspect, it is a feature of the present invention to provide a redundant network storage controller. The controller includes at least one I/O interface circuit that receives data from a host computer and writes the data to one or more storage devices. The controller also includes a primary memory subsystem that buffers the data before being written to the storage devices. The controller also includes a secondary memory subsystem that stores a redundant copy of the data. The controller also includes a plurality of bus bridges that bridge a bus coupled to the at least one I/O interface circuit with a plurality of buses coupled to the primary and secondary memory subsystems. The plurality of bus bridges write the data received on the bus concurrently to the primary and secondary memory subsystems on first and second of the plurality of buses, respectively.
[0015] An advantage of the present invention is that it avoids the need for a memory subsystem to copy received data to its mirrored memory subsystem, which has several benefits. First, the actual data transfer time is reduced since the two writes to the two memory subsystems are performed concurrently rather than sequentially. Second, less local bus bandwidth is consumed since the present invention performs one less local bus write transaction than the conventional method. Third, the fact that the memory subsystem processor does not have to issue a copy of the data reduces the command overhead time associated with a mirrored write, thereby freeing up memory subsystem processor bandwidth. Finally, software development time may be reduced since the write of the data copy is performed in hardware rather than in software.
[0016] Other features and advantages of the present invention will become apparent upon study of the remaining portions of the specification and drawings.
BRIEF DESCRIPTION OF THE DRAWINGS
[0017] FIGURE 1 is a block diagram of a computer network, including a redundant network storage controller according to the present invention.
[0018] FIGURE 2 is a block diagram of the redundant network storage controller of Figure 1 according to the present invention.
[0019] FIGURE 3 is a block diagram of a PCI-X bridge controller of Figure 2 according to the present invention.
[0020] FIGURE 4 is a block diagram of a broadcast bridge apparatus of Figure 3 according to the present invention. [0021] FIGURE 5 is a block diagram illustrating selective broadcast, or mirrored, writes to the redundant memory subsystems of Figure 2 based on a broadcast memory range according to the present invention.
[0022] FIGURE 6 is a flowchart illustrating operation of the redundant network storage controller of Figure 2 according to the present invention.
[0023] FIGURE 7 is a block diagram of the redundant network storage controller of Figure 2 illustrating data flow in a broadcast write example according to the present invention.
[0024] FIGURE 8 is a block diagram of the PCI-X bridge of Figure 3 illustrating data flow in the broadcast write example of Figure 7 according to the present invention.
[0025] FIGURE 9 is a block diagram of a related art conventional redundant network storage controller for illustrating data flow of a conventional mirrored write.
DETAILED DESCRIPTION
[0026] Referring now to Figure 1, a block diagram of a computer network 100 including a redundant network storage controller 102 according to the present invention is shown. The computer network 100 comprises a plurality of host computers 104 coupled to a plurality of storage devices 106 through networks 114 and a redundant network storage controller 102. The hosts 104 may be any type of computer, such as a file server, print server, enterprise server, or workstation. The storage devices 106 may be any type of storage devices, such as disk drives, tape drives, or writeable CDROM drives.
[0027] In one embodiment, computer network 100 comprises a storage area network (SAN), such as a Fibre Channel (FC) or Infrniband (IB) storage area network (SAN). In another embodiment, computer network 100 comprises a computer network with network attached storage (NAS). The networks 114 comprise any of various network types, including a FC network, an finiband network, an Ethernet network, a Token Ring network, an Arcnet network, an FDDI network, an LocalTalk network, an ATM network, etc. The interfaces 108 between hosts 104 and networks 114 and the interfaces 116 between networks 114 and redundant network storage controller 102 may be any one of various interfaces, such as a Fibre Channel, Ethernet, Infiniband (IB), Token Ring, Arcnet, FDDI, LocalTalk, or ATM. The interfaces 112 between redundant network storage controller 102 and storage devices 106 may be any one of various interfaces, such as a Fibre Channel, Ethernet, advanced technology attachment (ATA), serial ATA (SATA), small computer systems interface (SCSI), or Iiifiniband interface. In one embodiment, hosts 104 are coupled directly to redundant network storage controller 102 rather than indirectly through networks 114.
[0028] The redundant network storage controller 102 includes redundant components for increased data availability. In particular, when the hosts 104 write data to the redundant network storage controller 102, the redundant network storage controller 102 writes the data into a plurality of redundant memory subsystems so that if one of the memory subsystems fails, the other memory subsystem can write the data to the destination storage devices 106. Advantageously, as described below, the redundant network storage controller 102 writes the data to the plurality of redundant memory subsystems simultaneously, rather than writing the data to a first of the memory subsystems and then having the first memory subsystem copy the data to the second memory subsystem, and to other redundant memory subsystems, if present.
[0029] Referring now to Figure 2, a block diagram of the redundant network storage controller 102 of Figure 1 according to the present invention is shown. In the embodiment of Figure 2, redundant network storage controller 102 includes four interface modules 206 coupled to two redundant memory subsystems 222 by four PCI-X buses 212. In one embodiment, the PCI-X buses and devices described herein conform substantially to the PCI-X 1.0 specification. The four interface modules 206 are referred to individually as interface module-A 206A, interface module-B 206B, interface module-C 206C, and interface module-D 206D. The two memory subsystems 222 are referred to individually as memory subsystem-A 222A and memory subsystem-B 222B. The PCI-X buses 212 are referred to individually as PCI-X bus-A 212A, PCI- X bus-B 212B, PCI-X bus-C 212C, and PCI-X bus-D 212D. Memory subsystem-A 222A is coupled to PCI-X bus-D 212D and PCI-X bus-C 212C. Memory subsystem-B 222B is coupled to PCI-X bus-B 212B and PCI-X bus-A 212A. In one embodiment, the memory subsystems 222 provide arbitration for PCI-X buses 212.
[0030] Each of the interface modules 206 includes two interface controllers 204 and a four-ported PCI-X bus bridge 202. One of the interface controllers 204, also referred to as I O interface circuits 204, is coupled to one of the PCI-X bridge 202 ports via a first PCI-X bus 208A and the other interface controller 204 is coupled to another one , of the PCI-X bridge 202 ports via a second PCI-X bus 208B. In one embodiment, the interface controllers 204 are dual-ported Fibre Channel (FC) to PCI-X controllers. That is, each interface controller 204 includes two FC ports for coupling the redundant network storage controller 102 to hosts 104 and/or storage devices 106 via interfaces 108 and/or 112 of Figure 1. In one embodiment, PCI-X bridge 202 provides arbitration for PCI-X buses 208. Interface controllers 204 perform FC protocol to PCI-X protocol conversion. Interface controllers 204 receive FC frames on their FC ports and responsively initiate PCI-X commands via their PCI-X ports on PCI-X buses 208 targeted at the memory subsystems 222. Conversely, interface controllers 204 are the target of PCI-X commands from PCI-X bus 208 initiated by the memory controllers 222 and retransmitted by PCI-X bridge 202; the interface controllers 204 transmit FC frames on interfaces 108 and/or 112 to hosts 104 and/or storage devices 106 in response to the PCI-X commands. In particular, interface controllers 204 receive FC frames with write data destined for the memory subsystems 222 and initiate PCI-X burst write commands, such as memory write 'or memory write block commands, on their PCI-X ports to the PCI-X bridge 202 for retransmission to each of the memory subsystems 222 in a broadcasted, or mirrored, fashion, as discussed below. [0031] Each PCI-X bridge 202 includes four PCI-X ports for coupling to four PCI-X buses. Two of the PCI-X bridge 202 ports are coupled to the PCI-X port of the two interface controllers 204 via PCI-X buses 208A and 208B. The other two PCI-X bridge 202 ports are coupled to two of the four PCI-X buses 212. In the embodiment shown in Figure 2, the PCI-X bridge 202 of interface module-A 206A is coupled to PCI-X bus-A 212A and PCI-X bus-C 212C; the PCI-X bridge 202 of interface module-B 206B is coupled to PCI- X bus-D 212D and PCI-X bus-B 212B; the PCI-X bridge 202 of interface module-C 206C is coupled to PCI-X bus-C 212C and PCI-X bus-A 212A; and the PCI-X bridge 202 of interface module-D 206D is coupled to PCI-X bus-B 212B and PCI-X bus-D 212D. Hence, each of the PCI-X bridges 202 is coupled to each of the memory subsystems 222. The PCI-X bridges 202 act as the target of PCI-X commands on PCI- X buses 208 and retransmit the commands as initiators on PCI-X buses 212; conversely, the PCI-X bridges 202 act as the target of PCI-X commands on PCI-X buses 212 and retransmit the commands as initiators on PCI-X buses 208, thereby enabling communication between the interface controllers 204 and the memory subsystems 222. In particular, a PCI-X bridge 202 receives a PCI-X burst write command on one of its PCI-X buses 208 on one side of the PCI-X bridge 202 and selectively broadcasts the write data on both of its PCI-X buses 212 on the other side of the PCI-X bridge 202 to each of the redundant memory subsystems 222, as discussed below.
[0032] The memory subsystems 222 each include a memory controller 224 coupled to a memory 226 and a microprocessor 228. The memory controller 224 includes two PCI-X bus ports for interfacing with two of the PCI-X buses 212. The memory controller 224 enables the transfer of data between the two PCI-X buses 212 and the memory 226. The memory controller 224 also provides the processor 228 access to the memory 226. In one embodiment, the memory 226 is double-data-rate (DDR) SDRAM memory. In one embodiment, the microprocessor 228 performs storage device control functions, such as RAID functionality. For example, a memory subsystem 222 may receive a logical disk write command from a host 104 and receive the write command data into its memory 226. In response, the microprocessor 228 determines which sectors of which of the disk drives 106 are implicated by the disk write command and writes the data from the memory 226 to the appropriate disk 106 sectors by initiating PCI-X burst write commands to one or more interface modules 206. In one embodiment, memory subsystems 222 also provide out-of-band configuration and management interfaces for redundant network storage controller 102.
[0033] Referring now to Figure 3, a block diagram of a PCI-X bridge controller 202 of Figure 2 according to the present invention is shown. PCI-X bridge 202 includes four PCI-X interfaces 332, or ports 332, (referred to individually as 332A, 332B, 332C, and 332D), four broadcast bridges 302 (referred to individually as 302A, 302B, 302C, and 302D), and four sets of control logic 334 (referred to individually as 334A, 334B, 334C, and 334D). Control logic 334 includes arbitration and multiplexing logic for selectively coupling the broadcast bridges 302 to the PCI-X interfaces 332 in predetermined combinations, as shown, and as described below. That is, control logic 334 performs arbitration and steers control and data signals of PCI-X buses 208/212 of Figure 2 between the broadcast bridges 302 and the PCI-X interfaces 332, as shown, depending upon which of the broadcast bridges 302 is currently performing a PCI-X transaction on the associated PCI-X bus 208/212, as described below.
[0034] In particular, control logic 334A selectively couples PCI-X signals 322A from broadcast bridge 302A or PCI-X signals 322B from broadcast bridge 302B to PCI-X interface 332A; control logic 334B selectively couples PCI-X signals 322C from broadcast bridge 302C or PCI-X signals 322D from broadcast bridge 302D to PCI-X interface 332B; control logic 334C selectively couples PCI-X signals 324A from broadcast bridge 302A or PCI-X signals 324B from broadcast bridge 302B or PCI-X signals 326C from broadcast bridge 302C or PCI-X signals 326D from broadcast bridge 302D to PCI-X interface 332C; control logic 334D selectively couples PCI-X signals 326A from broadcast bridge 302A or PCI-X signals 326B from broadcast bridge 302B or PCI-X signals 324C from broadcast bridge 302C or PCI-X signals 324D from broadcast bridge 302D to PCI-X interface 332D. PCI-X signals 322A, 322B, 322C, or 322D are referred to generically as PCI-X signals 322; PCI-X signals 324A, 324B, 324C, or 324D are referred to generically as PCI-X signals 324; and PCI-X signals 326A, 326B, 326C, or 326D are referred to generically as PCI-X signals 326.
[0035] PCI-X interfaces 332A and 332B are on the interface module 206 side of PCI-X bridge 202 and are coupled to PCI-X buses 208A and 208B, respectively. PCI-X interfaces 332C and 332D are on the memory subsystem 222 side of PCI-X bridge 202 and are coupled to two of the PCI-X buses 212. Figure 3 illustrates the PCI-X bridge 202 of interface module-A 206A, which is representative of the other PCI-X bridges 202 of Figure 2. Because Figure 3 illustrates the PCI-X bridge 202 of interface module-A 206A, PCI-X interfaces 332C and 332D are shown coupled to PCI-X buses 212A and 212C, respectively, to correspond with Figure 2.
[0036] Each of the broadcast bridges 302 includes send/receive first-in-first-out (FIFO) memories 304, a broadcast FIFO memory 306, and three PCI-X ports, denoted portl 312, portO 314, and port2 316, all of which are described in more detail with respect to Figure 4. The FIFO memories are also referred to as buffers. Port2 316 is capable of being a PCI-X bus master, and portl 312 and portO 314 are capable of being a PCI-X master and a PCI-X target. Portl 312, the inputs of receive FIFOs 304, the outputs of send FIFOs 304, and the input of broadcast FIFO 306 are all coupled to PCI-X signals 322. PortO 314, the outputs of receive FIFOs 304, and the inputs of send FIFOs 304 are all coupled to PCI-X signals 324. Port2 316 and the output of broadcast FIFO 306 are all coupled to PCI-X signals 326. Also, portl 312 is coupled to portO 314 and port2316, as described below with respect to Figure 4.
[0037] PCI-X bridge 202 also includes a CPU bridge 352 coupled to control logic 334C and 334D. CPU bridge 352 is a communication path between the two memory subsystems 222 of Figure 2, and in particular for facilitating communication between the memory subsystem 222 processors 228 of Figure 2 via PCI-X memory commands. In one embodiment, CPU bridge 352 provides address translation between the two memory subsystems 222, so that a desired PCI-X address range in one memory subsystem 222 may be mapped to a different PCI-X address range in the other memory subsystem 222. In one embodiment, the primary memory subsystem 222 of a mirrored write operation employs CPU bridge 352 to update tables in the secondary memory subsystem 222 that specify the presence and location of mirrored data in the secondary memory subsystem 222. Because the mirrored data is used by the secondary memory subsystem 222 only if the primary memory subsystem 222 fails, updating ,the tables via the CPU bridge 352 advantageously alleviates the need for the primary memory subsystem 222 to interrupt the secondary memory subsystem 222 on every transaction.
[0038] The PCI-X bridge 202 also includes a broadcast enable bit 354 contained in a control register coupled to CPU bridge 352. In one embodiment, PCI-X bridge 202 also includes a software-programmable broadcast address range register 356 coupled to CPU bridge 352. In one embodiment, the broadcast address range is predetermined, i.e., hardwired, into the PCI-X bridge 202 rather than being programmable in address range register 356. The broadcast enable bit 354 and broadcast address range are used to selectively enable/disable broadcast writes described herein.
[0039] Advantageously, broadcast, or mirrored, writes may be selectively enabled/disabled. Some application environments, such as video streaming, may prefer the higher performance afforded by two independent memory subsystems over the higher data availability of redundant memory subsystems and choose to disable broadcasted writes. In one embodiment, the memory subsystems 222 program the broadcast enable bit 354 based on whether host software configures the memory subsystems 222 as redundant or non-redundant.
[0040] Referring now to Figure 4, a block diagram of a broadcast bridge apparatus 302 of Figure 3 according to the present invention is shown. The broadcast bridge 302 is includes portl 312, portO 314, port2 316, broadcast FIFO 306, and send/receive FIFOs 304 coupled to PCI-X signals 322, 324, and 326 of Figure 3. The FIFOs 304 and 306 are used to buffer PCI-X transaction data as it is transferred through PCI-X bridge 202 between the PCI-X buses 208 and 212. In one embodiment, each of the send/receive FIFOs 304 and the broadcast FIFO 306 are capable of storing and forwarding up to 4KB of data, which is the maximum allowable PCI-X transfer size. In one embodiment, the FIFOs 304/306 include outputs that indicate how full or empty the FIFOs 304/306 are to enable the PCI-X interfaces 332 know when one or more blocks of data are present in the FIFOs 304/306 and to disconnect on block boundaries if necessary.
[0041] The send/receive FIFOs 304 include a write receive FIFO 492, a read receive FIFO 494, a write send FIFO 496, and a read send FIFO 498. The inputs to write receive FIFO 492 and read receive FIFO 494 are coupled to PCI-X signals 322 and their outputs are coupled to PCI-X signals 324. The inputs to write send FIFO 496 and read send FIFO 498 are coupled to PCI-X signals 324 and their outputs are coupled to PCI-X signals 322. The input to broadcast FIFO 306 is coupled to PCI-X signals 322 and its output is coupled to PCI-X signals 326. [0042] The send FIFOs 496/498 are used to transfer data in the direction from portO 314 to portl 312, i.e., from a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202 to a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202. The receive FIFOs 492/494 are used to transfer data in the direction from portl 312 to portO 314, i.e., from a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202 to a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202. The broadcast FIFO 306 is used to transfer data in the direction from portl 312 to port2 316, i.e., from a PCI-X bus 208 on the interface controller 204 side of PCI-X bridge 202 to a PCI-X bus 212 on the memory subsystem 222 side of PCI-X bridge 202. In particular, when write data is being transferred through write receive FIFO 492 to a primary memory subsystem 222, advantageously broadcast FIFO 306 is selectively used to broadcast a mirrored copy of the write data to a secondary memory subsystem 222, as described below.
[0043] Portl 312 includes a PCI-X target circuit 414 coupled to an address/size register 412. PCI-X target 414 is configured to function as a target of PCI-X commands initiated by the interface controller 204 of Figure 2, which is selectively coupled to portl 312 via control logic 334 and PCI-X interface 332 on PCI-X bus 208. PCI-X target 414 receives from PCI-X signals 322 a PCI-X command 424, such as a PCI- X burst write command, initiated by one of the interface controllers 204 on a PCI-X bus 208. A start byte address and byte transfer count 422 of the burst write command are stored in address/size register 412.
[0044] PCI-X target 414 generates a write_pending signal 432 provided to a PCI-X master circuit 444 in portO 314 to inform portO 314 that portl 312 has received a PCI-X burst write command and that associated write data is being stored into write receive FIFO 492. In response to the assertion of write_pending signal 432, portO 314 PCI-X master 444 loads the PCI-X burst write command address and count into an address/size register 442 of portO 314 from address/size register 412 of portl 312, and initiates a PCI-X burst write command on PCI-X signals 324 to perform a transfer of the write data from the write receive FIFO 492 to the primary memory subsystem 222 of Figure 2, which is selectively coupled to portO 314 via control logic 334 and PCI-X interface 332 on PCI-X bus 212. PortO 314 PCI-X master 444 generates a write_done signal 434 to notify PCI-X target 414 that the PCI-X burst write command transferring the write data to the primary memory subsystem 222 has completed.
[0045] PCI-X target 414 also generates a broadcast_pending signal 436 provided to a PCI-X master circuit 474 in port2 316 to inform port2 316 that portl 312 has received a PCI-X burst write command and that a copy of associated write data is being stored into broadcast FIFO 306. However, PCI-X target 414 only asserts the broadcast_pending signal 436 if portl 312 determines that a broadcast, or mirrored, write is desired.
[0046] Portl 312 determines whether a broadcast write is desired by examining a broadcast_enable input 408 and an in_broadcast_range input 406. The broadcast_enable input 408 indicates the value of the broadcast enable bit 354 of Figure 3. The in_broadcast_range input 406 is the output of range checking logic 402 comprised in portl 312. The range checking logic 402 receives the PCI-X burst write command start byte address in address/size register 412 and a broadcast_address_range signal 404. In one embodiment, the broadcast_address_range signal 404 indicates the value stored in broadcast address range register 356 of Figure 3. In another embodiment, the value of the broadcast_address_range signal 404 is hardwired. The range checking logic 402 compares the start byte address from address/size register 412 with broadcast_address_range signal 404 and generates a true value on in_broadcast_range signal 406 if the start byte address from address/size register 412 is within the address range specified by broadcast_addressjrange signal 404; otherwise, range checking logic 402 generates a false value on in_broadcast_range signal 406.
[0047] In response to the assertion of broadcast jpending 436, port2 316 PCI-X master 474 loads the PCI- X write block command address and count into an address/size register 472 of port2 316 from address/size register 412 of portl 312, and initiates a PCI-X burst write command on PCI-X signals 326 to perform a transfer of the write data from the broadcast FIFO 306 to the secondary memory subsystem 222 of Figure 2, which is selectively coupled to port2 316 via control logic 334 and PCI-X interface 332 on PCI-X bus 212. Port2 316 PCI-X master 474 generates a broadcast_done signal 438 to notify PCI-X target 414 that the PCI-X burst write command transferring the write data to the secondary memory subsystem 222 has completed.
[0048] PCI-X target 414 and PCI-X master 444 operate in a similar manner just described to perform PCI-X read commands initiated by an interface controller 204 coupled to portl 312 to transfer data across PCI-X bridge 202 from a memory subsystem 222 to an interface controller 204 via read send FIFO 498. However, the read command data transfer is not a broadcasted read.
[0049] Portl 312 also includes a PCI-X master circuit 416 coupled to PCI-X target 414 and PCI-X signals 322. PortO 314 also includes a PCI-X target circuit 446 coupled to PCI-X master 444 and PCI-X signals 324. PCI-X target 446 and PCI-X master 416 operate in a manner similar to target 414 and master 444 described above to perform PCI-X commands initiated by a memory subsystem 222 coupled to portO 314 to transfer data across PCI-X bridge 202 between a memory subsystem 222 and an interface controller 204 via write send FIFO 496 and read receive FIFO 494 in a non-broadcasted manner.
[0050] PCI-X target 414 also generates a busy output signal 426 to indicate whether it is currently busy servicing a PCI-X command initiated by an interface controller 204 on PCI-X bus 208. PCI-X target 414 also receives a busy input signal 428 to indicate whether the other paired broadcast bridge 302 is currently busy servicing a PCI-X command initiated by an interface controller 204 on PCI-X bus 208. Thus, for example, with reference to Figure 3, the busy output 426 of portl 312 of broadcast bridge-A 302A is provided as the busy input 428 to portl 312 of broadcast bridge-B 302B. [0051] In the embodiment of Figure 3, the PCI-X bridge 202 includes two broadcast bridges 302 for each PCI-X bus pair 208/212, as shown, which accommodates overlapping PCI-X commands for increased performance over a single broadcast bridge 302 per PCI-X bus pair 208/212 configuration. However, more than two broadcast bridges 302 per PCI-X bus pair 208/212 may be employed in PCI-X bridge 202 depending upon the demands of the application employing the PCI-X bridge 202. In one embodiment, each of the broadcast bridges 302 coupled to a PCI-X interface 332 sees the beginning of a PCI-X command initiated on PCI-X bus 208/212 via control logic 334, and the non-busy broadcast bridges 302 (i.e., those not asserting their busy output 426) respond to the PCI-X command in a round-robin fashion. The control logic 334 couples to the PCI-X interface 332 the PCI-X signals 322 coupled to the non-busy broadcast bridge 302 that is selected to service the PCI-X command.
[0052] In one embodiment, portO 314, portl 312, and port2 316 operate according to different clock sources since each of the four PCI-X interfaces 332 of Figure 3 operate based on independent clock sources. Therefore, synchronization logic is provided to synchronize the signals that communicate between portO 314, portl 312, and ρort2 316, such as writejpending signal 432, write_done signal 434, broadcast_pending signal 436, and broadcast _done signal 438.
[0053] Referring now to Figure 5, a block diagram illustrating selective broadcast, or mirrored, writes to the redundant memory subsystems 222 of Figure 2 based on a broadcast memory range according to the present invention is shown. Figure 5 shows an address space of one of the PCI-X buses 208 of Figure 2 to which one of the interface controllers 204 of Figure 2 is attached. The address space is divided into three ranges: a first non-broadcast address range 502, a broadcast address range 504, and a second non-broadcast range 506. The broadcast address range 504 is stored in, i.e., known to, the PCI-X bridge 202 of Figure 2 and indicated on broadcast_address_range signal 404 of Figure 4. In one embodiment, the broadcast address range 504 is stored in the PCI-X bridge 202 because the broadcast address range 504 is hard-coded into the PCI-X bridge 202. In another embodiment, the broadcast address range 504 is stored in the PCI-X bridge 202 because the PCI-X bridge 202 includes broadcast address range register 356 of Figure 3, which specifies the broadcast address range 504. In one embodiment, broadcast address range register 356 is programmable by software. The first and second non-broadcast address ranges 502 and 506, respectively, represent the remainder of the address space of the PCI-X bus 208.
[0054] Figure 5 also shows an address space of the PCI-X bus 212 of Figure 2 to which one of the memory subsystems 222 of Figure 2 is attached, which will be referred to as the primary memory subsystem 222. The primary memory subsystem 222 address space is also divided into three ranges: a first non-broadcast address range 512 corresponding to first non-broadcast address range 502, a broadcast address range 514 corresponding to broadcast address range 504, and a second non-broadcast range 516 corresponding to second non-broadcast range 506. [0055] Figure 5 also shows an address space of the PCI-X bus 212 of Figure 2 to which the other of the memory subsystems 222 of Figure 2 is attached, which will be referred to as the secondary memory subsystem 222. The secondary memory subsystem 222 address space is also divided into three ranges: a first non-broadcast address range 522 coπesponding to first non-broadcast address range 502, a broadcast address range 524 corresponding to broadcast address range 504, and a second non-broadcast range 526 corresponding to second non-broadcast range 506.
[0056] As shown in Figure 5, write requests on the interface controller 204 side PCI-X bus 208 having an address in broadcast address range 504 are retransmitted by PCI-X bridge 202 to the corresponding address in both the primary broadcast address range 514 and the secondary broadcast address range 524 - assuming the broadcast enable bit 354 is set. However, write requests on the interface controller 204 side PCI-X bus 208 having an address in one of the non-broadcast address ranges 502/506 are retransmitted by PCI-X bridge 202 only to the corresponding address in the primary broadcast address range 512/516.
[0057] In one embodiment, multiple broadcast address ranges may be used, rather than a single broadcast address range. If the PCI-X burst write command start byte address falls within any of the multiple broadcast address ranges, the PCI-X bridge 202 performs a broadcast write.
[0058] Referring now to Figure 6, a flowchart illustrating operation of the redundant network storage controller 102 of Figure 2 according to the present invention is shown. Flow begins at block 602.
[0059] At block 602, one of the interface modules 206 of Figure 2 receives write data from a host 104 of Figure 1. For example, the interface module 206 may receive a Fibre Channel frame containing the write data. In response, the interface controller 204 of Figure 2 receiving the write data generates a PCI-X burst write command, such as a PCI-X memory write block command, on the PCI-X bus 208 of Figure 2 coupled to the interface controller 204. Flow proceeds to block 604.
[0060] At block 604, the PCI-X write command generated at block 602 is conveyed to portl 312 of each of the two broadcast bridges 302 of Figure 3, which are coupled to the PCI-X bus 208 via PCI-X interface 332 and control logic 334 of Figure 3. One of the broadcast bridges 302 that is not busy (as determined by busy inputs 428) according to the round-robin scheme responds to the PCI-X write command and asserts its busy output 426 of Figure 4. Consequently, control logic 334 uncouples the non-responding broadcast bridge 302 from PCI-X signals 332 and continues coupling the responding broadcast bridge 202 to PCI-X signals 322. Flow proceeds to decision block 606.
[0061] At block 606, portl 312 determines whether the address of the PCI-X write command is in the broadcast address range stored in the PCI-X bridge 202 (such as in broadcast address range register 356 in one embodiment), and whether the broadcast enable bit 354 is set. That is, portl 312 determines whether both of the broadcast_enable 408 and in_broadcast_range 406 signals are true. If not, flow proceeds to block 612 to perform a non-broadcast, or non-mirrored, write. Otherwise, flow proceeds in parallel to one flow beginning at block 622 and another flow beginning at block 632 to perform a broadcast, or mirrored, write.
[0062] At block 612, portl 312 notifies portO 314 of Figure 4 of the pending PCI-X write command via write_pending signal 432. That is, portl 312 notifies portO 314 that write data is being written into write receive FIFO 492 and provides the write command address and count to address/size register 442. Flow proceeds to block 614.
[0063] At block 614, portO 314 retransmits the PCI-X write command to the primary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 324, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332. Retransmitting the PCI-X write command includes providing the write data from write receive FIFO 492 to the primary memory subsystem 222 via PCI-X bus 212. Flow proceeds to block 616.
[0064] At block 616, the memory controller 224 of the primary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the write data from the write receive FIFO 492, and writes the data into its memory 226. Flow proceeds to block 618.
[0065] At block 618, portO 314 notifies portl 312 that the write data has been transmitted to the primary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 619.
[0066] At block 619, portl 312 deasserts its busy output signal 426 in response to the write command completion. Flow ends at block 619.
[0067] Blocks 622 through 628 are essentially the same as blocks 612 through 618. However, blocks 622 through 628 are part of a mirrored write due to the operations performed in blocks 632 through 638, as described below.
[0068] At block 622, portl 312 notifies portO 314 of the pending PCI-X write command via write_pending signal 432. That is, portl 312 notifies portO 314 that write data is being written into write receive FIFO 492 and provides the write command address and count to address/size register 442. Flow proceeds to block 624.
[0069] At block 624, portO 314 retransmits the PCI-X write command to the primary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 324, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332. Retransmitting the PCI-X write command includes providing the write data from write receive FIFO 492 to the primary memory subsystem 222 via PCI-X bus 212. Flow proceeds to block 626. [0070] At block 626, the memory controller 224 of the primary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the write data from the write receive FIFO 492, and writes the data into its memory 226. Flow proceeds to block 628.
[0071] At block 628, portO 314 notifies portl 312 that the write data has been transmitted to the primary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 629.
[0072] At block 632, portl 312 notifies port2 316 of Figure 4 of the pending PCI-X write command via broadcastjending signal 436. That is, portl 312 notifies port2 316 that write data is being written into broadcast FIFO 306 and provides the write command address and count to address/size register 472. Flow proceeds to block 634.
[0073] At block 634, port2 316 retransmits the PCI-X write command to the secondary memory subsystem 222 on PCI-X bus 212 via PCI-X signals 326, which are selectively coupled to PCI-X bus 212 via control logic 334 and PCI-X interface 332. Retransmitting the PCI-X write command includes providing a copy of the write data from broadcast FIFO 306 to the secondary memory subsystem 222 via PCI-X bus 212. Flow proceeds to block 636.
[0074] At block 636, the memory controller 224 of the secondary memory subsystem 222 receives the retransmitted PCI-X burst write command, including the copy of the write data from the broadcast FIFO 306, and writes the data into its memory 226. Flow proceeds to block 638.
[0075] At block 638, port2 316 notifies portl 312 that the write data has been transmitted to the secondary memory subsystem 222, i.e., that the PCI-X write command has completed. Flow proceeds to block 629.
[0076] At block 629, portl 312 has been notified by each of portO 314 and port2 316 that their respective PCI-X write commands to the primary and secondary memory subsystem 222, respectively, have completed, and responsively deasserts its busy output 426. Now the broadcast bridge 302 is ready to receive data associated with another PCI-X write command into its write receive FIFO 492 and its broadcast FIFO 306. Flow ends at block 629.
[0077] Referring now to Figure 7, a block diagram of the redundant network storage controller 102 of Figure 2 illustrating data flow in a broadcast write example according to the present invention is shown. In the example of Figure 7, memory subsystem-A 222A is the primary memory subsystem and memory subsystem-B 222B is the secondary memory subsystem of a broadcast write. The flow of data through redundant network storage controller 102 is indicated in Figure 7 by thick shaded arrows with numbers contained therein. The direction of the arrows shows the direction of data flow. The sequence of the numbers inside the arrows specifies the sequence of data flowing through the redundant network storage controller 102. [0078] A host 104 of Figure 1 transmits write data to redundant network storage controller 102 of Figure 7. The write data is received by the interface controller 204 coupled to interface 108, according to block 602 of Figure 6, as shown by arrow 1. The interface controller 204 generates a PCI-X burst write command on PCI-X bus 208A in response to reception of the host write data, according to block 602, as shown by arrow 2. PCI-X bridge 202 receives the PCI-X write command, according to block 604 of Figur e
[0079] In response to the PCI-X write command, PCI-X bridge 202 determines that the write command address is within the broadcast address range and that broadcast enable bit is set, according to block 606 of Figure 6. Consequently, PCI-X bridge 202 transmits the write data to the primary memory subsystem-A 222A on PCI-X bus 212C, according to blocks 622 and 624 of Figure 6, as shown by arrow 3 A. Concurrently, PCI-X bridge 202 transmits a copy of the write data to the secondary memory subsystem-B 222B on PCI-X bus 212A, according to blocks 632 and 634 of Figure 6, as shown by arrow 3B.
[0080] The primary memory subsystem-A 222A receives the data and writes the data into its cache memory 226, according to block 626 of Figure 6, as shown by arrow 4A. Concurrently, secondary memory subsystem-B 222B receives the copy of the data and writes the data into its cache memory 226, accordmg to block 636 of Figure 6, as shown by arrow 4B.
[0081] Referring now to Figure 8, a block diagram of the PCI-X bridge 202 of Figure 3 illustrating data flow in the broadcast write example of Figure 7 according to the present invention is shown. In the example of Figure 8, the flow of data through PCI-X bridge 202 is indicated by thick shaded arrows denoted 2, 3A, and 3B, corresponding to the data flow example arrows of Figure 7.
[0082] PCI-X interface 332A of Figure 8 receives the PCI-X write command on PCI-X bus 208A from interface controller 204 of Figure 7 and conveys the command to control logic 334A, as shown by arrow 2. In the example, broadcast bridge 302B is the next non-busy broadcast bridge 302 in the round-robin scheme; therefore, control logic 334A selectively couples PCI-X signals 322B to PCI-X interface 332A since portl 312 of broadcast bridge 302B responds to the PCI-X write command and asserts its busy output 426, according to block 604 of Figure 6, as shown by arrow 2.
[0083] Write receive FIFO 492 of Figure 4, included in send/receive FIFOs 304 of Figure 8, provides the write data on PCI-X signals 324B to control logic 334C; control logic 334C selectively provides the write data to PCI-X interface 332C, which in turn provides the data on PCI-X bus 212A, according to blocks 622 and 624 of Figure 6, as shown by arrow 3A. Simultaneously, broadcast FIFO 306 of Figure 8 provides a copy of the write data on PCI-X signals 326B to control logic 334D; control logic 334D selectively provides the write data to PCI-X interface 332D, which in turn provides the data on PCI-X bus 212C, according to blocks 632 and 634 of Figure 6, as shown by arrow 3B. [0084] Referring now to Figure 9, a block diagram of a related art conventional redundant network storage controller 900 for illustrating data flow of a conventional mirrored write is shown. Some elements of the conventional storage controller 900 are similar to elements of the redundant network storage controller 102 of Figure 7, and like elements are numbered the same. Figure 9 may be compared to Figure 7 in order to more fully appreciate the advantages of the present invention over the conventional method. For simplicity and clarity, unlike the redundant network storage controller 102 of Figure 7, the conventional storage controller 900 includes only two interface modules 906, denoted interface module-C 906C and interface module-D 906D, rather than the four interface modules 206 of Figure 7.
[0085] There are three main differences between conventional storage controller 900 and redundant network storage controller 102 of Figure 7. First, the PCI-X ports of the interface controllers 204 of Figure 9 are coupled directly to the memory subsystems 222 via the PCI-X buses 212. That is, one of the interface controllers 204 of interface module 906C is coupled to PCI-X bus 212C and the other interface controller 204 is coupled to PCI-X bus 212A. Similarly, one of the interface controllers 204 of interface module 906D is coupled to PCI-X bus 212B and the other interface controller 204 is coupled to PCI-X bus 212D. Second, each of the interface modules 906 of the conventional storage controller 900 includes a two-ported PCI-X bridge 902 rather than the four-ported PCI-X bridge 202 of Figure 7. PCI-X bridge 902 of interface module 906C couples PCI-X bus 212A and 212C. PCI-X bridge 902 of interface module 906D couples PCI-X bus 212B and 212D. Third, the memory controller 924 of Figure 9 includes a DMA controller for performing data transfers between its memory 226 and its PCI-X buses 212.
[0086] A host transmits write data to conventional storage controller 900 of Figure 9, as shown by arrow 1. The write data is received by the interface controller 204 of interface module 906C, which generates a PCI-X write command on PCI-X bus 212C to the primary memory subsystem-A 222 A in response to reception of the host write data, as shown by arrow 2. The primary memory subsystem-A 222A receives the data and writes the data into its cache memory 226, as shown by arrow 3.
[0087] Subsequently, the processor 228 of the primary memory subsystem-A 222A receives notification of the transfer of the write data into its memory 226 and instructs the memory controller 924 to read the just-written write data from its memory 226, as shown by arrow 4, and to copy the write data out on PCI-X bus 212D to interface module-D 906D, as shown by arrow 5. PCI-X bridge 902 of interface module-D 206D receives the copy of the write data from the primary memory subsystem-A 222A and retransmits the PCI-X write command, including the copy of the write data, on PCI-X bus 212B to the secondary memory subsystem-B 222B, as shown by arrow 6. The secondary memory subsystem-B 222B receives the copy of the data and writes it into its cache memory 226, as shown by arrow 7.
[0088] As may be observed by comparing Figure 7 and Figure 9, the conventional storage controller 900 takes a substantially longer time to perform a mirrored write than the redundant network storage controller 102 of Figure 7 since the conventional storage controller 900 performs its copy of the data to the secondary memory subsystem 222 in series with the write of the data to the primary memory subsystem 222; whereas, the redundant network storage controller 102 broadcasts the data to the secondary memory subsystem 222 simultaneously with the write of the data to the primary memory subsystem 222. In addition, because the redundant network storage controller 102 performs the broadcast write directly to the secondary memory subsystem 222, there is no need for the primary memory subsystem 222 to copy the data to the secondary memory subsystem 222, which makes more efficient use of the bandwidth of the PCI-X buses 212 and the bus between the memory 226 and memory controller 224. As may be observed, twice as much memory 226 bus bandwidth is consumed and 50% more PCI-X bus bandwidth 212 is consumed by the conventional storage controller 900 to perform a mirrored write than the redundant network storage controller 102 of the present invention.
[0089] Although the present invention and its objects, features, and advantages have been described in detail, other embodiments are encompassed by the invention. For example, although an embodiment has been described to broadcast write data to two redundant memory subsystems, i.e., to perform a mirrored write, the invention is adaptable to broadcast data to more than two redundant memory subsystems. In addition, although embodiments have been described with respect to the PCI-X bus, the invention is adaptable to work with other buses, such as the PCI bus, PCI Express bus, PCI-X2 bus, EISA bus, VESA bus, Futurebus, VME bus, MultiBus, RapidlO bus, AGP bus, ISA bus, 3GIO bus, Hypertransport bus, Fibre Channel, Ethernet, ATA, SATA, SCSI, Infiniband, etc. Furthermore, although an embodiment of the bus bridge has been described with a particular number of broadcast bridges, the number of broadcast bridges per bus bridge may be varied to meet the demands of the particular application in which the bus bridge is employed. Still further, although the invention has been described with respect to a single broadcast address range, multiple broadcast address ranges may be employed if desired. Finally, although an embodiment has been described having the write and broadcast FIFOs of a particular size, the size of the FIFOs may vary to meet the demands of the particular application in which the broadcast bridge is employed.
[0090] Those skilled in the art should appreciate that they can readily use the disclosed conception and specific embodiments as a basis for designing or modifying other structures for carrying out the same purposes of the present invention without departing from the spirit and scope of the invention as defined by the appended claims.
We claim:

Claims

1. A broadcast bridge apparatus, comprising:
a first port, for receiving data transmitted on a first local bus;
a second port, coupled to said first port, for receiving said data from said first port and providing said data for retransmission on a second local bus to a first memory subsystem; and
a third port, coupled to said first port, for receiving a copy of said data from said first port and selectively providing said copy of said data for retransmission on a third local bus to a second memory subsystem.
2. The apparatus of claim 1, wherein said third port selectively provides said copy of said data for retransmission on said third local bus to said second memory subsystem substantially concurrently with said first port providing said data for retransmission on said second local bus to said first memory subsystem.
3. The apparatus of claim 1, wherein said first and second memory subsystems are redundant.
4. The apparatus of claim 1, wherein said data originates from a host computer and is destined to be written from one of said first and second memory subsystems to one or more storage devices.
5. The apparatus of claim 1, wherein said first memory subsystem is alleviated from copying said data to said second memory subsystem due to said third port providing said copy of said data.
6. The apparatus of claim 1, wherein said first port is configured to receive an address of said data from said first local bus, wherein said third port selectively provides said copy of said data to said second memory subsystem only if said address lies within one or more address ranges of said first local bus, wherein said one or more address ranges comprise a subset of an address space of said first local bus.
7. The apparatus of claim 6, wherein said one or more address ranges are predetermined.
8. The apparatus of claim 6, wherein said one or more address ranges are programmable.
9. The apparatus of claim 6, further comprising:
a programmable control register, coupled to said third port, for storing a control value, wherein said third port selectively provides said copy of said data to said second memory subsystem only if said control value specifies said providing said copy of said data to said second memory subsystem and said address lies within said address range.
10. The apparatus of claim 1, further comprising:
a control register, coupled to said third port, for storing a control value, wherein said third port selectively provides said copy of said data to said second memory subsystem only if said control value specifies said providing said copy of said data to said second memory subsystem.
11. The apparatus of claim 10, wherein said control register is programmable.
12. The apparatus of claim 1, further comprising:
a first-in-first-out (FIFO) memory, coupling said first port to said third port, for selectively buffering said copy of said data.
13. The apparatus of claim 12, further comprising:
a second FIFO memory, coupling said first port to said second port, for buffering said data.
14. The apparatus of claim 1, further comprising:
communication logic, coupling said second port to said third port, whereby said first memory subsystem informs said second memory subsystem of said retransmission of said copy of said data to said second memory subsystem.
15. The apparatus of claim 1, wherein at least one of said first, second, and third local buses comprise a PCI-X (Peripheral Component Interconnect-X) bus.
16. The apparatus of claim 1, wherein at least one of said first, second, and third local buses comprise a Peripheral Component Interconnect (PCI) bus.
17. A bus bridge apparatus for broadcasting data from a first local bus on one side of the bridge to a plurality of redundant storage controllers coupled to second and third local buses on an opposite side of the bridge to relieve the redundant controllers from copying the data to one another, the apparatus comprising:
a first FIFO memory, coupled to receive data from the first local bus, said data associated with a first write transaction on the first local bus;
first master logic, coupled to said first FIFO memory, for causing a second write transaction on the second local bus to transfer said data from said first FIFO memory to a first of the plurality of redundant storage controllers; a second FIFO memory, coupled to receive said data from the first local bus; and
second master logic, coupled to said second FIFO memory, for causing a thfrd write transaction on the third local bus to transfer said data from said second FIFO memory to a second of the plurality of redundant storage controllers.
18. The apparatus of claim 17, further comprising:
target logic, coupled to said first and second master logic, for receiving a write command of said first write transaction on the first local bus, and for generating first and second signals to said first and second master logic, respectively, to signify a transfer of said data into said first and second FIFO memories, respectively, in response to said first write transaction.
19. The apparatus of claim 18, wherein said first and second master logic generate third and fourth signals, respectively, to signify to said target logic completion of said second and third write transactions, respectively.
20. The apparatus of claim 17, wherein said transfer of said data from said second FIFO memory to said second of the plurality of redundant storage controllers occurs substantially simultaneously with said transfer of said data from said first FIFO memory to said first of the plurality of redundant storage controllers.
21. The apparatus of claim 17, wherein said first and second master logic comprise PCI-X master logic.
22. A PCI-X bus bridge, for bridging a first PCI-X bus to second and third PCI-X buses, comprising:
first, second, and third PCI-X interfaces, coupled to the first, second, and third PCI-X buses, respectively, said first PCI-X interface configured to receive a plurality of write transactions from the first PCI-X bus; and
a plurality of broadcast bridge circuits, coupling said first PCI-X interface to said second and third PCI-X interfaces, each for causing both of said second and third PCI-X interfaces to retransmit a respective one of said plurality of write transactions on the second and third PCI-X buses, respectively.
23. The apparatus of claim 22, further comprising:
first, second, and third multiplexing logic, coupled to said first, second, and third PCI-X interfaces, respectively, for selectively coupling said first, second, and third PCI-X interfaces, respectively, to one of said plurality of broadcast bridge circuits.
24. The apparatus of claim 23, wherein each of said plurality of broadcast bridge cfrcuits includes a busy output for indicating whether said broadcast bridge circuit is currently servicing a PCI-X command.
25. The apparatus of claim 24, wherein said first, second, and third multiplexing logic selectively couples said first, second, and third PCI-X interfaces, respectively, to one of said plurality of broadcast bridge circuits based on said busy outputs.
26. A PCI-X bus bridge, comprising:
a PCI-X target cfrcuit, for receiving a PCI-X write command from a first PCI-X bus coupled to one side of the bus bridge, said PCI-X write command specifying an address of data to be written;
a control input to said PCI-X target circuit, for indicating whether said address is within an address range of an address space of said first PCI-X bus;
a write FIFO, coupled to said PCI-X target circuit, for receiving said data from said first PCI-X bus for retransmission on a second PCI-X bus coupled to a side of the bus bridge opposite said first PCI-X bus; and
a broadcast FIFO, coupled to said PCI-X target circuit, for receiving a copy of said data from said first PCI-X bus for retransmission on a thfrd PCI-X bus coupled to said opposite side of the bus bridge, wherein said broadcast FIFO receives said copy of said data only if said address is within said address range.
27. The PCI-X bus bridge of claim 26, further comprising:
a first PCI-X master circuit, coupled to said PCI-X target circuit, for retransmitting said data on said second PCI-X bus from said write FIFO; and
a second PCI-X master circuit, coupled to said PCI-X target cfrcuit, for retransmitting said copy of said data on said third PCI-X bus from said broadcast FIFO only if said address is within said address range.
28. The PCI-X bus bridge of claim 27, wherein said second PCI-X master circuit retransmits said copy of said data on said third PCI-X bus from said broadcast FIFO substantially simultaneously with said first PCI-X master circuit retransmitting said data on said second PCI-X bus from said write FIFO.
29. A method for selectively performing a broadcast data transfer across a bus bridge to a plurality of ■> redundant memory subsystems in a storage confroller, comprising:
receiving data on a ffrst bus on one side of the bus bridge;
writing said data to a first of the plurality of memory subsystems on a second bus on an opposite side of the bus bridge from said first bus;
determining whether the bus bridge is enabled to perform broadcast data transfers; and
writing a copy of said data to a second of the plurality of memory subsystems on a third bus on said opposite side of the bus bridge only if the bus bridge is enabled to perform broadcast data transfers, wherein the bus bridge writes said copy of said data to said second of the plurality of memory subsystems on said third bus substantially concurrently with said writing said data to said first of the plurality of memory subsystems on said second bus.
30. The method of claim 29, further comprising:
receiving an address of said data on said first bus; and
determining whether said address is within an address range, said address range being a subset of an address space of said first bus;
wherein said writing said copy of said data to said second of the plurality of memory subsystems is performed only if said address is within said address range.
31. The method of claim 29, further comprising:
storing said data into a first buffer in response to said receiving said data on said first bus and prior to said writing said data to said first of the plurality of memory subsystems;
storing said copy of said data into a second buffer concurrently with said storing said data into said ffrst buffer only if the bus bridge is enabled to perform broadcast data transfers.
32. The method of claim 31 , further comprising:
reading said data from said first buffer to provide said data for said writing said data to said first of the plurality of memory subsystems; and
reading said copy of said data from said second buffer to provide said copy of said data for said writing said copy of said data to said second of the plurality of memory subsystems.
33. The method of claim 29, wherein said first, second, and thfrd buses comprise PCI-X buses.
34. A redundant network storage controller, comprising:
at least one I/O interface circuit, for receiving data from a host computer and writing said data to one or more storage devices;
a primary memory subsystem, for buffering said data before being written to said storage devices;
a secondary memory subsystem, for storing a redundant copy of said data; and
a plurality of bus bridges, for bridging a bus coupled to said at least one I/O interface cfrcuit with a plurality of buses coupled to said primary and secondary memory subsystems, configured to write said data received on said bus concurrently to said primary and secondary memory subsystems on first and second of said plurality of buses, respectively.
35. The confroller of claim 34, wherein each of said plurality of bus bridges comprises:
a first port for coupling to said bus coupled to said at least one I O interface cfrcuit;
a second port for coupling to said first of said plurality of buses coupled to said primary memory subsystem; and
a third port for coupling to said second of said plurality of buses coupled to said secondary memory subsystem.
36. The confroller of claim 35, wherein each of said plurality of bus bridges further comprises:
a ffrst buffer, coupling said first and second ports, for buffering said data between said bus and said first of said plurality of buses; and
a second buffer, coupling said first and third ports, for buffering said data between said bus and said second of said plurality of buses.
37. The confroller of claim 34, wherein said primary memory subsystem updates said secondary memory subsystem with information specifying a presence of said data in said secondary subsystem after said data is written to said secondary memory subsystem.
PCT/US2004/004098 2003-02-18 2004-02-12 Broadcast bridge apparatus for transferring data to subsystems in a storage controller WO2004074996A2 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US10/368,688 US7143227B2 (en) 2003-02-18 2003-02-18 Broadcast bridge apparatus for transferring data to redundant memory subsystems in a storage controller
US10/368,688 2003-02-18

Publications (2)

Publication Number Publication Date
WO2004074996A2 true WO2004074996A2 (en) 2004-09-02
WO2004074996A3 WO2004074996A3 (en) 2004-12-29

Family

ID=32907643

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2004/004098 WO2004074996A2 (en) 2003-02-18 2004-02-12 Broadcast bridge apparatus for transferring data to subsystems in a storage controller

Country Status (2)

Country Link
US (1) US7143227B2 (en)
WO (1) WO2004074996A2 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107861846A (en) * 2017-10-19 2018-03-30 曙光信息产业(北京)有限公司 Test the device and method of memory access effective bandwidth

Families Citing this family (49)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7340555B2 (en) 2001-09-28 2008-03-04 Dot Hill Systems Corporation RAID system for performing efficient mirrored posted-write operations
US7143227B2 (en) 2003-02-18 2006-11-28 Dot Hill Systems Corporation Broadcast bridge apparatus for transferring data to redundant memory subsystems in a storage controller
US7146448B2 (en) * 2001-09-28 2006-12-05 Dot Hill Systems Corporation Apparatus and method for adopting an orphan I/O port in a redundant storage controller
US7536495B2 (en) * 2001-09-28 2009-05-19 Dot Hill Systems Corporation Certified memory-to-memory data transfer between active-active raid controllers
US7315911B2 (en) * 2005-01-20 2008-01-01 Dot Hill Systems Corporation Method for efficient inter-processor communication in an active-active RAID system using PCI-express links
US9264384B1 (en) 2004-07-22 2016-02-16 Oracle International Corporation Resource virtualization mechanism including virtual host bus adapters
US7543096B2 (en) * 2005-01-20 2009-06-02 Dot Hill Systems Corporation Safe message transfers on PCI-Express link from RAID controller to receiver-programmable window of partner RAID controller CPU memory
JPWO2006085357A1 (en) * 2005-02-08 2008-06-26 富士通株式会社 Storage control device for disk array device and redundancy recovery method
US7620741B2 (en) * 2005-04-22 2009-11-17 Sun Microsystems, Inc. Proxy-based device sharing
US7613864B2 (en) * 2005-04-22 2009-11-03 Sun Microsystems, Inc. Device sharing
US8223745B2 (en) * 2005-04-22 2012-07-17 Oracle America, Inc. Adding packet routing information without ECRC recalculation
US7565463B2 (en) * 2005-04-22 2009-07-21 Sun Microsystems, Inc. Scalable routing and addressing
US7574536B2 (en) * 2005-04-22 2009-08-11 Sun Microsystems, Inc. Routing direct memory access requests using doorbell addresses
US20060288132A1 (en) * 2005-05-31 2006-12-21 Mccall James A Memory single-to-multi load repeater architecture
US9813283B2 (en) 2005-08-09 2017-11-07 Oracle International Corporation Efficient data transfer between servers and remote peripherals
US7805560B2 (en) * 2005-08-31 2010-09-28 Ati Technologies Inc. Methods and apparatus for translating messages in a computing system
US7970956B2 (en) * 2006-03-27 2011-06-28 Ati Technologies, Inc. Graphics-processing system and method of broadcasting write requests to multiple graphics devices
US7536508B2 (en) * 2006-06-30 2009-05-19 Dot Hill Systems Corporation System and method for sharing SATA drives in active-active RAID controller system
US7681089B2 (en) * 2007-02-20 2010-03-16 Dot Hill Systems Corporation Redundant storage controller system with enhanced failure analysis capability
US8315269B1 (en) 2007-04-18 2012-11-20 Cypress Semiconductor Corporation Device, method, and protocol for data transfer between host device and device having storage interface
US7783822B2 (en) * 2007-07-25 2010-08-24 Hewlett-Packard Development Company, L.P. Systems and methods for improving performance of a routable fabric
TWI448902B (en) * 2007-08-24 2014-08-11 Cypress Semiconductor Corp Bridge device with page-access based processor interface
US8090894B1 (en) 2007-09-21 2012-01-03 Cypress Semiconductor Corporation Architectures for supporting communication and access between multiple host devices and one or more common functions
JP2009110053A (en) 2007-10-26 2009-05-21 Toshiba Corp Memory system
EP2350830A4 (en) * 2008-10-30 2013-05-22 Lsi Corp Storage controller data redistribution
US8140804B1 (en) * 2008-12-17 2012-03-20 Symantec Corporation Systems and methods for determining whether to perform a computing operation that is optimized for a specific storage-device-technology type
US9973446B2 (en) 2009-08-20 2018-05-15 Oracle International Corporation Remote shared server peripherals over an Ethernet network for resource virtualization
TWI587139B (en) * 2010-01-20 2017-06-11 旺玖科技股份有限公司 Driving device and method of accessing data
US8856460B2 (en) 2010-09-15 2014-10-07 Oracle International Corporation System and method for zero buffer copying in a middleware environment
JP5945543B2 (en) * 2010-09-15 2016-07-05 オラクル・インターナショナル・コーポレイション System including middleware machine environment
US9185054B2 (en) * 2010-09-15 2015-11-10 Oracle International Corporation System and method for providing zero buffer copying in a middleware machine environment
US9331963B2 (en) 2010-09-24 2016-05-03 Oracle International Corporation Wireless host I/O using virtualized I/O controllers
WO2012134932A2 (en) 2011-03-25 2012-10-04 Adc Telecommunications, Inc. Event-monitoring in a system for automatically obtaining and managing physical layer information using a reliable packet-based communication protocol
US9081537B2 (en) 2011-03-25 2015-07-14 Adc Telecommunications, Inc. Identifier encoding scheme for use with multi-path connectors
US8949496B2 (en) * 2011-03-25 2015-02-03 Adc Telecommunications, Inc. Double-buffer insertion count stored in a device attached to a physical layer medium
US8880768B2 (en) * 2011-05-20 2014-11-04 Promise Technology, Inc. Storage controller system with data synchronization and method of operation thereof
US8732191B2 (en) 2011-06-27 2014-05-20 Oracle International Corporation System and method for improving application connectivity in a clustered database environment
US9378045B2 (en) 2013-02-28 2016-06-28 Oracle International Corporation System and method for supporting cooperative concurrency in a middleware machine environment
US9110715B2 (en) 2013-02-28 2015-08-18 Oracle International Corporation System and method for using a sequencer in a concurrent priority queue
US10095562B2 (en) 2013-02-28 2018-10-09 Oracle International Corporation System and method for transforming a queue from non-blocking to blocking
US8689237B2 (en) 2011-09-22 2014-04-01 Oracle International Corporation Multi-lane concurrent bag for facilitating inter-thread communication
US9083550B2 (en) 2012-10-29 2015-07-14 Oracle International Corporation Network virtualization over infiniband
US20140118160A1 (en) * 2012-10-30 2014-05-01 Quantitative Sampling Technologies, LLC Controller for supervising data acquisition devices
US8464095B1 (en) * 2012-11-15 2013-06-11 DSSD, Inc. Method and system for multi-dimensional raid reconstruction and defect avoidance
US8924619B2 (en) * 2013-03-15 2014-12-30 Seagate Technology Llc Unified message-based communications
US8924776B1 (en) 2013-12-04 2014-12-30 DSSD, Inc. Method and system for calculating parity values for multi-dimensional raid
WO2015183834A1 (en) * 2014-05-27 2015-12-03 Rambus Inc. Memory module with reduced read/write turnaround overhead
US11232037B2 (en) 2017-10-23 2022-01-25 Seagate Technology Llc Using a first-in-first-out (FIFO) wraparound address lookup table (ALT) to manage cached data
US11327858B2 (en) 2020-08-11 2022-05-10 Seagate Technology Llc Preserving data integrity during controller failure

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5619642A (en) * 1994-12-23 1997-04-08 Emc Corporation Fault tolerant memory system which utilizes data from a shadow memory device upon the detection of erroneous data in a main memory device
EP0800138A1 (en) * 1996-04-04 1997-10-08 Symbios Logic Inc. Control apparatus and method for a RAID storage subsystem
EP0817054A2 (en) * 1996-06-28 1998-01-07 Digital Equipment Corporation Simultaneous, mirror write cache
US6493795B1 (en) * 1998-12-30 2002-12-10 Emc Corporation Data storage system

Family Cites Families (36)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1118101A (en) 1977-06-02 1982-02-09 Jerry Doniger Digital flight guidance system
US4428044A (en) 1979-09-20 1984-01-24 Bell Telephone Laboratories, Incorporated Peripheral unit controller
US5163131A (en) 1989-09-08 1992-11-10 Auspex Systems, Inc. Parallel i/o network file server architecture
US5301303A (en) 1990-04-23 1994-04-05 Chipcom Corporation Communication system concentrator configurable to different access methods
JP2910303B2 (en) 1990-06-04 1999-06-23 株式会社日立製作所 Information processing device
US5345565A (en) 1991-03-13 1994-09-06 Ncr Corporation Multiple configuration data path architecture for a disk array controller
US5483528A (en) 1994-10-11 1996-01-09 Telex Communications, Inc. TDM digital matrix intercom system
DE19540915A1 (en) 1994-11-10 1996-05-15 Raymond Engineering Redundant arrangement of solid state memory modules
US5881254A (en) * 1996-06-28 1999-03-09 Lsi Logic Corporation Inter-bus bridge circuit with integrated memory port
US5812754A (en) 1996-09-18 1998-09-22 Silicon Graphics, Inc. Raid system with fibre channel arbitrated loop
US6038680A (en) 1996-12-11 2000-03-14 Compaq Computer Corporation Failover memory for a computer system
JP3602293B2 (en) 1997-04-22 2004-12-15 株式会社ソニー・コンピュータエンタテインメント Data transfer method and device
US6094699A (en) 1998-02-13 2000-07-25 Mylex Corporation Apparatus and method for coupling devices to a PCI-to-PCI bridge in an intelligent I/O controller
US6243829B1 (en) 1998-05-27 2001-06-05 Hewlett-Packard Company Memory controller supporting redundant synchronous memories
US6098140A (en) 1998-06-11 2000-08-01 Adaptec, Inc. Modular bus bridge system compatible with multiple bus pin configurations
US6507581B1 (en) 1998-06-12 2003-01-14 Fairchild Semiconductor Corporation Dynamic port mode selection for crosspoint switch
US6230240B1 (en) 1998-06-23 2001-05-08 Hewlett-Packard Company Storage management system and auto-RAID transaction manager for coherent memory map across hot plug interface
US6185652B1 (en) 1998-11-03 2001-02-06 International Business Machin Es Corporation Interrupt mechanism on NorthBay
EP1131719A4 (en) 1998-11-14 2007-10-31 Mti Tech Corp Logical unit mapping in a storage area network (san) environment
US6272533B1 (en) 1999-02-16 2001-08-07 Hendrik A. Browne Secure computer system and method of providing secure access to a computer system including a stand alone switch operable to inhibit data corruption on a storage device
US6502157B1 (en) * 1999-03-24 2002-12-31 International Business Machines Corporation Method and system for perfetching data in a bridge system
JP3843713B2 (en) 1999-08-27 2006-11-08 株式会社日立製作所 Computer system and device allocation method
US6854034B1 (en) 1999-08-27 2005-02-08 Hitachi, Ltd. Computer system and a method of assigning a storage device to a computer
US6801958B2 (en) 1999-12-15 2004-10-05 Texas Instruments Incorporated Method and system for data transfer
US6421769B1 (en) 1999-12-30 2002-07-16 Intel Corporation Efficient memory management for channel drivers in next generation I/O system
US6629179B1 (en) * 2000-07-31 2003-09-30 Adaptec, Inc. Message signaled interrupt generating device and method
US7069368B2 (en) 2000-12-01 2006-06-27 Clearcube Technology, Inc. System of co-located computers in a framework including removable function modules for adding modular functionality
US20020069318A1 (en) 2000-12-01 2002-06-06 Chow Yan Chiew Real time application accelerator and method of operating the same
US6470429B1 (en) 2000-12-29 2002-10-22 Compaq Information Technologies Group, L.P. System for identifying memory requests as noncacheable or reduce cache coherence directory lookups and bus snoops
US6718408B2 (en) 2001-01-18 2004-04-06 Hewlett-Packard Development Company, L.P. Interchangeable and configurable input/output module for a computing deviceco
US6950895B2 (en) 2001-06-13 2005-09-27 Intel Corporation Modular server architecture
US7143227B2 (en) 2003-02-18 2006-11-28 Dot Hill Systems Corporation Broadcast bridge apparatus for transferring data to redundant memory subsystems in a storage controller
US7062591B2 (en) 2001-09-28 2006-06-13 Dot Hill Systems Corp. Controller data sharing using a modular DMA architecture
US7146448B2 (en) 2001-09-28 2006-12-05 Dot Hill Systems Corporation Apparatus and method for adopting an orphan I/O port in a redundant storage controller
US6839788B2 (en) 2001-09-28 2005-01-04 Dot Hill Systems Corp. Bus zoning in a channel independent storage controller architecture
US7437493B2 (en) 2001-09-28 2008-10-14 Dot Hill Systems Corp. Modular architecture for a network storage controller

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5619642A (en) * 1994-12-23 1997-04-08 Emc Corporation Fault tolerant memory system which utilizes data from a shadow memory device upon the detection of erroneous data in a main memory device
EP0800138A1 (en) * 1996-04-04 1997-10-08 Symbios Logic Inc. Control apparatus and method for a RAID storage subsystem
EP0817054A2 (en) * 1996-06-28 1998-01-07 Digital Equipment Corporation Simultaneous, mirror write cache
US6493795B1 (en) * 1998-12-30 2002-12-10 Emc Corporation Data storage system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107861846A (en) * 2017-10-19 2018-03-30 曙光信息产业(北京)有限公司 Test the device and method of memory access effective bandwidth
CN107861846B (en) * 2017-10-19 2020-09-25 曙光信息产业(北京)有限公司 Device and method for testing memory access effective bandwidth

Also Published As

Publication number Publication date
US7143227B2 (en) 2006-11-28
US20040177126A1 (en) 2004-09-09
WO2004074996A3 (en) 2004-12-29

Similar Documents

Publication Publication Date Title
US7143227B2 (en) Broadcast bridge apparatus for transferring data to redundant memory subsystems in a storage controller
US7266633B2 (en) System and method for communicating the synchronization status of memory modules during initialization of the memory modules
US7613890B1 (en) Consistent replication across multiple storage devices
US6675253B1 (en) Dynamic routing of data across multiple data paths from a source controller to a destination controller
US6370611B1 (en) Raid XOR operations to synchronous DRAM using a read buffer and pipelining of synchronous DRAM burst read data
US7536495B2 (en) Certified memory-to-memory data transfer between active-active raid controllers
EP0768607B1 (en) Disk array controller for performing exclusive or operations
US6978397B2 (en) Memory controller supporting redundant synchronous memories
US7971011B2 (en) Remote copy method and storage system
US8880833B2 (en) System and method for read synchronization of memory modules
US6009481A (en) Mass storage system using internal system-level mirroring
US7526592B2 (en) Interrupt control system and storage control system using the same
US7315911B2 (en) Method for efficient inter-processor communication in an active-active RAID system using PCI-express links
US7062591B2 (en) Controller data sharing using a modular DMA architecture
US7055054B2 (en) Fail-over of multiple memory blocks in multiple memory modules in computer system
US6182267B1 (en) Ensuring accurate data checksum
KR100258079B1 (en) The duplicated device by extention of memory bus in a tightly coupled fault tolerance system
US6493785B1 (en) Communication mode between SCSI devices
JPH08335204A (en) Data-processing system with bidirectional synchronous multidrop data bus
KR20100075847A (en) An optical solution to control data channels
US6385674B1 (en) Systems and methods for dynamic alignment of associated portions of a code word from a plurality of asynchronous sources
JP2008544421A (en) Proven memory-to-memory data transfer between active-active RAID controllers
US6370616B1 (en) Memory interface controller for datum raid operations with a datum multiplier
US20040064660A1 (en) Multiplexed bus with multiple timing signals
US5375217A (en) Method and apparatus for synchronizing disk drive requests within a disk array

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A2

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A2

Designated state(s): BW GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
122 Ep: pct application non-entry in european phase