WO1999005809A1 - System for reducing bus overhead for communications with a network interface - Google Patents

System for reducing bus overhead for communications with a network interface Download PDF

Info

Publication number
WO1999005809A1
WO1999005809A1 PCT/US1998/010652 US9810652W WO9905809A1 WO 1999005809 A1 WO1999005809 A1 WO 1999005809A1 US 9810652 W US9810652 W US 9810652W WO 9905809 A1 WO9905809 A1 WO 9905809A1
Authority
WO
WIPO (PCT)
Prior art keywords
buffer
computer system
data
network interface
command
Prior art date
Application number
PCT/US1998/010652
Other languages
French (fr)
Inventor
Nestor A. Fesas, Jr.
Original Assignee
Jato Technologies, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jato Technologies, Inc. filed Critical Jato Technologies, Inc.
Priority to AU75973/98A priority Critical patent/AU7597398A/en
Publication of WO1999005809A1 publication Critical patent/WO1999005809A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/40Bus networks
    • H04L12/40006Architecture of a communication node
    • H04L12/40032Details regarding a bus interface enhancer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/28Data switching networks characterised by path configuration, e.g. LAN [Local Area Networks] or WAN [Wide Area Networks]
    • H04L12/40Bus networks
    • H04L12/40006Architecture of a communication node

Definitions

  • the present invention relates to a device for connecting a computer system to a computer network, and more particularly to a method and an apparatus for reducing bus overhead in communications between a computer system and a network interface device through which the computer system communicates with a high speed packet-switched network.
  • NICs network interface controllers
  • the interface to the computer system typically connects to a bus within the computer system, such as a PCI bus, through which data is transferred between the computer system memory and the NIC.
  • PIO Programmable I/O
  • Shared memory can be used to facilitate communications between the NIC and the computer system.
  • the NIC and the computer system communicate by writing to and reading from a shared memory that exists in both the address space of the computer system and the address space of the NIC. This again leads to a simple hardware and operating system implementation, and a clean interface between the computer system and the NIC. However, it again places a burden on the application program to explicitly manage communications between the computer system and the NIC.
  • DMA direct memory access
  • DMA transfers between computer systems and NICs are commonly accomplished using the scatter-gather technique.
  • scatter-gather a bus master device in the NIC is first instructed to obtain a command block from the memory of a host computer system.
  • the command block contains a list of physical addresses for blocks within the host system memory that are to be copied to the DMA device.
  • the command block also contains a count of the number of fragments in the command block and the overall length of the data contained in the fragments pointed to by the command block.
  • the DMA device parses the command block, extracting the address of each fragment, and transfers the fragments from the host memory to the DMA device. This process is repeated for each fragment listed in the command block until all of the data described by the command block is copied to the DMA device.
  • a significant performance bottleneck in using the scatter-gather technique for transferring data to a high speed network is the translation from virtual to physical addresses.
  • Peripheral devices such as a NIC, cannot use virtual memory addresses to effect the transfers, because the hardware to implement the virtual-to-physical address translation is typically located inside the CPU. This means that conversion between virtual and physical addresses must take place before transfers between a computer system and a NIC can take place. This conversion can take a great deal of time and consume a significant amount of the computer system's processing power.
  • the driver When data is passed to a device driver for transmission to the NIC, the driver first performs a virtual -to- physical address conversion for each buffer fragment passed down to it from the application layers above. It is possible for each buffer fragment to straddle physical pages of the memory system.
  • each virtual address converted may correspond to more than one physical address converted. Consequently, several virtual-to-physical address conversions may be required for each buffer of data that is transferred from the computer system to the NIC. This can be very time- consuming because each virtual-to-physical address translation can take from tens to hundreds of CPU cycles to accomplish.
  • Peripheral devices such as
  • NICs typically connect to computer systems through a peripheral interconnect bus, such as the PCI bus.
  • a peripheral interconnect bus such as the PCI bus.
  • devices connected to the bus contend for control of the bus. Once a device is granted control of the bus, it drives bus signal lines to transfer data to or from the computer system.
  • the performance impediment stems from the number of times a NIC must contend for the peripheral interconnect bus when transferring data using the scatter-gather technique.
  • bus contention to transfer data between a NIC and an attached computer system will occur three times per buffer transferred: first, when the computer system informs the NIC that a buffer is available for its use; second when the NIC reads the command block describing the buffer; and third when the NIC transfers data to or from the buffer.
  • at least two buffer fragments will be described in each command block.
  • What is needed is a method for performing DMA between a computer system and a NIC which is free from the overhead of performing virtual to physical address translations and minimizes the number of bus transactions required to initiate the DMA transfer process.
  • the present invention provides a method and an apparatus for transferring data between a computer system and a network interface card that avoids virtual-to-physical address translations.
  • the computer system allocates blocks of memory during system initialization for storing data in transit between the computer system and the NIC, and the physical addresses of these blocks of memory are stored in a table on the NIC. Consequently, address conversion is performed only once, when the memory is allocated.
  • the device driver copies the data from the upper layers into the next available memory block.
  • the device driver then formats a command and passes it to the NIC for processing.
  • PDC packet descriptor command
  • the PDC contains a small ordinal value that indexes a table in the NIC, which includes a set of physical addresses of buffers preallocated by the computer system in the computer system memory. These buffers are used for storing data in transit to the NIC.
  • the PDC also contains the length of the buffer to be copied to or from the NIC.
  • the present invention also allows for multiple packets to be formatted into buffers and then subsequently transferred to the NIC in a single I/O operation. The present invention provides a number of advantages. First, virtual- to-physical address translation is avoided at run time.
  • the present invention incurs additional overhead because the processor must move data from the application program into the data buffers in the computer system's memory before this data is transferred to the NIC. At first glance, this double copy operation appears to incur a great amount of additional processor overhead. However, this additional overhead is considerably smaller than the overhead involved in performing virtual-to-physical address translations. Each translation requires many tens (if not hundreds) of CPU cycles, and many such translations may be required for a single transfer operation. Consequently, the present invention provides a significant performance advantage for small data transfers, which represent a significant percentage of all data transfers. Hundreds of bytes can be moved to the preallocated buffer in the time it takes to perform just one virtual-to-physical address translation. Moreover, as microprocessors move to 64 and 128 bit architectures, their capacity to move data per clock will increase thereby further widening the performance advantage of the present invention over conventional scatter-gather DMA.
  • CPU utilization may not be the primary bottleneck.
  • bus utilization may be the largest bottleneck.
  • favoring bus utilization at the expense of CPU utilization is often a desirable tradeoff to make.
  • the present invention can be characterized as an apparatus for facilitating communications between a computer system, including a memory and a bus, and a packet-switched network, comprising: a bus interface coupled to the bus, for communicating across the bus; a transmit buffer, for storing data to be transmitted on the packet-switched network; a transmit data path, coupled to the bus interface and the transmit buffer, for transferring data from the bus interface to the transmit buffer; a receive buffer, for storing data received from the packet-switched network; a receive data path, coupled to the bus interface and the receive buffer, for transferring data from the receive buffer to the bus interface; a buffer address table, coupled to the bus interface, for storing at least one address of at least one buffer in the memory of the computer system, the at least one buffer being preallocated by the computer system and used to store data in transit between the computer system and one of the transmit buffer and the receive buffer; and a controller coupled to the transmit buffer, the receive buffer and the buffer address table, for controlling the transfer of data from the computer system to
  • the apparatus includes: a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system; and a transmit execution queue, coupled to the bus interface, the transmit command queue and the controller, for storing and processing commands from the transmit command queue, and command blocks from the computer system which are referenced by commands from the transmit command queue.
  • the apparatus includes a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system; and a receive execution queue, coupled to the bus interface, the receive command queue and the controller, for storing and processing commands from the receive command queue and command blocks from the computer system that are referenced by commands from the receive command queue.
  • the controller includes a mechanism to transfer a plurality of packets in a single operation between the at least one buffer preallocated by the computer system and the transmit buffer. According to another aspect of the present invention, the controller includes a mechanism to transfer a plurality of packets in a single operation between the receive buffer and the at least one buffer preallocated by the computer system.
  • the present invention can also be characterized as a method for transferring data between a computer system and a network interface device, the network interface device being coupled to a packet-switched network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: receiving at the network interface device at least one address of a preallocated buffer in the memory; storing in the network interface device the at least one address of the preallocated buffer; receiving a command from the computer system through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; retrieving an address from the at least one address of a preallocated buffer stored in the network interface device; using the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and using the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
  • FIG. 1 is a block diagram illustrating some of the major functional components of a system for coupling host computer system 190 with a high speed network 160 in accordance with an aspect of the present invention.
  • FIG. 2 is a block diagram illustrating the format of a packet descriptor list in accordance with an aspect of the present invention.
  • FIG. 3 is a block diagram illustrating the structure of a simplified packet descriptor command for initiating a transfer of data between a NIC and a computer system in accordance with an aspect of the present invention.
  • FIG. 4 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 190 to a NIC in accordance with an aspect of the present invention.
  • FIG. 5 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 150 to a NIC in accordance with an aspect of the present invention.
  • FIG. 6 illustrates the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention.
  • FIG. 7 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention.
  • FIG. 8 is a block diagram illustrating some of the major functional components within a NIC in accordance with an aspect of the present invention.
  • FIG. 9 is a block diagram illustrating the structure of a preallocated buffer, including a plurality of packets for transmission to a NIC, in accordance with an aspect of the present invention.
  • FIG. 1 is a block diagram illustrating some of the major functional components of a host computer system 190, which connects to a high speed network 160 through network interface 195, in accordance with an aspect of the present invention.
  • Host computer system 190 includes interconnect bus 130, host bridge 110, host bus 140, host processor 120 and memory 180.
  • Host processor 120 connects to host bus 140, which also connects to host bridge 110.
  • Host processor 120 can be any type of processor system including a device controller, a microprocessor, or a mainframe computing system.
  • Host bus 140 is a bus which connects host processor 120 to host bridge 110.
  • Host bridge 110 includes cache controller 112 which connects to memory 180.
  • Memory 180 includes pre-allocated buffers 182, which are buffers pre-allocated by host processor 120 during system initialization. These buffers are used to store data in transit between host computer system 190 and high speed network 160.
  • Host bridge 110 additionally connects to interconnect bus 130.
  • Interconnect bus 130 is used to connect host computer system 190 to peripheral devices, such as network interface 195.
  • Interconnect bus 130 may be any type of commonly used interconnection bus, such as a PCI bus.
  • High speed network 160 is any type of high speed data network, including 100 megabit and gigabit Ethernet networks for example.
  • Network interface 195 may be implemented on a separate computer card, or it may be integrated into a computer system motherboard. It may also be integrated into a single silicon chip.
  • Network interface 195 includes physical layer interface 150 and controller 100.
  • Controller 100 is coupled to interconnect bus 130 within host computer system 190. Controller 100 additionally connects to physical layer interface 150, which connects to high speed network 160. Controller 100 performs the DMA functions involved in transferring data between memory 180 within host computer system 190, and high speed network 160. Controller 100 includes buffer address table 105, which includes a plurality of addresses of buffers in preallocated buffers 182 within memory 180. Physical layer interface 150 includes resources for performing communications across high speed network 160.
  • FIG. 2 is a diagram illustrating the structure of a packet descriptor list (PDL) in accordance with an aspect of the present invention.
  • the PDL illustrated in FIG. 2 specifies a transmission of data including the transmission of N separate fragments scattered through memory 180 within host computer system 190.
  • the PDL includes status field 200, which contains information regarding the status of the data transfer corresponding to the PDL. It also includes a number of fragments field 210, which indicates the number of fragments associated with the data transfer specified by the PDL.
  • the PDL also includes a packet length 220, which indicates the length of the entire transfer, including the plurality of associated fragments.
  • the PDL additionally includes a number of address/length pairs. Address of fragment 1 230 includes the physical address for fragment 1 within memory 180 of host computer system 190.
  • Length of fragment 1 235 includes the length of fragment 1.
  • Address of fragment 2 240 includes the address within memory 180 of fragment 2.
  • Length of fragment 2 245 includes the length of fragment 2.
  • address of fragment N 250 which contains the address of fragment N within memory 180.
  • the PDL additionally contains length of fragment N 255, which includes the length of fragment N.
  • FIG. 3 illustrates the format for a simplified command sent from host computer system 190 to network interface 195 in accordance with an aspect of the present invention.
  • the command is known as a packet description command (PDC) and fits within a single word of memory.
  • PDC packet description command
  • Options field 300 which is a field indicating the processing options for a transmission between host computer system 190 and network interface 195.
  • block index field 310 which indexes a buffer address table within controller 100, containing the physical address of a preallocated buffer within memory 180 in host computer system 190.
  • buffer length 320 which is the length of the data within the pre-allocated buffer that is to be transferred between network interface 195 and host computer system 190.
  • FIG. 4 illustrates the sequence of operations involved in transferring data from host computer system 190 to controller 100 within network interface 195 using a prior art packet descriptor list command format in accordance with an aspect of the present invention.
  • host computer system 190 writes a packet descriptor address to controller 100.
  • controller 100 uses this packet descriptor address to retrieve a command block 410 from host computer system 190.
  • controller 100 performs a series of retrieval operations 420, 430 and 440, to retrieve individual fragments from host computer system 190 into controller 100.
  • FIG. 5 illustrates the sequence of operations required to move data from host computer system 190 to controller 100 using a packet descriptor command format in accordance with an aspect of the present invention. The greatly simplified sequence of operations in this example results from the simplified command format and the preallocation of buffers within memory 180.
  • host computer system 190 writes a packet descriptor command 500 to controller
  • Controller 100 uses the block index 310 within this packet descriptor command as well as buffer length 320 to retrieve a buffer 510 within memory 180.
  • FIG. 6 illustrates the sequence of operations required to transfer data from controller 100 to host computer system 190 using a prior art packet descriptor list command format in accordance with an aspect of the present invention.
  • a packet descriptor address is pre-loaded 600 into controller 100 sometime before the incoming data is received at controller 100 from high speed network 160.
  • the packet descriptor address is used to retrieve a command block 610, including a packet descriptor list from host computer system 190.
  • a series of transfers, 620, 630 and 640 take place between controller 100 and host computer system 190 to transfer all of the constituent fragments to host computer system 190.
  • FIG. 6 illustrates the sequence of operations required to transfer data from controller 100 to host computer system 190 using a prior art packet descriptor list command format in accordance with an aspect of the present invention.
  • a packet descriptor command is preloaded 700 into controller 100 from host computer system 190 before data is received at controller 100.
  • this data is transferred to a buffer within host computer system 190 in a single transaction.
  • Controller 100 can buffer data until such time that host computer system 190 loads a packet descriptor command into controller 100.
  • FIG. 8 is a block diagram illustrating some of the major functional components of controller 100 in FIG. 1 in accordance with an aspect of the present invention.
  • Controller 100 includes bus master controller 800, which is coupled to transmit buffer 830 and receive buffer 835. Transmit buffer 830 and receive buffer 835 are used to store data to be transmitted and received from high speed network 160 pictured in FIG. 1.
  • Bus master controller 800 additionally connects to bus interface 820, which implements bus interface functions for a connection onto interconnect bus 130 within host computer system 190 in FIG. 1.
  • Bus master controller 800 includes transmit command FIFO 850, which stores transmit commands from host computer system 190.
  • Transmit command FIFO 850 is coupled to transmit execution queue 860.
  • Transmit execution queue 860 contains expanded commands from transmit command FIFO 850. If the command in transmit command FIFO 850 is a packet descriptor address, the corresponding command block is retrieved and placed into transmit execution queue 860. If the command is a packet descriptor command, the command is directly transferred to transmit execution queue 860.
  • Transmit execution queue 860 is additionally coupled to transmit buffer address table 870, which contains physical addresses of preallocated buffers within memory 180 in host computer system 190.
  • Bus master controller 800 additionally includes receive command FIFO
  • Bus master controller 800 also includes receive buffer address table 872, which contains a table of physical addresses of pre-allocated buffers for storing data received from high speed network 160.
  • Bus master controller 800 additionally includes byte aligner endian mode circuitry 840 and byte aligner endian mode circuitry 842.
  • Byte aligner endian mode circuitry 840 is coupled between bus interface 820 and transmit buffer 830. It performs byte alignment and endian mode reversal functions for control information associated with data transmissions from bus interface 820 to transmit buffer 830.
  • Byte aligner and endian mode circuitry 842 is coupled between receive buffer 835 and bus interface 820, and provides the same byte alignment and endian mode reversal functions for status information associated with data transmitted from receive buffer 835 and bus interface 820 in accordance with an aspect of the present invention.
  • Bus master controller 800 additionally includes bus master state machine
  • Bus master state machine 810 coordinates actions of the components within FIG. 8 to transfer data between bus interface 820 and transmit and receive buffers 830 and 835.
  • Bus master controller 800 is responsible for data flow between transmit buffer 830, receive buffer 835 and bus interface 820. It includes five major components: bus master state machine 810, byte aligner endian mode circuitry 840 and 842, transmit buffer address table 870, receive buffer address table 872, receive command FIFO 852 with receive execution queue 862, and transmit command FIFO 850 with transmit execution queue 860.
  • Bus master controller 800 supports three modes of operation: programmed I/O (PIO), packet descriptor list (PDL), and packet descriptor command (PDC).
  • PDC mode is also known as PROPULSION(tm) technology.
  • Bus master controller 800 decodes and controls transactions and routing of data required by the operating modes. As a shorthand for references to the address of a PDL, the term PDA is used throughout the remainder of the text.
  • Command execution queues 860 and 862 are used in the PDL and PDC modes of operation to hold either a packet descriptor address or a packet descriptor command.
  • a PDA provides the address where a corresponding packet descriptor list is obtained, while a PDC is used to execute a PROPULSION(tm) transaction.
  • the command execution queue contains the loaded PDL and/or PDC instructions from the command FIFO.
  • the PDL and PDC commands are executed from the command execution queues.
  • Packet descriptor lists are the data structures used to communicate information about transmit and receive packets. Both transmit and receive PDLs use the same format, shown in FIG. 2. Each PDL contains a packet descriptor header and one or more fragment descriptors describing the location and length of the packet data in host memory.
  • Packet length field 220 includes the sum of the length fields in the fragment descriptors.
  • the length field is also the sum of the fragment lengths, however this field is overwritten with the actual length of the packet after the packet is received.
  • bus master controller 800 scatters the packet across the locations described by each fragment descriptor. Note that the fragment lengths are not overwritten, so the last fragment transferred may contain less data than is indicated by the corresponding fragment length field.
  • the header length field also indicates how many fragments are completely filled and how much data is in the last fragment. If the buffer described by the receive PDL is not large enough to hold the complete packet, a receive overflow error is generated and the remaining data is discarded.
  • Transmit execution queue 860 and receive execution queue 862 are independent FIFOs containing 32 and 64 entries, respectively, in a preferred embodiment. They contain either a PDA or a PDC instructions.
  • the host software transfers control of the buffer to bus master controller 800.
  • the host computer system 190 writes a PDA or PDC to a command FIFO, the FIFO's command count register is incremented.
  • bus master controller 800 has processed the PDL pointed to by a PDA or the PDC, the PDA or the PDC is removed from the ring and the command count register is decremented.
  • the host software uses a command FIFO count register to determine how many PDA or PDC commands are currently owned by bus master controller 800. If the host is capable of writing commands, and thereby transferring control of the PDL/PDC to bus master controller 800, faster than bus master controller 800 uses the PDLs or PDCs, efficient pipelining of packets occurs and bus transactions overlap with network transactions. Because of the large size of FIFOs 850 and 852, some of the bursty nature of bus accesses can be smoothed out.
  • Bus master controller 800 uses command execution queues 860 and 862 as scratch memory while transferring packet data between host memory 180 and transmit and receive buffers 830 and 835. Bus master controller 800 copies the PDL pointed to by the PDA into a command FIFO in the corresponding command execution queue. Bus master controller 800 uses this PDL to program bus interface 820 with the location and length of each fragment to be transferred. Loading the complete PDL into scratch memory improves utilization of the interconnect bus 130 because, in most cases, the complete PDL can be transferred in one bus transaction. If bus master controller 800 were to read each fragment descriptor separately, performance would suffer because each fragment descriptor read would require a separate bus transaction including the associated arbitration latency.
  • Bus master controller 800 also uses command execution queues 860 and 862 to hold PDC instructions. PDC instructions are directly transferred from the command FIFO. PDCs are executed out of the command execution queues to maintain PDL/PDC ordering and to maximize the use of the command FIFO. Receive and transmit PDC instructions use the same word format, shown in FIG. 3.
  • the format contains buffer length 320, block index 310, and options field 300.
  • Block index 310 is an address into the corresponding buffer address table which contains the physical address at which data is to be transferred to or from the host memory.
  • Buffer length field 320 specifies the number of bytes to be transferred during transmit operations, or the allocated host memory space required for a receive operation.
  • Bus master controller 800 fills the allocated space, sets a receive overflow flag, and then discards the remaining amount of the packet.
  • Options field 300 is used to communicate special processing options to bus master controller 800, such as whether or not an interrupt is desired immediately upon the completion of a data transfer between controller 100 and host computer system 190.
  • Bus master state machine 810 coordinates and controls all activity associated with transferring packet data between host memory 180 and transmit and receive buffers 830 and 835 during PDL and PDC modes. Transmit and receive operations are independently described. However, they are actually performed by the same state machine 810 and are interleaved as necessary.
  • Bus master state machine 810 can be configured to give priority to a receive operation. If data reception is occurring fast enough, it will perform up to eight receive cycles for each transmit cycle.
  • a transmit PDL transaction is generated by the host software, which creates a PDL describing the packet in host memory, and transfers control to bus master controller 800 by writing the PDL Address (PDA) to transmit command
  • FIFO 850 Writing the PDA to transmit command FIFO 850 causes the transmit command count register to increment. When the command count register is greater than zero, bus master controller 800 extracts the next command from the FIFO. Since every PDL must have at least one fragment, bus master controller 800 programs bus interface 820 to read the PDL header and one fragment into transmit execution queue 860. If there is more than one fragment, bus master controller 800 reads the header to determine this fact and adjusts the number of fragments to be transferred to the execution queue. Once the complete PDL has been copied into the command execution queue, that PDA is discarded.
  • bus master controller 800 checks transmit buffer 830 to ensure there is enough room to load another packet. If there is not enough room, bus master controller 800 waits until transmit buffer 830 has enough room for a new packet.
  • Bus master controller 800 then proceeds to interpret each fragment descriptor and programs bus interface 820 to copy each fragment from host memory 180 to transmit buffer 830. When all fragments have been copied to transmit buffer 830, bus master controller 800 discards the PDL and checks the command queue to determine if another PDL is available. If so, bus master controller 800 executes the new PDL and repeats the operation described above. The operation of bus master controller 800 during receive transfers is similar to the transmit case with a few subtle differences. The host software creates a receive PDL, which describes the buffer in which to transfer the received data. A corresponding PDA is then written to receive command FIFO 852, transferring control to bus master controller 800.
  • Bus master controller 800 transfers the PDL into receive execution queue 862, even if no received packets are available in the receive data buffers. By transferring the PDL before it is actually needed, bus master controller 800 attempts to reduce the latency between receiving a packet and transferring it to host memory.
  • bus master controller 800 uses the preloaded PDL to determine how to scatter the received packet into host memory 180.
  • Bus master controller 800 programs the necessary transactions into bus interface 820 to copy each fragment of the received packet into host memory 180 as described by the PDL fragment descriptors.
  • the first word of the received packet in receive buffer 835 contains the total packet length and the receive status. This word is saved by bus master controller 800 and is transferred to the receive PDL in host memory 180 after all packet data has been copied to host memory 180.
  • PDL becoming non-zero indicates that bus master controller 800 has transferred control of the buffer and associated PDL back to host computer system 190. This process is repeated until all received packets have been transferred or the PDA receive command FIFO is exhausted.
  • Transmit PDC mode provides increased performance by reducing the number of bus acquisitions required. This mode requires that physical addresses of preallocated buffers be loaded into transmit buffer address table 870 during system initialization. An operation starts with the host software gathering data fragments into a predefined contiguous memory space in one of the preallocated buffers. Once this is complete, the information needed for a PDC instruction is known. The PDC instruction is then created and transferred to the transmit command FIFO 850, and the command count register is incremented.
  • bus master controller 800 If an instruction is at the beginning of transmit command FIFO 850, it is transferred to transmit execution queue 860. The command FIFO count is then decremented and the execution queue count is incremented. If the execution queue count is non-zero, control passes to bus master controller 800, which begins execution of the instruction. Bus master controller 800 decodes the instruction for the length and base index information while checking the transmit buffer flags for available packet space. With the proper information and status, bus master controller 800 configures bus interface 820 to commence transfer to the data buffer. Once the transfer is complete, the PDC instruction is discarded, the command queue count is decremented, and a complete transfer flag is set.
  • a receive PDC transaction is initiated by the receive buffer 835.
  • receive buffer address table 872, receive execution queue 862, and receive command FIFO 852 are preloaded. With all of this information loaded in advance, the data transfer occurs with minimal overhead. This information can also be reloaded during period of non-use, or when empty, to better distribute the workload over time.
  • Bus master controller 800 accesses the PDC instruction on top of receive command execution queue 862, and examines the block index and length. Bus master controller 800 uses this information to program bus interface 820 to request a bus transaction. At the same time, bus master controller 800 determines if the initial data packet will fit in the allocated memory space. If the data packet will not fit, bus master controller 800 sends data until the host memory space is filled, sets the receive overflow flag, and signals receive buffer 835 to discard the remaining portion of the data packet. If the memory space is greater than one packet, bus master controller 800 determines if another packet can be transferred, and sends additional packets until the allocated memory space is filled.
  • bus master controller 800 breaks the transfer into multiple transactions. Once the transfer is complete, the command execution count is decremented and another command is loaded. If another packet is ready in receive buffer 835, this cycle is repeated.
  • PIO programmed I/O
  • PIO programmed I/O
  • This mode requires, very little or no action from bus master controller 800.
  • PIO transfers one double word (32 bits) of data at a time directly to transmit buffer 830.
  • Each transfer requires a separate request for interconnect bus 130, thus increasing total bus acquisition latency.
  • Bus master controller 800 is only responsible for routing data to transmit buffer 830. The host software makes certain that packet space is available, and indicates to transmit buffer 830 that the packet transfer is complete by setting appropriate flags.
  • a receive PIO transfer is initiated by the host software.
  • the host software ensures that a packet is available prior to transferring the packet to host computer system 190. Double word transfers are performed across interconnect bus 130 until all packet data has been transferred.
  • the host software is also responsible for maintaining packet and data integrity.
  • the packet transfer process minimizes the number of interrupts necessary to interact with bus master controller 800.
  • host computer system 190 receives and transfers packets without ever taking an interrupt.
  • the host software can program bus master controller 800 to generate an interrupt in the following cases: after each packet has been transferred to the transmit buffer 830; when the transmit command FIFO is exhausted; or upon any transmit error.
  • Receive interrupts are generated after each receive packet has been transferred to host memory 180, when the receive command FIFO 852 is exhausted, or when receive errors occurs.
  • bus master controller 800 implements a "lying send" transmit policy in which a successful packet transmission is signaled to the host software as soon as possible after bus master controller 800 completes the data transfer between host memory 180 and transmit buffer 830 or receive buffer 835.
  • the packet is considered to be "transmitted” the moment bus master controller 800 has a complete copy of the packet. It is the responsibility of protocols above the driver level to ensure that packets are successfully transmitted to remote stations. If a packet is lost during transmission by bus master controller 800, the protocol must recognize that the packet is lost and take a corrective action, such as a retransmission.
  • Errors such as CRC, runt packet and long packet errors are detected by bus master controller 800 and signaled to the host software by specific bits in the receive PDL header.
  • the header also contains additional information bits pertaining to the inbound packet. Padding of packets that are shorter than the minimum legal length for transmission is the responsibility of software on host computer system 190.
  • Bus master controller 800 supports the use of PDL and PDC data transfer methods simultaneously.
  • software on host computer system 190 indicates the required transfer method by placing an appropriate command in transmit command FIFO 850 or receive command FIFO 852.
  • software on host computer system 190 initiates the process by writing a PDC to the appropriate command FIFO. If host computer system 190 wishes to transmit a packet using conventional bus master DMA, it writes a PDA to the command
  • intermixing PDC and PDL data transfer methods can be accomplished with just one index variable for the PDC queue, one for the PDL queue and a counter variable reflecting the free space in transmit command FIFO 850.
  • This technique works as long as each queue accommodates at least as many entries as transmit command FIFO 850.
  • CMD # of entries that can be accommodated in the command FIFO
  • PDL # of entries that can be accommodated in the PDL queue
  • the PDL and PDC variables with the "used"' subscript indicate the entries in the respective rings that contain transmit requests.
  • FIG. 9 is a diagram illustrating two packets packed into a single preallocated buffer within memory 180 in accordance with an aspect of the present invention.
  • FIG. 9 includes a first packet, including flags 901 and length 902.
  • Flags 901 contain status information for the first packet.
  • Length field 902 contains the length of the first packet.
  • the first packet also includes data 903, which contains all of the data associated with the packet.
  • the second packet includes flags 911 and length 912.
  • Flags 911 contain status information for the second packet.
  • Length field 912 contains the length of the second packet.
  • the second packet also includes data 913, which is the data associated with the second packet. As indicated by the ellipsis, additional packets may be included into a single preallocated buffer.

Abstract

The present invention provides a method and an apparatus for transferring data between a computer system (190) and a network interface card (195) that avoids virtual-to-physical address translations. The computer system (190) allocates blocks of memory (180) during system initialization for storing data in transit between the computer system (190) and the NIC (195), and the physical addresses of the blocks of memory (180) are stored in a table (105) on the NIC (195). Consequently, address conversion is performed only once, when the memory (180) is allocated. When a request to transfer data to the NIC (195) is received from the upper layers, the device driver copies the data from the upper layers into the next available memory block. The device driver then formats a command and passes into the NIC (195) for processing. Data transfer commands are communicated to the NIC (195) through a packet descriptor command (PDC), which is a 32-bit value subdivided into fields that completely describe the data transfer operation. The PDC contains a small ordinal value that indexes a table (105) in the NIC (195), which includes a set of physical addresses of buffers (182) preallocated by the computer system (190) in the computer system memory (180). These buffers (182) are used for storing data in transit to the NIC (195). The present invention also allows for multiple packets to be formatted into buffers and then subsequently transferred to the NIC (195) in a single I/O operation.

Description

SYSTEM FOR REDUCING BUS OVERHEAD FOR
COMMUNICATIONS WITH
A NETWORK INTERFACE
BACKGROUND
Field of the Invention
The present invention relates to a device for connecting a computer system to a computer network, and more particularly to a method and an apparatus for reducing bus overhead in communications between a computer system and a network interface device through which the computer system communicates with a high speed packet-switched network.
Related Art
The advent of computer networking has given rise to devices that connect computer systems to packet-switched data networks. These devices (known as network interface controllers, or NICs) typically include interfaces to both the computer system and the packet-switched data network, as well as a buffer memory for buffering packets of data in transit between the computer system and the packet-switched data network. The interface to the computer system typically connects to a bus within the computer system, such as a PCI bus, through which data is transferred between the computer system memory and the NIC. As computer networks and NICs greatly increase in performance, communications across this bus can become an impediment to achieving high performance in communications between the computer system and the packet- switched data network.
Three methods can be used to communicate between a computer system and a device such as a NIC. (1) Programmed I/O (PIO) operates by including explicit I/O commands in the application programs executed by the computer system. PIO can be implemented with a simple hardware and operating system design. However, it places a tremendous burden on the application program to explicitly manage communications between the computer system and the NIC.
(2) Shared memory can be used to facilitate communications between the NIC and the computer system. In a shared memory system, the NIC and the computer system communicate by writing to and reading from a shared memory that exists in both the address space of the computer system and the address space of the NIC. This again leads to a simple hardware and operating system implementation, and a clean interface between the computer system and the NIC. However, it again places a burden on the application program to explicitly manage communications between the computer system and the NIC.
(3) Finally, direct memory access (DMA) can be used to transfer data between the NIC and the memory of the computer system. DMA operates by allowing the NIC to perform bus operations to directly access the memory of the computer system in order to transfer data between the computer system and the NIC. A DMA system requires considerable complexity in hardware and operating system design. However, it relieves the application program of the burden of explicitly managing communications between the computer system and the NIC.
DMA transfers between computer systems and NICs are commonly accomplished using the scatter-gather technique. In scatter-gather, a bus master device in the NIC is first instructed to obtain a command block from the memory of a host computer system. At a minimum, the command block contains a list of physical addresses for blocks within the host system memory that are to be copied to the DMA device. The command block also contains a count of the number of fragments in the command block and the overall length of the data contained in the fragments pointed to by the command block. The DMA device parses the command block, extracting the address of each fragment, and transfers the fragments from the host memory to the DMA device. This process is repeated for each fragment listed in the command block until all of the data described by the command block is copied to the DMA device. A significant performance bottleneck in using the scatter-gather technique for transferring data to a high speed network is the translation from virtual to physical addresses. Peripheral devices, such as a NIC, cannot use virtual memory addresses to effect the transfers, because the hardware to implement the virtual-to-physical address translation is typically located inside the CPU. This means that conversion between virtual and physical addresses must take place before transfers between a computer system and a NIC can take place. This conversion can take a great deal of time and consume a significant amount of the computer system's processing power. When data is passed to a device driver for transmission to the NIC, the driver first performs a virtual -to- physical address conversion for each buffer fragment passed down to it from the application layers above. It is possible for each buffer fragment to straddle physical pages of the memory system. Thus, more than one physical address may correspond to each virtual address converted. Consequently, several virtual-to-physical address conversions may be required for each buffer of data that is transferred from the computer system to the NIC. This can be very time- consuming because each virtual-to-physical address translation can take from tens to hundreds of CPU cycles to accomplish.
Another significant performance impediment associated with the scatter-gather technique is its command block nature. Peripheral devices such as
NICs typically connect to computer systems through a peripheral interconnect bus, such as the PCI bus. In order to transfer data to or from the computer system, devices connected to the bus contend for control of the bus. Once a device is granted control of the bus, it drives bus signal lines to transfer data to or from the computer system. The performance impediment stems from the number of times a NIC must contend for the peripheral interconnect bus when transferring data using the scatter-gather technique. Under ideal circumstances for scatter-gather, bus contention to transfer data between a NIC and an attached computer system will occur three times per buffer transferred: first, when the computer system informs the NIC that a buffer is available for its use; second when the NIC reads the command block describing the buffer; and third when the NIC transfers data to or from the buffer. In typical scenarios, at least two buffer fragments will be described in each command block. As a result, there will be at least four contentions instead of three. These additional contentions create opportunities for other devices to obtain control of the bus and thus delay transfers initiated by the NIC.
What is needed is a method for performing DMA between a computer system and a NIC which is free from the overhead of performing virtual to physical address translations and minimizes the number of bus transactions required to initiate the DMA transfer process.
SUMMARY
The present invention provides a method and an apparatus for transferring data between a computer system and a network interface card that avoids virtual-to-physical address translations. The computer system allocates blocks of memory during system initialization for storing data in transit between the computer system and the NIC, and the physical addresses of these blocks of memory are stored in a table on the NIC. Consequently, address conversion is performed only once, when the memory is allocated. When a request to transfer data to the NIC is received from the upper layers, the device driver copies the data from the upper layers into the next available memory block. The device driver then formats a command and passes it to the NIC for processing. Data transfer commands are communicated to the NIC through a packet descriptor command (PDC), which is a 32-bit value subdivided into fields that completely describe the data transfer operation. The PDC contains a small ordinal value that indexes a table in the NIC, which includes a set of physical addresses of buffers preallocated by the computer system in the computer system memory. These buffers are used for storing data in transit to the NIC. The PDC also contains the length of the buffer to be copied to or from the NIC. The present invention also allows for multiple packets to be formatted into buffers and then subsequently transferred to the NIC in a single I/O operation. The present invention provides a number of advantages. First, virtual- to-physical address translation is avoided at run time. Second, the formatting of a packet descriptor list is greatly simplified. Third, the amount of control data transferred to the NIC by the computer system is greatly reduced. Finally, multiple packets can be transferred to the NIC in a single I/O operation, thereby making more efficient use of bandwidth on the interconnect bus.
The present invention incurs additional overhead because the processor must move data from the application program into the data buffers in the computer system's memory before this data is transferred to the NIC. At first glance, this double copy operation appears to incur a great amount of additional processor overhead. However, this additional overhead is considerably smaller than the overhead involved in performing virtual-to-physical address translations. Each translation requires many tens (if not hundreds) of CPU cycles, and many such translations may be required for a single transfer operation. Consequently, the present invention provides a significant performance advantage for small data transfers, which represent a significant percentage of all data transfers. Hundreds of bytes can be moved to the preallocated buffer in the time it takes to perform just one virtual-to-physical address translation. Moreover, as microprocessors move to 64 and 128 bit architectures, their capacity to move data per clock will increase thereby further widening the performance advantage of the present invention over conventional scatter-gather DMA.
Furthermore, CPU utilization may not be the primary bottleneck. In systems which move around large amounts of data, bus utilization may be the largest bottleneck. Hence, favoring bus utilization at the expense of CPU utilization is often a desirable tradeoff to make.
Thus, the present invention can be characterized as an apparatus for facilitating communications between a computer system, including a memory and a bus, and a packet-switched network, comprising: a bus interface coupled to the bus, for communicating across the bus; a transmit buffer, for storing data to be transmitted on the packet-switched network; a transmit data path, coupled to the bus interface and the transmit buffer, for transferring data from the bus interface to the transmit buffer; a receive buffer, for storing data received from the packet-switched network; a receive data path, coupled to the bus interface and the receive buffer, for transferring data from the receive buffer to the bus interface; a buffer address table, coupled to the bus interface, for storing at least one address of at least one buffer in the memory of the computer system, the at least one buffer being preallocated by the computer system and used to store data in transit between the computer system and one of the transmit buffer and the receive buffer; and a controller coupled to the transmit buffer, the receive buffer and the buffer address table, for controlling the transfer of data from the computer system to the transmit buffer, and from the receive buffer to the computer system.
According to an aspect of the present invention, the apparatus includes: a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system; and a transmit execution queue, coupled to the bus interface, the transmit command queue and the controller, for storing and processing commands from the transmit command queue, and command blocks from the computer system which are referenced by commands from the transmit command queue. According to another aspect of the present invention, the apparatus includes a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system; and a receive execution queue, coupled to the bus interface, the receive command queue and the controller, for storing and processing commands from the receive command queue and command blocks from the computer system that are referenced by commands from the receive command queue.
According to another aspect of the present invention, the controller includes a mechanism to transfer a plurality of packets in a single operation between the at least one buffer preallocated by the computer system and the transmit buffer. According to another aspect of the present invention, the controller includes a mechanism to transfer a plurality of packets in a single operation between the receive buffer and the at least one buffer preallocated by the computer system. The present invention can also be characterized as a method for transferring data between a computer system and a network interface device, the network interface device being coupled to a packet-switched network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: receiving at the network interface device at least one address of a preallocated buffer in the memory; storing in the network interface device the at least one address of the preallocated buffer; receiving a command from the computer system through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; retrieving an address from the at least one address of a preallocated buffer stored in the network interface device; using the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and using the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
DESCRIPTION OF THE FIGURES
FIG. 1 is a block diagram illustrating some of the major functional components of a system for coupling host computer system 190 with a high speed network 160 in accordance with an aspect of the present invention.
FIG. 2 is a block diagram illustrating the format of a packet descriptor list in accordance with an aspect of the present invention.
FIG. 3 is a block diagram illustrating the structure of a simplified packet descriptor command for initiating a transfer of data between a NIC and a computer system in accordance with an aspect of the present invention. FIG. 4 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 190 to a NIC in accordance with an aspect of the present invention.
FIG. 5 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from host computer system 150 to a NIC in accordance with an aspect of the present invention.
FIG. 6 illustrates the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention. FIG. 7 is a diagram illustrating the sequence of commands and data transfers involved in transferring data from a NIC to a host computer system 150 in accordance with an aspect of the present invention.
FIG. 8 is a block diagram illustrating some of the major functional components within a NIC in accordance with an aspect of the present invention. FIG. 9 is a block diagram illustrating the structure of a preallocated buffer, including a plurality of packets for transmission to a NIC, in accordance with an aspect of the present invention.
DESCRIPTION The following description is presented to enable any person skilled in the art to make and use the invention, and is provided in the context of a particular application and its requirements. Various modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the general principles defined herein may be applied to other embodiments and applications without departing from the spirit and scope of the present invention. Thus, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and features disclosed herein.
FIG. 1 is a block diagram illustrating some of the major functional components of a host computer system 190, which connects to a high speed network 160 through network interface 195, in accordance with an aspect of the present invention. Host computer system 190 includes interconnect bus 130, host bridge 110, host bus 140, host processor 120 and memory 180. Host processor 120 connects to host bus 140, which also connects to host bridge 110. Host processor 120 can be any type of processor system including a device controller, a microprocessor, or a mainframe computing system. Host bus 140 is a bus which connects host processor 120 to host bridge 110. Host bridge 110 includes cache controller 112 which connects to memory 180. Memory 180 includes pre-allocated buffers 182, which are buffers pre-allocated by host processor 120 during system initialization. These buffers are used to store data in transit between host computer system 190 and high speed network 160. Host bridge 110 additionally connects to interconnect bus 130. Interconnect bus 130 is used to connect host computer system 190 to peripheral devices, such as network interface 195. Interconnect bus 130 may be any type of commonly used interconnection bus, such as a PCI bus. High speed network 160 is any type of high speed data network, including 100 megabit and gigabit Ethernet networks for example. Network interface 195 may be implemented on a separate computer card, or it may be integrated into a computer system motherboard. It may also be integrated into a single silicon chip. Network interface 195 includes physical layer interface 150 and controller 100. Controller 100 is coupled to interconnect bus 130 within host computer system 190. Controller 100 additionally connects to physical layer interface 150, which connects to high speed network 160. Controller 100 performs the DMA functions involved in transferring data between memory 180 within host computer system 190, and high speed network 160. Controller 100 includes buffer address table 105, which includes a plurality of addresses of buffers in preallocated buffers 182 within memory 180. Physical layer interface 150 includes resources for performing communications across high speed network 160.
FIG. 2 is a diagram illustrating the structure of a packet descriptor list (PDL) in accordance with an aspect of the present invention. The PDL illustrated in FIG. 2 specifies a transmission of data including the transmission of N separate fragments scattered through memory 180 within host computer system 190. The PDL includes status field 200, which contains information regarding the status of the data transfer corresponding to the PDL. It also includes a number of fragments field 210, which indicates the number of fragments associated with the data transfer specified by the PDL. The PDL also includes a packet length 220, which indicates the length of the entire transfer, including the plurality of associated fragments. The PDL additionally includes a number of address/length pairs. Address of fragment 1 230 includes the physical address for fragment 1 within memory 180 of host computer system 190. Length of fragment 1 235 includes the length of fragment 1. Address of fragment 2 240 includes the address within memory 180 of fragment 2. Length of fragment 2 245 includes the length of fragment 2. Next, there are a number of intervening fragments, and then address of fragment N 250, which contains the address of fragment N within memory 180. The PDL additionally contains length of fragment N 255, which includes the length of fragment N.
FIG. 3 illustrates the format for a simplified command sent from host computer system 190 to network interface 195 in accordance with an aspect of the present invention. The command is known as a packet description command (PDC) and fits within a single word of memory. It includes options field 300, which is a field indicating the processing options for a transmission between host computer system 190 and network interface 195. It also includes block index field 310, which indexes a buffer address table within controller 100, containing the physical address of a preallocated buffer within memory 180 in host computer system 190. Finally, it includes buffer length 320, which is the length of the data within the pre-allocated buffer that is to be transferred between network interface 195 and host computer system 190. Note that this simplified command format does not require a "number of fragments" field because only one fragment is sent. It also does not require a separate address and length for each fragment because multiple fragments are concatenated together within a single pre-allocated buffer to be transferred in a single operation. FIG. 4 illustrates the sequence of operations involved in transferring data from host computer system 190 to controller 100 within network interface 195 using a prior art packet descriptor list command format in accordance with an aspect of the present invention. First, host computer system 190 writes a packet descriptor address to controller 100. Next, controller 100 uses this packet descriptor address to retrieve a command block 410 from host computer system 190. Once this command block is retrieved, controller 100 performs a series of retrieval operations 420, 430 and 440, to retrieve individual fragments from host computer system 190 into controller 100. FIG. 5 illustrates the sequence of operations required to move data from host computer system 190 to controller 100 using a packet descriptor command format in accordance with an aspect of the present invention. The greatly simplified sequence of operations in this example results from the simplified command format and the preallocation of buffers within memory 180. First, host computer system 190 writes a packet descriptor command 500 to controller
100. Controller 100 uses the block index 310 within this packet descriptor command as well as buffer length 320 to retrieve a buffer 510 within memory 180.
FIG. 6 illustrates the sequence of operations required to transfer data from controller 100 to host computer system 190 using a prior art packet descriptor list command format in accordance with an aspect of the present invention. First, a packet descriptor address is pre-loaded 600 into controller 100 sometime before the incoming data is received at controller 100 from high speed network 160. Next, the packet descriptor address is used to retrieve a command block 610, including a packet descriptor list from host computer system 190. Next, when the incoming data is received from high speed network 160, a series of transfers, 620, 630 and 640, take place between controller 100 and host computer system 190 to transfer all of the constituent fragments to host computer system 190. In contrast, FIG. 7 presents a greatly simplified series of transactions required to move data from controller 100 to host computer system 190 using a packet descriptor command format in accordance with an aspect of the present invention. First, a packet descriptor command is preloaded 700 into controller 100 from host computer system 190 before data is received at controller 100. Next, when data is finally received at controller 100 from high speed network 160, this data is transferred to a buffer within host computer system 190 in a single transaction.
Although optimum performance is attained when the packet descriptor command 700 is preloaded into controller 100, this sequence is not a requirement. Controller 100 can buffer data until such time that host computer system 190 loads a packet descriptor command into controller 100.
FIG. 8 is a block diagram illustrating some of the major functional components of controller 100 in FIG. 1 in accordance with an aspect of the present invention. Controller 100 includes bus master controller 800, which is coupled to transmit buffer 830 and receive buffer 835. Transmit buffer 830 and receive buffer 835 are used to store data to be transmitted and received from high speed network 160 pictured in FIG. 1. Bus master controller 800 additionally connects to bus interface 820, which implements bus interface functions for a connection onto interconnect bus 130 within host computer system 190 in FIG. 1. Bus master controller 800 includes transmit command FIFO 850, which stores transmit commands from host computer system 190.
Transmit command FIFO 850 is coupled to transmit execution queue 860. Transmit execution queue 860 contains expanded commands from transmit command FIFO 850. If the command in transmit command FIFO 850 is a packet descriptor address, the corresponding command block is retrieved and placed into transmit execution queue 860. If the command is a packet descriptor command, the command is directly transferred to transmit execution queue 860. Transmit execution queue 860 is additionally coupled to transmit buffer address table 870, which contains physical addresses of preallocated buffers within memory 180 in host computer system 190. Bus master controller 800 additionally includes receive command FIFO
852, which contains receive commands from host computer system 190 that are preloaded into receive command FIFO 852. Receive command FIFO 852 is coupled to receive execution queue 862, which contains expanded commands from receive command FIFO 852. Again, packet descriptor addresses within receive command FIFO 852 are expanded into corresponding command blocks which are loaded into receive execution queue 862. Packet descriptor commands are directly loaded into receive execution queue 862. Bus master controller 800 also includes receive buffer address table 872, which contains a table of physical addresses of pre-allocated buffers for storing data received from high speed network 160. Bus master controller 800 additionally includes byte aligner endian mode circuitry 840 and byte aligner endian mode circuitry 842. Byte aligner endian mode circuitry 840 is coupled between bus interface 820 and transmit buffer 830. It performs byte alignment and endian mode reversal functions for control information associated with data transmissions from bus interface 820 to transmit buffer 830. Byte aligner and endian mode circuitry 842 is coupled between receive buffer 835 and bus interface 820, and provides the same byte alignment and endian mode reversal functions for status information associated with data transmitted from receive buffer 835 and bus interface 820 in accordance with an aspect of the present invention. Bus master controller 800 additionally includes bus master state machine
810, which is coupled to all of the functional components within bus master controller 800, and is additionally coupled to bus interface 820, transmit buffer 830 and receive buffer 835. Bus master state machine 810 coordinates actions of the components within FIG. 8 to transfer data between bus interface 820 and transmit and receive buffers 830 and 835.
Bus master controller 800 is responsible for data flow between transmit buffer 830, receive buffer 835 and bus interface 820. It includes five major components: bus master state machine 810, byte aligner endian mode circuitry 840 and 842, transmit buffer address table 870, receive buffer address table 872, receive command FIFO 852 with receive execution queue 862, and transmit command FIFO 850 with transmit execution queue 860. Bus master controller 800 supports three modes of operation: programmed I/O (PIO), packet descriptor list (PDL), and packet descriptor command (PDC). The PDC mode is also known as PROPULSION(tm) technology. Bus master controller 800 decodes and controls transactions and routing of data required by the operating modes. As a shorthand for references to the address of a PDL, the term PDA is used throughout the remainder of the text.
Command execution queues 860 and 862 are used in the PDL and PDC modes of operation to hold either a packet descriptor address or a packet descriptor command. A PDA provides the address where a corresponding packet descriptor list is obtained, while a PDC is used to execute a PROPULSION(tm) transaction. The command execution queue contains the loaded PDL and/or PDC instructions from the command FIFO. The PDL and PDC commands are executed from the command execution queues. Packet descriptor lists are the data structures used to communicate information about transmit and receive packets. Both transmit and receive PDLs use the same format, shown in FIG. 2. Each PDL contains a packet descriptor header and one or more fragment descriptors describing the location and length of the packet data in host memory. In the case of a transmit PDL, the PDL describes the location and length of fragments that comprise the total packet. Packet length field 220 includes the sum of the length fields in the fragment descriptors. For receive PDLs, the length field is also the sum of the fragment lengths, however this field is overwritten with the actual length of the packet after the packet is received. When transferring a received packet from receive buffer 835 to host memory 180, bus master controller 800 scatters the packet across the locations described by each fragment descriptor. Note that the fragment lengths are not overwritten, so the last fragment transferred may contain less data than is indicated by the corresponding fragment length field. The header length field also indicates how many fragments are completely filled and how much data is in the last fragment. If the buffer described by the receive PDL is not large enough to hold the complete packet, a receive overflow error is generated and the remaining data is discarded.
Transmit execution queue 860 and receive execution queue 862 are independent FIFOs containing 32 and 64 entries, respectively, in a preferred embodiment. They contain either a PDA or a PDC instructions.
By writing a PDA or PDC to either a transmit command FIFO 850 or receive command FIFO 852, the host software transfers control of the buffer to bus master controller 800. Each time the host computer system 190 writes a PDA or PDC to a command FIFO, the FIFO's command count register is incremented. After bus master controller 800 has processed the PDL pointed to by a PDA or the PDC, the PDA or the PDC is removed from the ring and the command count register is decremented.
The host software uses a command FIFO count register to determine how many PDA or PDC commands are currently owned by bus master controller 800. If the host is capable of writing commands, and thereby transferring control of the PDL/PDC to bus master controller 800, faster than bus master controller 800 uses the PDLs or PDCs, efficient pipelining of packets occurs and bus transactions overlap with network transactions. Because of the large size of FIFOs 850 and 852, some of the bursty nature of bus accesses can be smoothed out.
Bus master controller 800 uses command execution queues 860 and 862 as scratch memory while transferring packet data between host memory 180 and transmit and receive buffers 830 and 835. Bus master controller 800 copies the PDL pointed to by the PDA into a command FIFO in the corresponding command execution queue. Bus master controller 800 uses this PDL to program bus interface 820 with the location and length of each fragment to be transferred. Loading the complete PDL into scratch memory improves utilization of the interconnect bus 130 because, in most cases, the complete PDL can be transferred in one bus transaction. If bus master controller 800 were to read each fragment descriptor separately, performance would suffer because each fragment descriptor read would require a separate bus transaction including the associated arbitration latency.
Bus master controller 800 also uses command execution queues 860 and 862 to hold PDC instructions. PDC instructions are directly transferred from the command FIFO. PDCs are executed out of the command execution queues to maintain PDL/PDC ordering and to maximize the use of the command FIFO. Receive and transmit PDC instructions use the same word format, shown in FIG. 3. The format contains buffer length 320, block index 310, and options field 300. Block index 310 is an address into the corresponding buffer address table which contains the physical address at which data is to be transferred to or from the host memory. Buffer length field 320 specifies the number of bytes to be transferred during transmit operations, or the allocated host memory space required for a receive operation. If the packet data for a receive operation is larger than the allocated host memory space, bus master controller 800 fills the allocated space, sets a receive overflow flag, and then discards the remaining amount of the packet. Options field 300 is used to communicate special processing options to bus master controller 800, such as whether or not an interrupt is desired immediately upon the completion of a data transfer between controller 100 and host computer system 190. Bus master state machine 810 coordinates and controls all activity associated with transferring packet data between host memory 180 and transmit and receive buffers 830 and 835 during PDL and PDC modes. Transmit and receive operations are independently described. However, they are actually performed by the same state machine 810 and are interleaved as necessary. Bus master state machine 810 can be configured to give priority to a receive operation. If data reception is occurring fast enough, it will perform up to eight receive cycles for each transmit cycle.
A transmit PDL transaction is generated by the host software, which creates a PDL describing the packet in host memory, and transfers control to bus master controller 800 by writing the PDL Address (PDA) to transmit command
FIFO 850. Writing the PDA to transmit command FIFO 850 causes the transmit command count register to increment. When the command count register is greater than zero, bus master controller 800 extracts the next command from the FIFO. Since every PDL must have at least one fragment, bus master controller 800 programs bus interface 820 to read the PDL header and one fragment into transmit execution queue 860. If there is more than one fragment, bus master controller 800 reads the header to determine this fact and adjusts the number of fragments to be transferred to the execution queue. Once the complete PDL has been copied into the command execution queue, that PDA is discarded.
Because the PDL header has the total length of the packet data, bus master controller 800 checks transmit buffer 830 to ensure there is enough room to load another packet. If there is not enough room, bus master controller 800 waits until transmit buffer 830 has enough room for a new packet.
Bus master controller 800 then proceeds to interpret each fragment descriptor and programs bus interface 820 to copy each fragment from host memory 180 to transmit buffer 830. When all fragments have been copied to transmit buffer 830, bus master controller 800 discards the PDL and checks the command queue to determine if another PDL is available. If so, bus master controller 800 executes the new PDL and repeats the operation described above. The operation of bus master controller 800 during receive transfers is similar to the transmit case with a few subtle differences. The host software creates a receive PDL, which describes the buffer in which to transfer the received data. A corresponding PDA is then written to receive command FIFO 852, transferring control to bus master controller 800. Bus master controller 800 transfers the PDL into receive execution queue 862, even if no received packets are available in the receive data buffers. By transferring the PDL before it is actually needed, bus master controller 800 attempts to reduce the latency between receiving a packet and transferring it to host memory.
This procedure repeats until: 1) a receive packet is available or 2) receive execution queue 862 is full. Even with receive execution queue 862 full, receive command FIFO 852 can load PDA/PDC instructions in advance to further reduce transaction latency. When a complete packet is available in receive buffer 835, bus master controller 800 uses the preloaded PDL to determine how to scatter the received packet into host memory 180. Bus master controller 800 programs the necessary transactions into bus interface 820 to copy each fragment of the received packet into host memory 180 as described by the PDL fragment descriptors.
The first word of the received packet in receive buffer 835 contains the total packet length and the receive status. This word is saved by bus master controller 800 and is transferred to the receive PDL in host memory 180 after all packet data has been copied to host memory 180. The receive status field of the
PDL becoming non-zero indicates that bus master controller 800 has transferred control of the buffer and associated PDL back to host computer system 190. This process is repeated until all received packets have been transferred or the PDA receive command FIFO is exhausted. Transmit PDC mode provides increased performance by reducing the number of bus acquisitions required. This mode requires that physical addresses of preallocated buffers be loaded into transmit buffer address table 870 during system initialization. An operation starts with the host software gathering data fragments into a predefined contiguous memory space in one of the preallocated buffers. Once this is complete, the information needed for a PDC instruction is known. The PDC instruction is then created and transferred to the transmit command FIFO 850, and the command count register is incremented. If an instruction is at the beginning of transmit command FIFO 850, it is transferred to transmit execution queue 860. The command FIFO count is then decremented and the execution queue count is incremented. If the execution queue count is non-zero, control passes to bus master controller 800, which begins execution of the instruction. Bus master controller 800 decodes the instruction for the length and base index information while checking the transmit buffer flags for available packet space. With the proper information and status, bus master controller 800 configures bus interface 820 to commence transfer to the data buffer. Once the transfer is complete, the PDC instruction is discarded, the command queue count is decremented, and a complete transfer flag is set.
A receive PDC transaction is initiated by the receive buffer 835. Prior to the data transfer, receive buffer address table 872, receive execution queue 862, and receive command FIFO 852 are preloaded. With all of this information loaded in advance, the data transfer occurs with minimal overhead. This information can also be reloaded during period of non-use, or when empty, to better distribute the workload over time.
When receive buffer 835 indicates a packet has been received from high speed network 160, control is transferred to bus master controller 800. Bus master controller 800 accesses the PDC instruction on top of receive command execution queue 862, and examines the block index and length. Bus master controller 800 uses this information to program bus interface 820 to request a bus transaction. At the same time, bus master controller 800 determines if the initial data packet will fit in the allocated memory space. If the data packet will not fit, bus master controller 800 sends data until the host memory space is filled, sets the receive overflow flag, and signals receive buffer 835 to discard the remaining portion of the data packet. If the memory space is greater than one packet, bus master controller 800 determines if another packet can be transferred, and sends additional packets until the allocated memory space is filled. If the memory space allocated is greater in size than a bus transaction can deliver in one transaction, bus master controller 800 breaks the transfer into multiple transactions. Once the transfer is complete, the command execution count is decremented and another command is loaded. If another packet is ready in receive buffer 835, this cycle is repeated.
The simplest form of transmit occurs during programmed I/O (PIO) mode (PIO) transfers. This mode requires, very little or no action from bus master controller 800. PIO transfers one double word (32 bits) of data at a time directly to transmit buffer 830. Each transfer requires a separate request for interconnect bus 130, thus increasing total bus acquisition latency. Bus master controller 800 is only responsible for routing data to transmit buffer 830. The host software makes certain that packet space is available, and indicates to transmit buffer 830 that the packet transfer is complete by setting appropriate flags.
A receive PIO transfer is initiated by the host software. The host software ensures that a packet is available prior to transferring the packet to host computer system 190. Double word transfers are performed across interconnect bus 130 until all packet data has been transferred. The host software is also responsible for maintaining packet and data integrity.
The packet transfer process minimizes the number of interrupts necessary to interact with bus master controller 800. In many cases, host computer system 190 receives and transfers packets without ever taking an interrupt. The host software can program bus master controller 800 to generate an interrupt in the following cases: after each packet has been transferred to the transmit buffer 830; when the transmit command FIFO is exhausted; or upon any transmit error. Receive interrupts are generated after each receive packet has been transferred to host memory 180, when the receive command FIFO 852 is exhausted, or when receive errors occurs.
In one embodiment, bus master controller 800 implements a "lying send" transmit policy in which a successful packet transmission is signaled to the host software as soon as possible after bus master controller 800 completes the data transfer between host memory 180 and transmit buffer 830 or receive buffer 835. The packet is considered to be "transmitted" the moment bus master controller 800 has a complete copy of the packet. It is the responsibility of protocols above the driver level to ensure that packets are successfully transmitted to remote stations. If a packet is lost during transmission by bus master controller 800, the protocol must recognize that the packet is lost and take a corrective action, such as a retransmission.
Errors such as CRC, runt packet and long packet errors are detected by bus master controller 800 and signaled to the host software by specific bits in the receive PDL header. The header also contains additional information bits pertaining to the inbound packet. Padding of packets that are shorter than the minimum legal length for transmission is the responsibility of software on host computer system 190.
Bus master controller 800 supports the use of PDL and PDC data transfer methods simultaneously. When transferring data, software on host computer system 190 indicates the required transfer method by placing an appropriate command in transmit command FIFO 850 or receive command FIFO 852. For packet transmission using the PDC data transfer method, software on host computer system 190 initiates the process by writing a PDC to the appropriate command FIFO. If host computer system 190 wishes to transmit a packet using conventional bus master DMA, it writes a PDA to the command
FIFO instead.
From the perspective of host computer system 190, intermixing PDC and PDL data transfer methods can be accomplished with just one index variable for the PDC queue, one for the PDL queue and a counter variable reflecting the free space in transmit command FIFO 850. This technique works as long as each queue accommodates at least as many entries as transmit command FIFO 850. Mathematically, we say that
CMD = # of entries that can be accommodated in the command FIFO PDL = # of entries that can be accommodated in the PDL queue
PDC = # of entries that can be accommodated in the PDC queue
FREE = # of unused entries in the command FIFO
CMD = PDL = PDC
CMD = FREE + PDLused + PDCused
The PDL and PDC variables with the "used"' subscript indicate the entries in the respective rings that contain transmit requests.
The system guarantees the relationship of used PDC/PDL entries to the total number of command FIFO entries with a free count variable. Initially, the counter is set to the size of the command FIFO. Each time a transmit request is submitted to bus master controller 800, the command FIFO free count is decremented. When the counter reaches zero, the host software reloads the counter from a bus master status register, thereby obtaining the most recent free count. Since the system guarantees the relationship in the above equation, we see that FREE = CMD - PDLused - PDCused. FIG. 9 is a diagram illustrating two packets packed into a single preallocated buffer within memory 180 in accordance with an aspect of the present invention. FIG. 9 includes a first packet, including flags 901 and length 902. Flags 901 contain status information for the first packet. Length field 902 contains the length of the first packet. The first packet also includes data 903, which contains all of the data associated with the packet. The second packet includes flags 911 and length 912. Flags 911 contain status information for the second packet. Length field 912 contains the length of the second packet. The second packet also includes data 913, which is the data associated with the second packet. As indicated by the ellipsis, additional packets may be included into a single preallocated buffer.
The foregoing description of embodiments of the invention have been presented for purposes of illustration and description only. They are not intended to be exhaustive or to limit the invention to the forms disclosed. Obviously, many modifications and variations will be apparent to practitioners skilled in the art.

Claims

What Is Claimed Is:
1. An apparatus for facilitating communications between a computer system, including a memory and a bus, and a data network, comprising: a bus interface, coupled to the bus, for communicating across the bus; a transmit buffer, coupled to the bus interface, for storing data to be transmitted on the data network; a receive buffer, coupled to the bus interface, for storing data received from the data network; a high speed network interface, coupled to the receive buffer and the transmit buffer, for communicating across the data network; a buffer address table, coupled to the bus interface, for storing at least one address of at least one buffer in the memory of the computer system, the at least one buffer being preallocated by the computer system and used to store data in transit between the computer system and one of the transmit buffer and the receive buffer; and a controller, coupled to the transmit buffer, the receive buffer and the buffer address table, for controlling the transfer of data from the computer system to the transmit buffer, and from the receive buffer to the computer system.
2. The apparatus of claim 1, including a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system, one type of transmit command stored in the transmit command queue including, a buffer index field, for indexing an entry in the buffer address table containing an address of a buffer in the at least one buffer pre-allocated by the computer system; and a packet length, field for indicating the length of a packet of data to be transferred from the pre-allocated buffer to the transmit buffer.
3. The apparatus of claim 1, including a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system, one type of command stored in the receive command queue including, a buffer index field for indexing an entry in the buffer address table containing an address of a buffer from the at least one buffer pre-allocated by the computer system; a packet length field for indicating the length of a packet of data to be transferred from the receive buffer to the pre-allocated buffer.
4. The apparatus of claim 1, including: a transmit command queue coupled to the bus interface and the controller, for storing transmit commands from the computer system; and a transmit execution queue coupled to the bus interface, the controller and the transmit command queue, for storing commands from the transmit command queue, and command blocks from the computer system which are referenced by commands from the transmit command queue.
5. The apparatus of claim 1, including: a receive command queue coupled to the bus interface and the controller, for storing receive commands from the computer system; and a receive execution queue coupled to the bus interface, the controller and the receive command queue, for storing commands from the receive command queue and command blocks from the computer system which are referenced by commands from the receive command queue.
6. The apparatus of claim 1 , wherein the buffer address table includes: a transmit buffer address table for storing at least one address of at least one transmit buffer in the memory of the computer system, the at least one transmit buffer being preallocated by the computer system and used to store data in transit between the computer system and the transmit buffer; and a receive buffer address table for storing at least one address of at least one receive buffer in the memory of the computer system, the at least one receive buffer being preallocated by the computer system and used to store data in transit between the receive buffer and the computer system.
7. The apparatus of claim 1 , wherein the controller includes a mechanism to transfer a plurality of packets in a single operation between the at least one buffer preallocated by the computer system and the transmit buffer.
8. The apparatus of claim 1, wherein the controller includes a mechanism to transfer a plurality of packets in a single operation between the receive buffer and the at least one buffer preallocated by the computer system.
9. The apparatus of claim 1, wherein the apparatus is implemented on a single silicon chip.
10. The apparatus of claim 1 , wherein the bus includes a PCI bus.
11. A method for transferring data between a computer system and a network interface device, the network interface device being coupled to a data network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: receiving at the network interface device at least one address of a preallocated buffer in the memory; storing in the network interface device the at least one address of the preallocated buffer; receiving a command from the computer system through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; retrieving an address from the at least one address of a preallocated buffer stored in the network interface device; using the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and using the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
12. The method of claim 11 , wherein the command received from the communication channel includes a length field indicating an amount of data to be transferred between the network interface device and the preallocated buffer, and including using the length field to facilitate the transferring of data between the network interface device and the preallocated buffer in the memory.
13. The method of claim 11 , wherein the command received from the communication channel includes an index for indexing the address from the at least one address of a preallocated buffer, and including using the index to facilitate the retrieving of an address from the at least one address of a preallocated buffer stored in the network interface device.
14. The method of claim 11 , wherein the using of the address to transfer data from the network interface device to the preallocated buffer in the memory if the command is a receive command, includes transferring a plurality of packets in a single operation between the network interface device and the preallocated buffer.
15. The method of claim 11 , wherein the using of the address to transfer data from the preallocated buffer in the memory to the network interface device if the command is a transmit command, includes transferring a plurality of packets in a single operation between the preallocated buffer and the network interface device.
16. A method for transferring data between a computer system and a network interface device, the network interface device being coupled to a data network, the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: preallocating at least one preallocated buffer in the memory of the computer system; transmitting to the network interface device at least one address of the at least one preallocated buffer, so that the network interface device may store the at least one address locally; transmitting a command from the computer system to the network interface device through the communication channel, the command indicating that a transfer between the network interface device and the computer system is to take place; transferring data from the preallocated buffer in the memory to the network interface device if the command is a transmit command; and transferring data from the network interface device to the preallocated buffer in the memory if the command is a receive command.
17. The method of claim 16, wherein: the transferring of data from the preallocated buffer in the memory to the network interface device is initiated by a DMA command received from the network interface device; and the transferring of data from the network interface device to the preallocated buffer in the memory is initiated by a DMA command received from the network interface device.
18. The method of claim 16, wherein the command transmitted to the network interface device includes a length field indicating an amount of data to be transferred between the network interface device and the preallocated buffer, and including using the length field to facilitate the transferring of data between the network interface device and the preallocated buffer in the memory.
19. The method of claim 16, wherein the command transmitted to the network interface device includes an index for indexing an address from the at least one address of a preallocated buffer stored locally at the network interface device.
20. The method of claim 16, wherein transferring data from the network interface device to the preallocated buffer in the memory if the command is a receive command, includes transferring a plurality of packets in a single operation between the network interface device and the preallocated buffer.
21. The method of claim 16, wherein the transferring of data from the preallocated buffer in the memory to the network interface device if the command is a transmit command, includes transferring a plurality of packets in a single operation between the preallocated buffer and the network interface device.
22. A method for transferring data between a computer system and a network interface device, the network interface device being coupled to a data network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: preallocating at least one preallocated buffer in the memory of the computer system; assembling a plurality of fragments of data from a plurality of locations in the memory into a buffer in the at least one preallocated buffer; and transferring data from the preallocated buffer in the memory to the network interface device.
23. A method for transferring data between a computer system and a network interface device, the network interface device being coupled to a data network, and the computer system including a memory and a communication channel, the communication channel being coupled to the network interface device, the method comprising: preallocating at least one preallocated buffer in the memory of the computer system; receiving data from the network interface device into the preallocated buffer in the memory, the data including a plurality of fragments of data; and distributing the plurality of fragments to a plurality of locations in the memory.
PCT/US1998/010652 1997-07-24 1998-05-26 System for reducing bus overhead for communications with a network interface WO1999005809A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU75973/98A AU7597398A (en) 1997-07-24 1998-05-26 System for reducing bus overhead for communications with a network interface

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US08/899,435 1997-07-24
US08/899,435 US6397316B2 (en) 1997-07-24 1997-07-24 System for reducing bus overhead for communication with a network interface

Publications (1)

Publication Number Publication Date
WO1999005809A1 true WO1999005809A1 (en) 1999-02-04

Family

ID=25410967

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1998/010652 WO1999005809A1 (en) 1997-07-24 1998-05-26 System for reducing bus overhead for communications with a network interface

Country Status (3)

Country Link
US (1) US6397316B2 (en)
AU (1) AU7597398A (en)
WO (1) WO1999005809A1 (en)

Families Citing this family (61)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040236877A1 (en) * 1997-12-17 2004-11-25 Lee A. Burton Switch/network adapter port incorporating shared memory resources selectively accessible by a direct execution logic element and one or more dense logic devices in a fully buffered dual in-line memory module format (FB-DIMM)
US8135842B1 (en) 1999-08-16 2012-03-13 Nvidia Corporation Internet jack
US6907488B1 (en) * 1999-09-14 2005-06-14 Siemens Aktiengesellschaft Serial data transmission via a bus system
US6721835B1 (en) * 2000-09-19 2004-04-13 Intel Corporation Method and apparatus for minimizing bus contention for I/O controller read operations
US7039717B2 (en) * 2000-11-10 2006-05-02 Nvidia Corporation Internet modem streaming socket method
US20020083150A1 (en) * 2000-12-27 2002-06-27 Linden Minnick Accessing information from memory
US7379475B2 (en) * 2002-01-25 2008-05-27 Nvidia Corporation Communications processor
WO2002084499A1 (en) * 2001-04-11 2002-10-24 Chelsio Communications, Inc. Multi-purpose switching network interface controller
US20020184566A1 (en) * 2001-06-01 2002-12-05 Michael Catherwood Register pointer trap
US20030086485A1 (en) * 2001-11-08 2003-05-08 John Lin Master to multi-slave asynchronous transmit fifo
US7454456B2 (en) * 2002-02-14 2008-11-18 International Business Machines Corporation Apparatus and method of improving network performance using virtual interfaces
US20030208630A1 (en) * 2002-04-19 2003-11-06 Twomey John M. Pre-assembled message buffers for a network processor
US6826634B2 (en) * 2002-06-10 2004-11-30 Sun Microsystems, Inc. Extended message block for network device drivers
EP1552409B1 (en) * 2002-08-19 2013-07-24 Broadcom Corporation One-shot rdma
US6944728B2 (en) * 2002-12-23 2005-09-13 Intel Corporation Interleaving memory access
JP4218034B2 (en) * 2003-01-29 2009-02-04 富士通株式会社 Data communication system, data communication method, and data communication program
US7689738B1 (en) 2003-10-01 2010-03-30 Advanced Micro Devices, Inc. Peripheral devices and methods for transferring incoming data status entries from a peripheral to a host
US6963946B1 (en) 2003-10-01 2005-11-08 Advanced Micro Devices, Inc. Descriptor management systems and methods for transferring data between a host and a peripheral
US7826614B1 (en) 2003-11-05 2010-11-02 Globalfoundries Inc. Methods and apparatus for passing initialization vector information from software to hardware to perform IPsec encryption operation
US20050132081A1 (en) * 2003-12-11 2005-06-16 International Business Machines Corporation Efficient protocol processing to increase bandwidth in interrupt mode
US8065439B1 (en) 2003-12-19 2011-11-22 Nvidia Corporation System and method for using metadata in the context of a transport offload engine
US8176545B1 (en) 2003-12-19 2012-05-08 Nvidia Corporation Integrated policy checking system and method
US7899913B2 (en) * 2003-12-19 2011-03-01 Nvidia Corporation Connection management system and method for a transport offload engine
US8549170B2 (en) * 2003-12-19 2013-10-01 Nvidia Corporation Retransmission system and method for a transport offload engine
US7533154B1 (en) 2004-02-04 2009-05-12 Advanced Micro Devices, Inc. Descriptor management systems and methods for transferring data of multiple priorities between a host and a network
US7249306B2 (en) * 2004-02-20 2007-07-24 Nvidia Corporation System and method for generating 128-bit cyclic redundancy check values with 32-bit granularity
US7206872B2 (en) * 2004-02-20 2007-04-17 Nvidia Corporation System and method for insertion of markers into a data stream
GB0404696D0 (en) * 2004-03-02 2004-04-07 Level 5 Networks Ltd Dual driver interface
US7698413B1 (en) 2004-04-12 2010-04-13 Nvidia Corporation Method and apparatus for accessing and maintaining socket control information for high speed network connections
GB0408876D0 (en) 2004-04-21 2004-05-26 Level 5 Networks Ltd User-level stack
US7945705B1 (en) 2004-05-25 2011-05-17 Chelsio Communications, Inc. Method for using a protocol language to avoid separate channels for control messages involving encapsulated payload data messages
US7957379B2 (en) * 2004-10-19 2011-06-07 Nvidia Corporation System and method for processing RX packets in high speed network applications using an RX FIFO buffer
US7562366B2 (en) * 2005-02-03 2009-07-14 Solarflare Communications, Inc. Transmit completion event batching
US7831749B2 (en) * 2005-02-03 2010-11-09 Solarflare Communications, Inc. Including descriptor queue empty events in completion events
JP2006259898A (en) * 2005-03-15 2006-09-28 Toshiba Corp I/o controller, signal processing system and data transferring method
US7819763B2 (en) * 2005-04-21 2010-10-26 Campbell Steven S Baseball batting trainer
US7496699B2 (en) * 2005-06-17 2009-02-24 Level 5 Networks, Inc. DMA descriptor queue read and cache write pointer arrangement
US7724658B1 (en) 2005-08-31 2010-05-25 Chelsio Communications, Inc. Protocol offload transmit traffic management
US7660306B1 (en) 2006-01-12 2010-02-09 Chelsio Communications, Inc. Virtualizing the operation of intelligent network interface circuitry
US7660264B1 (en) 2005-12-19 2010-02-09 Chelsio Communications, Inc. Method for traffic schedulign in intelligent network interface circuitry
US7616563B1 (en) 2005-08-31 2009-11-10 Chelsio Communications, Inc. Method to implement an L4-L7 switch using split connections and an offloading NIC
US7715436B1 (en) 2005-11-18 2010-05-11 Chelsio Communications, Inc. Method for UDP transmit protocol offload processing with traffic management
US7596644B2 (en) * 2006-01-11 2009-09-29 Solarflare Communications, Inc. Transmit rate pacing system and method
US8660137B2 (en) * 2005-09-29 2014-02-25 Broadcom Israel Research, Ltd. Method and system for quality of service and congestion management for converged network interface devices
US7760733B1 (en) 2005-10-13 2010-07-20 Chelsio Communications, Inc. Filtering ingress packets in network interface circuitry
US7984180B2 (en) * 2005-10-20 2011-07-19 Solarflare Communications, Inc. Hashing algorithm for network receive filtering
US7773630B2 (en) * 2005-11-12 2010-08-10 Liquid Computing Corportation High performance memory based communications interface
US7721023B2 (en) * 2005-11-15 2010-05-18 International Business Machines Corporation I/O address translation method for specifying a relaxed ordering for I/O accesses
CA2655500A1 (en) * 2006-06-19 2007-12-27 Liquid Computing Corporation Token based flow control for data communication
US7873964B2 (en) * 2006-10-30 2011-01-18 Liquid Computing Corporation Kernel functions for inter-processor communications in high performance multi-processor systems
US8935406B1 (en) 2007-04-16 2015-01-13 Chelsio Communications, Inc. Network adaptor configured for connection establishment offload
US8589587B1 (en) 2007-05-11 2013-11-19 Chelsio Communications, Inc. Protocol offload in intelligent network adaptor, including application level signalling
US8060644B1 (en) 2007-05-11 2011-11-15 Chelsio Communications, Inc. Intelligent network adaptor with end-to-end flow control
US7826350B1 (en) 2007-05-11 2010-11-02 Chelsio Communications, Inc. Intelligent network adaptor with adaptive direct data placement scheme
US7831720B1 (en) 2007-05-17 2010-11-09 Chelsio Communications, Inc. Full offload of stateful connections, with partial connection offload
US8412862B2 (en) 2008-12-18 2013-04-02 International Business Machines Corporation Direct memory access transfer efficiency
US8954704B2 (en) 2011-08-12 2015-02-10 International Business Machines Corporation Dynamic network adapter memory resizing and bounding for virtual function translation entry storage
US20130042238A1 (en) 2011-08-12 2013-02-14 International Business Machines Corporation Optimized Virtual Function Translation Entry Memory Caching
US9824004B2 (en) 2013-10-04 2017-11-21 Micron Technology, Inc. Methods and apparatuses for requesting ready status information from a memory
US10108372B2 (en) 2014-01-27 2018-10-23 Micron Technology, Inc. Methods and apparatuses for executing a plurality of queued tasks in a memory
US9454310B2 (en) 2014-02-14 2016-09-27 Micron Technology, Inc. Command queuing

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4887075A (en) * 1987-06-19 1989-12-12 Kabushiki Kaisha Toshiba Local area network system with a multi-computer system coupled thereto and method for controlling the same
US5539736A (en) * 1992-12-31 1996-07-23 Unisys Corporation Method for providing LAN address discovery and terminal emulation for LAN-connected personal computer (PCs) using xerox network system (XNS)
US5586269A (en) * 1989-09-18 1996-12-17 Matsushita Electric Industrial Co., Ltd. Communication control device and method for automatically determining a self-address

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4419728A (en) 1981-06-22 1983-12-06 Bell Telephone Laboratories, Incorporated Channel interface circuit providing virtual channel number translation and direct memory access
US4888691A (en) 1988-03-09 1989-12-19 Prime Computer, Inc. Method for disk I/O transfer
US5315708A (en) * 1990-02-28 1994-05-24 Micro Technology, Inc. Method and apparatus for transferring data through a staging memory
US5522050A (en) 1993-05-28 1996-05-28 International Business Machines Corporation Bus-to-bus bridge for a multiple bus information handling system that optimizes data transfers between a system bus and a peripheral bus
JP3531074B2 (en) * 1994-11-28 2004-05-24 富士通株式会社 Storage device control system
US5893926A (en) * 1995-12-08 1999-04-13 International Business Machines Corporation Data buffering technique in computer system
US5937436A (en) * 1996-07-01 1999-08-10 Sun Microsystems, Inc Network interface circuit including an address translation unit and flush control circuit and method for checking for invalid address translations
US5983332A (en) * 1996-07-01 1999-11-09 Sun Microsystems, Inc. Asynchronous transfer mode (ATM) segmentation and reassembly unit virtual address translation unit architecture

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4887075A (en) * 1987-06-19 1989-12-12 Kabushiki Kaisha Toshiba Local area network system with a multi-computer system coupled thereto and method for controlling the same
US5586269A (en) * 1989-09-18 1996-12-17 Matsushita Electric Industrial Co., Ltd. Communication control device and method for automatically determining a self-address
US5539736A (en) * 1992-12-31 1996-07-23 Unisys Corporation Method for providing LAN address discovery and terminal emulation for LAN-connected personal computer (PCs) using xerox network system (XNS)

Also Published As

Publication number Publication date
US20020009075A1 (en) 2002-01-24
US6397316B2 (en) 2002-05-28
AU7597398A (en) 1999-02-16

Similar Documents

Publication Publication Date Title
US6397316B2 (en) System for reducing bus overhead for communication with a network interface
US6065087A (en) Architecture for a high-performance network/bus multiplexer interconnecting a network and a bus that transport data using multiple protocols
EP1896965B1 (en) Dma descriptor queue read and cache write pointer arrangement
KR100437146B1 (en) Intelligent network interface device and system for accelerating communication
JP2539614B2 (en) Apparatus and method for generating pointer address
US7464199B2 (en) Method, system, and program for handling Input/Output commands
US6526451B2 (en) Method and network device for creating circular queue structures in shared memory
EP1358562B1 (en) Method and apparatus for controlling flow of data between data processing systems via a memory
JP3863912B2 (en) Automatic start device for data transmission
US6219736B1 (en) Universal serial bus (USB) RAM architecture for use with microcomputers via an interface optimized for integrated services device network (ISDN)
US6049842A (en) Efficient data transfer mechanism for input/output devices
US5745684A (en) Apparatus and method for providing a generic interface between a host system and an asynchronous transfer mode core functional block
EP0498201A2 (en) Generic high bandwidth adapter architecture
US20020196806A1 (en) Method and apparatus for multi context direct memory access
US6823403B2 (en) DMA mechanism for high-speed packet bus
US6128674A (en) Method of minimizing host CPU utilization in driving an adapter by residing in system memory a command/status block a soft interrupt block and a status block queue
US20030120835A1 (en) Single descriptor scatter gather data transfer to or from a host processor
EP0752665B1 (en) Method and apparatus for coordinating data transfer between hardware and software
US7761529B2 (en) Method, system, and program for managing memory requests by devices
US5581741A (en) Programmable unit for controlling and interfacing of I/O busses of dissimilar data processing systems
CN110188059B (en) Flow control type FIFO (first in first out) cache device and method for unified configuration of data valid bits
US6856619B1 (en) Computer network controller
EP0772831B1 (en) Bidirectional parallel signal interface
US5923852A (en) Method and system for fast data transmissions in a processing system utilizing interrupts
US6801963B2 (en) Method, system, and program for configuring components on a bus for input/output operations

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GE GH GM GW HU ID IL IS JP KE KG KP KR KZ LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
NENP Non-entry into the national phase

Ref country code: KR

REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: CA