|Publication number||US6195721 B1|
|Application number||US 08/632,183|
|Publication date||Feb 27, 2001|
|Filing date||Apr 15, 1996|
|Priority date||Feb 3, 1994|
|Also published as||DE19501674A1|
|Publication number||08632183, 632183, US 6195721 B1, US 6195721B1, US-B1-6195721, US6195721 B1, US6195721B1|
|Inventors||Steven E. Rice|
|Original Assignee||Tektronix, Inc.|
|Export Citation||BiBTeX, EndNote, RefMan|
|Patent Citations (5), Referenced by (7), Classifications (6), Legal Events (4)|
|External Links: USPTO, USPTO Assignment, Espacenet|
This is a continuation of application Ser. No. 08/190,969, filed Feb. 3, 1994 and now abandoned.
The present invention relates to data transfer management between data processors, and more particularly to inter-processor data transfer management for increasing overall throughput for both processors.
When transferring data from one data processor to another, each data processor having its own bus, delays in the transfer are unpredictable in that the bus for the responding data processor may not be immediately available to the requesting data processor. This problem is increased where the two processors run asynchronously, and is even worse where the processors run at different clock rates. The stalling of the requesting processor while waiting for the availability of the responding processor is further increased where large amounts of data are to be transferred.
Accordingly the present invention provides a system for inter-processor data transfer management that uses an interface circuit between the processors. The interface circuit receives a data transfer request from a requesting processor and acknowledges the request so that the requesting processor may continue to run without waiting for completion of the request. The interface circuit then arbitrates for the bus of the responding processor and completes the data transfer between the responding processor and the interface circuit. At the completion of the transfer the interface circuit advises the requesting processor that it is ready for another request, and upon receipt of another request the interface circuit completes the prior request with the requesting processor.
The objects, advantages and novel features of the present invention are apparent from the following detailed description when read in conjunction with the appended claims and attached drawing.
FIG. 1 is a block diagram view of a system for inter-processor data transfer management according to the present invention.
FIG. 2 is a block diagram view of an interface circuit for the system of FIG. 1 according to the present invention.
Referring now to FIG. 1 a first processor 10 has an associated bus 12 for performing a first set of program applications. A second processor 20 has an associated bus 22 for performing a second set of program applications. Coupled between the buses 12, 22 is an interface circuit 30. Address, data and control lines that make up the first bus 12 are coupled to one side of the interface circuit 30, and address, data and control lines that make up the second bus 22 are coupled to the other side of the interface circuit.
As shown in FIG. 2 the interface circuit 30 has an address buffer 32 coupled between the address lines of the buses 12, 22 for storing and transferring an address from one processor 10 to the other 20, a data buffer 34, which may be a FIFO buffer for multi-word burst transfers, coupled between the data lines of the buses, and a control logic circuit 36 coupled between the control lines of the buses 12, 22 for communicating with the processors 10, 20 and for controlling the address and data buffers 32, 34. The address buffer 32 also may be a FIFO in a system where burst accesses are not supported, as considerable advantage is often gained by performing multiple data transfers per bus arbitration. A FIFO as the address buffer 32 has the additional advantage of permitting scatter/gather transfers, i.e., sequential data transferred to/from non-sequential addresses. Alternatively the address buffer 32 may contain both an address register and an address generator, allowing multiple data transfers without the hardware overhead of a FIFO.
The requesting processor 10 places a request on its bus 12 that includes a destination address and data (if the request is a write request) for the responding processor bus 22. The control logic circuit 36 provides an acknowledge response to the requesting processor 10 and stores the address and data in the respective address and data buffers 32, 34, or provides the acknowledge response together with the data from the data buffer if the request is a read request. The requesting processor 10, upon receipt of the acknowledge response, continues to run its applications rather than wait for completion of the request.
Once the request from the requesting processor 10 is acknowledged by the interface circuit 30, the control logic circuit 36 then arbitrates for control of the responding processor bus 22 by sending a bus request to the responding processor 20. Once the responding processor 20 is ready to accept the request from the interface circuit 30, a bus grant signal is returned to the control logic circuit 36. The address is transferred from the address buffer 32 to the responding processor bus 22. The data also is transferred from the data buffer 34 if the request is a write request. For a read operation the responding processor bus 22 transfers the data at the address from the address buffer 32 to the data buffer 34. At completion of the transfer the control logic circuit 36 releases the responding processor bus 22 and alerts the requesting processor 10 that the interface circuit 30 is ready for another request.
The interface circuit 30 serves to decouple the transfers of data between the processors 10, 20 by providing storage and control capabilities. Full benefit is gained when the requesting processor 10 pipelines the transfers and allows the interface circuit 30 to control the timing of the transfer. In more detail the requesting processor 10 accesses the interface circuit 30 with a transfer request, indicating a desire to either read data from or write data to the memory of the other processor 20. The interface circuit 30 latches the address in the address buffer 32 and the data, if a write request, in the data buffer 34, and terminates the cycle with an acknowledgement to the requesting processor 10. The control logic circuit 36 asserts an access request to the arbitration logic of the other processor 20. When the access is granted, the interface circuit 30 performs the requested operation and relinquishes the responding processor's bus 22. The control logic circuit 36 then asserts an interrupt or direct memory access (DMA) request to the requesting processor 10. The requesting processor 10 receives the data (on a read) or supplies the next data word to transfer (on a write), along with the address of the next location to be accessed. Transfers continue with minimal throughput degradation for both processors. If the operation was a read request, the requesting processor 10 transmits a dummy request to the control logic circuit 36 to transfer the last data for the request from the data buffer 34 to the requesting processor.
For one particular implementation a Motorola 96002 digital signal processor (DSP) has two built-in DMA controllers and runs at 33 MHz. The DMA controllers are capable of operating concurrently with other processing. When initiating a DMA transfer from main memory of another computer 20, such as a Motorola 68040 computer processing unit (CPU) running at 25 MHz, to any memory space of the DSP where the DSP is the requesting computer 10, a number of steps are taken:
1. The source address in CPU memory is set to the address of the first location to be transferred.
2. The destination address in DSP memory is set to one less than the address of the first location to be filled with valid data.
3. The transfer count is set to one more than the number of data words to be transferred.
4. The DMA controller mode is set such that it initiates a transfer of one word of a block of data to be moved upon signal from the interface circuit 30.
5. The interface circuit is signalled to assert the first request, beginning the transfer sequence. The first transfer causes the interface circuit to latch the CPU memory source address and return a data word to the DMA which is ignored, which is the reason the destination address is set to one less than the address of the first location to be filled with valid data.
6. The interface circuit then issues an access request to CPU memory. When access is granted, the latched address is used to retrieve one word (or four words in burst mode) from the CPU memory. The data is latched in the data buffer, and the interface circuit signals the DMA controller in the DSP to assert the next request.
7. For the rest of the DMA transfer the data word corresponding to the previous memory address is retrieved from the data buffer, the next address is latched by the address buffer, and the control logic circuit initiates an access to retrieve the data word. For a transfer of 100 data words from the CPU memory to the DSP memory, 101 requests are initiated by the DMA controller since the first data word is ignored and the last source address for the CPU is a dummy.
When going in the other direction it is not necessary to transfer an extra data word. For DMA transfers from the memory space of the DSP to CPU memory, the following steps are taken:
1. The source address in DSP memory is set to the address of the first location to be transferred.
2. The destination address in CPU memory is set to the address of the first location to be filled with valid data.
3. The transfer count is set to the number of data words to be transferred.
4. The DMA controller mode is set such that it initiates a transfer of one word (or four words if in burst mode) of the block to be moved upon signal.
5. The interface circuit is signalled to assert the first request, beginning the transfer sequence. The first transfer causes the interface circuit to latch the CPU memory destination address and the data word(s) from the DMA. The control logic circuit then issues an access request to the CPU. When access is granted, the latched address and data are used to store the data word(s) into CPU memory, and the control logic circuit signals the DMA controller to assert the next request.
The 96002 has an address space four times larger than that of the 68040, so the DSP address space may be segmented into four address spaces, one of which may be used as the DMA space. Four words of data may be buffered and moved to and from CPU main memory in four word bursts, as indicated above, while presenting a single word per transfer interface to the DSP. This reduces the CPU bus loading caused by DMA transfers to and from the DSP by reducing the number of arbitration requests by about seventy-five percent (75%), i.e., one arbitration request per four data words transferred instead of four arbitration requests. Also the data transfer time, i.e., the actual CPU bus access time required to transfer four data words, is reduced by approximately fifty percent (50%) in the burst transfer mode. In the burst transfer mode the transfers are on a four-word boundary. Also to ensure complete transfer of the data the blocks to transfer are a multiple of four words long.
If the requesting processor 10 has an address space smaller than that of the responding processor, all that is required is the addition of a latch to hold the required upper address bits for the transfer. The latch could be either external or internal to the interface circuit 30 when implemented as an application specific integrated circuit (ASIC).
Although the particular implementation described above uses a resident DMA controller in the requesting processor 10, transfers may also be controlled by an interrupt process, as is well known in the art. Further either processor 10, 20 may be the requesting or responding processor for full bi-directional operation. For simultaneous bi-directional operation the data and address buffers 32, 34 are multiple parallel buffers that provide two separate paths so both processors 10, 20 may access the interface circuit 30 via the separate paths. Additionally the interface circuit 30 could arbitrate for both buses at the same time so that transfers routinely occur in both directions at the same time. Since the data is stored internally in the data buffer 34, only one port needs to be active at a time for each transfer.
Thus the present invention provides an inter-processor data transfer management system that allows data transfer between processors without stalling the requesting processor by using an interface circuit which has storage and control capabilities for decoupling the transfers.
|Cited Patent||Filing date||Publication date||Applicant||Title|
|US4926375 *||May 5, 1987||May 15, 1990||Ge Fanuc Automation North America, Inc.||Multiple nodes broadcast communication method with receiver identification by bit position in transferred massage|
|US5045998 *||Jun 1, 1989||Sep 3, 1991||International Business Machines Corporation||Method and apparatus for selectively posting write cycles using the 82385 cache controller|
|US5309567 *||Jan 24, 1992||May 3, 1994||C-Cube Microsystems||Structure and method for an asynchronous communication protocol between master and slave processors|
|US5327570 *||Jul 22, 1991||Jul 5, 1994||International Business Machines Corporation||Multiprocessor system having local write cache within each data processor node|
|US5410654 *||Jul 22, 1991||Apr 25, 1995||International Business Machines Corporation||Interface with address decoder for selectively generating first and second address and control signals respectively in response to received address and control signals|
|Citing Patent||Filing date||Publication date||Applicant||Title|
|US6708246 *||Oct 20, 2000||Mar 16, 2004||Renesas Technology Corp.||Signal processing device with bus ownership control function|
|US7590764 *||Jul 29, 2003||Sep 15, 2009||Micron Technology, Inc.||System and method for dynamic buffer allocation|
|US8209478 *||Jun 26, 2012||Himax Technologies Limited||Single-port SRAM and method of accessing the same|
|US20020099890 *||Nov 20, 2001||Jul 25, 2002||Stmicroelectronics S.A.||Apparatus and method for processing interruptions in a data transmission over a bus|
|US20040054823 *||Jul 29, 2003||Mar 18, 2004||Rooney Jeffrey Jay||Dynamic buffer allocation for a computer system|
|US20100228910 *||Sep 9, 2010||Chun-Yu Chiu||Single-Port SRAM and Method of Accessing the Same|
|US20140229645 *||Feb 10, 2013||Aug 14, 2014||Mellanox Technologies Ltd.||Credit-based low-latency arbitration with data transfer|
|U.S. Classification||710/310, 709/201|
|International Classification||G06F15/167, G06F13/40|
|Dec 19, 2000||AS||Assignment|
Owner name: TEKTRONIX, INC., OREGON
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:RICE, STEVEN E.;REEL/FRAME:011407/0985
Effective date: 19940128
|Jul 14, 2004||FPAY||Fee payment|
Year of fee payment: 4
|Jul 17, 2008||FPAY||Fee payment|
Year of fee payment: 8
|Aug 23, 2012||FPAY||Fee payment|
Year of fee payment: 12