|Publication number||USRE41413 E1|
|Application number||US 11/351,220|
|Publication date||Jul 6, 2010|
|Filing date||Feb 10, 2006|
|Priority date||Jul 1, 1997|
|Also published as||EP0935797A1, US6118462, US6690379, US20010040580, US20030103056, US20120007873, WO1999013451A1|
|Publication number||11351220, 351220, US RE41413 E1, US RE41413E1, US-E1-RE41413, USRE41413 E1, USRE41413E1|
|Original Assignee||Neal Margulis|
|Export Citation||BiBTeX, EndNote, RefMan|
|Patent Citations (62), Non-Patent Citations (30), Classifications (34), Legal Events (3)|
|External Links: USPTO, USPTO Assignment, Espacenet|
More than one reissue application has been filed for the reissue of U.S. Pat. No. 6,690,379. The reissue applications are application Ser. No. 11/351,220, filed Feb. 2, 2006 (the present application), and reissue application Ser. No. 12/562,983, filed Sep. 18, 2009, which is a continuation reissue of present reissue application Ser. No. 11/351,220.
This patent application is a continuation application of U.S. patent application Ser. No. 09/541,413, filed on Mar. 31, 2000 now abandoned entitled “Computer System Controller Having Internal Memory and External Memory Control,” naming Neal Margulis as inventor, which is a continuation application of U.S. patent application Ser. No. 08/926,666, filed on Sep. 9, 1997, now U.S. Pat. No. 6,118,462, which resulted from a continuation-in-part application of U.S. patent application Ser. No. 08/886,237, filed on Jul. 1, 1997, entitled “Computer System Having a Common Display Memory and Main Memory,” naming Neal Margulis as inventor, now U.S. Pat. No. 6,057,862, the disclosures of which are incorporated by reference.
1. Field of the Invention
The present invention relates generally to a memory architecture for computer systems and more particularly to a memory subsystem comprised of internal memory and control for external memory.
2. Discussion of Prior Art
A typical personal computer system has a central processing unit (CPU) with an external main memory and has a graphics display subsystem with its own memory subsystem. Part of this memory subsystem is a frame buffer that provides the output to the display, and part of this subsystem may be used for off-screen operations. However, the graphics display subsystem memory and the main system's pool of memory do not share data efficiently or move data efficiently from one memory subsystem to the other.
Another typical personal computer system has a single memory subsystem for both the CPU and the graphics subsystem. The performance of this type of computer system is lower than that of computer systems that have separate memory subsystems for the graphics display subsystem and for the CPU. Even though these single external memory systems can support a cache memory for the CPU, their overall performance is still lower because the memory bandwidth is shared between the graphics and CPU subsystems. These computer systems are very limited in their ability to achieve good performance for both the CPU and graphics subsystems. In order to be cost effective, these systems typically use a lower cost main memory that is not optimized for the special performance needs of graphics operations.
For systems that use a single external memory subsystem to perform all of their display refresh and drawing operations, performance is compromised by the memory bandwidth for these operations being shared with the memory bandwidth for the CPU. “Refresh” is the general term for taking the information contained in a frame buffer memory and sequentially transferring the information by rows to a palette digital-to-analog converter (DAC) to be displayed on an output device such as a monitor, TV or flat panel display. The frame buffer's entire contents needs to be transferred to the output device continuously for the displayed image to be visible. In the case of a monitor, this refresh is performed typically between 75 and 95 times per second. For high-resolution color systems, the refresh process consumes an appreciable portion of the total bandwidth available from the memory.
In addition to the refresh bandwidth, the graphics subsystem performs drawing operations that also consume an appreciable amount of bandwidth. In the case of 2-D graphics acceleration the drawing operations include Bit-BLt (Bit Block Transfers), line drawing and other operations that use the same common pool of memory.
Intel and other companies in the PC industry have designed an advanced peripheral port (AGP) bus and an associated system architecture for combining graphics and chipsets. AGP is a second private bus between the main memory controller chipset and the graphics display subsystems. AGP and the associated system architecture allow the storage of 3-D texture memory in the main memory that can be accessed by the graphics subsystem. This is one limited use of shared main memory for a graphics function. However, because there is a single bus between the graphics subsystem and the main memory controller chipset, this bus limits the system performance. This single bus is shared by all CPU commands to the graphics controller, any CPU direct reads or writes of display data, all texture fetches from main memory and any other transfers of display information that is generated or received from the CPU or I/O subsystems (i.e. video data from a capture chip or a decoder).
AGP is designed to overcome the above-described performance limitations from using the main memory subsystem for display refresh and drawing operations. AGP systems overcome these limitations by a brute force requirement that the graphics subsystem on the AGP bus have a separate frame buffer memory subsystem for screen refresh and drawing operations. Using frame buffer memory is a good solution for eliminating the performance penalties associated with drawing and refresh operations. Meanwhile, as a frame buffer is always required, AGP systems do not allow for screen refresh to be performed from the main system memory. This does not allow the optimization of refreshing all or part of the screen from main memory.
Additionally, the drawing operations must be performed in the graphics display memory and are therefore performed by the graphics subsystem controller. Also limiting the dedicated frame buffer system flexibility, the graphics subsystem controller can not efficiently draw into the main system memory.
Separating the frame buffer memory from the main system memory duplicates the input/output (I/O) system data. For example, this occurs in a system where video data enters the system over an I/O bus through a system controller and then is stored in the main system memory. If the data is displayed, it needs to be copied into the frame buffer. This creates a second copy of the data, transfer of which requires additional bandwidth.
Another alternative is to have a peripheral bus associated with the graphics controller where the I/O data is transferred to the frame buffer. While this allows display of the data without additional transfers over a system bus, the data remains local to the display subsystem. The CPU or main I/O systems can not access the data without using a system bus. For systems with a shared memory subsystem, the I/O data enters a shared memory region. It is then available to either the display subsystem or the CPU.
What is needed is an integrated system controller that supports a memory architecture which combines internal and external memory in which common memory can be used for display memory and main memory, without having inadequate bandwidth access to the common memory to impair performance.
The present invention resides in a memory architecture having one or more high bandwidth memory subsystems where some of the memory subsystems are external to the controller and some of the memory subsystems are internal. Each of the high bandwidth memory subsystems is shared and connected over a plurality of buses to a display subsystem, a central processing unit (CPU) subsystem, input/output (I/O) buses and other controllers. A display subsystem is configured to receive various video and graphics type data from the high-speed memory subsystems and to process it for display refresh. Additional buffers and caches are used for the subsystems to optimize system performance. The display refresh path includes processing of the data from the memory subsystem for output to the display, where the data enters the shared memory subsystems from an I/O subsystem, from the CPU subsystem or from the graphics subsystem.
The present invention resides in a memory architecture having one or more shared high-bandwidth memory subsystems that are both internal and external to the system controller. Each of the high-bandwidth memory subsystems is connected over a plurality of buses to the display subsystem, the central processing unit (CPU) subsystem, the input/output (I/O) buses and other controllers. The display subsystem is configured to receive various video and graphics data types for processing and display refresh from the high-speed shared memory. Additional buffers and caches are used for the subsystems to optimize the system.
A low cost multimedia personal computer system is achieved by optimizing a system with respect to memory bandwidth to share one or more common memory subsystems for aspects of display memory and main system memory. The
There are two data buses in the
This implementation shows a shared address and control (A&C) bus 424. Arbitration and control unit 408 is responsible for responding to requests from CPU subsystem controller 402, graphics drawing and display subsystem 404 and peripheral and I/O control unit 440, and scheduling their memory accesses. Arbitration and control unit 408 includes a set of configuration and state registers (not shown) that processes requests intelligently. Additionally, the request protocol specifies the amount of data required by the requester. Arbitration and control unit 408 processes the requests with the objectives of maximizing concurrency of the two data buses, optimizing for the length of the transfers and assuring that the latency for requests does not compromise system performance.
To meet these conflicting objectives, arbitration and control unit 408 tracks the state of the memory channels as well as the latency of the requests. Arbitration and control unit 408 breaks a single request from a subsystem into multiple requests to the memory channels. By doing this, the latency and memory bursts are optimized. Also, the requesting subsystems request very long bursts of data without concern for unbalancing the system throughput and without having to reuse the A&C bus 424.
The integrated processor 510 included in
A crossbar switch can be designed to be bi-directional or unidirectional. In the case of unidirectional switches, both a set of read switches and a set of write switches may be needed. Not all switches in a system need to be as complex as a crossbar switch. Much simpler switches and MUX based switches can be used and still achieve good overall performance. In the simplest case, a switch may be a connection point between a subsystem channel and a memory channel. A simpler switch architecture is particularly useful for the multi-bank and multiple row buffer configurations shown later in
For example, if subsystem A is accessing channel MC3, the switch labeled S3A is active. Concurrently, subsystem B may be accessing channel MC4 with switch S4B closed, and subsystem C may access channel MC1 with switch S1C, while subsystem D accesses channel MC2 through switch S2D. If a subsystem needs to connect to a memory channel that is in use by another subsystem, it is blocked and must wait.
The configuration registers 802 are set to reflect the nature of the subsystem controller. These characteristics can include the burst lengths, the latency tolerance and other addressing information. Configuration information is also required for the memory channel information. The status registers 804 track both pending requests from the switch subsystem controllers 808, 810 and 812 and the status of the memory channels 818, 820, 822 and 824.
Arbitration controller unit 814 receives memory requests from each of subsystems 808, 810 and 812. By using the configuration register 802 information as the status information, arbitration controller unit 814 acknowledges requests at appropriate times and signals memory channel request unit 806 and switch subsystem controllers 808, 810 and 812 to cycle through the memory requests.
Arbitration controller unit 814 ensures that the subsystems that have maximum latency tolerances are not compromised. Additionally, arbitration controller unit 814 maximizes the total bandwidth of the system to achieve the best performance. In some cases bursts are not broken up so that they can complete the use of a memory channel. In other cases, a single subsystem controller request is broken up and filled with multiple memory channel accesses.
The MSC 960 must handle various size data requests. The IRAM bank width can be independent from the width of the IMC data path 902. The MSC 960 uses the MUX 910 logic to ensure that the appropriate data is transferred in the appropriate order to the IMC 902. This is an effective means for the MSC 960 to take advantage of the wide data paths available from IRAM banks 920 through 950. Multiple data transfers on the IMC 902 are accommodated by proportionally fewer IRAM bank accesses.
Additionally, the configuration of the memory bank allows fast sequential accesses. A bank of memory is defined as a row-column array of storage cells. Typically in DRAM, an entire row of the array is enabled with a single access. This allows any data within that row to be acessed quickly. If an access to a different row address within the same bank of IRAM occurs, a “pre-charge” penalty is incurred and the access is delayed. To avoid the likelihood of this occurrence, this example shows multiple banks employed in the memory subsystem.
While an internal memory subsystem can be designed as a singular bank, there are performance advantages to using multiple banks of memory.
In the case of DRAM, the IRAM banks (920 through 950) are interleaved on a bank basis both to take advantage of the page mode access within a bank and to hide the page miss penalty by changing banks when crossing a page boundary. The memory sequencer for the IRAM subsystem manages the banks to maximize bandwidth based on the memory access patterns. This involves either pre-charging the DRAM bank whenever a new bank is accessed or keeping a page active in each bank of memory.
The data bus 902 may be connected directly to a processing or IO subsystem data bus instead of going through an additional switch. This saves an additional level of switching. In order to allow the IRAM bank data to be shared in this type of configuration, the IRAM banks can also be connected to additional MUXs (not shown). Each additional MUX connects the IRAM banks to a separate processing or I/O subsystem data bus.
When the MSC 1022 receives a new read request, it accesses the IDRAM array 1002 storing the requested data. The complete row of data from the IDRAM array is then transferred to a row buffer and then from the row buffer through optional MUX 1020 onto line 1026 to the IMC. In the case of a request for a series of data, the row buffer data is routed so that the request is filled in a burst manner on the IMC 1026. All of the row data remains in the row buffer.
The MSC 1022 fulfills subsequent data requests to different rows in the same manner without affecting the data stored in the other row buffers. These requests can be to the same or different IMCs. When a data read occurs to an address where the corresponding data already residues in the row buffer, the row buffer fulfills the read request directly without needing an additional IDRAM bank 1002 access. Having multiple rows of data in the row buffers for fast access achieves very high performance for typical access patterns to a memory subsystem.
MSC 1022 handles the control of writes to the memory subsystem in a similar manner. One skilled in the art of cache controller design is familiar with the following complications that result from having the IDRAM data temporarily cached in row buffers 1004 through 1018. If a data write occurs to a row of data that is already present in a row buffer, the write is simply done to the row buffer, and that row buffer is tagged as having the most recent copy of the data. This tag, referred to as “dirty,” is significant as it requires that data be stored to the IDRAM array at some time and any subsequent reads to that row of data must be fulfilled with the most recent “dirty” data and not the “stale” data existing in the array.
There are further implementation tradeoffs when dirty data is written back to the array. Similarly, there is a need to design implementation tradeoffs for data writes to addresses not currently contained within a row buffer. The primary options are “allocation on write” where the complete row is read out of the array so that writes can occur to the row buffer. A simpler implementation simply “writes through” data writes to the IDRAM bank 1002 for locations that are not currently present in a row buffer.
An implementation detail for the allocation of row buffers corresponding to the memory locations is the tradeoff between performance and simplicity of implementation. In the simplest case, a row buffer is “direct mapped” to a fixed number of potential memory array rows. In the most flexible and most complex case, any row buffer corresponds to any IDRAM row and is said to be “fully associative.” Intermediate complexity of design of a “set associative” mapping is possible where more than one row buffer corresponds to each fixed set of IDRAM rows.
Another complexity results from the set and fully associative mapping schemes where a row buffer replacement algorithm must be implemented. Since more than one row buffer can contain the data for a given row access, an algorithm is needed to choose which row buffer to replace for the new access. The preferred embodiment employs a type of “Least Recently Used” (LRU) replacement algorithm.
Designing a single bank of IDRAM 1002 may have some advantages as compared to a multi-bank design for area and power savings. To achieve greater performance from a single bank IDRAM 1002, temporary row buffers 1004 through 1018 are used to store memory reads and writes. These temporary row buffers 1004 through 1018 multi-port the memory bank.
Multi-porting is an extension of the dual-port approach that has long been used in specialty video RAMs (VRAMs). VRAMs include both a random access port and a serial access port. The serial access port uses data from a serial access memory (SAM) that is loaded in a single cycle from a RAM array. The VRAMs allow simultaneously accessing both the SAM data and the random data. VRAMs also allow data to be input serially into the SAM and then transferred in a single cycle into the main RAM.
The row buffers accomplish the same general function as a SAM does. The row buffers, like a SAM register, allow the contents an entire very wide row of RAM to be transferred in a single cycle into the row buffer. Unlike serial accesses to the SAM in a VRAM system, with the row buffers on-chip, the data path to the internal memory channel can be arbitrarily wide. Additionally, data steering logic is included in the data path so that data from the DRAM bank is transferred on the most optimal data lines of the IMC 1026.
Different subsystems use row buffers differently. For a function such as display refresh, the refresh controller makes a memory address request. The corresponding row of memory is transferred into a row buffer. The memory controller transfers the requested amount of data from the row buffer to the refresh controller. The memory transfer typically requires less data than the complete row buffer contents. When the refresh controller performs the next sequential request, the data is already in the row buffer ready to transfer.
The CPU subsystem in a non-graphics application performs forms a cache line fill from a memory address corresponding to an IDRAM bank. The IDRAM row is transferred to the row buffer and the cache line data is transferred through to the cache data channel. The row buffer is presumably larger than the cache-line size such that any additional cache line fills corresponding to the same row buffer address range are filled without needing to re-access the IDRAM bank.
Furthermore, multiple row buffers contain valid data at a given time. Accesses to different row buffers occur sequentially without losing the ability to return to active row buffers that contain valid data. Using the two examples above, a partial read of row buffer 1 (RB1) occurs on line 1026 to the IMC as part of screen refresh. Next the CPU performs a cache line fill over the IMC 1026 from RB2. The refresh then continues from RB1 as the next burst of transfers over the IMC 1026.
The IMC data buses 1026-1032 could be connected directly to a processing or I/O subsystem data bus instead of going through an additional switch. This saves an additional level of switching. Similarly, the row buffer data lines 1040-1054 could optionally be connected directly to a processing or subsystem data bus instead of going through the optional MUX 1020. Alternatively row buffer data lines 1040-1054 could be directly connected to the system data switch instead of going through the optional MUX 1020.
The improvement over the previous embodiments is the hybrid approach of combining multiple IDRAM banks each with a multitude of row buffers. As shown in
Also shown within each IDRAM memory subsystem 1102, 1104 is an optional data manipulator (DM) e.g., 1160. The data manipulator 1160 contains storage elements that act as a second level of caching, as well as a simple Arithmetic Logic Unit (ALU), and is managed by the MSC 1130. The advantage of having the data manipulator 1160 within the IDRAM memory subsystem 1102 is the higher performance that is achieved. The data manipulator 1160 is the full width of the row buffers, or wider, without the need to increase the width of the IMC 1112, 1114 or the data switch 1110, and operates at data rates higher than the rates of data passing through the data switch 1110. This local optimization improves the performance for operations that occur within an IDRAM bank. Any operations that involve data in more than one IDRAM bank still need to utilize the data switch 1110 data paths.
The MSC 1130 can control the DM 1160 such that operations over the IMC 1112 that would be read-modify-write operations can be satisfied within the IDRAM memory subsystem with a simple write operation. U.S. Pat. No. 5,544,306, which is incorporated by reference, describes techniques for achieving this, where a Frame Buffer Dynamic Random Access Memory converts read-modify-write operations such as Z-Buffer compare and red-blue-green (RGB) alpha blending into a write-only operation.
The GDPs operate in parallel to manipulate image data for display. Each GDP may have local registers, buffers and cache memory. The GDPs can each operate on different IRAM subsystem data, or multiple GDPs may operate on data in one IRAM subsystem. The GDPs may each be responsible for the complete graphics pipeline of operations such as transform, lighting, set-up and rendering. Alternatively, each GDP may perform one of the stages of the graphics pipeline. Ideally the GDPs will be flexible enough that, depending on the particular application being performed, the system will operate in the most efficient configuration.
In the case where multiple GDPs are rendering data, the rendered data is not always in a regular structure representing a frame buffer. The Display Processor Subsystem (DPS) can be provided with the mapping information and reconstruct the display information from the various stored rendering information. The DPS reconstructs the image scan line-by-scan line so that the data can be sent out and displayed properly. The DPS also performs operations such as scaling and filtering that are better suited to being performed in this back end path than by the GDPs.
The path to the main memory data switch may be used by both the GDPs and the DPS. In the case of the GDPs, large textures or other elements requiring large amounts of storage can be read in by the GDPs and processed. In some cases the raw or processed data is cached in the IRAM subsystems or the data is simply used and only the resulting data stored locally. The display processor subsystem utilizes the path to main memory for constructing the output display. The output consists of data, from both the GDPs as well as from other elements, such as video data that are stored in the main system memory. The DPS constructs the output scan-line by scan-line from the data stored in either IRAM subsystems or main memory.
The architecture shown in
An enhanced system with a common display memory and main memory preferably includes separate controls for each memory subsystem, an arbitration controller that takes the requests from multiple processor or peripheral subsystems, and a memory data path so that by a memory subsystem provides memory data to a processor or peripheral subsystem without preventing additional processor peripheral subsystems from accessing other memory subsystems.
An enhanced system can include a partial drawing buffer where a graphics engine can write a portion of the display output data and transfer the portion of the display output data to a common memory subsystem for use during subsequent display updates after a display frame has been processed. An enhanced system preferably includes a complete drawing buffer where a graphics engine can store the complete display output data and transfer the display output data for subsequent display updates.
An enhanced system preferably includes a graphics controller to perform 3-D graphics functions, a texture cache to provide data for the graphics controller, and an order buffer where the graphics controller can fetch data.
For a 3-D graphics controller, one of the key aspects of 3-D processing is determining which objects, and subsequently which pixels of which objects, are visible for a given frame. Many objects of a given 3-D image may be occluded from a viewpoint by another object's pixels. To insure that the pixels from the proper object are in front and properly displayed, the 3-D system includes what is generally referred to as a Z-buffer or an order buffer. The order buffer is used to determine if the triangles or pixels of a new object are to be displayed for a given frame based on their position relative to the viewpoint. The earlier in a graphics pipeline that the ordering is performed, the less computation is needed to render pixels that will not ultimately be visible for a scene. However, it is sometimes just simpler to perform the complete rendering of a triangle and then on a pixel-by-pixel basis decide whether or not to update the display based on the value in the order buffer.
For systems with a single 3-D controller, accessing the order buffer is a key bandwidth consideration. Therefore, as with textures, it is advantageous to have a cache or buffer for the ordering information. For systems with multiple 3-D controllers, each 3-D controller may be permitted to operate asynchronously to balance the computation load and increase the system throughput. An order buffer that is accessible to each of the controllers allows asynchronous processing to occur and still be sure that the proper pixels from each object will end up in view.
Those skilled in the art will recognize that this invention can be implemented with additional subsystems connected in series or in parallel to the disclosed subsystems, depending on the application. Therefore, the present invention is limited only by the following claims.
|Cited Patent||Filing date||Publication date||Applicant||Title|
|US4649516||Jun 1, 1984||Mar 10, 1987||International Business Machines Corp.||Dynamic row buffer circuit for DRAM|
|US4791639 *||Aug 6, 1986||Dec 13, 1988||Mitel Corporation||Communications switching system|
|US5142638||Apr 8, 1991||Aug 25, 1992||Cray Research, Inc.||Apparatus for sharing memory in a multiprocessor system|
|US5182801||Jun 9, 1989||Jan 26, 1993||Digital Equipment Corporation||Apparatus and method for providing fast data transfer between multiple devices through dynamic reconfiguration of the memory space of the devices|
|US5243447||Jun 19, 1992||Sep 7, 1993||Intel Corporation||Enhanced single frame buffer display system|
|US5335321||Jun 19, 1992||Aug 2, 1994||Intel Corporation||Scalable multimedia platform architecture|
|US5450355||Feb 5, 1993||Sep 12, 1995||Micron Semiconductor, Inc.||Multi-port memory device|
|US5450542 *||Nov 30, 1993||Sep 12, 1995||Vlsi Technology, Inc.||Bus interface with graphics and system paths for an integrated memory system|
|US5454107||Nov 30, 1993||Sep 26, 1995||Vlsi Technologies||Cache memory support in an integrated memory system|
|US5459835||Dec 28, 1992||Oct 17, 1995||3D Labs Ltd.||Graphics rendering systems|
|US5471672||Sep 2, 1994||Nov 28, 1995||Intel Corporation||Method for implementing a high speed computer graphics bus|
|US5473370 *||Dec 9, 1993||Dec 5, 1995||Fuji Photo Film Co., Ltd.||Electronic still-video camera, and playback apparatus thereof being capable of storing image data when the storage capacity of a memory card is exceeded|
|US5490112||Oct 14, 1994||Feb 6, 1996||Micron Technology, Inc.||Multi-port memory device with multiple sets of columns|
|US5544306||May 3, 1994||Aug 6, 1996||Sun Microsystems, Inc.||Flexible dram access in a frame buffer memory and system|
|US5572655||Jan 27, 1995||Nov 5, 1996||Lsi Logic Corporation||High-performance integrated bit-mapped graphics controller|
|US5574847||Sep 29, 1993||Nov 12, 1996||Evans & Sutherland Computer Corporation||Computer graphics parallel system with temporal priority|
|US5598542 *||Aug 8, 1994||Jan 28, 1997||International Business Machines Corporation||Method and apparatus for bus arbitration in a multiple bus information handling system using time slot assignment values|
|US5613146||Jun 7, 1995||Mar 18, 1997||Texas Instruments Incorporated||Reconfigurable SIMD/MIMD processor using switch matrix to allow access to a parameter memory by any of the plurality of processors|
|US5650955||Aug 16, 1996||Jul 22, 1997||Neomagic Corporation||Graphics controller integrated circuit without memory interface|
|US5666521||Nov 20, 1996||Sep 9, 1997||Intel Corporation||Method and apparatus for performing bit block transfers in a computer system|
|US5671373||Jun 8, 1995||Sep 23, 1997||Hewlett-Packard Company||Data bus protocol for computer graphics system|
|US5680591||Mar 28, 1995||Oct 21, 1997||Cirrus Logic, Inc.||Method and apparatus for monitoring a row address strobe signal in a graphics controller|
|US5715437||Nov 10, 1994||Feb 3, 1998||Brooktree Corporation||System for, and method of, processing in hardware commands received from software without polling of the hardware by the software|
|US5720019||Jun 8, 1995||Feb 17, 1998||Hewlett-Packard Company||Computer graphics system having high performance primitive clipping preprocessing|
|US5734328 *||Dec 14, 1994||Mar 31, 1998||Canon Kabushiki Kaisha||Apparatus for switching communication method based on detected communication distance|
|US5748921||Dec 11, 1995||May 5, 1998||Advanced Micro Devices, Inc.||Computer system including a plurality of multimedia devices each having a high-speed memory data channel for accessing system memory|
|US5790110||Jan 15, 1997||Aug 4, 1998||Brooktree Corporation||System and method for generating video in a computer system|
|US5790138||Jan 16, 1996||Aug 4, 1998||Monolithic System Technology, Inc.||Method and structure for improving display data bandwidth in a unified memory architecture system|
|US5815167||Jun 27, 1996||Sep 29, 1998||Intel Corporation||Method and apparatus for providing concurrent access by a plurality of agents to a shared memory|
|US5867180||Mar 13, 1997||Feb 2, 1999||International Business Machines Corporation||Intelligent media memory statically mapped in unified memory architecture|
|US5892964||Jun 30, 1997||Apr 6, 1999||Compaq Computer Corp.||Computer bridge interfaces for accelerated graphics port and peripheral component interconnect devices|
|US5911051||Sep 27, 1996||Jun 8, 1999||Intel Corporation||High-throughput interconnect allowing bus transactions based on partial access requests|
|US5936635||Jun 28, 1996||Aug 10, 1999||Cirrus Logic, Inc.||System and method of rendering polygons into a pixel grid|
|US6041010||Jun 26, 1997||Mar 21, 2000||Neomagic Corporation||Graphics controller integrated circuit without memory interface pins and associated power dissipation|
|US6041400||Oct 26, 1998||Mar 21, 2000||Sony Corporation||Distributed extensible processing architecture for digital signal processing applications|
|US6057862||Jul 1, 1997||May 2, 2000||Memtrax Llc||Computer system having a common display memory and main memory|
|US6076139||Sep 30, 1997||Jun 13, 2000||Compaq Computer Corporation||Multimedia computer architecture with multi-channel concurrent memory access|
|US6081279||Oct 21, 1997||Jun 27, 2000||Alliance Semiconductor Corporation||Shared memory graphics accelerator system|
|US6101584||May 2, 1997||Aug 8, 2000||Mitsubishi Denki Kabushiki Kaisha||Computer system and semiconductor device on one chip including a memory and central processing unit for making interlock access to the memory|
|US6104417||Sep 13, 1996||Aug 15, 2000||Silicon Graphics, Inc.||Unified memory computer architecture with dynamic graphics memory allocation|
|US6108015||Nov 2, 1995||Aug 22, 2000||Cirrus Logic, Inc.||Circuits, systems and methods for interfacing processing circuitry with a memory|
|US6118462||Sep 9, 1997||Sep 12, 2000||Memtrax Llc||Computer system controller having internal memory and external memory control|
|US6167486||Nov 18, 1996||Dec 26, 2000||Nec Electronics, Inc.||Parallel access virtual channel memory system with cacheable channels|
|US6173381||Aug 8, 1997||Jan 9, 2001||Interactive Silicon, Inc.||Memory controller including embedded data compression and decompression engines|
|US6215497||Aug 12, 1998||Apr 10, 2001||Monolithic System Technology, Inc.||Method and apparatus for maximizing the random access bandwidth of a multi-bank DRAM in a computer graphics system|
|US6232990||Jun 11, 1998||May 15, 2001||Hewlett-Packard Company||Single-chip chipset with integrated graphics controller|
|US6240437||Nov 10, 1997||May 29, 2001||Texas Instruments Incorporated||Long instruction word controlling plural independent processor operations|
|US6247084||Oct 5, 1998||Jun 12, 2001||Lsi Logic Corporation||Integrated circuit with unified memory system and dual bus architecture|
|US6295074||Mar 21, 1996||Sep 25, 2001||Hitachi, Ltd.||Data processing apparatus having DRAM incorporated therein|
|US6690379||Nov 21, 2002||Feb 10, 2004||Memtrax Llc||Computer system controller having internal memory and external memory control|
|DE19619464A1||May 14, 1996||Dec 12, 1996||Hewlett Packard Co||Datenbusprotokoll für ein Computergraphiksystem|
|EP0613098A2||Feb 14, 1994||Aug 31, 1994||Kabushiki Kaisha Toshiba||Image processing apparatus and method of controlling the same|
|EP0747872A1||May 13, 1996||Dec 11, 1996||International Business Machines Corporation||Video processor with addressing mode control|
|JPH0954835A||Title not available|
|JPH01266651A||Title not available|
|JPH08221319A||Title not available|
|JPH11510620A||Title not available|
|JPS6143359A||Title not available|
|WO1995015528A1||Nov 23, 1994||Jun 8, 1995||Vlsi Technology Inc||A reallocatable memory subsystem enabling transparent transfer of memory function during upgrade|
|WO1996013775A1||Oct 13, 1995||May 9, 1996||Flamepoint Inc||Simultaneous processing by multiple components|
|WO1997006523A1||Aug 5, 1996||Feb 20, 1997||Cirrus Logic Inc||Unified system/frame buffer memories and systems and methods using the same|
|WO1997026604A1||Jan 15, 1997||Jul 24, 1997||Monolothic System Technology I||Method and structure for improving display data bandwidth in a unified memory architecture system|
|1||"Accelerated Graphics Port Interface Specification", Revision 1.0, Intel Corporation, Jul. 31, 1996.|
|2||"Plato/PX Integrated Platform Accelerator," S3 Incorporated, Santa Clara, California, Jan. 1997.|
|3||Donovan et al "Pixel Processing in a Memory Controller" Sun Microsystems IEEE Computer Graphics and Application pp. 51-61.|
|4||Foley, et al, "Computer Graphic Principles and Practice," Addison-Wesley Publishing Company, 2.sup.nd Edition, 1990, pp. 165-179, 856-862.|
|5||Gillett, Richard B., "Memory Channel Network for PCI", IEEE Micro, Feb. 1996, pp. 12-18.|
|6||International Search Report for Application No. PCT/US98/13569 Mailed Nov. 19, 1998, 3 pages.|
|7||International Search Report, mailed May 1, 1999 for application PCT/US98/17223.|
|8||Japan Patent Office. Notification of Reasons for Refusal. Office Action dated Mar. 13, 2008. Japan Patent Application No. H11-515542. English Language Translation. 3 pages.|
|9||Japan Patent Office. Notification of Reasons for Refusal. Office Action dated Mar. 13, 2008. Japan Patent Application No. H11-515542. Japanese Language. 3 pages.|
|10||Jay Torborg & Jim Kajiya, "Talisman: Commodity Realtime 3D Graphics for the PC," SIGGRAPH 96.|
|11||JP Office Action, mailed Mar. 13, 2008 for application 11-515542.|
|12||JP Office Action, mailed Sep. 22, 2008 for application 11-515542.|
|13||K. Curt: "UMA Lowers Overall System Costs" Electronic Design., vol. 43, No. 18, Sep. 5, 1995, p. 118, 120, 122 XP000535285 Hasbrouck Heights, New Jersey US see p. 118, left-hand column, paragraph 1-middle column, paragraph 2; figure 1.|
|14||K. Curt: "UMA Lowers Overall System Costs" Electronic Design., vol. 43, No. 18, Sep. 5, 1995, p. 118, 120, 122 XP000535285 Hasbrouck Heights, New Jersey US see p. 118, left-hand column, paragraph 1—middle column, paragraph 2; figure 1.|
|15||Notice of Allowability, mailed Oct. 22, 1999 for U.S. Appl. No. 08/886,237.|
|16||Notice of Allowability, mailed Oct. 22, 1999 for U.S. Appl. No. 08/926,666.|
|17||Office Action, mail Dec. 1, 2009, for JP Patent Application 11-515542, 4 pages.|
|18||Office Action, mailed Apr. 7, 2003 for U.S. Appl. No. 10/042,751.|
|19||Office Action, mailed Aug. 31, 1999 for U.S. Appl. No. 08/926,666.|
|20||Office Action, mailed Jun. 20, 2001 for U.S. Appl. No. 09/541,413.|
|21||Office Action, mailed Mar. 13, 2001 for U.S. Appl. No. 09/541,413.|
|22||Office Action, mailed Mar. 18, 1999 for U.S. Appl. No. 08/886,237.|
|23||Office Action, mailed Mar. 2, 1999 for U.S. Appl. No. 08/926,666.|
|24||Office Action, mailed Mar. 20, 2002 for U.S. Appl. No. 09/541,413.|
|25||Office Action, mailed Oct. 1, 2002 for U.S. Appl. No. 10/201,492.|
|26||Office Action, mailed Oct. 2, 1998 for U.S. Appl. No. 08/886,237.|
|27||Office Action, mailed Oct. 5, 1998 for U.S. Appl. No. 08/926,666.|
|28||Park et al. "A High Performance Parallel Computing System for Imaging and Graphics" IEEE Pacific Rim Conference on Communications, Computers and Signal Processing, May 9-10, 1991.|
|29||Patterson, et al, "A Case for Intelligent RAM," IEEE Micro, Mar./Apr., 1997 pp. 34-44.|
|30||Yoki Eda, "UMA Reduces PC-installed Memory by Common Use of Frame Buffer and Main Memory," Nikkel Electronics, Japan, Kikkel Business Publications, Inc., Mar. 11, 1996, No. 657, 16 pages.|
|U.S. Classification||345/535, 345/531, 345/542, 345/536|
|International Classification||G09G5/393, G09G5/00, G06F12/00, H04N7/50, G06T11/00, G09G5/39, H04N7/26, G06F13/16, G09G5/36, G06F13/18|
|Cooperative Classification||H04N19/61, H04N19/423, G06F13/1605, G09G2360/12, G09G5/001, G09G2360/126, G09G3/003, G09G5/393, G09G2360/122, G09G2360/121, G09G5/39, G09G2360/125, G09G5/363|
|European Classification||H04N7/26L2, G09G5/36C, G09G5/00A, G09G3/00B4, G09G5/39, H04N7/50, G06F13/16A|
|Jun 1, 2007||AS||Assignment|
Owner name: MEMTRAX LLC, CALIFORNIA
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MARGULIS, NEAL;REEL/FRAME:019366/0743
Effective date: 19990525
Owner name: XTREMA LLC, NEVADA
Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MEMTRAX LLC;REEL/FRAME:019366/0894
Effective date: 20050324
|Jul 21, 2011||FPAY||Fee payment|
Year of fee payment: 8
|Jul 28, 2015||FPAY||Fee payment|
Year of fee payment: 12