WO2000011614A2 - Tangent space lighting in a deferred shading architecture - Google Patents
Tangent space lighting in a deferred shading architecture Download PDFInfo
- Publication number
- WO2000011614A2 WO2000011614A2 PCT/US1999/019036 US9919036W WO0011614A2 WO 2000011614 A2 WO2000011614 A2 WO 2000011614A2 US 9919036 W US9919036 W US 9919036W WO 0011614 A2 WO0011614 A2 WO 0011614A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- texture
- vector
- fragment
- block
- bump
- Prior art date
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
- G06T15/80—Shading
- G06T15/87—Gouraud shading
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/30—Clipping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T1/00—General purpose image data processing
- G06T1/20—Processor architectures; Processor configuration, e.g. pipelining
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T11/00—2D [Two Dimensional] image generation
- G06T11/40—Filling a planar surface by adding surface attributes, e.g. colour or texture
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/005—General purpose rendering architectures
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/04—Texture mapping
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/20—Perspective computation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/10—Geometric effects
- G06T15/40—Hidden part removal
- G06T15/405—Hidden part removal using Z-buffer
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL
- G06T15/00—3D [Three Dimensional] image rendering
- G06T15/50—Lighting effects
- G06T15/80—Shading
- G06T15/83—Phong shading
Definitions
- This invention relates to computing systems generally, to three-dimensional computer graphics, more particularly, and most particularly to a structure and method for performing tangent space lighting in a three-dimensional graphics processor implementing deferred shading features.
- Computer graphics is the art and science of generating pictures with a computer. Generation of pictures, or images, is commonly called rendering.
- rendering Generally, in three-dimensional (3D) computer graphics, geometry that represents surfaces (or volumes) of objects in a scene is translated into pixels stored in a frame buffer, and then displayed on a display device.
- Real-time display devices such as CRTs used as computer monitors, refresh the display by continuously displaying the image over and over. This refresh usually occurs row-by-row, where each row is called a raster line or scan line. In this document, raster lines are numbered from bottom to top, but are displayed in order from top to bottom.
- 3D animation In a 3D animation, a sequence of images is displayed, giving the illusion of motion in three- dimensional space.
- Interactive 3D computer graphics allows a user to change his viewpoint or change the geometry in real-time, thereby requiring the rendering system to create new images on- the-fly in real-time.
- each renderable object In 3D computer graphics, each renderable object generally has its own local object coordinate system, and therefore needs to be translated (or transformed) from object coordinates to pixel display coordinates.
- this is a 4-step process: 1) translation (including scaling for size enlargement or shrink) from object coordinates to world coordinates, which is the coordinate system for the entire scene; 2) translation from world coordinates to eye coordinates, based on the viewing point of the scene; 3) translation from eye coordinates to perspective translated eye coordinates, where perspective scaling (farther objects appear smaller) has been performed; and 4) translation from perspective translated eye coordinates to pixel coordinates, also called screen coordinates.
- Screen coordinates are points in three-dimensional space, and can be in either screen-precision (i.e., pixels) or object-precision (high precision numbers, usually floating-point), as described later. These translation steps can be compressed into one or two steps by precomputing appropriate translation matrices before any translation occurs.
- Many techniques are used for generating pixel color values, including Gouraud shading, Phong shading, and texture mapping.
- Figure 1 shows a three-dimensional object, a tetrahedron, with its own coordinate axes (x 0 _,,y 0bJ . z - bj )-
- the three-dimensional object is translated, scaled, and placed in the viewing point's coordinate system based on (X eye ⁇ eye .Z ey. )-
- the object is projected onto the viewing plane, thereby correcting for perspective.
- the object appears to have become two-dimensional; however, the object's z-coordinates are preserved so they can be used later by hidden surface removal techniques.
- the object is finally translated to screen coordinates, based on (X screen .y S ree ⁇ .Z sc r ee n).
- the geometry representing the surfaces closest to the scene viewing point must be determined.
- the visible surfaces within the volume subtended by the pixel's area determine the pixel color value, while hidden surfaces are prevented from affecting the pixel.
- Non-opaque surfaces closer to the viewing point than the closest opaque surface (or surfaces, if an edge of geometry crosses the pixel area) affect the pixel color value, while all other non-opaque surfaces are discarded.
- the term "occluded" is used to describe geometry which is hidden by other non-opaque geometry.
- the depth complexity of a scene is a measure of the wasted processing. For example, for a scene with a depth complexity of ten, 90% of the computation is wasted on hidden pixels.
- This wasted computation is typical of hardware renderers that use the simple Z- buffer technique (discussed later herein), generally chosen because it is easily built in hardware. Methods more complicated than the Z Buffer technique have heretofore generally been too complex to build in a cost-effective manner.
- An important feature of the method and apparatus invention presented here is the avoidance of this wasted computation by eliminating hidden portions of geometry before they are rasterized, while still being simple enough to build in cost-effective hardware.
- the point When a point on a surface (frequently a polygon vertex) is translated to screen coordinates, the point has three coordinates: 1) the x-coordinate in pixel units (generally including a fraction); 2) the y-coordinate in pixel units (generally including a fraction); and 3) the z-coordinate of the point in either eye coordinates, distance from the virtual screen, or some other coordinate system which preserves the relative distance of surfaces from the viewing point.
- positive z- coordinate values are used for the "look direction" from the viewing point, and smaller values indicate a position closer to the viewing point.
- a surface is approximated by a set of planar polygons
- the vertices of each polygon are translated to screen coordinates.
- the screen coordinates are interpolated from the coordinates of vertices, typically by the processes of edge walking and span interpolation.
- a z-coordinate value is generally included in each pixel value (along with the color value) as geometry is rendered.
- the Deering Reference includes a diagram of a generic 3D graphics pipeline (i.e., a renderer, or a rendering system) that it describes as "truly generic, as at the top level nearly every commercial 3D graphics accelerator fits this abstraction", and this pipeline diagram is reproduced here as Figure 2.
- a generic 3D graphics pipeline i.e., a renderer, or a rendering system
- Such pipeline diagrams convey the process of rendering, but do not describe any particular hardware.
- This document presents a new graphics pipeline that shares some of the steps ofthe generic 3D graphics pipeline. Each ofthe steps in the generic 3D graphics pipeline will be briefly explained here, and are also shown in the method flow diagram of Figure 3 Processing of polygons is assumed throughout this document, but other methods for describing 3D geometry could be substituted. For simplicity of explanation, triangles are used as the type of polygon in the described methods.
- the first step within the floating-point intensive functions of the generic 3D graphics pipeline after the data input is the transformation step (Step 214), which was described above.
- the transformation step is also shown in Figure 3 as the first step in the outer loop ofthe method flow diagram, and also includes "get next polygon".
- the second step the clip test, checks the polygon to see if it is at least partially contained in the view volume (sometimes shaped as a frustum) (Step 216). If the polygon is not in the view volume, it is discarded; otherwise processing continues.
- the third step is face determination, where polygons facing away from the viewing point are discarded (Step 218). Generally, face determination is applied only to objects that are closed volumes.
- the fourth step, lighting computation generally includes the set up for
- the fifth step deletes any portion ofthe polygon that is outside of the view volume because that portion would not project within the rectangular area of the viewing plane (Step 224).
- polygon clipping is done by splitting the polygon into two smaller polygons that both project within the area of the viewing plane. Polygon clipping is computationally expensive.
- the sixth step, perspective divide does perspective correction for the projection of objects onto the viewing plane (Step 226). At this point, the points representing vertices of polygons are converted to pixel space coordinates by step seven, the screen space conversion step (Step 228).
- the eighth step (Step 230), set up for incremental render, computes the various begin, end, and increment values needed for edge walking and span interpolation (e.g.: x, y, and z-coordinates; RGB color; texture map space u and v-coordinates; and the like).
- edge walking and span interpolation e.g.: x, y, and z-coordinates; RGB color; texture map space u and v-coordinates; and the like.
- edge walking incrementally generates horizontal spans for each raster line of the display device by incrementing values from the previously generated span (in the same polygon), thereby "walking" vertically along opposite edges of the polygon.
- span interpolation Step 2344 "walks" horizontally along a span to generate pixel values, including a z-coordinate value indicating the pixel's distance from the viewing point.
- the z-buffered blending also referred to as Testing and Blending (Step 236) generates a final pixel color value.
- the pixel values also include color values, which can be generated by simple Gouraud shading (i.e., interpolation of vertex color values) or by more computationally expensive techniques such as texture mapping (possibly using multiple texture maps blended together), Phong shading (i.e., per-fragment lighting), and/or bump mapping (perturbing the interpolated surface normal).
- texture mapping possibly using multiple texture maps blended together
- Phong shading i.e., per-fragment lighting
- bump mapping perturbing the interpolated surface normal.
- the z-buffered blend By comparing the generated z-coordinate value to the corresponding value stored in the Z Buffer, the z-buffered blend either keeps the new pixel values (if it is closer to the viewing point than previously stored value for that pixel location) by writing it into the frame buffer, or discards the new pixel values (if it is farther). At this step, antialiasing methods can blend the new pixel color with the old pixel color.
- the z-buffered blend generally includes most of the per-fragment operations, described below.
- the generic 3D graphics pipeline includes a double buffered frame buffer, so a double buffered MUX is also included.
- An output lookup table is included for translating color map values.
- digital to analog conversion makes an analog signal for input to the display device.
- a major drawback to the generic 3D graphics pipeline is its drawing intensive functions are not deterministic at the pixel level given a fixed number of polygons. That is, given a fixed number of polygons, more pixel-level computation is required as the average polygon size increases.
- the floating-point intensive functions are proportional to the number of polygons, and independent of the average polygon size. Therefore, it is difficult to balance the amount of computational power between the floating-point intensive functions and the drawing intensive functions because this balance depends on the average polygon size.
- Prior art Z Buffers are based on conventional Random Access Memory (RAM or DRAM), Video RAM (VRAM), or special purpose DRAMs.
- RAM Random Access Memory
- VRAM Video RAM
- special purpose DRAMs One example of a special purpose DRAM is presented in "FBRAM: A new Form of Memory Optimized for 3D Graphics", by Deering, Schlapp, and Lavelle, pages 167 to 174 of SIGGRAPH94 Proceedings, 24-29 July 1994, Computer Graphics Proceedings, Annual Conference Series, published by ACM SIGGRAPH, New York, 1994, Softcover ISBN 0201607956.
- OpenGL is a software interface to graphics hardware which consists of several hundred functions and procedures that allow a programmer to specify objects and operations to produce graphical images.
- the objects and operations include appropriate characteristics to produce color images of three-dimensional objects.
- Most of OpenGL (Version 1.2) assumes or requires a that the graphics hardware include a frame buffer even though the object may be a point, line, polygon, or bitmap, and the operation may be an operation on that object.
- the general features of OpenGL (just one example of a graphical interface) are described in the reference "The OpenGL ® Graphics System: A Specification (Version 1.2) edited by Mark Segal and Kurt Akeley, Version 1.2, March 1998; and hereby incorporated by reference.
- OpenGL the invention is not limited to structures, procedures, or methods which are compatible or consistent with
- inventive structure and method may be implemented in a manner that is consistent with the OpenGL, or other standard graphical interface, so that a data set prepared for one of the standard interfaces may be processed by the inventive structure and method without modification.
- inventive structure and method provides some features not provided by OpenGL, and even when such generic input/output is provided, the implementation is provided in a different manner.
- pipeline state does not have a single definition in the prior-art.
- the OpenGL specification sets forth the type and amount of the graphics rendering machine or pipeline state in terms of items of state and the number of bits and bytes required to store that state information.
- pipeline state tends to include object vertex pertinent information including for example, the vertices themselves the vertex normals, and color as well as "non-vertex" information.
- object geometry information When information is sent into a graphics renderer, at least some object geometry information is provided to describe the scene.
- the object or objects are specified in terms of vertex information, where an object is modeled, defined, or otherwise specified by points, lines, or polygons (object primitives) made up of one or more vertices.
- a vertex In simple terms, a vertex is a location in space and may be specified for example by a three-space (x,y,z) coordinate relative to some reference origin.
- Associated with each vertex is other information, such as a surface normal, color, texture, transparency, and the like information pertaining to the characteristics of the vertex. This information is essentially "per-vertex" information.
- a color value may be specified in the data stream for a particular vertex and then not respecified in the data stream until the color changes for a subsequent vertex.
- the color value may still be characterized as per-vertex data even though a color value is not explicitly included in the incoming data stream for each vertex.
- Texture mapping presents an interesting example of information or data which could be considered as either per-vertex information or pipeline state information.
- one or more texture maps may be specified, each texture map being identified in some manner, such as with a texture coordinate or coordinates.
- One may consider the texture map to which one is pointing with the texture coordinate as part of the pipeline state while others might argue that it is per-vertex information.
- Other information not related on a one-to-one basis to the geometry object primitives, used by the renderer such as lighting location and intensity, material settings, reflective properties, and other overall rules on which the renderer is operating may more accurately be referred to as pipeline state.
- Parameters considered to be renderer (pipeline) state in OpenGL are identified in Section 6.2 of the afore referenced OpenGL Specification (Version 1.2, at pages 193-217).
- APIs such as OpenGL (Open Graphics Library) and D3D
- OpenGL Open Graphics Library
- D3D Data Stored Graphics Library
- a frame buffer stores a set of pixels as a two-dimensional array.
- Each picture- element or pixel stored in the frame buffer is simply a set of some number of bits. The number of bits per pixel may vary depending on the particular GL implementation or context.
- Corresponding bits from each pixel in the framebuffer are grouped together into a bitplane; each bitplane containing a single bit from each pixel.
- the bitplanes are grouped into several logical buffers referred to as the color, depth, stencil, and accumulation buffers.
- the color buffer in turn includes what is referred to under OpenGI as the front left buffer, the front right buffer, the back left buffer, the back right buffer, and some additional auxiliary buffers.
- the values stored in the front buffers are the values typically displayed on a display monitor while the contents of the back buffers and auxiliary buffers are invisible and not displayed.
- Stereoscopic contexts display both the front left and the front right buffers, while monoscopic contexts display only the front left buffer.
- the color buffers must have the same number of bitplanes, but particular implementations of context may not provide right buffers, back buffers, or auxiliary buffers at all, and an implementation or context may additionally provide or not provide stencil, depth, or accumulation buffers.
- the color buffers consist of either unsigned integer color indices or R, G, B, and, optionally, a number "A" of unsigned integer values; and the number of bitplanes in each of the color buffers, the depth buffer (if provided), the stencil buffer (if provided), and the accumulation buffer (if provided), is fixed and window dependent. If an accumulation buffer is provided, it should have at least as many bit planes per R, G, and B color component as do the color buffers.
- a fragment produced by rasterization with window coordinates of , y w ) modifies the pixel in the framebuffer at that location based on a number of tests, parameters, and conditions. Noteworthy among the several tests that are typically performed sequentially beginning with a fragment and its associated data and finishing with the final output stream to the frame buffer are in the order performed (and with some variation among APIs): 1) pixel ownership test; 2) scissor test; 3) alpha test; 4) Color Test; 5) stencil test; 6) depth test; 7) blending; 8) dithering; and 9) logicop. Note that the OpenGL does not provide for an explicit "color test" between the alpha test and stencil test. Per-Fragment operations under OpenGL are applied after all the color computations. Each of these tests or operations is briefly described below.
- the pixel ownership test determines if the pixel at location (xw, yw) in the framebuffer is currently owned by the GL context. If it is not, the window system decides the fate ofthe incoming fragment. Possible results are that the fragment is discarded or that some subset ofthe subsequent per-fragment operations are applied to the fragment. This pixel ownership test allows the window system to properly control the GL's behavior.
- the associated window defines the pixels the process wants to write or render to.
- the window associated with one process may be in front of the window associated with another process, behind that window, or both windows may be entirely visible. Since there is only a single frame buffer for the entire display screen or desktop, the pixel ownership test involves determining which process and associated window owns each of the pixels. If a particular process does not "own" a pixel, it fails the pixel ownership test relative to the frame buffer and that pixel is thrown away.
- the pixel ownership test is run by each process, and that for a give pixel location in the frame buffer, that pixel may pass the pixel ownership test for one o the processes, and fail the pixel ownership test for the other process. Furthermore, in general, a particular pixel can pass the ownership test for only one process because only one process can own a particular frame buffer pixel at the same time.
- the pixel ownership test may not be particularly relevant. For example, if the scene is being rendered to an off-screen buffer, and subsequently Block Transferred or
- the pixel is not owned by that process, then there is no need to write a pixel value to that location, and all subsequent processing for that pixel may be ignored.
- all the data associated with a particular pixel on the screen is read during rasterization. All information for any polygon that feeds that pixel is read, including information as to the identity of the process that owns that frame buffer pixel, as well as the z-buffer, the color value, the old color value, the alpha value, stencil bits, and so forth. If a process owns the pixel, then the other downstream process are executed (for example, scissor test, alpha test, and the like). On the other hand, if the process does not own the pixel and fails the ownership test for that pixel, the process need not consider that pixel further and that pixel is skipped for subsequent tests.
- the scissor test determines if fa, y w ) lies within a scissor rectangle defined by four coordinate values corresponding to a left bottom (left, bottom) coordinate, a width of the rectangle, and a height of the rectangle. The values are set with the procedure "void Scissor( int left, int bottom, sizei width, sizei height)" under OpenGL. If left ⁇ x civilization ⁇ left+width and bottom ⁇ yograph ⁇ bottom+height, then the scissor test passes; otherwise the scissor test fails and the particular fragment being tested is discarded. Various initial states are provided and error conditions monitored and reported.
- a rectangle defines a window which may be an on-screen or off-screen window.
- the window is defined by an x-left, x-right, y-top, and y-bottom coordinate (even though it may be expressed in terms of a point and height and width dimensions from that point).
- This scissor window is useful in that only pixels from a polygon fragment that fall in that screen aligned scissor window will change. In the event that a polygon straddles the scissor window, only those pixels that are inside the scissor window may change.
- the pipeline calculates everything it needs to in order to determine the z-value and color of that pixel. Once z-value and color are determined, that information is used to determine what information should be placed in the frame buffer (thereby determining what is displayed on the display screen).
- the scissor test provides means for discarding pixels and/or fragments before they actually get to the frame buffer to cause the output to change.
- Alpha Test Color is defined by four values, red (R), green (G), blue (B), and alpha (A).
- the RGB values define the contribution from each ofthe primary colors, and alpha is related to the transparency. Typically, color is a 32-bit value, 8-bits for each component, though such representation is not limited to 32- bits.
- Alpha test compares the alpha value of a given pixel to an alpha reference value. The type of comparison may also be specified, so that for example the comparison may be a greater-than operation, a less-than operation, and so forth. If the comparison is a greater-than operation, then the pixel's alpha value has to be greater than the reference to pass the alpha test.
- Alpha test is a per-fragment operation and happens after all of the fragment coloring calculations and lighting and shading operations are completed. Each of these per-fragment operations may be though of as part of the conventional z-buffer blending operations.
- Color test is similar to the alpha test described hereinbefore, except that rather than performing the magnitude or logical comparisons between the pixel alpha (A) value and a reference value, the color test performs a magnitude or logical comparison between one or a combination of the R, G, or B color components and reference value(s).
- the comparison test may be for example, greater- than, less-than, equal-to, greater-than-or-equal-to, "greater-than- ⁇ and less- than c 2 " where c, and c 2 are sore predetermined reference values, and so forth.
- Color test might, for example, be useful to provide blue-screen functionality.
- the comparison test may also be performed on a single color component or on a combination of color components.
- one typically has one value for each component for the color test there are effectively two values per component, a maximum value and a minimum value.
- stencil test conditionally discards a fragment based on the outcome of a comparison between a value stored in a stencil buffer at location (x w y and a reference value.
- Several stencil comparison functions are permitted such that the stencil test passes never, always, if the reference value is less than, less than or equal to, equal to, greater than or equal to, greater than, or not equal to the masked stored value in the stencil buffer.
- the Under OpenGL if the stencil test fails, the incoming fragment is discarded.
- the reference value and the comparison value can have multiple bits, typically 8 bits so that 256 different values may be represented.
- a tag having the stencil bits is also written into the frame buffer. These stencil bits are part of the pipeline state.
- the type of stencil test to perform can be specified at the time the geometry is rendered.
- the stencil bits are used to implement various filtering, masking or stenciling operations. For example, if a particular fragment ends up affecting a particular pixel in the frame buffer, then the stencil bits can be written to the frame buffer along with the pixel information.
- the depth buffer test discards the incoming fragment if a depth comparison fails.
- the comparison is enabled or disabled with the generic Enable and Disable commands using the OpenGL symbolic constant DEPTH_TEST.
- depth test is disabled, the depth comparison and subsequent possible updates to the depth buffer value are bypassed and a fragment is passed to the next operation.
- the stencil bits are also involved and are modified even if the test is bypassed.
- the stencil value is modified if the depth buffer test passed. If depth test is enabled, the depth comparison takes place and the depth buffer and stencil value may subsequently be modified. The manner in which the depth test is implemented in OpenGL is described in greater detail in the OpenGL specification at page 145.
- Depth comparisons are implemented in which possible outcomes are as follows: the depth buffer test passes never, always, if the incoming fragment's z ⁇ value is less than, less than or equal to, equal to, greater than, greater than or equal to, or not equal to the depth value stored at the location given by the incoming fragment's (xicide, y w ) coordinates. If the depth buffer test fails, the incoming fragment is discarded. The stencil value at the fragment's fa, y w ) coordinate is updated according to the function currently in effect for depth buffer test failure. Otherwise, the fragment continues to the next operation and the value of the depth buffer at the fragment's (x , y w ) location is set to the fragment's zgan value. In this case the stencil value is updated according to the function currently in effect for depth buffer test success.
- the necessary OpenGL state is an eight-valued integer and a single bit indicating whether depth buffering is enabled or disabled.
- blending combines the incoming fragment's R, G, B, and A values with the R, G, B, and A values stored in the framebuffer at the incoming fragment's (X ⁇ Y w ) location.
- This blending is typically dependent on the incoming fragment's alpha value (A) and that of the corresponding frame buffer stored pixel.
- A alpha value
- Cs refers to the source color for an incoming fragment
- Cd refers to the destination color at the corresponding framebuffer location
- Cc refers to a constant color in-the GL state.
- Individual RGBA components of these colors are denoted by subscripts of s, d, and c respectively.
- Blending is basically an operation that takes color in the frame buffer and the color in the fragment, and blends them together.
- the manner in which blending is achieved, that is the particular blending function, may be selected from various alternatives for both the source and destination.
- Blending is described in the OpenGL specification at page 146-149 and is hereby incorporated by reference.
- Various blend equations are available under OpenGL.
- the blending equation is evaluated separately for each color component and its corresponding weighting coefficient.
- Each of the four R, G, B, A components has its own weighting factor.
- the blending test (or blending equation) is part of pipeline state and can potentially change for every polygon, but more typically would chang only for the object made up or several polygons.
- blending is only performed once other tests such as the pixel ownership test and stencil test have been passed so that it is clear that the pixel or fragment under consideration would or could have an effect in the output.
- dithering selects between two color values or indices.
- RGBA mode consider the value of any of the color components as a fixed-point value with m bits to the left of the binary point, where m is the number of bits allocated to that component in the framebuffer; call each such value c.
- dithering selects a value d such that d e ⁇ max ⁇ 0, [c]-1 , [c] ⁇ . This selection may depend on the xicide and y w coordinates of the pixel.
- color index mode the same rule applies with c being a single color index. The value of c must not be larger than the maximum value representable in the framebuffer for either the component or the index.
- Logicop Under OpenGL, there is a final logical operation applied between the incoming fragment's color or index values and the color or index values stored in the frame buffer at the corresponding location. The result of the logical operation replaces the values in the framebuffer at the fragment's (x, y) coordinates.
- Various logical operations may be implemented between source (s) and destination (d), including for example: clear, set, and, noop, xor, or, nor, nand, invert, copy, inverted and, equivalence, reverse or, reverse and, inverted copy, and inverted or.
- the logicop arguments and corresponding operations, as well as additional details of the OpenGL logicop implementation, are set forth in the OpenGL specification at pates 150-151.
- Logical operations are performed independently for each color index buffer that is selected for writing, or for each red, green, blue, and alpha value of each color buffer that is selected for writing.
- the required state is an integer indicating the logical operation, and two bits indicating whether the logical operation is enabled or disabled.
- pixels are referred to as the smallest individually controllable element of the display device. But, because images are quantized into discrete pixels, spatial aliasing occurs.
- a typical aliasing artifact is a "staircase" effect caused when a straight line or edge cuts diagonally across rows of pixels.
- Some rendering systems reduce aliasing effects by dividing pixels into subpixels, where each sub- pixel can be colored independently. When the image is to be displayed, the colors for all sub-pixels within each pixel are blended together to form an average color for the pixel.
- a renderer that uses up to 16 sub-pixels per pixel is described in "RealityEngine Graphics", by Akeley, pages 109 to 116 of SIGGRAPH93 Proceedings, 1-6 August 1993, Computer Graphics Proceedings, Annual Conference Series, published by ACM SIGGRAPH, New York, 1993, Softcover ISBN 0-201-58889- 7 and CD-ROM ISBN 0-201-56997-3 (hereinafter referred to as the Akeley Reference).
- A-Buffer used to perform blending
- This technique is also included in the Akeley Reference
- the A-buffer is an antialiasing technique that reduces aliasing by keeping track of the percent coverage of a pixel by a rendered polygon.
- the main drawback to this technique is the need to sort polygons front-to-back (or back-to-front) at each pixel in order to get acceptable antialiased polygons.
- CAM Content Addressable Memories
- Magnitude Comparison CAM is defined here as any CAM where the stored data are treated as numbers, and arithmetic magnitude comparisons (i.e. less-than, greater-than, less-than- or-equal-to, and the like) are performed on the data in parallel. This is in contrast to ordinary CAM which treats stored data strictly as bit vectors, not as numbers.
- An MCCAM patent, included herein by reference, is U.S. Patent Number 4,996,666, by Jerome F. Duluk Jr., entitled “Content- Addressable Memory System Capable of Fully Parallel Magnitude Comparisons", granted February
- Duluk Patent Structures within the Duluk Patent specifically referenced shall include the prefix "Duluk Patent” (for example, “Duluk Patent MCCAM Bit Circuit”).
- the basic internal structure of an MCCAM is a set of memory bits organized into words, where each word can perform one or more arithmetic magnitude comparisons between the stored data and input data.
- each word can perform one or more arithmetic magnitude comparisons between the stored data and input data.
- a parallel search comparison operation is called a "query" of the stored data.
- the invention described herein is a system and method for performing tangent space lighting in a deferred shading architecture.
- floating point-intensive lighting computations are performed only after hidden surfaces have been removed from the graphics pipeline. This can result in dramatically fewer lighting computations than in the conventional approach described in reference to Figure 2, where shading computations ( Figure 2, 222) are performed for nearly all surfaces before hidden pixels are removed in the z-buffered blending operation ( Figure 2, 236).
- SGI Silicon Graphics International
- lighting computations generate for each pixel of a surface an RGBA color value that accounts for the surface's color, orientation and material properties; the orientation and properties ofthe surface illumination; and the viewpoint from which the illuminated surface is observed.
- the material properties can include: fog, emissive color, reflective properties (ambient, diffuse, specular) and bump effects.
- the illumination properties can include for one or more lights: color (global ambient, light ambient, light diffuse, light specular) and attenuation, spotlight and shadow effects.
- FIG. 3 there is shown a diagram illustrating the elements employed in the lighting computations of both the conventional approach and the present invention. This figure does not illustrate the elements used in bump mapping calculations, which are shown in Figure 4. The elements shown in Figure 3 are defined below.
- V (V X ,V V ,V Z ) the position ofthe fragment to be illuminated in eye coordinates
- N the unit normal vector at the fragment (N x , N V ,' N ' *z I
- P L the location ofthe light source in eye coordinates (P ⁇ , P Ly , R ⁇ .) .
- P L represents the coordinates of a unit vector from the origin to the light
- P E the location ofthe viewer (viewpoint). In eye coordinates the viewpoint is at either
- L is the unit vector from the vertex to the light, P L , and is defined as follows:
- H is the unit vector half way between E and L , and is defin e( _ as follows:
- H -EJ- .
- S D the unit vector in the direction of the spotlight, it is a Lighting Source Parameter and is provided as a unit vector.
- S c is the cosine of the angle that defines the spotlight cone. It is a Lighting Source
- Emissive Color The color given to a surface by its self illuminating material property without a light.
- Ambient Color The color given to a surface due to a lights ambient intensity and scaled by the materials ambient reflective property. Ambient Color is not dependent on the position of the light or the viewer. Two types of ambient lights are provided, a Global Ambient Scene Light, and the ambient light intensity associated with individual lights.
- Diffuse Color The color given to a surface due to a light's diffuse intensity and scaled by the material's diffuse reflective property and the direction of the light with respect to the surface's normal. Because the diffuse light reflects in all directions, the position of the viewpoint has no effect on a surface's diffuse color.
- Specular Color The color given to a surface due to a light's specular intensity and scaled by the material's specular reflective property and the directions of the light and the iewpoint with respect to the surface's normal.
- the rate at which a material's specular reflection fades off is an exponential factor and is specified as the material's shininess factor.
- Attenuation The amount that a color's intensity from a light source fades away as a function of the distance from the surface to the light. Three factors are specified per light, a constant coefficient, a linear coefficient, and a quadratic coefficient.
- Spotlight A feature per light source that defines the direction of the light and its cone of illumination.
- a spotlight has no effect on a surface that lies outside its cone.
- the illumination by the spotlight inside the cone depends on how far the surface is from the center of the cone and is specified by a spotlight exponent factor.
- the ambient attribute of a material, A. m is used to scale the Global Scene Ambient Light, A- s , to determine the global ambient effect. I.e.,
- Each light may also have a spotlight attribute and an attenuation factor, which are expressed as follows.
- Each light can be specified to act as a spotlight.
- the result of a spotlight is to diminish the effect that a light has on a vertex based upon the distance of the vertex from the direction that the spotlight is pointed. If the light is not a spotlight then there is no effect and the spotlight factor is one.
- the parameters needed to specify a spotlight are the position of the spotlight, P L , P Li , the
- the ambient effect of local lights is the Local Ambient Light, A d , scaled by the ambient
- the diffuse light effect is determined by the position of the light with respect to the normal of the surface. It does not depend on the position of the viewpoint. It is determined by the diffuse attribute of the material, D cm , the diffuse attribute of the light, D cl , the position of the light,
- L is the unit length vector from the vertex to the light position. If the light position is at infinity
- the diffuse effect can be described as D cl , the diffuse light, scaled by, D cm , the diffuse
- the cosine of the angle between the direction ofthe light and the surface normal is limited between 0 and 1. If the cosine is negative, then the diffuse effect is 0.
- the specular light effect is determined by the position of the light with respect to the normal of the surface and the position of the viewpoint. It is determined by the specular color of the material, S cm , the specular exponent (shininess) of the material, S rm , the specular attribute of
- the light S., , the position of the light, P L , P u , the unit eye vector E (described below), the
- L is the unit length vector from the vertex to the light position. If the light position is at infinity
- E is the unit length vector from the vertex to the viewpoint. If the viewpoint position is at infinity,
- H is the unit length vector halfway between L and E .
- the halfway vector, H is independent of the vertex position and is provided as light parameter.
- the specular effect can be described as S c , , the diffuse light, scaled by, S cm , the diffuse
- a light's position can be defined as having a distance of infinity from the origin but still have a vector pointing to its position. This definition is used in simplifying the calculation needed to determine the vector from the vertex to the light (in other APIs, which do not define the light's position in this way, this simplification cannot be made). If a light is at infinity, then this vector is independent of the position of the vertex, is constant for every vertex, and does not need the vertex's eye coordinates. This simplification is used for spotlights, diffuse color, and specular color.
- the viewpoint is defined as being at the origin or at infinity in the z direction. This is used to simplify the calculation for specular color. If the viewer is at infinity then the vector from the vertex to the viewpoint is independent ofthe position of the vertex, is constant for every vertex, and does not need the vertex's eye coordinates. This vector is then just the unit vector in the z direction, Z .
- Table 1 summarizes the calculations needed for lighting depending on whether local or infinite light position and viewer are specified.
- bump mapping produces more realistic lighting by simulating the shadows and highlights resulting from illumination of a surface on which the effect of a three dimensional texture is imposed/mapped.
- An example of such a textured surface is the pebbled surface of a basketball or the dimpled surface of a golf ball.
- a texture map e.g., a representation ofthe pebbled basketball surface
- N surface normal
- Bump mapping requires extensions to the OpenGL standard. The theoretical basis of bump mapping is now described with reference to Figure 4. This approach is common to both of the most common bump mapping methods: the SGI approach and the Blinn approach.
- Bump Mapping Background Bump Mapping is defined as a perturbation of the Normal Vector, N resulting in the perturbed
- the perturbed vector can be calculated by defining V ⁇ to be the location of a point, V g , after it has been moved ("bumped") a distance h in the direction of the Normal, N. Define the unit vector in the Normal direction as,
- V e ' V e + h ⁇ N
- the Normal Vector can be defined as the cross product of the surface tangents:
- the Perturbed Normal can be defined as the cross product ofthe surface tangents of the bumped point.
- Vf V t + Hi ⁇ N + h ⁇ 1 f dt dt Since and are relatively small, they are dropped. dS dt
- N' ( ⁇ 7 S + h s ⁇ N) x (V t + h t - N)
- Basis Vectors can be calculated using [5].
- Basis Vectors This calculation for Basis Vectors is the one proposed by Blinn and requires Surface Tangents, a unit Normal Vector, and a cross product.
- the vertices V1 and V2 of a triangle can be described relative to V0 as:
- V, V s ⁇ s, + v t • i
- V 2 V s ⁇ s 2 + v t ⁇ i 2
- SGI Bump Mapping Referring to Figure 5A, there is shown a functional flow diagram illustrating a bump mapping approach proposed by Silicon Graphics (SGI).
- the functional blocks include: "compute perturbed normal" SGI10, "store texture map” SGI12, “perform lighting computations” SGI14 and "transform eye space to tangent space” SGI16.
- steps SGI10 and SGI12 are performed in software and the steps SGI14 and SGI16 are performed in 3D graphics hardware.
- the step SGI16 is performed using the same hardware that is optimized to perform Phong shading.
- the SGI approach is documented in the Peercy reference.
- a key aspect of the SGI approach is that all lighting and bump mapping computations are performed in tangent space, which is a space defined for each surface/object by orthonormal vectors comprising a unit surface normal (N) and two unit surface tangents (T and B).
- the basis vectors could be explicitly defined at each vertex by an application program or could be derived by the graphics processor from a reference frame that is local to each object.
- the tangent space is defined, the components of the basis vectors are given in eye space.
- a standard theorem from linear algebra states that the matrix used to transform from coordinate system A (e.g., eye space) to system B (e.g., tangent space) can be formed from the coordinates of the basis vectors of system B in system A.
- a matrix M whose columns comprise the basis vectors N, T and B represented in eye space coordinates can be used to transform eye space vectors into corresponding tangent space vectors.
- this transformation is used in the SGI pipeline to enable the lighting and bump mapping computations to be done in tangent space.
- the elements employed in the illustrated SGI approach include the following: u one coordinate of tangent space in plane of surface v one coordinate of tangent space in plane of surface
- an input texture map is a 1 , 2 or 3-dimensional array of values f(u,v) that define a height field in (u,v) space.
- this height field is converted to a collection of partial derivatives f u (u,v), f dock(u,v) that gives the gradient in two directions (u and v) for each point of the height field); f v (u,v) partial derivative along the v axis ofthe input texture map computed at each point of the texture map (see discussion of f v (u,v));
- B unit surface binormal defined as the cross product of N and T.
- step SGI10 an input texture map comprising a set of partial derivatives f u (u,v), f v (u,v) is used in combination with the surface normal (N) and tangents (P u , P v ) and basis vectors B and T to compute the perturbed normal in tangent space (N' ⁇ s ) at each point of the height field according to the following equations (step SGI10):
- the coefficients a, b and c are the unnormalized components ofthe perturbed normal N' ⁇ s in tangent space (i.e., the coefficient c is in the normal direction and the coefficients a and b represent perturbations to the normal in the u and v directions).
- these coefficients are stored as a texture map TMAP, which is provided to the SGI 3D hardware in a format specified by an appropriate API (e.g, OpenGL).
- the light and half angle vectors (L, H) are transformed to the tangent space using a matrix M (shown below) whose columns comprise the eye space (i.e, x, y and z) coordinates of the tangent, binormal and normal (T, B, N) (SGI16):
- the resulting tangent space versions L ⁇ s and H ⁇ s of the light and half angle vectors are output to the Phong lighting and bump mapping step (SGI1 ) along with the input normal N and the texture map TMAP.
- the graphics hardware performs all lighting computations in tangent space using the tangent space vectors previously described.
- the SGI system employs the perturbed vector N' T s (represented by the texture map TMAP components) in the lighting computations. Otherwise, the SGI system employs the input surface normal N in the lighting computations.
- the step SGI14 involves:
- a disadvantage of the SGI approach is that it requires a large amount of unnecessary information to be computed (e.g., for vertices associated with pixels that are not visible in the final graphics image). This information includes:
- N' for each vertex of each surface
- L for each vertex of each surface
- SGI OpenGL extension SGIX_fragment_lighting_space, which is incorporated herein by reference.
- Figure 5B shows a hypothetical hardware implementation of the SGI bump mapping/Phong shading approach that is proposed in the Peercy reference.
- the surface normal N and transformed light and Half-angle vectors L ⁇ s , H ⁇ s are interpolated at the input of the block SGI14.
- the L ⁇ s and H ⁇ s interpolations could be done multiple times, once for each of the active lights.
- the switch S is used to select the perturbed normal N' ⁇ s when bump mapping is in effect or the unperturbed surface normal IM when bump mapping is not in effect.
- the resulting normal and interpolated light and half-angle vectors are then normalized and the normalized resulting normalized vectors are input to the illumination computation, which outputs a corresponding pixel value.
- FIG. 6A there is shown a functional flow diagram illustrating the Blinn bump mapping approach.
- the functional blocks include: generate gradients B10, "compute perturbed normal" B12 and “perform lighting computations" B14.
- the step B10 is performed in software and the steps B12 and B14 are performed in dedicated bump mapping hardware.
- the Blinn approach is described in the Blinn and Peercy references.
- the elements employed in the illustrated Blinn approach include the following: s one coordinate of bump space grid t one coordinate of bump space grid
- an input texture map is a 1 , 2 or 3-dimensional array of values h(s,t) that define a height field in (s,t) space.
- the API converts this height field to a collection of partial derivatives h s (s,t), h,(s,t) that gives the gradient in two directions (s and t) at each point of the height field); h,(s,t) partial derivative along the t axis of the bump height field computed at each point of the texture map (see discussion of h s (s,t));
- H half angle vector in eye space b s basis vector enabling bump gradients h, to be mapped to eye space; b, basis vector enabling bump gradients h, to be mapped to eye space.
- the Blinn approach presumes that a texture to be applied to a surface is initially defined by a height field h(s, t).
- the Blinn approach does not directly use this height field, but requires that the texture map representing the height field be provided by the API as a set of gradients h s (s, t) and h,(s, t) (SGI10). That is, rather than providing the perturbed normal N' (as in the SGI approach), the Blinn texture map provides two scalar values h s , h t that represent offsets/perturbations to the normal.
- step (B12) the Blinn bump mapping approach perturbs the Normal vector N according to the following equation:
- N' N + h s - b s + h t - b t
- Figure 6B shows a hypothetical hardware implementation of the Blinn bump mapping approach that is proposed in the Peercy reference.
- the multiple vector cross- products that must be computed and the required number of interpolations and normalizations.
- the extra operations are required in the Blinn approach to derive the basis vectors at each pixel (i.e., for each illumination calculation).
- the three interpolation operations applied to the cross-products (B, x N), (N x B s ), (N s x B are required to be wide floating point operations (i.e., 32 bit operations) due to the possible large range of the cross-product values.
- the invention provides structure and method for performing lighting in a graphics processor.
- the invention specifcially provides structure and method for performing tangent space lighting in a deferred shading architecture.
- Embodiments of the invention may also provide variable scale bump mapping, automatic basis generation, automatic gradient-field generation, normal interpolation by doing angle and magnitude computations separately.
- the invention provides a bump mapping method for use in a deferred graphics pipeline processor comprising: receiving for a pixel fragment associated with a surface for which bump effects are to be computed: a surface tangent, binormal and normal defining a tangent space relative to the surface associated with the fragment; and a texture vector representing perturbations to the surface normal in the directions of the surface tangent and binormal caused by the bump effects at the surface position associated with the pixel fragment; computing a set of basis vectors from the surface tangent, binormal and normal that define a transformation from the tangent space to eye space in view of the orientation of the texture vector; computing a perturbed, eye space, surface normal reflecting the bump effects by performing a matrix multiplication in which the texture vector is multiplied by a transformation matrix whose columns comprise the basis vectors, giving a result that is the perturbed, eye space, surface normal; and performing lighting computations for the pixel fragment using the perturbed, eye space, surface normal, giving an apparent color for the pixel fragment that accounts
- a variable scale bump mapping method for shading a computer graphics image comprising steps of: receiving for a vertex of polygon associated with a surface to which bump effects are to be mapped geometry vectors (V S1 V View N) and a texture vector (Tb); separating the geometry vectors into unit basis vectors fo, b t , n) and magnitudes (m bs , m b consult m bn ); multiplying the magnitudes and the texture vector to form a texture-magnitude vector (mTb'); scaling components of the texture-magnitude vector by a vector s to form a scaled texture-magnitude vector (mTb"); and multiplying the scaled texture-magnitude vector and the unit basis vectors to provide a perturbed unit normal (N') in eye space for a pixel location, whereby the need to specify surface tangents and binormal at the pixel location to perform lighting computations to give the pixel fragment bump effects is eliminated.
- this method is further defined such that the step of multiplying the magnitudes and the texture-magnitude vector produces a transformation matrix, which enables fixed point multiplication hardware to be used. In another embodiment, this method is further defined such that the step of multiplying the magnitudes and the texture-magnitude vector produces a transformation matrix that defines a transformation from different tangent space coordinates systems to an eye space coordinate system. In still another variation, this method is performed such that the different tangent space coordinates systems are selected from known coordinate systems, including from the Blinn coordinate system.
- the invention provides automatic gradient field generation.
- a variable scale bump mapping method for shading a computer graphics image comprising steps of: receiving a gray scale image for which bump effects are to be computed; taking a derivative relative to a gray scale intensity for a pixel fragment associated with the gray scale image; and computing from the derivative a perturbed unit normal in eye space to give the pixel fragment bump effects.
- This method may also optionally include the step of computing from the derivative a perturbed unit normal in eye space comprises the step of forming a transformation matrix that defines a transformation of the derivative of the gray scale intensity to an eye space coordinate system.
- the method for bump mapping for shading a computer graphics image comprises: receiving for a pixel fragment associated with a surface for which bump effects are to be computed: a magnitude vector (m), and a bump vector (Tb); and a unit transformation matrix (M); multiplying the magnitude vector and the bump vector to form a texture-magnitude vector (mTb 1 ); scaling components of the texture-magnitude vector by a vector s to form a scaled texture- magnitude vector (mTb"); multiplying the scaled texture-magnitude vector and the unit transformation matrix to provide a perturbed normal (N'); re-scaling components of the perturbed normal to form rescaled vector (N"); and normalizing the rescaled vector to provide a unit perturbed normal that is used to perform lighting computations to give the pixel fragment bump effects.
- M unit transformation matrix
- the step of scaling the components of the texture-magnitude vector comprises the step of selecting the scalars so the resulting matrix can be represented as a fixed-point vector.
- the unit transformation matrix also comprises fixed-point values, and wherein the step of multiplying the scaled texture-magnitude vector and the unit transformation matrix comprises the step of multiplying using fixed-point multiplication hardware.
- the step of re-scaling components of the perturbed normal comprises the step of multiplying by a reciprocal of vector s (1/(s s , s,, s n )) to re-establish a correct relationship between their values.
- Computer graphics is the art and science of generating pictures or images with a computer. This picture generation is commonly referred to as rendering.
- rendering The appearance of motion, for example in a 3-Dimensional animation is achieved by displaying a sequence of images.
- Interactive 3-Dimensional (3D) computer graphics allows a user to change his or her viewpoint or to change the geometry in real-time, thereby requiring the rendering system to create new images on-the-fly in real-time. Therefore, real-time performance in color, with high quality imagery is becoming increasingly important.
- the invention is directed to a new graphics processor and method and encompasses numerous substructures including specialized subsystems, subprocessors, devices, architectures, and corresponding procedures.
- Embodiments ofthe invention may include one or more of deferred shading, a tiled frame buffer, and multiple-stage hidden surface removal processing, as well as other structures and/or procedures.
- this graphics processor is hereinafter referred to as the DSGP (for Deferred Shading Graphics Processor), or the DSGP pipeline, but is sometimes referred to as the pipeline.
- Embodiments of the present invention are designed to provide high-performance 3D graphics with Phong shading, subpixel anti-aliasing, and texture- and bump-mapping in hardware.
- the DSGP pipeline provides these sophisticated features without sacrificing performance.
- the DSGP pipeline can be connected to a computer via a variety of possible interfaces, including but not limited to for example, an Advanced Graphics Port (AGP) and/or a PCI bus interface, amongst the possible interface choices. VGA and video output are generally also included.
- Embodiments ofthe invention supports both OpenGL and Direct3D APIs.
- the OpenGL specification entitled “The OpenGL Graphics System: A Specification (Version 1.2)" by Mark Segal and Kurt Akeley, edited by Jon Leech, is included incorporated by reference.
- Each frame (also called a scene or user frame) of 3D graphics primitives is rendered into a 3D window on the display screen.
- a window consists of a rectangular grid of pixels, and the window is divided into tiles (hereinafter tiles are assumed to be 16x16 pixels, but could be any size). If tiles are not used, then the window is considered to be one tile.
- Each tile is further divided into stamps (hereinafter stamps are assumed to be 2x2 pixels, thereby resulting in 64 stamps per tile, but stamps could be any size within a tile).
- stamps are assumed to be 2x2 pixels, thereby resulting in 64 stamps per tile, but stamps could be any size within a tile).
- Each pixel includes one or more of samples, where each sample has its own color values and z-value (hereinafter, pixels are assumed to include four samples, but any number could be used).
- a fragment is the collection of samples covered by a primitive within a particular pixel. The term "fragment" is also used to describe the collection of visible samples
- the renderer calculates the color value (RGB or RGBA) and z value for each pixel of each primitive, then compares the z value of the new pixel with the current z value in the Z-buffer. If the z value comparison indicates the new pixel is "in front of the existing pixel in the frame buffer, the new pixel overwrites the old one; otherwise, the new pixel is thrown away.
- Z-buffer rendering works well and requires no elaborate hardware. However, it typically results in a great deal of wasted processing effort if the scene contains many hidden surfaces.
- the renderer may calculate color values for ten or twenty times as many pixels as are visible in the final picture. This means the computational cost of any per-pixel operation —such as Phong shading or texture-mapping — is multiplied by ten or twenty.
- the number of surfaces per pixel, averaged over an entire frame, is called the depth complexity of the frame.
- the depth complexity is a measure of the renderer's inefficiency when rendering a particular frame.
- HSR hidden surface removal
- the HSR process can be complicated by other operations (that is by operation other than depth test) that can discard primitives.
- These other operations include: pixel ownership test, scissor test, alpha test, color test, and stencil test (as described elsewhere in this specification).
- Some of these operations discard a primitive based on its color (such as alpha test), which is not determined in a deferred shading pipeline until after the HSR process (this is because alpha values are often generated by the texturing process, included in pixel fragment coloring). For example, a primitive that would normally obscure a more distant primitive (generally at a greater z-value) can be discarded by alpha test, thereby causing it to not obscure the more distant primitive.
- CHSR conservative hidden surface removal
- FIG. 2 A conventional 3D graphics pipeline is illustrated in Figure 2.
- DSGFV1 inventive 3D Deferred Shading Graphics Pipeline Version 1
- Figure 8 A conventional 3D graphics pipeline is illustrated in Figure 2.
- DSGFV1 inventive 3D Deferred Shading Graphics Pipeline Version 1
- Figure 8 the inventive pipeline ( Figure 8) has been obtained from the generic conventional pipeline ( Figure 2) by replacing the drawing intensive functions 231 with: (1) a scene memory 250 for storing the pipeline state and primitive data describing each primitive, called scene memory in the figure; (2) an exact hidden surface removal process 251 ; (3) a fragment coloring process 252; and (4) a blending process 253.
- the scene memory 250 stores the primitive data for a frame, along with their attributes, and also stores the various settings of pipeline state throughout the frame.
- Primitive data includes vertex coordinates, texture coordinates, vertex colors, vertex normals, and the like
- primitive data also includes the data generated by the setup for incremental render, which includes spatial, color, and edge derivatives.
- the scene memory 250 can be double buffered, thereby allowing the HSR process to perform computations on one frame while the floating-point intensive functions perform computations on the next frame.
- the scene memory can also be triple buffered.
- the scene memory could also be a scratchpad for the HSR process, storing intermediate results for the HSR process, allowing the HSR process to start before all primitive have been stored into the scene memory.
- every primitive is associated with the pipeline state information that was valid when the primitive was input to the pipeline. The simplest way to associate the pipeline state with each primitive is to include the entire pipeline state within each primitive.
- the preferred way to store information in the scene memory is to keep separate lists: one list for pipeline state settings and one list for primitives. Furthermore, the pipeline state information can be split into a multiplicity of sub-lists, and additions to each sub-list occurs only when part of the sub- list changes.
- the preferred way to store primitives is done by storing a series of vertices, along with the connectivity information to re-create the primitives. This preferred way of storing primitives eliminates redundant vertices that would otherwise occur in polygon meshes and line strips.
- the HSR process described relative to DSGPvl is required to be an exact hidden surface removal (EHSR) because it is the only place in the DSGPvl where hidden surface removal is done.
- EHSR exact hidden surface removal
- the exact hidden surface removal (EHSR) process 251 determines precisely which primitives affect the final color of the pixels in the frame buffer. This process accounts for changes in the pipeline state, which introduces various complexities into the process. Most of these complications stem from the per-fragment operations (ownership test, scissor test, alpha test, and the like), as described above. These complications are solved by the innovative conservative hidden surface removal (CHSR) process, described later, so that exact hidden surface removal is not required.
- CHSR innovative conservative hidden surface removal
- This process is different from edged walk 232 and span interpolation 234 because this process must be able to efficiently generate colors for subsections of primitives. That is, a primitive may be partially visible, and therefore, colors need to be generated for only some of its pixels, and edge walk and span interpolation assume the entire primitive must be colored.
- the HSR process may generate a multiplicity of visible subsections of a primitive, and these may be interspersed in time amongst visible subsections of other primitives.
- the fragment coloring process 252 should be capable of generating color values at random locations within a primitive without needing to do incremental computations along primitive edges or along the x-axis or y-axis.
- the blending process 253 of the inventive embodiment combines the fragment colors together to generate a single color per pixel. In contrast to the conventional z-buffered blend process 236, this blending process 253 does not include z-buffer operations because the exact hidden surface removal process 251 as already determined which primitives are visible at each sample.
- the blending process 253 may keep separate color values for each sample, or sample colors may be blended together to make a single color for the entire pixel. If separate color values are kept per sample and are stored separately into the Frame buffer 240 , then final pixel colors are generated from sample colors during the scan out process as data is sent to the digital to analog converter 242.
- the pipeline renders primitives, and the invention is described relative to a set of renderable primitives that include: 1) triangles, 2) lines, and 3) points.
- Polygons with more than three vertices are divided into triangles in the Geometry block, but the DSGP pipeline could be easily modified to render quadrilaterals or polygons with more sides. Therefore, since the pipeline can render any polygon once it is broken up into triangles, the inventive renderer effectively renders any polygon primitive.
- the pipeline To identify what part of a 3D window on the display screen a given primitive may affect, the pipeline divides the 3D window being drawn into a series of smaller regions, called tiles and stamps. The pipeline performs deferred shading, in which pixel colors are not determined until after hidden- surface removal.
- MCCAM Magnitude Comparison Content Addressable Memory
- CHSR Conservative Hidden Surface Removal
- the CHSR processes each primitive in time order and, for each sample that a primitive touches, makes conservative decision based on the various API state variables, such at depth test and alpha test.
- One of the important features of the CHSR process is that color computation does not need to be done during hidden surface removal even though non-depth-dependent tests from the API, such as alpha test, color test, and stencil test can be performed by the DSGP pipeline.
- the CHSR process can be considered a finite state machine
- each per-sample FSM is called a sample finite state machine (SFSM).
- SFSM sample finite state machine
- Each SFSM maintains per-sample data including: (1) z-coordinate information; (2) primitive information (any information needed to generate the primitive's color at that sample or pixel); and (3) one or more sample state bits (for example, these bits could designate the z-value or z-values to be accurate or conservative). While multiple z-values per sample can be easily used, multiple sets of primitive information per sample would be expensive.
- the SFSM maintains primitive information for one primitive.
- the SFSM may also maintain transparency information, which is used for sorted transparencies, described in the next section.
- the DSGP can operate in two distinct modes: 1) Time Order Mode, and 2) Sorted Transparency Mode.
- Time Order Mode is described above, and is designed to preserve, within any particular tile, the same temporal sequence of primitives.
- the Sorted Transparency mode is described immediately below.
- the control of the pipeline operating mode is done in the Sort Block.
- Sort Block is located in the pipeline between a Mode Extraction Unit (MEX) and Setup (STP) unit.
- Sort Block operates primarily to take geometry scattered around the display window and sort it into tiles.
- Sort Block also manages the Sort Memory, which stores all the geometry from the entire scene before it is rasterized, along with some mode information.
- Sort memory comprises a double- buffered list of vertices and modes. One page collects a scene's geometry (vertex by vertex and mode by mode), while the other page is sending its geometry (primitive by primitive and mode by mode) down the rest of the pipeline.
- time ordered mode time order of vertices and modes are preserved within each tile, where a tile is a portion ofthe display window bounded horizontally and vertically.
- time order preserved we mean that for a given tile, vertices and modes are read in the same order as they are written.
- sorted transparency mode reading of each tile is divided into multiple passes, where, in the first pass, guaranteed opaque geometry is output from the sort block, and in subsequent passes, potentially transparent geometry is output from the sort block.
- time ordering is preserved, and mode date is inserted in its correct time-order location.
- Sorted transparency mode by be performed in either back-to-front or front-to-back order. In the preferred embodiment, the sorted transparency method is performed jointly by the Sort Block and the Cull Block.
- CHSR conservative hidden surface removal
- Each vertex includes a color pointer, and as vertices are received, the vertices including the color pointer are stored in sort memory data storage.
- the color pointer is a pointer to a location in the polygon memory vertex storage that includes a color portion of the vertex data.
- MLM Material-Lighting-Mode
- MLM includes six main pointers plus two other pointers as described below.
- Each of the six main pointers comprises an address to the polygon memory state storage, which is a sequential storage of all of the state that has changed in the pipeline, for example, changes in the texture, the pixel, lighting and so forth, so that as a need arises any time in the future, one can recreate the state needed to render a vertex (or the object formed from one or more vertices) from the MLM pointer associated with the vertex, by looking up the MLM pointers and going back into the polygon memory state storage and finding the state that existed at the time.
- the Mode Extraction Block is a logic block between Geometry and Sort that collects temporally ordered state change data, stores the state in Polygon memory, and attaches appropriate pointers to the vertex data it passes to Sort Memory.
- Geometry and Sort In the normal OpenGL pipeline, and in embodiments of the inventive pipeline up to the Sort block, geometry and state data is processed in the order in which it was sent down the pipeline. State changes for material type, lighting, texture, modes, and stipple affect the primitives that follow them. For example, each new object will be preceded by a state change to set the material parameters for that object. ln the inventive pipeline, on the other hand, fragments are sent down the pipeline in Tile order after the Cull block.
- Mode Injection Block figures out how to preserve state in the portion of the pipeline that processes data in spatial (Tile) order instead of time order.
- Mode Extraction Block sends a subset ofthe Mode data (cull_mode) down the pipeline for use by Cull.
- Cull_mode packets are produced in Geometry Block.
- Mode Extraction Block inserts the appropriate color pointer in the Geometry packets.
- Pipeline state is broken down into several categories to minimize storage as follows: (1) Spatial pipeline state includes data headed for Sort that changes every vertex; (2) Cul!_mode state includes data headed for Cull (via Sort) that changes infrequently; (3) Color includes data headed for
- Polygon memory that changes every vertex; (4) Material includes data that changes for each object; (5) TextureA includes a first set of state for the Texture Block for textures 0&1 ; (6) TextureB includes a second set of state for the Texture Block for textures 2 through 7; (7) Mode includes data that hardly ever changes; (8) Light includes data for Phong; (9) Stipple includes data for polygon stipple patterns. Material, Texture, Mode, Light, and Stipple data are collectively referred to as MLM data (for Material, Light and Mode). We are particularly concerned with the MLM pointers for state preservation.
- MLM data for Material, Light and Mode
- Color data along with the appropriate pointers to MLM data, is also written to Polygon Memory.
- the spatial data is sent to Sort, along with a pointer into Polygon Memory (the color pointer).
- Color and MLM data are all stored in Polygon memory. Allocation of space for these records can be optimized in the micro-architecture definition to improve performance.
- Each primitive entry in Sort Memory contains a Color Pointer to the corresponding Color entry in Polygon Memory.
- the Color Pointer includes a Color Address, Color Offset and Color Type that allows us to construct a point, line, or triangle and locate the MLM pointers.
- the Color Address points to the final vertex in the primitive. Vertices are stored in order, so the vertices in a primitive are adjacent, except in the case of triangle fans.
- This first dualoct contains pointers to the MLM data for the points, lines, strip, or fan in the vertex list.
- the subsequent dualocts in the vertex list contain Color data entries. For triangle fans, the three vertices for the triangle are at Color Address, (Color Address-1), and (Color Address - Color Offset +1). Note that this is not quite the same as the way pointers are stored in Sort memory.
- State is a time varying entity, and MEEX accumulates changes in state so that state can be recreated for any vertex or set of vertices.
- the Ml J block is responsible for matching state with vertices down stream. Whenever a vertex comes into MEX and certain indicator bits are set, then a subset of the pipeline state information needs to be saved. Only the states that have changed are stored, not all states, since the complete state can be created from the cumulative changes to state.
- the six MLM pointers for Material, TextureA, TextureB, Mode, Light, and Stipple identify address locations where the most recent changes to the respective state information is stored. Each change in one of these state is identified by an additional entry at the end of a sequentially ordered state storage list stored in a memory. Effectively, all state changes are stored and when particular state corresponding to a point in time (or receipt of a vertex) is needed, the state is reconstructed from the pointers.
- mode packets This packet of mode that are saved are referred to as mode packets, although the phrase is used to refer to the mode data changes that are stored, as well as to larger sets of mode data that are retrieved or reconstructed by MIJ prior to rendering.
- Polygon memory vertex storage stores just the color portion.
- Polygon memory stores the part of pipeline beauhatis not needed for hidden surface removal, and it also stores the part of the vertex data which is not needed for hidden surface removal (predominantly the items needed to make colors.)
- the inventive structure and method may advantageously make use of trilinear mapping of multiple layers (resolutions) of texture maps.
- Texture maps are stored in a Texture Memory which may generally comprise a single-buffered memory loaded from the host computer's memory using the AGP interface.
- a single polygon can use up to four textures.
- Textures are MlP-mapped. That is, each texture comprises a series of texture maps at different levels of detail or resolution, each map representing the appearance of the texture at a given distance from the eye point.
- the Texture block performs tri-linear interpolation from the texture maps, to approximate the correct level of detail.
- the Texture block can alternatively performs other interpolation methods, such as anisotropic interpolation.
- the Texture block supplies interpolated texture values (generally as RGBA color values) to the Phong block on a per-fragment basis.
- Bump maps represent a special kind of texture map. Instead of a color, each texel of a bump map contains a height field gradient.
- the multiple layers are MIP layers, and interpolation is within and between the MIP layers.
- the first interpolation ii within each layer then you interpolate between the two adjacent layers, one nominally having resolution greater than required and the other layer having less resolution than required, so that it is done 3-dimensionally to generate an optimum resolution.
- the inventive pipeline includes a texture memory which includes a texture cache really a textured reuse register because the structure and operation are different from conventional caches.
- the host also includes storage for texture, which may typically be very large, but in order to render a texture, it must be loaded into the texture cache which is also referred to as texture memory.
- texture memory which is also referred to as texture memory.
- S and T's Associated with each VSP.
- the inventive structure provides a set of eight content addressable (memory) caches running in parallel, n one embodiment, the cache identifier is one of the content addressable tags, and that's the reason the tag part of the cache and the data part of the cache is located are located separate from the tag or index. Conventionally, the tag and data are co-located so that a query on the tag gives the data.
- the tags and data are split up and indices are sent down the pipeline.
- the data and tags are stored in different blocks and the content addressable lookup is a lookup or query of an address, and even the "data" stored at that address in itself and index that references the actual data which is stored in a different block.
- the indices are determined, and sent down the pipeline so that the data referenced by the index can be determined.
- the tag is in one location
- the texture data is in a second location
- the indices provide a link between the two storage structures.
- Texel Reuse Detection Registers comprise a multiplicity of associate memories, generally located on the same integrated circuit as the texel interpolator.
- the texel reuse detection method is performed in the
- an object in some orientation in space is rendered.
- the object has a texture map on it, and its represented by many triangle primitives.
- the procedure implemented in software will instruct the hardware to load the particular object texture into a
- more than one texture map may be retrieved and stored in the memory, for example two or several maps may be stored depending on the available memory, the size of the texture maps, the need to store or retain multiple texture maps, and the sophistication ofthe management scheme.
- spatial object coherence is of primary importance. At least for an entire single object, and typically for groups of objects using the same texture map, all of the triangles making up the object are processed together. The phrase spatial coherency is applied to such a scheme because the triangles form the object and are connected in space, and therefore spatially coherent.
- inventive deferred shader structure and method we do not necessarily rely on or derive appreciable benefit from this type of spatial object coherence.
- Embodiments of the inventive deferred shader operate on tiles instead. Any given tile might have an entire object, a plurality of objects, some entire objects, or portions of several objects, so that spatial object coherence over the entire tile is typically absent.
- the pipeline and texture block are advantageously capable of changing the texture map on the fly in real-time and in response to the texture required for the object primitive (e.g. triangle) received. Any requirement to repeatedly retrieve the texture map from the host to process the particular object primitive (for example, single triangle) just received and then dispose of that texture when the next different object primitive needing a different texture map would be problematic to say the least and would preclude fast operation.
- object primitive e.g. triangle
- a sizable memory is supported on the card.
- 128 megabytes are provided, but more or fewer megabytes may be provided.
- 34 Mb, 64 Mb, 256 Mb, 512 Mb, or more may be provided, depending upon the needs of the user, the real estate available on the card for memory, and the density of memory available.
- the inventive structure and method stores and reuses them when there is a reasonable chance they will be needed again.
- the invention uses the textels that have been read over and over, so when we need one, we read it, and we know that chances are good that once we have seem one fragment requiring a particular texture map, chances are good that for some period of time afterward while we are in the same tile, we will encounter another fragment from the same object that will need the same texture. So we save those things in this cache, and then on the fly we look up from the cache (texture reuse register) which ones we need. If there is a cache miss, for example, when a fragment and texture map are encountered for the first time, that texture map is retrieved and stored in the cache.
- Fragment coloring is performed for two-dimensional display space and involves an interpolation of the color from for example the three vertices of a triangle primitive, to the sampled sub-sample of the displayed pixel.
- fragment coloring involves applying an interpolation function to the colors at the three fragment vertices to determine a color for a location spatially located between or among the three vertices. Typically, but optionally, some account will be taken of the perspective correctness in performing the interpolation.
- surface normals are interpolated based on linear interpolation of the input normals .
- linear interpolation of the composite surface normals may provide adequate accuracy; however, considering a two-dimensional interpolation example, when one vector (surface normal) has for example a larger magnitude that the other vector, but comparable angular change to the first vector, the resultant vector will be overly influenced by the larger magnitude vector in spite of the comparable angular difference between the two vectors. This may result in objectionable error, for example, some surface shading or lighting calculation may provide an anomalous result and detract from the output scene.
- the magnitude is interpolated separately from the direction or angle.
- the interpolated magnitude are computed then the direction vectors which are equal size.
- the separately interpreted magnitudes and directions are then recombined, and the direction is normalized. While the ideal angular interpretation would provide the greatest accuracy, however, the interpolation involves three points on the surface of a sphere and various great-circle calculations. This sort of mathematical complexity is not well suited for real-time fast pipeline processing.
- the single step linear interpolation is much easier but is susceptible to greater error.
- the inventive surface normal interpolation procedure has greater accuracy than conventional linear interpolation, and lower computational complexity that conventional angular interpolation.
- variable scale bump maps involves one or both of two separate procedures: automatic basis generation and automatic gradient field generation.
- automatic gradient field takes a derivative, relative to gray scale intensity, of a gray scale image, and uses that derivative as a surface normal perturbation to generate a bump for a bump map.
- Automatic basis generation saves computation, memory storage in polygon memory, and input bandwidth in the process.
- an s,t and surface normal are specified. But the s and t aren't color, rather they are two-dimensional surface normal perturbations to the texture map, and therefore a texture bump map.
- the s and t are used to specify the directions in which to perturb the surface normals in order to create a usable bump map.
- the s,t give us an implied coordinate system and reference from which we can specify perturbation direction.
- Use of the s,t coordinate system at each pixel eliminates any need to specify the surface tangent and the bi-normal at the pixel location. As a result, the inventive structure and method save computation, memory storage and input bandwidth.
- the background describes two exemplary approaches to performing bump mapping in a conventional 3D graphics system. These approaches compute for each vertex of a surface a perturbed surface normal N' that accounts for bump effects and then employ in lighting computations the perturbed normal N' instead ofthe input surface normal N.
- One of the approaches attempts to reduce the number of bump mapping computations by storing in a texture map precomputed components of the perturbed normals N' of the surfaces involved in the lighting computation.
- the components of the perturbed surface normals N' are defined in "tangent space", which differs from the "eye space” in which many elements of the lighting equation are defined.
- the SGI approach performs all lighting computations in tangent space. This allows the perturbed normals N' to be used directly from the texture map. However, this also requires that vectors used in the lighting equation (e.g., the light and halfangle vectors L and H) first be transformed from eye space to tangent space. As described in the background, this transformation is done for each vertex using a transformation matrix comprising surface tangent, binormal and normal vectors (T, B, N).
- the SGI approach performs all graphics processing steps prior to the final pixel output step one primitive (i.e., polygon, triangle, etc.) at a time.
- one primitive i.e., polygon, triangle, etc.
- Another result of this approach is that unnecessary, numerically intensive tangent space transformations and lighting computations are likely to be performed for hidden surfaces whose pixels will be discarded in the z-buffer removal step.
- Another result of this approach is that in the SGI pipeline there is no need to retain any of the lighting state for primitives other than the one being currently processed.
- DSGP would require the graphics pipeline to retain the lighting state for all visible surfaces. Retaining this lighting state could require significant storage per fragment. For this reason it would not be practical to implement the SGI approach in a deferred shading environment.
- the present invention is a system and method for performing tangent space lighting in a DSGP.
- the present invention is a system and method for performing bump mapping and lighting computations in eye space using texture information represented in tangent space.
- One embodiment encompasses blocks of the DSGP that preprocess data (referred to collectively as the preprocessor hereinafter) and a Phong shader (implemented as hardware or software).
- the preprocessor receives texture maps specified in a variety of formats and converts those texture maps to a common format for use by the Phong shader.
- the preprocessor also provides basis vectors (bon b here n), a vector Tb that represents in tangent/object space texture/bump data, light data, material data, eye coordinates and other information used by the Phong shader to perform the lighting and bump mapping computations.
- the data from the preprocessor is provided for each fragment for which lighting effects need to be computed.
- the Phong shader computes the RGBA value for the pixels in a fragment using the information provided by the preprocessor.
- the Phong shader performs all lighting computations in eye space, which requires it first to transform bump data from tangent space to eye space.
- the Phong hardware does this by multiplying a matrix M whose columns comprise the eye space basis vectors (b s , b beneficiary n) and the vector Tb of bump map data.
- the eye space basis vectors are defined by the DSGP preprocessor so that the multiplication (MxTb) gives the perturbed normal N' in eye space in accordance with the Blinn bump mapping equation:
- the Phong shader uses the resulting perturbed normal N' in the lighting equations.
- One advantage of this approach over the prior art is that it is necessary to transform only a single vector (the perturbed normal) to eye space whereas, in the SGI approach, it is necessary to transform both the light and half angle vectors (L, H) to tangent space for multiple lights.
- the preprocessor provides the basis vectors (b s , b promotional n) as a set of unit
- N r/ ! _ n ⁇ m ⁇ + b s m bs h s -b t m bt h t .
- the Phong shader performs the bump mapping and lighting computations using floating point hardware.
- the Phong shader is optimized to store each component of the matrix M' as a fixed point value.
- This enables the Phong shader to be configured to perform all or a substantial portion of the matrix multiplication M x Tb using fixed point hardware, which reduces hardware complexity.
- a significant advantage of the present invention is that the Phong shader does not need to interpolate any vectors (e.g., the tangent space perturbed normal N', light L or half angle H vectors). Instead, the preprocessor performs whatever vertex interpolations are necessary and provides the interpolated vectors to the Phong shader referenced to the (s, t) bump grid along with a fragment located at the same grid position. This greatly reduces the complexity of the bump operations, which, as a result can be integrated with the Phong shader whether implemented in hardware or software.
- the preprocessor performs vector interpolation by separating each vector into a unit vector and an associated magnitude, interpolating the unit vectors and magnitudes separately, and combining the interpolated unit vector and magnitude.
- This procedure is more accurate and produces fewer artifacts than when non-normalized vectors are directly interpolated, as in the prior art.
- one artifact that results from normalizing non-unit vectors is an approximation error directly related to the magnitudes of the vectors being interpolated.
- the preprocessor passes the Phong shader at least one packet of texture information (a texel) for each fragment to be illuminated.
- a texel provides the bump mapping data to be used for fragment.
- the information content of a texel used to provide bump mapping data depends on the format of the texture information provided to the DSGP. For example, when the texture information is provided in the SGI format the texel vector Tb provides the components n' x , n' y , n' z of the perturbed surface normal. When the input is provided in the Blinn format, the texel vector Tb provides the surface gradients h s , h, of the unperturbed surface normal.
- the Phong hardware determines the perturbed normal in eye space by multiplying the matrix M by a vector Tb that comprises the three texel components (n' x , n' y , n'_).
- the Phong hardware determines the perturbed normal in eye space by multiplying the matrix M by a vector Tb that comprises the two texel components h s , h t and a third component that is 1.
- the third component that is 1 accounts for the fact that the Blinn approach applies the height gradients (h s , hj to the unperturbed surface normal.
- the preprocessor passes the Phong hardware the following fragement information for each fragment being illuminated: tangent space components n x , n y , n z of the surface normal unit vector; magntude m n of the surface normal; surface tangent unit vector b s along the s tangent space axis; surface tangent unit vector b, along the t tangent space axis; surface tangent b s magnitude; surface tangent b t magnitude; eye coordinates x, y, z.
- Figure 1 is a diagrammatic illustration showing a tetrahedron, with its own coordinate axes, a viewing point's coordinate system, and screen coordinates
- Figure 2 is a diagrammatic illustration showing a conventional generic renderer for a 3D graphics pipeline.
- Figure 3 is a diagrammatic illustration showing elements of a lighting computation performed in a 3D graphics system.
- Figure 4 is a diagrammatic illustration showing elements of a bump mapping computation performed in a 3D graphics system.
- Figure 5A is a diagrammatic illustration showing a functional flow diagram of portions of a 3D graphics pipeline that performs SGI bump mapping.
- Figure 5B is a diagrammatic illustration showing a functional block diagram of portions of a 3D graphics pipeline that performs Silicon Graphics Computer Systems.
- Figure 6A is a diagrammatic illustration showing a functional flow diagram of a generic 3D graphics pipeline that performs "Blinn" bump mapping.
- Figure 6B is a diagrammatic illustration showing a functional block diagram of portions of a 3D graphics pipeline that performs Blinn bump mapping.
- Figure 7 is a diagrammatic illustration showing an embodiment of the inventive 3- Dimensional graphics pipeline, particularly showing the relationship of the Geometry Engine 3000 with other functional blocks and the Application executing on the host and the Host Memory.
- Figure 8 is a diagrammatic illustration showing a first embodiment of the inventive 3-
- Figure 9 is a diagramatic illustration showing an exemplary block diagram of an embodiment of the pipeline showing the major functional units in the front-end Command Fetch and Decode Block (CFD) 2000.
- Figure 10 shows the flow of data through one embodiment of the DSGP 1000.
- Figure 11 shows an example of how the Cull block produces fragments from a partially obscured triangle.
- Figure 12 demonstrates how the Pixel block processes a stamp's worth of fragments.
- Figure 13 is a diagramatic illustration highlighting the manner in which one embodiment of the Deferred Shading Graphics Processor (DSGP) transforms vertex coordinates.
- DSGP Deferred Shading Graphics Processor
- Figure 14 is a diagramatic illustration highlighting the manner in which one embodiment of the Deferred Shading Graphics Processor (DSGP) transforms normals, tangents, and binormals.
- DSGP Deferred Shading Graphics Processor
- FIG 15 is a diagrammatic illustration showing a functional block diagram of the Geometry Block (GEO).
- GEO Geometry Block
- Figure 16 is a diagrammatic illustration showing relationships between functional blocks on semiconductor chips in a three-chip embodiment of the inventive structure.
- Figure 17 is a diagramatic illustration exemplary data flow in one embodiment of the Mode Extraction Block (MEX).
- Figure 18 is a diagramatic illustration showing packets sent to and exemplary Mode
- Figure 19 is a diagramatic illustration showing an embodiment of the on-chip state vector partitioning of the exemplary Mode Extraction Block.
- Figure 20 is a diagrammatic illustration showing aspects of a process for saving information to polygon memory.
- Figure 21 is a diagrammatic illustration showing DSGP triangles arriving at the STP Block and which can be rendered in the aliased or anti-aliased mode
- Figure 22 is a diagrammatic illustration showing the manner in which DSGP renders lines by converting them into quads and various quads generated for the drawing of aliased and anti-aliased lines of various orientations.
- Figure 23 is a diagrammatic illustration showing the manner in which the user specified point is adjusted to the rendered point in the Geometry Unit.
- Figure 24 is a diagrammatic illustration showing the manner in which anti-aliased line segments are converted into a rectangle in the CUL unit scan converter that rasterizes the parallelograms and triangles uniformly.
- Figure 25 is a diagrammatic illustration showing the manner in which the end points of aliased lines are computed using a parallelogram, as compared to a rectangle in the case of anti-aliased lines.
- Figure 26 is a diagrammatic illustration showing an aspect of how Setup represents lines and triangles, including the vertex assignment.
- Figure 27 is a diagrammatic illustration showing an aspect of how Setup represents lines and triangles, including the slope assignments.
- Figure 28 is a diagrammatic illustration showing an aspect of how Setup represents lines and triangles, including the quadrant assignment based on the orientation of the line.
- Figure 29 is a diagrammatic illustration showing how Setup represents lines and triangles, including the naming of the clip descriptors and the assignment of clip codes to verticies.
- Figure 30 is a diagrammatic illustration showing an aspect of how Setup represents lines and triangles, including aspects of how Setup passes particular values to CUL.
- Figure 31 is a diagrammatic illustration of exemplary embodiments of tag caches which are fully associative and use Content Addressible Memories (CAMs) for cache tag lookup.
- CAMs Content Addressible Memories
- Figure 32 is a diagrammatic illustration showing the manner in which mde data flows and is cached in portions of the DSGP pipeline.
- Figure 33 is a diagrammatic illustration of an exemplary embodiment of the Fragment Block.
- Figure 34 is a diagrammatic illustration showing examples of VSPs with the pixel fragments formed by various primitives.
- Figure 35 is a diagrammatic illustration showing aspects of Fragment Block interpolation using perspective corrected barycentric interpolation for triangles.
- Figure 36 shows an example of how interpolating between vectors of unequal magnitude may result in uneven angular granularity and why the inventive structure and method does not interpolate normals and tangents this way.
- Figure 37 is a diagrammatic illustration showing how the fragment x and y coordinates used to form the interpolation coefficients in the Fragment Block are formed
- Figure 38 is a diagrammatic illustration showing an overview of texture array addressing.
- Figure 39 is a diagrammatic illustration showing the Phong unit position in the pipeline and relationship to adjacent blocks.
- Figure 40 is a digrammatic illustration showning the flow of information packets to Phong 14000 from Fragment 11000, Texture 12000 and from Phong to Pixel 15000.
- Figure 41 is a diagrammatic illustration showing a block diagram of Phong comprising several sub-units.
- Figure 42 is a diagrammatic illustration showing the a function flow diagram of processing performed by the Texture Computation block 14114 of Figure 41.
- Figure 43 is a diagrammatic illustration of a portion of the inventive DSGP involved with computation of bump and lighting effects, emphasizing computations performed in the Phong block 14000;
- Figure 44 is a diagrammatic illustration showing the functional flow of a bump computation performed by one embodiment of the bump unit 14130 of Figure 43.
- Figure 45 is a diagrammatic illustration showing the functional flow of a method used to compute a perturbed surface normal within one embodiment of the bump unit 14130 that can be implemented using fixed-point operations.
- Figure 46 is a diagrammatic illustration showing a block diagram of the PIX block.
- Figure 47 is a diagrammatic illustration showing the BackEnd Block (BKE) and units interfacing to it.
- BKE BackEnd Block
- Figure 48 is a diagrammatic illustration showing external client units that perform memory read and write through the BKE.
- the pipeline takes data from the host computer's I/O bus, processes it, and sends it to the computer's display.
- the pipeline is divided into twelve blocks, plus three memory stores and the frame buffer.
- Figure 15 shows the flow of data through the pipeline 1000. The blocks that make up the pipeline are discussed below.
- Command Fetch and Decode (CFD) 2000 handles communication with the host computer through the I/O bus. It converts its input into a series of packets, which it passes to the
- Geometry block Most of the input stream consists of geometrical data — lines, points, and polygons. The descriptions of these geometrical objects can include colors, surface normals, texture coordinates, and so on.
- the input stream also contains rendering information, such as lighting, blending modes, and buffer functions.
- the Geometry block 3000 handles four major tasks: transforms, decomposition of all polygons into triangles, clipping, and per-vertex lighting calculations needed for Gouraud shading.
- the Geometry block transforms incoming graphics primitives into a uniform coordinate space ("world space"). Then it clips the primitives to the viewing volume, or frustum, in addition to the six planes that define the viewing volume (left, right, top, bottom, front and back), the DSGP pipeline provides six user-definable clipping planes.
- the Geometry block breaks polygons with more than three vertices into sets of triangles, to simplify processing.
- the Geometry block calculates the vertex colors that the Fragment block uses to perform the shading. c. Mode Extraction (MEX)
- the Mode Extraction block 4000 separates the data stream into two parts: 1) vertices, and 2) everything else. Vertices are sent to the Sort block. The "everything else” — lights, colors, texture coordinates, and so on — is stored in a special buffer called the Polygon Memory, where it can be retrieved by the Mode Injection block.
- the Polygon Memory is double buffered, so the
- Mode Injection block can read data for one frame, while the Mode Extraction block is storing data for the next frame.
- the mode data stored in the Polygon Memory falls into three major categories: per-frame data (such as lighting), per-primitive data (such as material properties) and per-vertex data (such as color).
- per-frame data such as lighting
- per-primitive data such as material properties
- per-vertex data such as color
- the Mode Extraction block sends the Sort block a packet containing the vertex data and a pointer into the Polygon Memory.
- the pointer is called the color pointer, which is somewhat misleading, since it is used to retrieve all sorts of other information besides color.
- the packet also contains fields indicating whether the vertex represents a point, the endpoint of a line, or the corner of a triangle.
- the vertices are sent in a strict time sequential order, the same order in which they were fed into the pipeline.
- the packet also specifies whether the current vertex forms the last one in a given primitive (i.e., "completes" the primitive). In the case of triangle strips or fans, and line strips or loops, the vertices are shared between adjacent primitives. In this case, the packets indicate how to identify the other vertices in each primitive.
- the Sort block 6000 receives vertices from the Mode Extraction block and sorts the resulting points, lines, and triangles by tile.
- the double-buffered Sort Memory 7000 it maintains a list of vertices representing the graphic primitives, and a set of Tile Pointer Lists-one list for each tile in the frame.
- the Sort block adds a pointer to the vertex to that tile's Tile Pointer List.
- Sort block When the Sort block has finished sorting all the geometry in a frame, it sends the data to Setup.
- Each Sort block output packet represents a complete primitive. Sort sends its output in tile-by-tile order: all of the primitives that touch a given tile, then all of the primitives that touch the next tile, and so on. Note that this means that Sort may send the same primitive many times, once for each tile it touches.
- the Setup block 8000 calculates spatial derivatives for lines and triangles. It processes one tile's worth of data, one primitive at a time. When it's done with a primitive, it sends the data on to the Cull block.
- the Setup block also breaks stippled lines into separate line segments (each a rectangular region), and computes the minimum z value for each primitive within the tile.
- Each primitive packet output from Setup represents one primitive: a triangle, line segment or point.
- the Cull block 9000 is one of the more complex blocks, and processing is divided into two steps: Magnitude Comparison Content Addressable Memory (MCCAM) Cull, and Subpixel Cull.
- MCCAM Magnitude Comparison Content Addressable Memory
- Subpixel Cull The Cull block accepts data one tile's worth at a time.
- the MCCAM Cull discards primitives that are hidden completely by previously processed geometry.
- the Subpixel Cull takes the remaining primitives (which are partly or entirely visible), and determines the visible fragments.
- the Subpixel Cull outputs one stamp's worth of fragments at a time, called a Visible Stamp
- VSP Value (VSP).
- Figure 16 shows an example of how the Cull block produces fragments from a partially obscured triangle.
- a Visible Stamp Portion produced by the Cull block contains fragments from only a single primitive, even if multiple primitives touch the stamp. Therefore, in the diagram, the output VSP contains fragments from only the gray triangle. The fragment formed by the tip of the white triangle is sent in a separate VSP, and the colors of the two VSPs are combined later, in the Pixel block.
- Each pixel in a VSP is divided up into a number of samples to determine how much of the pixel is covered by a given fragment.
- the Pixel block uses this information when it blends the fragments to produce the final color for the pixel.
- MIJ Mode Injection
- the Mode Injection block 10000 retrieves mode information —such as colors, material properties, and so on — from the Polygon Memory 5000 and passes it downstream as required. To save bandwidth, the individual downstream blocks cache recently used mode information.
- the Mode Injection block keeps track of what information is cached downstream, and only sends information as necessary.
- Fragment block 11000 is somewhat misleadingly named, since its main work is interpolation. It interpolates color values for Gouraud shading, surface normals for Phong shading and texture coordinates for texture mapping. It also interpolates surface tangents for use in the bump mapping algorithm, if bump maps are in use.
- the Fragment block performs perspective corrected interpolation using barycentric coefficients.
- the Texture block 12000 applies texture maps to the pixel fragments. Texture maps are stored in the Texture Memory 13000. Unlike the other memory stores described previously, the Texture Memory is single-buffered. It is loaded from the host computer's memory using the AGP interface. A single polygon can use up to four textures.
- Textures are mip-mapped. That is, each texture comprises a series of texture maps at different levels of detail, each map representing the appearance of the texture at a given distance from the eye point. To produce a texture value for a given pixel fragment, the Texture block performs tri-linear interpolation from the texture maps, to approximate the correct level of detail. The Texture block also performs other interpolation methods, such as anisotropic interpolation.
- the Texture block supplies interpolated texture values (generally as RGBA color values) to the Phong block on a per-fragment basis.
- Bump maps represent a special kind of texture map. Instead of a color, each texel of a bump map contains a height field gradient.
- the Phong block 14000 performs Phong shading for each pixel fragment. It uses the material and lighting information supplied by the Mode Injection block, the texture colors from the Texture block, and the surface normal generated by the Fragment block to determine the fragment's apparent color. If bump mapping is in use, the Phong block uses the interpolated height field gradient from the Texture block to perturb the fragment's surface normal before shading.
- the Pixel block 15000 receives VSPs, where each fragment has an independent color value.
- the Pixel block performs pixel ownership test, scissor test, alpha test, stencil operations, depth test, blending, dithering and logic operations on each sample in each pixel (see OpenGL Spec 1.1 , Section 4.1 , "Per-Fragment Operations," p. 109).
- the Pixel block When the Pixel block has accumulated a tile's worth of finished pixels, it blends the samples within each pixel (thereby performing antialiasing of pixels) and sends them to the Backend, to be stored in the framebuffer.
- Figure 17 demonstrates how the Pixel block processes a stamp's worth of fragments.
- the Pixel block receives two VSPs, one from a gray triangle and one from a white triangle. It then blends the fragments and the background color to produce the final pixels. It weights each fragment according to how much of the pixel it covers-or to be more precise, by the number of samples it covers.
- the Pixel Processing block performs stencil testing, alpha blending, and antialiasing of pixels. When it accumulates a tile's worth of finished pixels, it sends them to the Backend, to be stored in the framebuffer.
- the Backend 16000 receives a Tile's worth of pixels at a time from the Pixel block, and stores them into the framebuffer 17000.
- the Backend also sends a Tile's worth of pixels back to the Pixel block, because specific framebuffer values can survive from frame to frame. For example, stencil bit values can constant over many frames, but can be used in all those frames.
- the Backend performs 2D drawing and sends the finished frame to the output devices. It provides the interface between the framebuffer and the computer monitor and video output.
- the AGl block is responsible for implementing all the functionality mandated by the AGP and/or PCI specifications in order to send and receive data to host memory or the CPU. This block should completely encapsulate the asynchronous boundary between the AGP bus and the rest of the chip.
- the AGl block should implement the optional Fast Write capability in the AGP 2.0 spec in order to allow fast transfer of commands by PIO.
- the AGl block is connected to the Read/Write Controller, the DMA Controller and the Interrupt Control Registers on CFD.
- the CFD block is the unit between the AGP interface and the hardware that actually draws pictures. There is a lot of control and data movement units, with little to no math. Most of what the CFD block does is to route data for other blocks. Commands and textures for the 2D, 3D,
- Backend, and Ring come across the AGP bus and are routed by the front end to the units which consume them.
- CFD does some decoding and unpacking of commands, manages the AGP interface, and gets involved in DMA transfers and retains some state for context switches. It is one of the leastsimilar, but most essential components of the DSGP system.
- Figure 9 shows a block diagram of the pipeline showing the major functional units in the CFD block 2000.
- the front end of the DSGP graphics system is broken into two sub-units, the AGl block and the CFD block.
- the rest of this section will be concerned with describing the architecture of the CFD block. References will be made to AGl, but they will be in the context of requirements which CFD has in dealing with AGl.
- the GEO block is the first computation unit at the front end of the graphical pipeline. It deals mainly with per-vertex operations, like the transformation of vertex coordinates and normals.
- the Frontend i.e., AGl and CFD Blocks
- the Frontend deals with fetching and decoding the Graphics Hardware Commands.
- the Frontend loads the necessary transform matrices, material and light parameters and other mode settings into the input registers of the GEO block.
- the GEO block sends transformed vertex coordinates, normals, generated and/or transformed texture coordinates, and per-vertex colors, to the Mode Extraction and Sort blocks.
- Mode Extraction stores the "color" data and modes in the Polygon memory. Sort organizes the per-vertex "spatial" data by Tile and writes it into the Sort Memory.
- Operation Modes The pipeline can operate in maximum performance mode when only a certain subset of its features is in use. In this mode, the GEO block carries out only a subset of all possible operations for each primitive. As more features are enabled, the pipeline moves through a series of lower-performance modes. The Geometry engine reuses the available computational elements to process primitives at a slower rate for the non-performance mode settings. The mapping of features to performance modes is described in the following sections.
- the GEO block operates on vertices that define geometric primitives: points, lines, triangles, quads, and polygons. It performs coordinate transformations and Gouraud shading operations on a per-vertex basis. Only during the Primitive Assembly phase does it group vertices together into lines and triangles (in the process, it breaks down quads and polygons into sets of triangles). It performs clipping and surface tangent generation for each primitive.
- Vertex Coordinate Transformation Each vertex is specified by a set of object coordinates (Xo, Yo, Zo, Wo). The addition of the fourth coordinate enables the vertices to be expressed in homogeneous coordinates. In a homogeneous system, a series of transformations involving rotation, scaling and translation can be combined in a single transform matrix called the Model-View matrix. The vertex object coordinates are transformed to vertex eye coordinates by multiplying them with the 4x4 Model- View matrix:
- Figure 13 summarizes how the DSGP transforms vertex coordinates.
- the GEO block may have to process a current normal, current texture coordinates, and current color for each vertex. Normals affect the lighting calculations.
- the current normal is a three-dimensional vector (Nxo, Nyo, Nz ⁇ ). Texture coordinates determine how a texture image is mapped onto a primitive.
- the GEO block transforms and renormalizes these as it does the normal. It can also generate these vectors if the user doesn't supply them.
- the GEO block generates the tangent using the texture coordinates and the vertex eye coordinates, and the binormal from a cross product of the normal and the tangent.
- the GEO block produces tangents and binormals needed for bump mapping at half rate.
- Figure 14 summarizes how DSGP transforms normals, tangents, and binormals.
- GEO Geometry Block
- Figure 16 is a diagrammatic illustration showing relationships between functional blocks on semiconductor chips in a three-chip embodiment of the inventive structure.
- the current color determines the vertex color.
- the GEO block uses the vertex normal, lighting and material parameters to evaluate the vertex color.
- the material colors can also be derived optionally from the current color. Colors are specified as four values: R, G, S, and A; or a single color index value. Colors are converted by CFD to floating point numbers before they are used in the GEO block. At the end of the vertex lighting evaluation, the resulting colors are clamped back into eight-bit fixed point representing a range of 0.0 to 1.0, inclusive.
- Texture Coordinate Processing Texture coordinates can also be generated using vertex coordinates or the normal instead of being provided by the user.
- a transformation matrix can be optionally applied to the texture coordinates. Texture coordinates are specified using the homogeneous coordinates named s, t, r, and q.
- the transformation matrix is a 4x4 matrix. In the performance case, the resulting q is 1 , r is ignored and s and t are used to access the texture map. At reduced performance, q is used to divide the texture coordinates for perspective scaling.
- the texture coordinate r is used for three dimensional textures and shadows. Up to eight sets of texture coordinates are supported in the GEO block. Two texture coordinates can be transformed and transformed at half performance. Five texture coordinates can be handled at one-third of the full performance rate. Finally, ail eight texture coordinates can be generated and transformed at quarter performance rate.
- the GEO block compares vertex clip coordinates to the clip planes generate outcodes. It uses these outcodes to reject primitives that are outside the view volume (for example, if all of the vertices in a primitive are above the top clipping plane, the primitive is rejected). Some primitives can not be trivially rejected even if they are completely outside of the view volume. If the outcodes indicate that the primitive is entirely inside the view volume and doesn't intersect any clipping planes, the primitive is accepted and no further clipping calculations are required.
- the window coordinates of the current vertex and previous vertices are used to determine the face direction of polygons and optionally perform back face culling.
- the primary color includes the Ambient, the Emissive and the Diffuse components of the color, attenuated and highlighted by spotlights. It has Red, Green, Blue, and Alpha components (RGBA). All lights and the current material settings contribute to the primary color.
- the Fragment block interpolates the primary and secondary colors separately.
- the primary color is blended with the texture color before the secondary color is applied for a given fragment to determine the final pixel color.
- the GEO block does not do any extra work.
- the DSGP pipeline supports both Phong and Gouraud shading simultaneously for separate lights. This increases the total number of lights significantly using Gouraud and the quality of the lighting using up to eight Phong lights.
- Phong uses the GEO block Primary and Secondary color output as the "current" colors for color material.
- the Mode Extraction block (MEX) in conjunction with the Mode Injection (MIJ) block is responsible for the management of graphics state related information.
- the state changes are incremental; that is, the value of a state parameter remains in effect until it is changed. Therefore, the applications only need to update the parameters that change.
- the rendering is linear; that is, primitives are rendered in the order received. Points, lines, triangle strips, triangle fans, polygons, quads, and quad strips are examples of graphical primitives.
- state changes are accumulated until the spatial information for a primitive is received, and those accumulated states are in effect during the rendering of that primitive.
- the Geometry (GEO) block receives the primitives in order, performs all vertex operations (transformations, vertex lighting, clipping, and primitive assembly), and sends the data down the pipeline.
- the Sort block receives the time ordered data and bins it by the tiles it touches. (Within each tile, the list is in time order.)
- the CUL block receives the data from the SRT block in tile order, and culls out parts of the primitives that definitely do not contribute to the rendered images.
- the CUL block generates the VSPs.
- a VSP corresponds to the visible portion of a polygon on the stamp.
- the TEX and PHG units receive the VSPs and are responsible for the texturing and lighting of the fragments respectively.
- the last block, i.e. the Pixel block consumes the VSPs and the fragment colors to generate the final picture.
- MEX is a logic block between Geometry and Sort blocks that collects and saves the temporally ordered state change data, and attaches appropriate pointers to the primitive vertices in order to associate the correct state with the primitive when it is rendered.
- the Mode Injection (MIJ) block is responsible for the retrieval of the state and any other information associated with the state pointer (in this document, generally called the MLM Pointer) when it is needed. It is also responsible for the repackaging of the information as appropriate. An example of the repackaging occurs when the vertex data in polygon memory is retrieved and bundled into triangle input packets for fragment.
- the graphics state affects the appearance of the rendered primitives. Different parts of the DSGP pipeline use different state information. Here, we are only concerned with the pipeline stages downstream from the GEO block. DSGP breaks up the graphics state into several categories based on how that state information is used by the various pipeline stages. The proper partitioning of the state is very important. It can affect the performance (by becoming bandwidth and access limited), size of the chips (larger caches and/or logic complications), and the pin count.
- the MEX block is responsible for the following:
- the state saved in Polygon memory is the one used by the blocks downstream from MIJ, e.g. Fragment, Texture, Phong and Pixel blocks. This state is partitioned as described elsewhere in this description.
- the MIJ is responsible for the following:
- BeginTile to Fragment and Pixel units. 2. Associating the state with each VSP received from the CUL block.
- Mode injection thus deals with the retrieval of state as well as the per-vertex data needed for computing the final colors for each fragment in the VSP.
- the VertexModes packet contains the mode information generated by the host computer (i.e., software) that MEX attaches to each spatial packet before it is passed on to the Sort block.
- the VertexModes packet includes: line width, point size, line stipple information, and depth test operation control bits.
- the Spatial packet contains the window coordinates of the vertex and other per-vertex information generated by the Geometry block such as the start bit for the stipple pattern for line primitives.
- the spatial packet includes: window coordinates of the vertex, polygon winding, vertex reuse in polygon fans and strips, edge flags, and blending operation control bits (such as alpha test and alpha blending).
- the vertex modes are generated by software.
- Geometry block receives the cull modes and vertex modes from software. It sends cull and vertex modes to MEX as described above.
- MEX construct a spatial packet for sort by attaching the vertex modes to the spatial packet.
- MEX block also attaches state MLM Pointers to this packet before passing it on to the Sort block.
- the MEX block collapses the line width and point width parameters into one parameter, since the primitive can not be both a point and a line at the same time. It uses the Sort primitive type to determine if the primitive is a point, a line or a polygon. If the primitive is a point it sends the point width down to Sort otherwise it sends the line width. Other fields are left untouched.
- Texturing has many parameters, especially when multiple textures are included, it is advantageous to have a multiplicity of texture packets.
- the texture parameter packets contain information needed for retrieval and filtering of texels. This document assumes there are eight possible textures assigned to each vertex. TexA parameter packet contains parameters for the first two textures and TexB parameter packet contains the same (per-texture) information for up to 6 additional textures.
- Per-texture information includes: texture ID, number of texture dimensions (i.e., 1 D, 2D, or 3D), texture size (i.e., width, height, and depth), texture boarder information, texture format, texture filter control bits, texture wrapping control bits, texture clamping control bits, level of detail control bits, and texture comparison operation control bits.
- the TexA packet contains one or two of these entries and the TexB packet can contain up to 6 entries.
- TexA and TexB packets are generated by the software and sent to MEX via the GEO block.
- MEX manages TexA and TexB as two state partitions, and saves them in the Polygon memory.
- Each TexA and TexB state partition has a pointer associated with it.
- Mode Injection block retrieves these packets as needed later on. Geometry block does not use any of this information.
- the Texture block Given the texture id, its (s, t, r) coordinates, and the mipmap level, the Texture block is responsible for retrieving the texels, unpacking and filtering the texel data as needed. Fragment block sends texture id, s, t, r, mip level, as well as the texture mode information to Texture block. Note that s, t, r, and mip level coming from Fragment are floating point values. For each texture, TEX block outputs one 36 bit texel value to PHG. Texture block does not combine the fragment and texture colors; that happens in the Phong block. Texture block needs the texture parameters and the texture coordinates. Texture parameters are obtained from the two texture parameter caches in the Texture block. Fragment block uses the texture width and height parameters in the miplevel computation. Fragment uses the TextureDimension field to determine if the texture dimension and if it is enabled (0 means that the texture is disabled) and
- TexCoordSet to associate a coordinate set with it.
- the "lighting" partition of the state contains information for a multiplicity of lights (hereinafter, this document assumes a maximum of 8 lights) used in fragment lighting computations as well as the global state affecting the lighting of a fragment such as the fog parameters etc.
- Light cache packet includes the following per-light information: light type, attenuation constants, spotlight parameters, light positional information, and light color information (including ambient, diffuse, and specular colors).
- the light cache packet also includes the following global lighting information: global ambient lighting, fog parameters, and number of lights in use.
- a light cache entry is about 300 bytes, (approximately 300 bits for each of the eight lights plus 120 bits of global light modes).
- the LightCache packet is generated by the software and sent to MEX via the GEO block.
- MEX manages the LightCache packet as one of the state partitions, and saves it in the Polygon memory when necessary.
- the LightCache state partition has a pointer associated with it.
- Mode injection block retrieves this packet from polygon memory as needed later on. Geometry block does not use any of this information.
- per-light cache entries could be used rather than caching the entire lighting state. This would allow less data to be transmitted down the pipeline when there is a light parameter cache miss.
- application programs would be provided
- the material partition of the graphics state contains all the information about the material used in fragment lighting computation. Note that the fragment material state is different from the material state attached to the vertex of a primitive. The fragment-material state information is not used during the vertex lighting computations performed in the GEO block.
- This packet includes: texture enable control bits (selection of active textures), texture environment parameters, material color parameters (emissive, ambient, diffuse, and specular colors, and shininess), shininess cutoff value, and color material parameters.
- Pixel modes affect the per-fragment operations in the PIX block.
- Software creates the pixel mode packet and it is sent to MEX via GEO.
- MEX saves the packet in Polygon memory.
- MIJ retrieves the packet, and sends it to the PIX block.
- Pixel modes include the following information: frame buffer write masks (depth, color, and stencil masks), blending operations, depth function, stencil function, and scissor operations.
- the stipple packet specifies the polygon stipple pattern. It is efficient for the stipple pattern to be cached separately because it is not used often, and when used, does not change often. It is a large number of bytes (usually 128 bytes due to the need for 32 x 32 bit pattern), so to include it in any other parameter cache would add a large additional overhead to the associated packet.
- the fragment block interpolates the supplied per-vertex data and generates the information needed for the blocks downstream from the Fragment block.
- the interpolated parameters may consist of some or all of the possible parameters depending on the state pointer attached to the VSP.
- the packet size stored into Polygon Memory is variable, depending on the number and type of parameters used for a particular vertex.
- These parameters include: primitive type, vertex reuse to construct polygon fans and strips, undipped vertex x, y, and 1/w values, vertex eye coordinates (x,, ye , y eye , z eye ), inverse perspective term, vertex primary and secondary colors, vertex normal vector, tangent vector, binormal vectors, and up to 8 sets of texture coordinates.
- the normal, tangent, and binormal vectors can each be represented as either a single vector or as a unit vector (i.e., the vector's direction) and a corresponding magnitude.
- Undipped vertex x, y, and 1/w values are particularly useful because interpolated primitive parameters (such as colors, normals, texture coordinates, etc.) can be generated from the original vertex parameters of the primitive, even if the primitive gets clipped to the display screen.
- interpolated primitive parameters such as colors, normals, texture coordinates, etc.
- new vertices are created in order to keep all primitives on-screen. This would usually require all vertex parameters to be interpolated at these new vertex locations (along the display screen edges), which is an expensive set of operations.
- the interpolation of these parameters at clip-generated vertices is avoided by storing clipped values into Sort Memory (i.e., the spatial x, y, and z values), but storing undipped vertex parameters into Polygon Memory.
- the Geo block generates per-vertex information that is stored in polygon memory.
- the MIJ block is responsible for retrieving the needed state and vertices from the polygon memory in order to reconstruct the primitive that includes the VSP.
- triangle vertex texture coordinates are sent to Fragment unit and not the texture unit.
- the texture unit receives the interpolated and perspective corrected texture coordinates for each fragment from the Fragment block.
- MEX receives a sequence of packets from GEO. For each primitive, MEX first receives the relevant state packets and then it receives the geometry packets. (Color vertex information is received before the sort vertex information.)
- the sort vertex data consists of the information needed for sorting and culling of primitives such as the clipped window coordinates.
- the VtxMode packet contains information about depth test etc. The information in CullMode, VtxMode and sort vertex packets is sent to the Sort-Setup-Cuil part of the pipeline.
- the "color" vertex data consists of information needed for lighting and texturing of primitive fragments such as the vertex eye-coordinates, vertex normals, texture coordinates etc and is saved in polygon memory to be retrieved later.
- the Sort-Setup-Cull part of the pipeline converts the primitives into VSPs. These VSPs are then textured and lit by the Fragment-Texture-Phong part of the pipeline.
- the VSPs output from the Cull block to MIJ block are not necessarily ordered by primitives. In most cases, they will be in the VSP scan order on the tile, i.e. the VSPs for different primitives may be interleaved.
- Fragment-Texture-Phong part ofthe pipeline needs to know which primitive a particular VSP belongs to; as well as the graphics state at the time that primitive was first introduced.
- MEX associates a "color pointer" with each Sort Vertex (which is then passed on to each VSP in this primitive).
- MIJ decodes the pointer, and retrieves needed information from the
- MEX thus needs to accumulate any state changes that have happened since the last state save.
- the state changes become effective as soon as a vertex is encountered.
- MEX keeps a state vector on chip. This state vector has 10 partitions as shown in Figure 19.
- MEX needs nearly 1170 bytes of on-chip memory to store the state vector.
- VertexModes are held in a register in MEX and are appended to the vertices passed on to the Sort-Setup-Cull part of the pipeline.
- the CullModes are sent to Sort as
- Mex2SrtCullModePkt keeps a dirty bit and a pointer (in polygon memory) for each partition in the state vector. Thus there are 10 dirty bits and 9 mode pointers, since cull modes do not get saved in the polygon memory and therefore do not require a pointer. Every time MEX receives an input packet corresponding to a state partition from the Geo block, it updates that partition in the state vector. MEX also sets the dirty bit corresponding to that partition.
- MEX receives a color vertex, it examines the dirty bits to see if any part of the state has been updated since the last save. All state partitions that have been updated and are relevant to the rendering of the current primitive are saved to the polygon memory and their pointers updated. Their dirty bits are also cleared. Note that the dirty bits are only cleared for the primitives that are saved to the polygon memory. Which TextureA, TextureB, and Material gets saved to the polygon memory depends on the "face" of the primitive and the dirty bits. This is schematically outlined in Figure 20.
- MEX constructs a composite color pointer called the MLM Pointer containing the pointer to the last saved location of the applicable TextureA, TextureB, Material, Light, Stipple, and PixelMode.
- This pointer is attached to the vertices passed on to the Sort block.
- Sort Block i. Functional Overview of the SRT Block
- the Sort Block is located in the pipeline between Mode Extraction (MEX) and Setup (STP).
- the primary function of the Sort Block is to take geometry scattered around the display window and sort it into tiles.
- the Sort Block manages the Sort Memory, which stores all the geometry for an entire scene before it is rasterized, along with a small amount of mode information.
- the Sort Memory is a double buffered list of vertices and modes. One page collects a scene's geometry
- the window (the display area on the screen) is divided horizontally and vertically into a set of tiles, and Sort keeps an ordered list for each tile.
- Sort keeps an ordered list for each tile.
- vertices and modes are written sequentially into the Sort Memory as they are received by the Sort Block.
- a page of Sort Memory is read, it is done on a tile-by-tile basis.
- the read process operates in two modes: 1) Time Order Mode; and 2) Sorted Transparency Mode.
- Time Order Mode time order of vertices and modes are preserved within each tile. That is, for a given tile, vertices and modes are read in the same order as they are written.
- Sorted Transparency Mode reading of each tile is divided into multiple passes, where, in the first pass, guaranteed opaque geometry is output from the Sort Block, and, in subsequent passes, potentially transparent geometry is output from the Sort Block.
- the time ordering is preserved, and mode data is inserted in its correct time-order location.
- the beginning of a frame is designated by the reception of a MEX Output Begin Frame Packet, and always corresponds to the start of a user frame (that is, the application is starting to draw a new picture). These begin frame packets are passed from Sort down the pipeline to Setup when Sort Memory Pages are swapped.
- the ending of a frame is designated by the reception of a MEX Output End Frame Packet, but only corresponds to the end of a user frame if a memory overflow did not occur and software did not force the user frame to split. A memory overflow occurs when either Sort Memory or Polygon Memory becomes full.
- the Sort Block receives and outputs Sort Primitives, which are: points, lines, and triangles.
- a Sort Primitive triangle can be either a filled triangle or a line mode triangle.
- primitives are sorted according to Cull Primitives, which include: points, lines, filled triangles, and lines that are edges of triangles.
- edges of line mode triangles are considered separate primitives. If a line mode triangle is received by the Sort Block, it is sorted according to the tiles its edges touch. Any edge ofthe triangle (that has its LineFlag TRUE) causes the entire triangle to be sorted into the tiles that the edge touches, but a triangle with multiple edges in the same tile only cause one Pointer Entry per tile. This reduces the number of primitives per tile, because, for example, if a large line mode triangle surrounds several tiles without any of its edges touching the tiles, no Cull Primitives are read for this triangle in these tiles.
- the Cull Primitive is further described in the Setup Block document, but the CullType parameter is essentially the SortPrimitiveType parameter with an additional bit to choose amongst the three edges of a line mode triangle.
- the Setup (STP) block receives a stream of packets from the Sort (SRT) block. These packets have spatial information about the primitives to be rendered. The output of the STP block goes to the Cull (CUL) block.
- the primitives received from SRT can be filled triangles, line triangles, lines, stippled lines, and points. Each of these primitives can be rendered in aliased or antialiased mode.
- the SRT block sends primitives to STP (and other pipeline stages downstream) in tile order. Within each tile the data is organized in time order or in sorted transparency order.
- the CUL block receives data from the STP block in tile order (in fact in the order that STP receives primitives from SRT), and culls out parts of the primitives that definitely do not contribute to the rendered images. This is accomplished in two stages.
- the first stage, MCCAM Cull allows detection of those elements in a rectangular memory array whose content is greater than a given value.
- the second stage refines on this search by doing a sample by sample content comparison.
- the STP block prepares the incoming primitives for processing by the CUL block.
- STP produces a tight bounding box and minimum depth value Zmin for the part ofthe primitive intersecting the tile for MCCAM culling.
- MCCAM cull stage marks the stamps in the bounding box that may contain depth values less than Zmin.
- the Z cull stage takes these candidate stamps, and if they are a part of the primitive, computes the actual depth value for samples in that stamp. This more accurate depth value is then used for comparison and possible discard on a sample by sample basis.
- STP also computes the depth gradients, line slopes, and other reference parameters such as depth and primitive intersection points with the tile edge for the Z cull stage.
- the CUL unit produces the VSPs used by the other pipeline stages.
- Polygons arriving at the STP block are essentially triangles.
- the triangles can be rendered in the aliased or anti-aliased mode.
- Figure 21 shows DSGP triangles.
- the STP unit processes the aliased and anti-aliased triangles identically.
- the pipeline units downstream render aliased triangles by locating all samples at the center of the pixel.
- the sample locations are determined by the SampleLocSel parameters passed down with one of the control packets.
- a sample belongs to the triangle if it falls within the geometric boundary of the triangle. If the sample falls exactly on the edge of the triangle, then the inclusion rules are used to determine whether or not that sample belongs to the triangle.
- DSGP renders lines by converting them into quads.
- Figure 22 shows various quads generated for the drawing of aliased and anti-aliased lines of various orientations.
- the width of the lines is rounded to the nearest supported width.
- the width adjustment needs to be done prior to the SORT stage. It can be done by the software. STP does not modify the incoming line widths.
- quads are generated differently for aliased and anti-aliased lines.
- quad vertices also depend on whether the line is x-major or y-major.
- DSGP renders anti-aliased points as circles and aliased points as squares.
- the circles are centered at the user specified position.
- the diameter of the circle is the width specified by the user rounded to the nearest supported width.
- the user specified position of the point is snapped to the center of the pixel or rounded to a corner of the pixel depending on whether the resulting width is odd or even respectively.
- the adjustment of point size and position should happen in the pipeline prior to the SORT block. Since the position of the point is subject to transformations, Geometry unit seems like the right place to do this.
- Figure 23 shows the rendered point.
- the user specified point is indicated by the circle.
- Setup converts the line segments into parallelograms which consists of four vertices.
- a triangle has three vertices.
- Setup describes the each primitive with a set of four points. Note that not all values are needed for all primitives.
- Setup uses top, bottom, and either left or right corner, depending on the triangle's orientation.
- a line segment is treated as a parallelogram, so Setup uses all four points.
- Figures 26-30 show how Setup represents triangles and lines. Note that while the triangle's vertices are the same as the original vertices, Setup generates new vertices to represent the lines as quads.
- the unified representation of primitives uses primitive descriptors which are assigned to the original set of vertices in the window coordinates.
- flags which indicate which descriptors have valid and meaningful values: VtxYmin, VtxYmax, VtxLeftC, VtxRightC, LeftCorner, and RightCorner.
- these descriptors are obtained by sorting the triangle vertices by their y coordinates. For line segments these descriptors are assigned when the line quad vertices are generated.
- VtxYmin is the vertex with the minimum y value.
- VtxYmax is the vertex with the maximum y value.
- VtxLeftC is the vertex that lies to the left of the long y-edge (the edge of the triangle formed by joining the vertices VtxYmin and VtxYmax) in the case of a triangle, and to the left of the diagonal formed by joining the vertices VtxYmin and VtxYmax for parallelograms.
- VtxRightC is the vertex that lies to the right of the long y-edge in the case of a triangle, and to the right of the diagonal formed by joining the vertices VtxYmin and VtxYmax for parallelograms. If the triangle is such that the long edge is also the right edge, then the flag RightCorner is FALSE (0) indicating that the VtxRightC is invalid.
- VtxXmin VtxXmax
- VtxTopC VtxBotC
- TopCorner BottomCorner
- VtxXmin is the vertex with the minimum x value
- VtxXmax is the vertex with the maximum x value.
- VtxTopC is the vertex that lies above the long x-edge (edge joining vertices VtxXmin and VtxXmax) in the case of a triangle, and above the diagonal formed by joining the vertices VtxXmin and VtxXmax for parallelograms. If the triangle is such that the long x-edge is also the top edge, then the flag TopCorner is FALSE (0) indicating that the VtxTopC is invalid. Similarly, VtxBotC is the vertex that lies below the long x-axis in the case of a triangle, and below the diagonal formed by joining the vertices VtxXmin and VtxXmax for parallelograms.
- VtxBotC is invalid.
- Figure 26 shows the vertex assignment graphically.
- the slopes ( ⁇ x/ ⁇ y) of the four polygon edges - represented as ⁇ SIYmaxLeft, SIYmaxRight, SILeftYmin, SIRightYmin ⁇ and the inverse of slopes (dy/dx) ⁇ rSIXminTop, rSIXminBot, rSITopXmax, rSIBotXmax ⁇ .
- Slope naming convention used is SIStrtEnd. SI is for slope, Strt is first vertex identifier and End is the second vertex identifier of the edge.
- SIYmaxLeft is the slope of the left edge - connecting the VtxYMax and VtxLeftC.
- SIYmaxLeft is the slope of the long edge.
- the letter r in front indicates that the slope is reciprocal, i.e. represents ( ⁇ y/ ⁇ x) instead of ( ⁇ x/ ⁇ y).
- Figure 27 shows the slope assignments graphically.
- Setup starts with a set of vertices, (x-, y 0 ,z 0 ), (x,, y ⁇ z,), and (x 2 , y 2 , Zj).
- the three indices iO, i1 , and i2 for the vertices sorted by y (in the ascending order) are determined, as are the indices jO, j1 , j2 for vertices sorted by x (in the ascending order).
- indices iO, i1 , and i2 are used to compute a set of (dx/dy) derivatives.
- indices jO, j1 , and j2 are used to compute the (dy/dx) derivatives for the edges.
- edge-on triangles i.e. triangles having two edges with equal slopes. Whether the middle vertex is on the left or the right is determined by comparing the slopes dx2/dy of line formed by vertices v[i2] and v[i1], and dxO/dy of the line formed by vertices vp2] and vpO]. If (dx2/dy > dxO/dy) then the middle vertex is to the right of the long edge else it is to the left of the long edge. The computed values are then assigned to the primitive descriptors. Assigning the x descriptors is similar. We thus have the edge slopes and vertex descriptors we need for the processing of triangles.
- Depth gradients are the partial derivatives of z along the x- and y-axes. We use the following equations: ⁇ y - xo)(j>2 - yo) - (xi - - yo)
- Setup receives 26 bits (s25) for each vertex z-value from Sort unit.
- the partial derivatives are computed as 1.24.10 precision values.
- the x, y coordinates are 14 bit integers with precision corresponding to the (8x8) sub-raster grid per pixel.
- the partial derivatives are computed on the scale of the sub-raster grid.
- the "factor” is passed down to from the SRT block.
- the (r * unit) offset part is taken care of in GEO.
- the depth values are represented as 24 bit integers. This offset is added to each of the vertex z values.
- the computed offset is clamped to 24 bits (in fact s24) before being added to the z values.
- Figure 28 shows the quadrant assignment based on the orientation of the line. Which quadrant the line lies in is determined by looking at the relative position of (x1 , y1 ) with respect to (xO, yO).
- the xhw, yhw and the primitive descriptors for each quadrant are determined.
- CUL Cull Block
- the Cull unit is responsible for: 1) pre-shading hidden surface removal; and 2) breaking down primitive geometry entities (triangles, lines and points) to stamp based geometry entities called Visible Stamp Portions (VSPs).
- VSPs Visible Stamp Portions
- the Cull unit does, in general, a conservative culling of hidden surfaces. Cull can only conservatively remove hidden surfaces because it does not handle some "fragment operations" such as alpha test and stencil test.
- the Cull block's sample z-buffer can hold two depth values, but the Cull block can only store the attributes of one primitive per sample. Thus, whenever a sample requires blending colors from two pieces of geometry, Cull has to send the first primitive (using time order) down the pipeline, even though there may be later geometry that hides both pieces of the blended geometry.
- the Cull Unit receives Setup Output Primitive Packets that each describe, on a per tile basis, either a triangle, a line or a point.
- Sort is the unit that bins the incoming geometry entities to tiles.
- Setup is the unit that pre-processed the primitives to provide more detailed geometric information for Cull to do the hidden surface removal. Setup will pre-calculate the slope value for all the edges, the bounding box ofthe primitive within the tile, minimum depth value (front most) of the primitive within the tile, and other relevant data.
- Mode Extraction Prior to Sort, Mode Extraction has already extracted the information of color, light, texture and related mode data, Cull only gets the mode data that is relevant to Cull and a pointer, called Color Pointer, that points to color, light and texture data stored in Polygon Memory.
- the Cull Unit sends one Visible Stamp Portion (VSP) at a time to the Mode Injection unit.
- VSP Visible Stamp Portion
- a VSP is a visible portion of a geometry entity within a stamp.
- Mode Injection reconnects the VSP with its color, light and texture data and sends it to Fragment and later stages in the pipeline.
- the Cull Unit performs two main functions.
- the primary function is to remove geometry that is guaranteed to not affect the final results in the frame buffer (i.e., a conservative form of hidden surface removal).
- the second function is to break primitives into units of stamp portions (SPs).
- SPs stamp portions
- a stamp portion is the intersection of a primitive with a given stamp.
- the portion amount is determined by sampling. Any stamp will have 16 predetermined sample points (actually each pixel within a stamp has 4 predetermined sample points).
- the portion "size" is then given by the number and the set of sample points covered by a primitive in a given stamp.
- Cull processes primitives one tile at a time.
- the pipeline is in one of two modes: 1) Time Order Mode; or 2) Sorted Transparency Mode.
- Time Order Mode time order of vertices and modes are preserved within each tile, and the tile is processed in a single pass through the data. That is, for a given tile, vertices and modes are read in the same order as they are written, but are skipped if they do not affect the current tile.
- Sorted Transparency Mode the processing of each tile is divided into multiple passes, where, in the first pass, guaranteed opaque geometry is processed (the Sort Block only send non-transparent geometry for this pass). In subsequent passes, potentially transparent geometry is processed (the Sort Block repeatedly sends all the transparent geometry for each pass). Within each pass, the time ordering is preserved, and mode data is inserted in its correct time-order location.
- MIJ Mode Injection Block
- the Mode Injection (MIJ) block in conjunction with the Mode Extraction block is responsible for the management of graphics state related information.
- state changes are incremental, i.e. the value of a state parameter remains in effect until it is changed.
- the applications only need to update the parameters that change.
- the rendering is linear, i.e. primitives are rendered in the order received. Points, lines, triangle strips, triangle fans, polygons, quads, and quad strips are examples of graphical primitives.
- all state changes accumulated until the spatial information about a primitive is received are effective during the rendering of that primitive.
- rendering is tile based.
- the Geometry (GEO) block receives the primitives in order, performs all vertex operations (transformations, vertex lighting, clipping, and primitive assembly), and sends the data down the pipeline.
- the Sort block receives the time ordered data and bins it by the tiles it touches. (Within each tile, the list is in time order.)
- the CUL block receives the data from the SRT block in tile order, and culls out parts of the primitives that definitely do not contribute to the rendered images.
- the CUL block generates the VSPs.
- a VSP corresponds to the visible portion of a polygon on the stamp.
- a stamp is a 2x2 pixel area ofthe image.
- the TEX and PHG units receive the VSPs and are responsible for the texturing and lighting of the fragments respectively.
- the last block i.e. the
- Pixel block consumes the VSPs and the fragment colors to generate the final picture.
- a primitive may touch many tiles and therefore, unlike traditional rendering pipelines, may be visited many times (once for each tile it touches) during the course of rendering the frame.
- the pipeline must remember the graphics state in effect at the time the primitive entered the pipeline, and recall it every time it is visited by the pipeline stages downstream from SRT.
- MEX is a logic block between Geometry and Sort blocks that collects and saves the temporally ordered state change data, and attaches appropriate pointers to the primitive vertices in order to associate the correct state with the primitive when it is rendered.
- the Mode Injection (MIJ) block is responsible for the retrieval of the state and any other information associated with the state pointer (aka the MLM Pointer) when it is needed. It is also responsible for the repackaging of the information as appropriate. An example of the repackaging occurs when the vertex data in polygon memory is retrieved and bundled into primitive (triangle, line, point) input packets for fragment.
- MIJ receives VSP packets from the CUL block.
- Each VSP packet corresponds to the visible portion of a primitive on the 2x2 pixel stamp.
- the VSPs output from the Cull block to MIJ block are not necessarily ordered by primitives. In most cases, they will be in the VSP scan order on the tile, i.e. the VSPs for different primitives may be interleaved.
- the pipeline stages downstream from the MIJ block need information about the type ofthe primitive (i.e.
- MEX also attaches ColorPointers ⁇ ColorAddress, ColorOffset, and ColorType ⁇ to each primitive sent to Sort, which is in turn passed on to each of the VSPs of that primitive. MIJ decodes this pointer to retrieve the necessary information from the polygon memory.
- MIJ starts working on a frame after it receives a BeginFrame packet from CUL.
- the VSP processing for the frame begins when CUL is done with the first tile in the frame and MIJ receives the first VSP for that tile.
- Color Pointer Decode The color pointer consists of three parts, the ColorAddress, ColorOffset, and ColorType. (We refer the reader to the Mode Extraction Architecture Specification for details of the ColorPointer and the MLM_Pointer.)
- the ColorAddress points to the ColorVertex that completes the primitive.
- ColorOffset provides the number of vertices separating the ColorAddress from the dualoct that contains the MLM_Pointer.
- ColorType contains information about the type of the primitive, size of each ColorVertex, and the enabled edges for line mode triangles. The ColorVertices making up the primitive may be 2, 4, 6, or 9 dualocts long.
- MIJ decodes the ColorPointer to obtain addresses ofthe dualocts containing the MLM_Pointer, and all the ColorVertices that make up the primitive.
- the MLM_Pointer contains the dualoct address of the six state packets in polygon memory.
- the MIJ block is responsible for making sure that the Fragment, Texture, Phong and Pixel blocks have all the information they need for processing the fragments in the VSP, before the VSP arrives at that stage.
- the ColorVertices of the primitive as well as the six state packets pointed to by the pointers in the MLM_Pointer need to be resident in the blocks that need them, before the VSP fragments can be processed.
- MIJ was to retrieve the MLM_pointer, the state packets, and ColorVertices for each ofthe VSPs, it will amount to nearly 1 KB of data per VSP. This is equivalent to 125GB/sec of polygon memory bandwidth for reading the data, and as much for writing out the data to FRG and PIX blocks.
- VSPs VSPs
- the primitives i.e. we are likely to get a sequence of VSPs corresponding to the same primitive.
- the VSPs do not arrive at MIJ in primitive order. Instead, they are in the VSP scan order on the tile, i.e. the VSPs for different primitives crossing the scan-line may be interleaved. Because of this reason, the caching scheme based on the current and previous VSP alone will cut down the bandwidth by approximately 80%.
- MIJ manages seven caches for the downstream blocks - one for FRG (ColorData Cache 10016) and two each for the TEX (TexA 10018, TexB 10020), PHG (Light 10024, Material 10022), and PIX
- the Mode Injection block resides between the CUL block and the rest of the pipeline downstream from CUL.
- MIJ receives the control and VSP packets from the CUL block.
- MIJ interfaces with the Fragment and Pixel blocks.
- the MIJ is responsible for the following:
- Polygon memory stores per-vertex data.
- MIJ retrieves the required vertices (3 for a triangle, 2 for a line, and 1 for point primitives) from the polygon memory.
- Mode injection thus deals with the retrieval of state as well as the per-vertex data needed for computing the final colors for each fragment in the VSP.
- the Fragment block is located after Cull and Mode Injection and before Texture, Phong, and Bump. It receives Visible Stamp Portions (VSPs) that consist of up to 4 fragments that need to be shaded.
- VSPs Visible Stamp Portions
- the fragments in a VSP always belong to the same primitive, therefore the fragments share the primitive data defined at vertices including all the mode settings.
- a sample mask, sMask defines which subpixel samples of the VSP are active. If one or more of the four samples for a given pixel is active. This means a fragment is needed for the pixel, and the vertex-based data for primitive will be interpolated to make fragment-based data.
- the active subpixel sample locations are used to determine the corresponding x and y coordinates of the fragment.
- the Fragment block caches the color data to be reused by multiple VPSs belonging to the same primitive.
- Mode Injection identifies if the color cache contains the required data. If it is a hit, Mode Injection sends the VSP, which includes an index into the cache. On a cache miss, Mode Injection replaces an entry from the cache with the new color data, prior to sending the VSP packet with the Color cache index pointing to the new entry.
- Mode Injection replaces an entry from the cache with the new color data, prior to sending the VSP packet with the Color cache index pointing to the new entry.
- all modes, materials, texture info, and light info settings are cached in the blocks in which they are used. An index for each of these caches is also included in the VSP packet.
- the Fragment block caches some texture and mode info.
- Figure 32 shows the flow and caching of mode data in the last half of the DSGP pipeline.
- the Fragment block's main function is the interpolation of the polygon information provided at the vertices for all active fragments in a VSP.
- the Fragment block can perform the interpolations of a given fragment in parallel and fragments within a VSP can be done in an arbitrary order. Fully interpolated stamps are forwarded to the Texture, Phong and Bump blocks in the same order as received.
- the Fragment block generates Level of Detail (LOD or ⁇ ) values for up to four textures and sends them to the Texture block.
- LOD or ⁇ Level of Detail
- the Fragment block will have an adequately sized FIFO in its input to smooth variable stamp processing time and the Color cache fill latency.
- Figure 33 shows a block diagram of the Fragment block.
- the Fragment block can be divided into six sub-blocks. Namely: 1. The cache fill sub-block 11050 2. The Color cache 11052
- the first block handles Color cache misses. New polygon data replaces old data in the cache.
- the Color cache index, CCIX points to the entry to be replaced.
- the block doesn't write all of the polygon data directly into the cache. It uses the vertex coordinates, the reciprocal of the w coordinate, and the optional texture q coordinate to calculate the barycentric coefficients. It writes the barycentric coefficients into the cache, instead of the info used to calculate them.
- the second sub-block implements the Color cache.
- Fragment receives a VSP packet (hit)
- the cache entry pointed to by CCIX is read to access the polygon data at the vertices and the associated barycentric coefficients.
- the third sub-block prepares the interpolation coefficients for the first fragment of the VSP.
- the coefficients are expressed in plane equation form for the numerator and the denominator to facilitate incremental computation of the next fragment's coefficients.
- the total area of the triangle divides both the numerator and denominator, therefore can be simplified.
- additional storage and bandwidth is saved by only providing two out of three sets of barycentric coordinates along with the denominator. As a non-performance case, texture coordinates with a q other than 1 will be interpolated using 3 more coefficients for the denominator.
- the x and y coordinates given per stamp correspond to the lower left pixel in the stamp. Only the position of the stamp in a tile is determined by these coordinates. A separate packet provides the coordinates of the tile that subsequent stamps belong to. A lookup table is used with the corresponding bits in sMask to determine the lower bits of the fragment x and y coordinates at subpixel accuracy. This choosing of an interpolation location at an active sample location ensures that the interpolation coefficients will always be positive with their sum being equal to one.
- the fourth sub-block interpolates the colors, normals, texture coordinates, eye coordinates, and Bump tangents for each covered pixel.
- the interpolators are divided in four groups according to their precision.
- the first group interpolates 8 bit fixed point color fractions. The values are between 0 and 1 , the binary representation of the value 1 is with all the bits set to one.
- the second set interpolates sixteen bit, fixed point, unit vectors for the normals and the surface tangent directions.
- the third set interpolates 24 bit floating point numbers with sixteen bit mantissas. The vertex eye coordinates and the magnitudes ofthe normals and surface tangents fall into this category.
- the last group interpolates the texture coordinates which are also 24 bit FP numbers but may have different interpolation coefficients. All interpolation coefficients are generated as 24 bit FP values but fewer bits or fixed point representation can be used when interpolating 8 bit or 16 bit fixed point values.
- the fifth sub-block re-normalizes the normal and surface tangents.
- the magnitudes obtained during this process are discarded.
- the original magnitudes are interpolated separately before being forwarded to the Phong and Bump block.
- the texture map u, v coordinates and Level of Detail (LOD) are evaluated in the sixth sub-block.
- the barycentric coefficients are used in determining the texture LOD. Up to four separate textures associated with two texture coordinates are supported. Therefore the unit can produce up to four
- Figure 34 shows examples of VSPs with the pixel fragments formed by various primitives.
- a copy of the sMask is also sent directly to the Pixel block, bypassing the shading blocks (Fragment, Texture, Phong and Bump).
- the bypass packet also includes the z values, the Mode and Polygon
- V 0 , V,, and V 2 are the vertices of the triangle.
- A, and A 2 can be found as:
- ⁇ 0 ( .y) Are e a a ( ( V P 0 ,V” 1t V 2 2 ) ) .
- Area(i,j,k) denotes the area in window coordinates of the triangle with vertices i, j, and k.
- Area(v 0 , v v v 2 ) y ⁇ v ⁇ y vj1 ⁇ ⁇ y w0 ⁇ y w2 - ⁇ v ⁇ y w1 + ⁇ v ⁇ *y w0 - ⁇ w0 *y*
- w c0 , w c1 , w c2 are the clip w coordinates of V 0 , V,, and V 2 , respectively.
- Ao, A,, and A 2 are the barycentric coordinates of the fragment for which the data are produced.
- Vertex 2 is assumed to hold the data. In case q is not equal to one the s, t, and r coordinates need to be divided by q.
- the normal and surface tangents may have a magnitude associated with directional unit vectors.
- Figure 36 shows how interpolating between vectors of unequal magnitude results in uneven angular granularity, which is why we do not interpolate normals and tangents this way.
- Figure 37 shows how the fragment x and y coordinates used to form the interpolation coefficients are formed.
- the tile x and y coordinates, set at the beginning of a tile processing form the most significant bits.
- the sample mask (sMask) is used to find which fragments need to be processed.
- a lookup table provides the least significant bits of the coordinates at sub-pixel accuracy. We may be able to reduce the size of the LUT if we can get away with 2 bits of sample location select.
- x ⁇ , x w1 , ⁇ are the window x-coordinates of the three triangle vertices.
- y M , y w1> y ⁇ are the three y-coordinates of the triangle vertices.
- the denominator components can be formed by adding the individual constants in the numerator:
- D x C xO +C X 1 + C x2 ⁇
- D y C yO +C y1 + C y2>
- D k C kO +C k1 + C k2
- the above calculations need to be done only once per triangle.
- the color memory cache is used to save the coefficients for the next VSP ofthe same triangle. On a cache miss the coefficients need to be re-evaluated.
- G x (x,y) C xX xx+C yX xy + C kx
- G 2 (x, y) W, (x, y) - G Q (x, y) - G (x, y)
- G 0 (x+1,y) G 0 (x,y)+C x0
- G 2 (x+1,y) G 2 (x,y) + C x2
- R Dlft ⁇ 0 ⁇ x* ⁇ ⁇ y x"Rrs ⁇ Dlff ⁇ LL. ⁇ l x ⁇ ,yjx-Rr ⁇ Dlff ⁇ L- ⁇ 2 x ⁇ ,,.yj ⁇ rs Dlff2
- a Dlff L 0 (x,y) ⁇ D/ffo+ L,(x,y) ⁇ ⁇ D/ff) + 2 (x,y) ⁇ A D , ff2
- G Spec L 0 ( .y) x G Spec- +L ⁇ ( X . ) x G S pe c ) + L 2( . ) x G Spec 2
- the 8-bit color values are actually fraction between 0 and 1 inclusive.
- the missing represented number is 1-2 "8 .
- the value one is represented with all the bits set taking the place of the missing representation.
- the 8-bit index value replaces the R value of the Diffuse and the Specular component of the color.
- the normal vector has to be re-normalized after the interpolation:
- At half-rate (accumulative) we interpolate up to four texture coordinates. This is done either using the plane equations or barycentric coordinates.
- the r-texture coordinates are also interpolated for volume texture rendering but at one third of the full rate.
- s[1] L 0 (x,y) ⁇ s 0 [1] + L,(x,y) ⁇ s,[1] + L 2 (x,y) ⁇ s 2 [1]
- _[1] L 0 (x,y) ⁇ t 0 [1] + L ⁇ x,y) ⁇ t ⁇ 1] + L 2 (x,y) ⁇ t 2 [1]
- the surface tangents also have to be normalized, like the normals, after interpolation.
- ⁇ is called the Level of Detail (LOD) and p is called the scale factor that governs the magnification or minification ofthe texture image
- n and m are the width and the height of a two dimensional texture map.
- the partial derivatives of u and v are obtained using the partials of s and t. For one dimension texture map t, v, and the partial derivatives ⁇ v/ ⁇ x and ⁇ v/ ⁇ y are set to zero. For a line the formula is:
- the DSGP pipeline supports up to four textures with two sets of texture coordinates.
- the Fragment block passes s, t, r, and ⁇ to the Texture block for each active texture. Note that ⁇ is not the final LOD.
- the Texture block applies additional rules such as LOD clamping to obtain the final value for ⁇ .
- the Fragment uses three caches to perform the needed operations.
- the primary cache is the Color cache. It holds the color data for the primitive (triangle, line, or point).
- the cache miss determination and replacement logic is actually located in the Mode Inject block.
- the Fragment block normally receives a "hit" packet with an index pointing to the entry that hold the associated Color data. If a miss is detected by the Mode Inject block, a "fill" packet is sent first to replace an entry in the cache with the new data before any "hit" packets are sent to use the new data. Therefore it is important not to change the order of packets sent by Mode inject, since the cache replacement and use logic assumes that the incoming packets are processed in order.
- the Fragment block modifies some of the data before writing in the Color cache during cache fills. This is done to prepare the barycentric coefficients during miss time.
- the vertex window coordinates, the reciprocal of the clip-w coordinates at the vertices and texture q coordinates at the vertices are used and replaced by the C ⁇ , oj , C y(1 0] , C ⁇ , 0] , D x , D y , D k barycentric coefficients.
- the S x , S y , T x , and T y values are evaluated during cache misses and stored along with the other data.
- the Color cache is currently organized as a 256 entry, four set associative cache.
- the microArchitecture of the Mode Inject and Fragment Units may change this organization provided that the performance goals are retained. It assumed that at full rate the Color cache misses will be less than 15% ofthe average processed VSPs.
- the data needed at half rate is stored as two consecutive entries in the Color cache.
- the index provided in this case will be always be an even number.
- TEXTURE_1D TEXTURE_2D
- TEXTURE_3D are the enable bits for a given texture.
- TEXTURE_HIGH, TEXTURE_WIDTH define respectively the m and n values used in the u and v calculations.
- TEXTURE_COORD_SET_SOURCE identifies which texture coordinate is bound to a given texture.
- the texture mode caches are organized as a 32 entry fully associative cache.
- the assumed miss rate for texture mode cache 0 is less than 0.2% per VSP.
- Mode Cache is organized as a fully associative, eight-entry cache.
- the assumed miss rate is 0.001 % per VSP (negligible).
- the following info is cached in the Mode Cache:
- Texture mapping is a technique for simulating surface textures by coloring polygons with detailed images. Typically, a single texture map will cover an entire object that consists of many polygons.
- a texture map consists of one or more rectangular arrays of RGBA color (up to 2K by 2K in Apex). The user supplies coordinates, either manually or automatically in the Geometry Block, into the texture map at each vertex. These coordinates are interpolated for each fragment, the texture values are looked up in the texture map and the color assigned to the fragment. Bump map coefficients are obtained similarly using the light_texture extension. See the Phong Block for details.
- texture maps must be scaled so that the texture pattern appears the same size relative to the object being textured.
- scaling and filtering a texture image for each fragment is an expensive proposition.
- Mipmapping allows the renderer to avoid some of this work at run-time.
- the user provides a series of texture arrays at successively lower resolutions, each array representing the texture at a specified level of detail (LOD or ⁇ ).
- LOD or ⁇ level of detail
- the Apex Board supports texture mapping with tri-linear mipmapping at 250M textured fragments/sec. Up to eight texture maps and eight sets of texture coordinates per fragment are supported at proportionally lower performance. Apex handles bump maps as textures, using either the SGI extensions or height gradient fields. It will perform 3-D texture mapping at a somewhat slower rate, because the texel cache will perform less efficiently due to less optimal texel reuse.
- Shadow a simple extension to support multipass shadows.
- Texture Block caches texels to get local reuse. Texture maps are stored in texture memory in
- the user can send some triangles to be textured with one map and then change the texture data associated with the same texture number to texture other triangles in the same frame.
- Our pipeline requires that all sets of texture data for a frame be available to the Texture Block.
- Texture Memory stores texture arrays that the Texture Block is currently using.
- Software manages the texture memory, copying texture arrays from host memory into Texture Memory. It also maintains a table of texture array addresses in Texture Memory.
- the Texture Block identifies texture arrays by virtual texture number and LOD.
- the arrays for the highest LODs are lumped into a single record. (In one embodiment, seven LOD s each contain 21 kilobytes.)
- a texture array pointer table associates a texture array ID (virtual texture number concatenated with the LOD) with an address in Texture Memory. We need to support thousands of texture array pointers, so the texture array pointer table will have to be stored in Texture Memory. We need to map texture array IDs to addresses ⁇ 500M times per second.
- FIG. 38 gives an overview of texture array addressing.
- the Texture Block implements a double hashing algorithm to search the pointer table in memory.
- Software manages the texture array pointer table, using the hardware hashing algorithm to store table elements.
- the Texture Block sends an interrupt to the host when it needs a texture array that is not already in texture memory.
- the host copies the texture array from main memory to texture memory, and updates the texture array pointer table, as described above.
- the host controls which texture arrays are overwritten by new data.
- the host will need to rearrange texture memory to do garbage collection, etc.
- the hardware will support the following memory copies:
- Texture Memory A texture array is divided into 2x2 texel blocks. Each texel block in an array is represented in Texture
- Memory by a 16 or 18 byte record containing RGBA, RGB, or height gradient data for four texels. Texturing a given fragment with tri-linear mip-mapping requires accessing 2 to 8 of these blocks, depending on where the fragment falls relative to the 2x2 blocks.
- Texture Memory In addition to the normal path between Texture Memory and the Texture Block, there is a path from host memory to Texture Memory. The bandwidth should be about 500 MB/s. This "Back Door Bus" path connects the framebuffer and Texture Memory to the host. We also support memory to memory copies in Texture Memory under the control of software.
- Texture Formats In hardware, we support the OpenGL internal formats RGBA8, RGB12 (signed), and LUMINANCE16_ALPHA16 (signed). Software will support the other formats that use a subset of the storage of these formats, e.g., RGB8. Some uses of Texture Memory, e.g., for bump map coefficients, may interpret the texel bits in other ways. We will support 16-bit interpolations for bump map textures. After the Texture Block, all colors are treated as 8 bit quantities except for light_texture quantities like normals, depth, and height fields.
- the Texture Block uses four sets of arithmetic units for the calculations: two with 16 bit precision, one with 12 bit precision, and one with 8 bit precision.
- Video feed will be in one of several YUV (or YIQ) formats.
- RGB YUV
- We will do the conversion to RGB and pack the values into texel format (2x2 blocks instead of scanline order) by using the 2D core to translate to RGB and using the Rambus masked writes to store the texels in 2x2 blocks.
- This data will be stored in Texture Memory and displayed as a normal texture.
- the Phong Block calculates the color of a fragment by combining the color, material, geometric, and lighting information from the Fragmen t Block with the texture information from the Texture Block. The result is a colored fragment that is forwarded to the Pixel Block where it is blended with any color information already residing in the frame buffer.
- the Phong Block embodies a number of features for performing tangent space lighting in a deferred shading environment. These features include:
- Phong block 14000 does not interpolate partials or normals. Instead, these interpolations are done in the Fragment block 11000, which passes the interpolated results to Phong.
- the method by which Fragment 11000 performs these interpolations is described above; however, features of this method and its advantages are briefly recited herein:
- Fragment does not interpolate partials or normals of arbitrary magnitude
- Fragment • Instead, per-vertex partials and normals are provided to Fragment as unit vectors and associated magnitudes, which Fragment separately interpolates (see discussion above of barycentric interpolation for triangles and other inventive interpolation methods performed by Fragment);
- Fragment normalizes the interpolated partial and normal unit vectors and passes the results to Phong as the fragment unit normals and partials;
- Fragment passes the interpolated magnitudes to Phong as the magnitudes associated with the fragment unit normals and partials;
- Phong performs bump and lighting calculations using the interpolated unit vectors and associated magnitudes.
- Phong block 14000 does not interpolate L or H vectors. Instead, Phong receives from the Fragment block 11000 a unit light vector PI and a unit fragment vector V, both defined in eye space coordinates. Phong derives the light vector L without interpolation b y subtracting V from P1. Phong is then able to derive the half-angle vector H from the light vector and a known eye vector E.
- advantages ofthe inventive system for performing tangent space lighting in a deferred shading architecture include:
- Color Index Mode Texture and fragment lighting operations do not take place in color index mode. In this mode the only calculations performed by the Phong Block are the fog calculations. In this case the mantissa ofthe R value of the incoming fragment color is interpreted as an 8-bit color index varying from 0 to 255, and is routed directly to the fog block for processing.
- FIG. 34 there is shown a block diagram illustrating Phong's position in the pipeline and relationship to adjacent blocks.
- the Phong Block 14000 is located after Texture 12000 and before Pixel 15000. It receives data from both Texture and Fragment 11000. Fragment sends per-fragment data as well as cache fill data that are passed through from mode injection. Texture sends only texel data 120001 a.
- the data from Fragment 11000 include: stamp x, y 14001 a; RGBA diffuse data 14001b; RGBA spectral data 14001 c; surface normals 14001d; bump basis vectors 14001 e; eye coordinates 14001f; light cache index 14001g; and material cache index 14001 h.
- the Phong Block has two internal caches: the "light” cache 14154, which holds infrequently changing information such as scene lights and global rendering modes, and the "material” cache
- the Phong procedure is composed of several sub-computations, or blocks, which are summarized here. Pseudo-code along with details of required data and state information are described later in this specification.
- Figure 36 shows a block diagram of Phong 14000, showing the various Phong computations.
- Texture computation 14114 accepts incoming texels 14102 from the Texture Block and texture mode information 14151 a from the material cache 14150. This computation applies the texture- environment calculation and merges multiple textures if present. The result is forwarded to the Light-environment subunit 14142 in the case of the conventional use of textures, or to other subunits, such as Bump 14130, in case the texture is to be interpreted as modifying some parameter of the Phong calculation other than color.
- Material Computation/Selection Material computation 14126 determines the source of the material values for the lighting computation. Inputs to Material computation 14126 include material texture values from Texture 14114, fragment material values 14108 from Fragment and a primary color 14106 originating in the Gouraud calculation. Using current material mode bits from the material cache 14150 the Material computation may decide to replace the fragment material 14126 with the texture values 14114 or with the incoming primary color 14106.
- Bump computation 14130 determines the surface normal to be used in the lighting calculation. Inputs to Bump include bump texture information 14122 from Texture 14114 and the surface normal, tangent and binormal 14110 from Fragment 11000. The Bump computation 14130 may simply pass through the normal as interpolated by Fragment, or may use a texel value 14122 in a calculation that involves a 3x3 matrix multiply.
- Light-Texture Computation Inputs to Light-Texture computation 14134 include light texture information 14118 from the Texture computation 14114 and the fragment light information 14112 from Fragment. Light-Texture computation 14134 decides whether any ofthe components of the lights 14112 should be replaced by a texel 14118.
- Fragment lighting computation 14138 performs the actual lighting calculation for this fragment using an equation similar to that used for per-vertex lighting in the GEO block. This equation has been discussed in detail in the Background section.
- Inputs to Fragment Lighting include material data 14128 from Material selection 14126, surface normal from Bump 14130 and light data from 14136 from Light-Texture 14134.
- Light environment computation 14142 blends the result 14410 of the fragment lighting computation with the texture color 14118 forwarded from the Texture Block. (7) Fog Computation
- Fog computation 14146 applies "fog"; modifies the fragment color 14144 using a computation that depends only on the distance from the viewer's eye to the fragment, the final result 14148 from Fog computation 14146 is forwarded to the Pixel Block .
- the previous section has generally described the blocks composing the Phong computation and the data used and generated by those sub-blocks.
- the blocks can be implemented in hardware or software that meets the requirements of the preceding general description and subsequent detailed descriptions.
- data can be transferred between the Phong blocks and the external units (i.e., Texture, Fragment and Pixel) and among the Phong blocks using a variety of implementations capable of satisfying Phong I/O requirements. While all of these alternative embodiments are within the scope of the present invention, a description is now provided of one preferred embodiment where the Phong blocks are implemented in hardware and data is transferred between top-level units (i.e., Texture, Fragment, Phong and Pixel) using packets. The content of the I/O packets is described first.
- the packets include: a half-rate fragment packet 11902; a full-rate fragment packet 11904; a material cache miss packet 11906 (from MIJ, relayed by Fragment); a light cache mss packet 11908 (from MIJ, relayed by Fragment); texture packets, or texels, 12902; a pixel output packet 14902.
- the Phong block 14000 receives packets 11902, 11904 from the Fragment block 11000 containing information that changes per-fragment that cannot be cached.
- a packet from the Fragment 11000 contains for one fragment: • pointers to cached information related to lighting and material associated with the fragment; one or more color values;
- fragment geometry data fragment normal and, optionally, tangent and binormal
- each full-rate packet 11904 includes a reduced set of fragment information that is used by Phong to perform a simplified lighting computation that can be performed at the full DSGP cycle rate in a "full performance mode".
- Each half rate packet 11902 includes a full set of fragment information that is used by Phong to perform a full lighting computation at the half cycle rate.
- This distinction between full and half rate information is not an essential feature of the present invention but is useful in hardware and software implementations where it would not be possible to perform the full lighting computation at the half cycle rate. In such an implementation this distinction conserves bandwidth required for communications between the Phong and Fragment units.
- the only data that varies per fragment is the surface normal direction and the Gouraud colors produced by the geometry engine.
- to reduce bandwidth and input queue size per-stamp information is shared among all the pixels of a visible stamp portion. This allows Fragment 11000 to send only one full- rate packet 11904 per VSP that also applies to up to four fragments composing the VSP). In this case, Phong needs to be told how many fragments make up the stamp, but has no need to know the screen space coordinates of the fragment.
- the full-rate packet 11904 provides:
- the illustrated Phong embodiment can perform bump mapping and local viewer (i.e., variable eye position) operations.
- An additional difference over the full-rate operations is that the normal provided by the Fragment block for these operations is not required to be of unit magnitude.
- the half-rate packet 11902 provides for each fragment in a stamp.
- normal unit vector and associated magnitude 14001d Figure 34
- surface tangent unit vector and associated magnitude part of bump basis 14001 e
- Figure 34 eye coordinates 14001f.
- Fragment 11000 can send one half-rate packet 11902 per VSP that also applies to up to four fragments composing the VSP.
- the Phong block 14000 includes a material cache 14150 ( Figures 34, 35) that holds material information for one or more objects likely to be an active subject of the illumination computation. This information generally changes per object, thus, when the Phong/Bump computation is to be performed for a new object, it is unlikely that the material characteristics of the new object is resident in the material cache 14150.
- Fragment 11000 provides the material index 14001h ( Figure 34) that identifies the particular material information associated with the fragment to be illuminated. In one embodiment this material index is transmitted as part of the half- and full-rate fragment packets 11902, 11904.
- Phong 14000 issues a cache miss message that causes Fragment 11000 to return a material cache miss packet 11906 from Mode Injection 10000.
- the material cache miss packet 11906 is used by Phong 14000 to fill in the material cache data for the new object.
- the information provided in a material cache miss packet 11906 includes: a unique material cache index 1 001 h; texture information for each texel associated with the object described by the material cache miss packet describing how to use the texel, including:
- fragment material information including: • emissive, ambient, diffuse, specular and shininess characteristics for the object;
- the Phong block 14000 includes a light cache 14154 ( Figures 34, 35) that holds light information for one or more lights used in the illumination computation. This information typically changes once per frame. Thus, in contrast to the material cache, light cache misses are unlikely. Accordingly, the bandwidth for light cache misses should be negligible.
- Fragment 11000 provides a light index 14001g ( Figure 34) that identifies the particular light information to be used in the illumination computation associated with the fragment to be illuminated. In one embodiment this light index is transmitted as part of the half- and full-rate fragment packets 11902, 11904.
- Phong 14000 issues a message that causes Fragment 11000 to return a light cache miss packet 11908 from Mode Injection 10000 that is written into the light cache 14154.
- the light cache miss packet includes:
- Header ?????? sHead 6 1 6 0 75 packet length in 16 bits packLength 8 1 8 l 1 00
Abstract
Description
Claims
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US09/379,144 US6577317B1 (en) | 1998-08-20 | 1999-08-20 | Apparatus and method for geometry operations in a 3D-graphics pipeline |
AU55765/99A AU5576599A (en) | 1998-08-20 | 1999-08-20 | How to do tangent space lighting in a deferred shading architecture |
Applications Claiming Priority (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US9733698P | 1998-08-20 | 1998-08-20 | |
US60/097,336 | 1998-08-20 | ||
US09/213,990 US6771264B1 (en) | 1998-08-20 | 1998-12-17 | Method and apparatus for performing tangent space lighting and bump mapping in a deferred shading graphics processor |
US09/213,990 | 1998-12-17 |
Publications (3)
Publication Number | Publication Date |
---|---|
WO2000011614A2 true WO2000011614A2 (en) | 2000-03-02 |
WO2000011614A3 WO2000011614A3 (en) | 2000-06-15 |
WO2000011614B1 WO2000011614B1 (en) | 2000-07-27 |
Family
ID=26793137
Family Applications (6)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/019036 WO2000011614A2 (en) | 1998-08-20 | 1999-08-20 | Tangent space lighting in a deferred shading architecture |
PCT/US1999/018971 WO2000030040A1 (en) | 1998-08-20 | 1999-08-20 | Advanced deferred shading graphics pipeline processor |
PCT/US1999/019241 WO2000011604A2 (en) | 1998-08-20 | 1999-08-20 | Apparatus and method for geometry operations in a 3d-graphics pipeline |
PCT/US1999/019190 WO2000011613A2 (en) | 1998-08-20 | 1999-08-20 | Performing hidden surface removal in a graphics processor with deferred shading |
PCT/US1999/019254 WO2000019377A1 (en) | 1998-08-20 | 1999-08-20 | Graphics processor with deferred shading |
PCT/US1999/019363 WO2000011605A2 (en) | 1998-08-20 | 1999-08-20 | Fragment operations in a 3d-graphics pipeline |
Family Applications After (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/US1999/018971 WO2000030040A1 (en) | 1998-08-20 | 1999-08-20 | Advanced deferred shading graphics pipeline processor |
PCT/US1999/019241 WO2000011604A2 (en) | 1998-08-20 | 1999-08-20 | Apparatus and method for geometry operations in a 3d-graphics pipeline |
PCT/US1999/019190 WO2000011613A2 (en) | 1998-08-20 | 1999-08-20 | Performing hidden surface removal in a graphics processor with deferred shading |
PCT/US1999/019254 WO2000019377A1 (en) | 1998-08-20 | 1999-08-20 | Graphics processor with deferred shading |
PCT/US1999/019363 WO2000011605A2 (en) | 1998-08-20 | 1999-08-20 | Fragment operations in a 3d-graphics pipeline |
Country Status (6)
Country | Link |
---|---|
US (5) | US6771264B1 (en) |
EP (2) | EP1138023A4 (en) |
JP (3) | JP3657519B2 (en) |
KR (2) | KR100485241B1 (en) |
AU (6) | AU5687599A (en) |
WO (6) | WO2000011614A2 (en) |
Cited By (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6532013B1 (en) | 2000-05-31 | 2003-03-11 | Nvidia Corporation | System, method and article of manufacture for pixel shaders for programmable shading |
US6664963B1 (en) | 2000-05-31 | 2003-12-16 | Nvidia Corporation | System, method and computer program product for programmable shading using pixel shaders |
US6690372B2 (en) | 2000-05-31 | 2004-02-10 | Nvidia Corporation | System, method and article of manufacture for shadow mapping |
US6697064B1 (en) | 2001-06-08 | 2004-02-24 | Nvidia Corporation | System, method and computer program product for matrix tracking during vertex processing in a graphics pipeline |
US6704025B1 (en) | 2001-08-31 | 2004-03-09 | Nvidia Corporation | System and method for dual-depth shadow-mapping |
US6734861B1 (en) | 2000-05-31 | 2004-05-11 | Nvidia Corporation | System, method and article of manufacture for an interlock module in a computer graphics processing pipeline |
US6778181B1 (en) | 2000-12-07 | 2004-08-17 | Nvidia Corporation | Graphics processing system having a virtual texturing array |
US6844880B1 (en) | 1999-12-06 | 2005-01-18 | Nvidia Corporation | System, method and computer program product for an improved programmable vertex processing model with instruction set |
US6870540B1 (en) * | 1999-12-06 | 2005-03-22 | Nvidia Corporation | System, method and computer program product for a programmable pixel processing model with instruction set |
US7006101B1 (en) | 2001-06-08 | 2006-02-28 | Nvidia Corporation | Graphics API with branching capabilities |
US7009605B2 (en) | 2002-03-20 | 2006-03-07 | Nvidia Corporation | System, method and computer program product for generating a shader program |
US7009615B1 (en) | 2001-11-30 | 2006-03-07 | Nvidia Corporation | Floating point buffer system and method for use during programmable fragment processing in a graphics pipeline |
US7023437B1 (en) | 1998-07-22 | 2006-04-04 | Nvidia Corporation | System and method for accelerating graphics processing using a post-geometry data stream during multiple-pass rendering |
US7162716B2 (en) | 2001-06-08 | 2007-01-09 | Nvidia Corporation | Software emulator for optimizing application-programmable vertex processing |
US7170513B1 (en) | 1998-07-22 | 2007-01-30 | Nvidia Corporation | System and method for display list occlusion branching |
US7209140B1 (en) | 1999-12-06 | 2007-04-24 | Nvidia Corporation | System, method and article of manufacture for a programmable vertex processing model with instruction set |
US7286133B2 (en) | 2001-06-08 | 2007-10-23 | Nvidia Corporation | System, method and computer program product for programmable fragment processing |
US7456838B1 (en) | 2001-06-08 | 2008-11-25 | Nvidia Corporation | System and method for converting a vertex program to a binary format capable of being executed by a hardware graphics pipeline |
Families Citing this family (636)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8253729B1 (en) * | 1983-05-09 | 2012-08-28 | Geshwind David M | Trimming depth buffer during 2D to 3D conversion |
US6590996B1 (en) * | 2000-02-14 | 2003-07-08 | Digimarc Corporation | Color adaptive watermarking |
US7375727B1 (en) * | 1998-07-22 | 2008-05-20 | Nvidia Corporation | System, method and computer program product for geometrically transforming geometric objects |
US6480205B1 (en) | 1998-07-22 | 2002-11-12 | Nvidia Corporation | Method and apparatus for occlusion culling in graphics systems |
US6552723B1 (en) * | 1998-08-20 | 2003-04-22 | Apple Computer, Inc. | System, apparatus and method for spatially sorting image data in a three-dimensional graphics pipeline |
US6771264B1 (en) * | 1998-08-20 | 2004-08-03 | Apple Computer, Inc. | Method and apparatus for performing tangent space lighting and bump mapping in a deferred shading graphics processor |
US6978045B1 (en) * | 1998-10-02 | 2005-12-20 | Minolta Co., Ltd. | Image-processing apparatus |
GB2343601B (en) * | 1998-11-06 | 2002-11-27 | Videologic Ltd | Shading and texturing 3-dimensional computer generated images |
US6509905B2 (en) * | 1998-11-12 | 2003-01-21 | Hewlett-Packard Company | Method and apparatus for performing a perspective projection in a graphics device of a computer graphics display system |
JP3258286B2 (en) * | 1998-12-15 | 2002-02-18 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Drawing method and drawing apparatus for displaying image data of a plurality of objects in which translucent and opaque objects are mixed on a computer display screen |
US7224364B1 (en) * | 1999-02-03 | 2007-05-29 | Ati International Srl | Optimal initial rasterization starting point |
US6466223B1 (en) * | 1999-03-24 | 2002-10-15 | Microsoft Corporation | Method and apparatus for texture memory management |
US6791569B1 (en) * | 1999-07-01 | 2004-09-14 | Microsoft Corporation | Antialiasing method using barycentric coordinates applied to lines |
US6628836B1 (en) * | 1999-10-05 | 2003-09-30 | Hewlett-Packard Development Company, L.P. | Sort middle, screen space, graphics geometry compression through redundancy elimination |
JP3950926B2 (en) * | 1999-11-30 | 2007-08-01 | エーユー オプトロニクス コーポレイション | Image display method, host device, image display device, and display interface |
US6848029B2 (en) | 2000-01-03 | 2005-01-25 | Dirk Coldewey | Method and apparatus for prefetching recursive data structures |
US7058636B2 (en) * | 2000-01-03 | 2006-06-06 | Dirk Coldewey | Method for prefetching recursive data structure traversals |
US6731297B1 (en) * | 2000-01-11 | 2004-05-04 | Intel Corporation | Multiple texture compositing |
US7483042B1 (en) * | 2000-01-13 | 2009-01-27 | Ati International, Srl | Video graphics module capable of blending multiple image layers |
US6995761B1 (en) * | 2000-01-14 | 2006-02-07 | California Institute Of Technology | Compression of 3D surfaces using progressive geometry |
US7116334B2 (en) * | 2000-01-28 | 2006-10-03 | Namco Bandai Games Inc. | Game system and image creating method |
US20020009293A1 (en) * | 2000-02-03 | 2002-01-24 | Aldrich Kipp A. | HDTV video server |
JP3349490B2 (en) * | 2000-02-14 | 2002-11-25 | インターナショナル・ビジネス・マシーンズ・コーポレーション | Image display method, image display system, host device, image display device, and display interface |
US7159041B2 (en) * | 2000-03-07 | 2007-01-02 | Microsoft Corporation | Method and system for defining and controlling algorithmic elements in a graphics display system |
US7098925B1 (en) * | 2000-03-10 | 2006-08-29 | Intel Corporation | Shading of images using texture |
EP1269418A1 (en) * | 2000-03-31 | 2003-01-02 | Intel Corporation | Tiled graphics architecture |
US7038811B1 (en) * | 2000-03-31 | 2006-05-02 | Canon Kabushiki Kaisha | Standardized device characterization |
US6819321B1 (en) * | 2000-03-31 | 2004-11-16 | Intel Corporation | Method and apparatus for processing 2D operations in a tiled graphics architecture |
US7119813B1 (en) * | 2000-06-02 | 2006-10-10 | Nintendo Co., Ltd. | Variable bit field encoding |
US7032031B2 (en) * | 2000-06-23 | 2006-04-18 | Cloudshield Technologies, Inc. | Edge adapter apparatus and method |
US7405734B2 (en) * | 2000-07-18 | 2008-07-29 | Silicon Graphics, Inc. | Method and system for presenting three-dimensional computer graphics images using multiple graphics processing units |
US6963347B1 (en) * | 2000-08-04 | 2005-11-08 | Ati International, Srl | Vertex data processing with multiple threads of execution |
US6980218B1 (en) * | 2000-08-23 | 2005-12-27 | Nintendo Co., Ltd. | Method and apparatus for efficient generation of texture coordinate displacements for implementing emboss-style bump mapping in a graphics rendering system |
US6999100B1 (en) | 2000-08-23 | 2006-02-14 | Nintendo Co., Ltd. | Method and apparatus for anti-aliasing in a graphics system |
US6825851B1 (en) | 2000-08-23 | 2004-11-30 | Nintendo Co., Ltd. | Method and apparatus for environment-mapped bump-mapping in a graphics system |
US7061502B1 (en) * | 2000-08-23 | 2006-06-13 | Nintendo Co., Ltd. | Method and apparatus for providing logical combination of N alpha operations within a graphics system |
US7002591B1 (en) * | 2000-08-23 | 2006-02-21 | Nintendo Co., Ltd. | Method and apparatus for interleaved processing of direct and indirect texture coordinates in a graphics system |
US8692844B1 (en) * | 2000-09-28 | 2014-04-08 | Nvidia Corporation | Method and system for efficient antialiased rendering |
US6828980B1 (en) * | 2000-10-02 | 2004-12-07 | Nvidia Corporation | System, method and computer program product for z-texture mapping |
US6914618B2 (en) * | 2000-11-02 | 2005-07-05 | Sun Microsystems, Inc. | Methods and systems for producing A 3-D rotational image from A 2-D image |
US7079133B2 (en) * | 2000-11-16 | 2006-07-18 | S3 Graphics Co., Ltd. | Superscalar 3D graphics engine |
JP3705739B2 (en) * | 2000-12-11 | 2005-10-12 | 株式会社ナムコ | Information storage medium and game device |
US6975320B1 (en) | 2000-12-12 | 2005-12-13 | Micron Technology, Inc. | Method and apparatus for level-of-detail computations |
US6664961B2 (en) * | 2000-12-20 | 2003-12-16 | Rutgers, The State University Of Nj | Resample and composite engine for real-time volume rendering |
US20030063095A1 (en) * | 2000-12-29 | 2003-04-03 | Sun Microsystems, Inc. | Statistic logic for collecting a histogram of pixel exponent values |
JP2002252770A (en) * | 2001-02-22 | 2002-09-06 | Matsushita Graphic Communication Systems Inc | Classification method for image information, image coding method, and image coder |
US6791559B2 (en) * | 2001-02-28 | 2004-09-14 | 3Dlabs Inc., Ltd | Parameter circular buffers |
US6828975B2 (en) * | 2001-03-01 | 2004-12-07 | Microsoft Corporation | Method and system for managing graphics objects in a graphics display system |
FR2822274B1 (en) * | 2001-03-13 | 2003-11-21 | Stephane Clement Francoi Rehel | METHOD FOR DISPLAYING AND HANDLING AN OBJECT IN THREE DIMENSIONS AND CORRESPONDING APPLICATIONS |
EP1258837A1 (en) * | 2001-05-14 | 2002-11-20 | Thomson Licensing S.A. | Method to generate mutual photometric effects |
US6859209B2 (en) * | 2001-05-18 | 2005-02-22 | Sun Microsystems, Inc. | Graphics data accumulation for improved multi-layer texture performance |
GB2378108B (en) | 2001-07-24 | 2005-08-17 | Imagination Tech Ltd | Three dimensional graphics system |
US6778189B1 (en) * | 2001-08-24 | 2004-08-17 | Nvidia Corporation | Two-sided stencil testing system and method |
US6734853B2 (en) * | 2001-08-28 | 2004-05-11 | Intel Corporation | Method of using view frustrum culling for scaleable collision detection |
US7145577B2 (en) * | 2001-08-31 | 2006-12-05 | Micron Technology, Inc. | System and method for multi-sampling primitives to reduce aliasing |
US6924820B2 (en) * | 2001-09-25 | 2005-08-02 | Sun Microsystems, Inc. | Over-evaluating samples during rasterization for improved datapath utilization |
WO2003032253A2 (en) | 2001-10-10 | 2003-04-17 | Sony Computer Entertainment America Inc. | System and method for environment mapping |
US6999076B2 (en) * | 2001-10-29 | 2006-02-14 | Ati Technologies, Inc. | System, method, and apparatus for early culling |
JP3761085B2 (en) * | 2001-11-27 | 2006-03-29 | 株式会社ソニー・コンピュータエンタテインメント | Image processing apparatus, components thereof, and rendering processing method |
KR100450836B1 (en) * | 2001-12-11 | 2004-10-01 | 삼성전자주식회사 | Apparatus for generating 3-dimensional image from 2-dimensional image |
US7426534B2 (en) * | 2001-12-19 | 2008-09-16 | International Business Machines Corporation | Method and system for caching message fragments using an expansion attribute in a fragment link tag |
US6816161B2 (en) * | 2002-01-30 | 2004-11-09 | Sun Microsystems, Inc. | Vertex assembly buffer and primitive launch buffer |
AU2003238511A1 (en) * | 2002-02-01 | 2003-09-02 | Koninklijke Philips Electronics N.V. | Stepless 3d texture mapping in computer graphics |
US6774895B1 (en) | 2002-02-01 | 2004-08-10 | Nvidia Corporation | System and method for depth clamping in a hardware graphics pipeline |
US7310103B2 (en) * | 2002-03-05 | 2007-12-18 | Sun Microsystems, Inc. | Pipelined 2D viewport clip circuit |
US7535913B2 (en) * | 2002-03-06 | 2009-05-19 | Nvidia Corporation | Gigabit ethernet adapter supporting the iSCSI and IPSEC protocols |
US7159212B2 (en) * | 2002-03-08 | 2007-01-02 | Electronic Arts Inc. | Systems and methods for implementing shader-driven compilation of rendering assets |
US6975322B2 (en) * | 2002-03-12 | 2005-12-13 | Sun Microsystems, Inc. | Dynamically adjusting a number of rendering passes in a graphics system |
US7015909B1 (en) * | 2002-03-19 | 2006-03-21 | Aechelon Technology, Inc. | Efficient use of user-defined shaders to implement graphics operations |
US8284844B2 (en) | 2002-04-01 | 2012-10-09 | Broadcom Corporation | Video decoding system supporting multiple standards |
US7376743B1 (en) * | 2002-04-02 | 2008-05-20 | Cisco Technology, Inc. | Method and apparatus for load balancing in a virtual private network |
US7009608B2 (en) * | 2002-06-06 | 2006-03-07 | Nvidia Corporation | System and method of using multiple representations per object in computer graphics |
US6771271B2 (en) * | 2002-06-13 | 2004-08-03 | Analog Devices, Inc. | Apparatus and method of processing image data |
AUPS300502A0 (en) * | 2002-06-17 | 2002-07-11 | Canon Kabushiki Kaisha | Generating one or more linear blends |
US6812927B1 (en) * | 2002-06-18 | 2004-11-02 | Nvidia Corporation | System and method for avoiding depth clears using a stencil buffer |
KR20030097507A (en) * | 2002-06-21 | 2003-12-31 | 삼성전자주식회사 | Color calibrator for flat panel display and method thereof |
US6977658B2 (en) * | 2002-06-27 | 2005-12-20 | Broadcom Corporation | System for and method of performing an opacity calculation in a 3D graphics system |
US6954215B2 (en) * | 2002-06-28 | 2005-10-11 | Microsoft Corporation | System and method for employing non-alpha channel image data in an alpha-channel-aware environment |
JP3845045B2 (en) * | 2002-07-23 | 2006-11-15 | 株式会社リコー | Image processing apparatus, image processing method, image forming apparatus, printing apparatus, and host PC |
FR2842977A1 (en) * | 2002-07-24 | 2004-01-30 | Total Immersion | METHOD AND SYSTEM FOR ENABLING A USER TO MIX REAL-TIME SYNTHESIS IMAGES WITH VIDEO IMAGES |
US7002599B2 (en) * | 2002-07-26 | 2006-02-21 | Sun Microsystems, Inc. | Method and apparatus for hardware acceleration of clipping and graphical fill in display systems |
US6857108B2 (en) * | 2002-07-31 | 2005-02-15 | Lsi Logic Corporation | Interactive representation of structural dependencies in semiconductor design flows |
US7257519B2 (en) * | 2002-08-02 | 2007-08-14 | Evans & Sutherland Computer Corporation | System and method for weighted correction of an eyepoint position |
US7176917B1 (en) | 2002-08-09 | 2007-02-13 | Avid Technology, Inc. | Visual programming interface for a three-dimensional animation system for defining real time shaders using a real-time rendering engine application programming interface |
US7508398B1 (en) | 2002-08-27 | 2009-03-24 | Nvidia Corporation | Transparent antialiased memory access |
US20040088682A1 (en) * | 2002-11-05 | 2004-05-06 | Thompson Ryan C. | Method, program product, and apparatus for cache entry tracking, collision detection, and address reasignment in processor testcases |
US7242400B2 (en) * | 2002-11-13 | 2007-07-10 | Ati Technologies Ulc | Compression and decompression of data using plane equations |
US7633506B1 (en) * | 2002-11-27 | 2009-12-15 | Ati Technologies Ulc | Parallel pipeline graphics system |
US7656416B2 (en) * | 2002-11-27 | 2010-02-02 | Ati Technologies, Inc. | Apparatus for generating anti-aliased and stippled 3d lines, points and surfaces using multi-dimensional procedural texture coordinates |
JPWO2004055697A1 (en) * | 2002-12-13 | 2006-04-20 | 富士通株式会社 | Processing method, processing apparatus, and computer program |
US7928997B2 (en) * | 2003-02-06 | 2011-04-19 | Nvidia Corporation | Digital image compositing using a programmable graphics processor |
US8749561B1 (en) * | 2003-03-14 | 2014-06-10 | Nvidia Corporation | Method and system for coordinated data execution using a primary graphics processor and a secondary graphics processor |
CN100557593C (en) * | 2003-04-03 | 2009-11-04 | Nxp股份有限公司 | Multiple pipeline disposal system and the integrated circuit that is combined with this system |
US7259765B2 (en) | 2003-04-04 | 2007-08-21 | S3 Graphics Co., Ltd. | Head/data scheduling in 3D graphics |
US7148888B2 (en) * | 2003-04-04 | 2006-12-12 | Via Technologies, Inc. | Head/data request in 3D graphics |
US7714858B2 (en) * | 2003-04-18 | 2010-05-11 | Hewlett-Packard Development Company, L.P. | Distributed rendering of interactive soft shadows |
JP3966832B2 (en) | 2003-04-28 | 2007-08-29 | 株式会社東芝 | Drawing processing apparatus and drawing processing method |
US7218331B2 (en) * | 2003-05-13 | 2007-05-15 | Via Technologies, Inc. | Bounding box in 3D graphics |
US20050017969A1 (en) * | 2003-05-27 | 2005-01-27 | Pradeep Sen | Computer graphics rendering using boundary information |
US7681112B1 (en) | 2003-05-30 | 2010-03-16 | Adobe Systems Incorporated | Embedded reuse meta information |
US7852405B1 (en) * | 2003-06-27 | 2010-12-14 | Zoran Corporation | Method and apparatus for high definition capture |
US8275910B1 (en) * | 2003-07-02 | 2012-09-25 | Apple Inc. | Source packet bridge |
US7164420B2 (en) * | 2003-07-24 | 2007-01-16 | Autodesk, Inc. | Ray tracing hierarchy |
WO2005013066A2 (en) * | 2003-07-25 | 2005-02-10 | New York University | Logic arrangement, data structure, system and method for miltilinear representation of multimodal data ensembles for synthesis, rotation and compression |
US7139005B2 (en) * | 2003-09-13 | 2006-11-21 | Microsoft Corporation | Optimized fixed-point mathematical library and graphics functions for a software-implemented graphics rendering system and method using a normalized homogenous coordinate system |
US8775997B2 (en) | 2003-09-15 | 2014-07-08 | Nvidia Corporation | System and method for testing and configuring semiconductor functional circuits |
US8732644B1 (en) | 2003-09-15 | 2014-05-20 | Nvidia Corporation | Micro electro mechanical switch system and method for testing and configuring semiconductor functional circuits |
US8775112B2 (en) | 2003-09-15 | 2014-07-08 | Nvidia Corporation | System and method for increasing die yield |
US7528830B2 (en) * | 2003-09-17 | 2009-05-05 | Koninklijke Philips Electronics N.V. | System and method for rendering 3-D images on a 3-D image display screen |
US7593010B2 (en) * | 2003-09-18 | 2009-09-22 | Microsoft Corporation | Software-implemented transform and lighting module and pipeline for graphics rendering on embedded platforms using a fixed-point normalized homogenous coordinate system |
JP2005100176A (en) * | 2003-09-25 | 2005-04-14 | Sony Corp | Image processor and its method |
JP4183082B2 (en) * | 2003-09-26 | 2008-11-19 | シャープ株式会社 | 3D image drawing apparatus and 3D image drawing method |
KR100546383B1 (en) * | 2003-09-29 | 2006-01-26 | 삼성전자주식회사 | 3D graphics rendering engine for processing an invisible fragment and method thereof |
US8133115B2 (en) | 2003-10-22 | 2012-03-13 | Sony Computer Entertainment America Llc | System and method for recording and displaying a graphical path in a video game |
US7139003B1 (en) * | 2003-12-15 | 2006-11-21 | Nvidia Corporation | Methods of processing graphics data including reading and writing buffers |
US8174531B1 (en) | 2003-10-29 | 2012-05-08 | Nvidia Corporation | Programmable graphics processor for multithreaded execution of programs |
US8860737B2 (en) * | 2003-10-29 | 2014-10-14 | Nvidia Corporation | Programmable graphics processor for multithreaded execution of programs |
US7836276B2 (en) * | 2005-12-02 | 2010-11-16 | Nvidia Corporation | System and method for processing thread groups in a SIMD architecture |
US7978197B2 (en) * | 2003-11-14 | 2011-07-12 | Microsoft Corporation | Systems and methods for downloading algorithmic elements to a coprocessor and corresponding techniques |
KR20050047741A (en) * | 2003-11-18 | 2005-05-23 | 삼성전자주식회사 | Image processing device and method thereof |
US7015914B1 (en) * | 2003-12-10 | 2006-03-21 | Nvidia Corporation | Multiple data buffers for processing graphics data |
US7053893B1 (en) * | 2003-12-15 | 2006-05-30 | Nvidia Corporation | Position conflict detection and avoidance in a programmable graphics processor using tile coverage data |
US7053904B1 (en) * | 2003-12-15 | 2006-05-30 | Nvidia Corporation | Position conflict detection and avoidance in a programmable graphics processor |
US7102645B2 (en) * | 2003-12-15 | 2006-09-05 | Seiko Epson Corporation | Graphics display controller providing enhanced read/write efficiency for interfacing with a RAM-integrated graphics display device |
US7420568B1 (en) * | 2003-12-17 | 2008-09-02 | Nvidia Corporation | System and method for packing data in different formats in a tiled graphics memory |
US8711161B1 (en) | 2003-12-18 | 2014-04-29 | Nvidia Corporation | Functional component compensation reconfiguration system and method |
US7221368B1 (en) * | 2003-12-18 | 2007-05-22 | Nvidia Corporation | Stippled lines using direct distance evaluation |
US7450120B1 (en) | 2003-12-19 | 2008-11-11 | Nvidia Corporation | Apparatus, system, and method for Z-culling |
US7995056B1 (en) | 2003-12-22 | 2011-08-09 | Nvidia Corporation | Culling data selection system and method |
US8269769B1 (en) | 2003-12-22 | 2012-09-18 | Nvidia Corporation | Occlusion prediction compression system and method |
US8854364B1 (en) * | 2003-12-22 | 2014-10-07 | Nvidia Corporation | Tight depth range occlusion prediction system and method |
US8390619B1 (en) * | 2003-12-22 | 2013-03-05 | Nvidia Corporation | Occlusion prediction graphics processing system and method |
US7433364B2 (en) * | 2003-12-24 | 2008-10-07 | Intel Corporation | Method for optimizing queuing performance |
US9098943B1 (en) * | 2003-12-31 | 2015-08-04 | Ziilabs Inc., Ltd. | Multiple simultaneous bin sizes |
US8643659B1 (en) | 2003-12-31 | 2014-02-04 | 3Dlabs Inc., Ltd. | Shader with global and instruction caches |
US7281122B2 (en) * | 2004-01-14 | 2007-10-09 | Ati Technologies Inc. | Method and apparatus for nested control flow of instructions using context information and instructions having extra bits |
US20050195186A1 (en) * | 2004-03-02 | 2005-09-08 | Ati Technologies Inc. | Method and apparatus for object based visibility culling |
FI117655B (en) * | 2004-03-25 | 2006-12-29 | Cadfaster Oy | A method for processing a computer-aided polygon model, a device and a computer program |
US7609902B2 (en) * | 2004-04-13 | 2009-10-27 | Microsoft Corporation | Implementation of discrete cosine transformation and its inverse on programmable graphics processor |
US8704837B2 (en) * | 2004-04-16 | 2014-04-22 | Apple Inc. | High-level program interface for graphics operations |
US7847800B2 (en) * | 2004-04-16 | 2010-12-07 | Apple Inc. | System for emulating graphics operations |
US7248265B2 (en) * | 2004-04-16 | 2007-07-24 | Apple Inc. | System and method for processing graphics operations with graphics processing unit |
US8134561B2 (en) | 2004-04-16 | 2012-03-13 | Apple Inc. | System for optimizing graphics operations |
US7231632B2 (en) * | 2004-04-16 | 2007-06-12 | Apple Computer, Inc. | System for reducing the number of programs necessary to render an image |
US7636489B2 (en) * | 2004-04-16 | 2009-12-22 | Apple Inc. | Blur computation algorithm |
KR100601952B1 (en) * | 2004-04-20 | 2006-07-14 | 삼성전자주식회사 | Apparatus and method for reconstitution of three-dimensional graphic data |
US8432394B1 (en) | 2004-05-14 | 2013-04-30 | Nvidia Corporation | Method and system for implementing clamped z value interpolation in a raster stage of a graphics pipeline |
US7190366B2 (en) * | 2004-05-14 | 2007-03-13 | Nvidia Corporation | Method and system for a general instruction raster stage that generates programmable pixel packets |
US8736628B1 (en) | 2004-05-14 | 2014-05-27 | Nvidia Corporation | Single thread graphics processing system and method |
US7091982B2 (en) * | 2004-05-14 | 2006-08-15 | Nvidia Corporation | Low power programmable processor |
US8711155B2 (en) * | 2004-05-14 | 2014-04-29 | Nvidia Corporation | Early kill removal graphics processing system and method |
US8687010B1 (en) | 2004-05-14 | 2014-04-01 | Nvidia Corporation | Arbitrary size texture palettes for use in graphics systems |
US8736620B2 (en) * | 2004-05-14 | 2014-05-27 | Nvidia Corporation | Kill bit graphics processing system and method |
US7079156B1 (en) * | 2004-05-14 | 2006-07-18 | Nvidia Corporation | Method and system for implementing multiple high precision and low precision interpolators for a graphics pipeline |
US8860722B2 (en) * | 2004-05-14 | 2014-10-14 | Nvidia Corporation | Early Z scoreboard tracking system and method |
EP1759380B1 (en) * | 2004-05-14 | 2011-11-16 | NVIDIA Corporation | Low power programmable processor |
US8743142B1 (en) | 2004-05-14 | 2014-06-03 | Nvidia Corporation | Unified data fetch graphics processing system and method |
US20060007234A1 (en) * | 2004-05-14 | 2006-01-12 | Hutchins Edward A | Coincident graphics pixel scoreboard tracking system and method |
US7389006B2 (en) * | 2004-05-14 | 2008-06-17 | Nvidia Corporation | Auto software configurable register address space for low power programmable processor |
US8416242B1 (en) | 2004-05-14 | 2013-04-09 | Nvidia Corporation | Method and system for interpolating level-of-detail in graphics processors |
US8411105B1 (en) | 2004-05-14 | 2013-04-02 | Nvidia Corporation | Method and system for computing pixel parameters |
JP4451717B2 (en) | 2004-05-31 | 2010-04-14 | 株式会社ソニー・コンピュータエンタテインメント | Information processing apparatus and information processing method |
US20050275733A1 (en) * | 2004-06-10 | 2005-12-15 | Philip Chao | Method and apparatus of rendering a video image by polynomial evaluation |
US7382377B1 (en) * | 2004-06-17 | 2008-06-03 | Nvidia Corporation | Render to texture cull |
CN100476499C (en) | 2004-06-23 | 2009-04-08 | 艺术科学魁恩传媒公司 | Sculptural imaging with optical tiles |
US8130237B2 (en) * | 2004-06-24 | 2012-03-06 | Apple Inc. | Resolution independent user interface design |
US7397964B2 (en) * | 2004-06-24 | 2008-07-08 | Apple Inc. | Gaussian blur approximation suitable for GPU |
US8068103B2 (en) * | 2004-06-24 | 2011-11-29 | Apple Inc. | User-interface design |
US7490295B2 (en) | 2004-06-25 | 2009-02-10 | Apple Inc. | Layer for accessing user interface elements |
US20050285866A1 (en) * | 2004-06-25 | 2005-12-29 | Apple Computer, Inc. | Display-wide visual effects for a windowing system using a programmable graphics processing unit |
US8566732B2 (en) | 2004-06-25 | 2013-10-22 | Apple Inc. | Synchronization of widgets and dashboards |
US7761800B2 (en) | 2004-06-25 | 2010-07-20 | Apple Inc. | Unified interest layer for user interface |
US7652678B2 (en) * | 2004-06-25 | 2010-01-26 | Apple Inc. | Partial display updates in a windowing system using a programmable graphics processing unit |
US8239749B2 (en) | 2004-06-25 | 2012-08-07 | Apple Inc. | Procedurally expressing graphic objects for web pages |
US8302020B2 (en) | 2004-06-25 | 2012-10-30 | Apple Inc. | Widget authoring and editing environment |
US7546543B2 (en) | 2004-06-25 | 2009-06-09 | Apple Inc. | Widget authoring and editing environment |
US8453065B2 (en) | 2004-06-25 | 2013-05-28 | Apple Inc. | Preview and installation of user interface elements in a display environment |
US7755629B2 (en) * | 2004-06-30 | 2010-07-13 | Canon Kabushiki Kaisha | Method of rendering graphic objects |
US7518608B2 (en) * | 2004-07-30 | 2009-04-14 | Sony Corporation | Z-depth matting of particles in image rendering |
US7256796B1 (en) * | 2004-08-03 | 2007-08-14 | Nvidia Corporation | Per-fragment control for writing an output buffer |
US7400325B1 (en) * | 2004-08-06 | 2008-07-15 | Nvidia Corporation | Culling before setup in viewport and culling unit |
US20060033736A1 (en) * | 2004-08-10 | 2006-02-16 | Wang Andy W | Enhanced Color and Lighting Model for Computer Graphics Productions |
WO2006026265A2 (en) * | 2004-08-31 | 2006-03-09 | Silicon Optix | Method and apparatus for reading and writing pixel-aligned subframes in a frame buffer |
US7218291B2 (en) * | 2004-09-13 | 2007-05-15 | Nvidia Corporation | Increased scalability in the fragment shading pipeline |
US8723231B1 (en) | 2004-09-15 | 2014-05-13 | Nvidia Corporation | Semiconductor die micro electro-mechanical switch management system and method |
US7286139B2 (en) * | 2004-09-17 | 2007-10-23 | Via Technologies, Inc. | Partial guardband clipping |
US20060061577A1 (en) * | 2004-09-22 | 2006-03-23 | Vijay Subramaniam | Efficient interface and assembler for a graphics processor |
US8711156B1 (en) | 2004-09-30 | 2014-04-29 | Nvidia Corporation | Method and system for remapping processing elements in a pipeline of a graphics processing unit |
US20060071933A1 (en) | 2004-10-06 | 2006-04-06 | Sony Computer Entertainment Inc. | Application binary interface for multi-pass shaders |
US20060082577A1 (en) * | 2004-10-20 | 2006-04-20 | Ugs Corp. | System, method, and computer program product for dynamic shader generation |
US7385604B1 (en) * | 2004-11-04 | 2008-06-10 | Nvidia Corporation | Fragment scattering |
JP4692956B2 (en) * | 2004-11-22 | 2011-06-01 | 株式会社ソニー・コンピュータエンタテインメント | Drawing processing apparatus and drawing processing method |
US7227551B2 (en) * | 2004-12-23 | 2007-06-05 | Apple Inc. | Manipulating text and graphic appearance |
US8140975B2 (en) | 2005-01-07 | 2012-03-20 | Apple Inc. | Slide show navigation |
US7209139B1 (en) * | 2005-01-07 | 2007-04-24 | Electronic Arts | Efficient rendering of similar objects in a three-dimensional graphics engine |
JP4812073B2 (en) * | 2005-01-31 | 2011-11-09 | キヤノン株式会社 | Image capturing apparatus, image capturing method, program, and recording medium |
KR100612890B1 (en) * | 2005-02-17 | 2006-08-14 | 삼성전자주식회사 | Multi-effect expression method and apparatus in 3-dimension graphic image |
US7242169B2 (en) * | 2005-03-01 | 2007-07-10 | Apple Inc. | Method and apparatus for voltage compensation for parasitic impedance |
US8089486B2 (en) * | 2005-03-21 | 2012-01-03 | Qualcomm Incorporated | Tiled prefetched and cached depth buffer |
CA2597436C (en) * | 2005-03-24 | 2011-09-20 | Lg Electronics Inc. | Method of executing scanning in broadband wireless access system |
JP2006293553A (en) * | 2005-04-07 | 2006-10-26 | Aisin Aw Co Ltd | Rotation processor for font data and map display system |
US7479965B1 (en) * | 2005-04-12 | 2009-01-20 | Nvidia Corporation | Optimized alpha blend for anti-aliased render |
US9363481B2 (en) * | 2005-04-22 | 2016-06-07 | Microsoft Technology Licensing, Llc | Protected media pipeline |
US7499051B1 (en) | 2005-04-29 | 2009-03-03 | Adobe Systems Incorporated | GPU assisted 3D compositing |
US7463261B1 (en) * | 2005-04-29 | 2008-12-09 | Adobe Systems Incorporated | Three-dimensional image compositing on a GPU utilizing multiple transformations |
US7802028B2 (en) * | 2005-05-02 | 2010-09-21 | Broadcom Corporation | Total dynamic sharing of a transaction queue |
US7349066B2 (en) * | 2005-05-05 | 2008-03-25 | Asml Masktools B.V. | Apparatus, method and computer program product for performing a model based optical proximity correction factoring neighbor influence |
US8427496B1 (en) | 2005-05-13 | 2013-04-23 | Nvidia Corporation | Method and system for implementing compression across a graphics bus interconnect |
US8386628B1 (en) * | 2005-05-23 | 2013-02-26 | Glance Networks, Inc. | Method and apparatus for reducing the amount of information that must be transmitted to slower viewers over a remote viewing session |
US7894528B2 (en) * | 2005-05-25 | 2011-02-22 | Yissum Research Development Company Of The Hebrew University Of Jerusalem | Fast and robust motion computations using direct methods |
US8543931B2 (en) | 2005-06-07 | 2013-09-24 | Apple Inc. | Preview including theme based installation of user interface elements in a display environment |
US7636126B2 (en) | 2005-06-22 | 2009-12-22 | Sony Computer Entertainment Inc. | Delay matching in audio/video systems |
US9298311B2 (en) * | 2005-06-23 | 2016-03-29 | Apple Inc. | Trackpad sensitivity compensation |
US7432937B2 (en) * | 2005-06-30 | 2008-10-07 | Intel Corporation | System and method for concave polygon rasterization |
US7496416B2 (en) | 2005-08-01 | 2009-02-24 | Luxology, Llc | Input/output curve editor |
US20070035553A1 (en) * | 2005-08-12 | 2007-02-15 | Microsoft Corporation | General framework for aligning textures |
US7436412B2 (en) * | 2005-08-24 | 2008-10-14 | Qualcomm Incorporated | Graphics engine with efficient interpolation |
US7551177B2 (en) * | 2005-08-31 | 2009-06-23 | Ati Technologies, Inc. | Methods and apparatus for retrieving and combining samples of graphics information |
US8014615B2 (en) * | 2005-09-02 | 2011-09-06 | Adobe Systems Incorporated | System and method for decompressing video data and alpha channel data using a single stream |
US8189908B2 (en) * | 2005-09-02 | 2012-05-29 | Adobe Systems, Inc. | System and method for compressing video data and alpha channel data using a single stream |
US7433191B2 (en) * | 2005-09-30 | 2008-10-07 | Apple Inc. | Thermal contact arrangement |
US7441230B2 (en) | 2005-10-07 | 2008-10-21 | Lucasfilm Entertainment Company Ltd. | Method of utilizing product proxies with a dependency graph |
US8144149B2 (en) * | 2005-10-14 | 2012-03-27 | Via Technologies, Inc. | System and method for dynamically load balancing multiple shader stages in a shared pool of processing units |
US8266232B2 (en) * | 2005-10-15 | 2012-09-11 | International Business Machines Corporation | Hardware processing of commands within virtual client computing environment |
US7752556B2 (en) | 2005-10-27 | 2010-07-06 | Apple Inc. | Workflow widgets |
US7743336B2 (en) | 2005-10-27 | 2010-06-22 | Apple Inc. | Widget security |
US7954064B2 (en) | 2005-10-27 | 2011-05-31 | Apple Inc. | Multiple dashboards |
US8543824B2 (en) | 2005-10-27 | 2013-09-24 | Apple Inc. | Safe distribution and use of content |
US9104294B2 (en) | 2005-10-27 | 2015-08-11 | Apple Inc. | Linked widgets |
US7414624B2 (en) * | 2005-10-28 | 2008-08-19 | Intel Corporation | Apparatus and method for a frustum culling algorithm suitable for hardware implementation |
US20070097139A1 (en) * | 2005-11-02 | 2007-05-03 | Chao-Chin Chen | Method and apparatus of primitive filter in graphic process applications |
GB0524804D0 (en) | 2005-12-05 | 2006-01-11 | Falanx Microsystems As | Method of and apparatus for processing graphics |
US7934255B1 (en) * | 2005-11-08 | 2011-04-26 | Nvidia Corporation | Apparatus, system, and method for offloading packet classification |
US8294731B2 (en) * | 2005-11-15 | 2012-10-23 | Advanced Micro Devices, Inc. | Buffer management in vector graphics hardware |
US7707514B2 (en) | 2005-11-18 | 2010-04-27 | Apple Inc. | Management of user interface elements in a display environment |
US8624909B2 (en) * | 2005-11-21 | 2014-01-07 | Vixs Systems Inc. | Image processing system and method thereof |
US7598711B2 (en) * | 2005-11-23 | 2009-10-06 | Apple Inc. | Power source switchover apparatus and method |
WO2007063586A1 (en) * | 2005-11-30 | 2007-06-07 | Fujitsu Limited | Three-dimensional graphic apparatus, three-dimensional graphic method, three-dimensional program, and recording medium |
EP1960968A4 (en) * | 2005-12-01 | 2016-06-29 | Intel Corp | Computer graphics processor and method for rendering a three-dimensional image on a display screen |
US7616218B1 (en) | 2005-12-05 | 2009-11-10 | Nvidia Corporation | Apparatus, system, and method for clipping graphics primitives |
US7439988B1 (en) | 2005-12-05 | 2008-10-21 | Nvidia Corporation | Apparatus, system, and method for clipping graphics primitives with respect to a clipping plane |
US7434032B1 (en) | 2005-12-13 | 2008-10-07 | Nvidia Corporation | Tracking register usage during multithreaded processing using a scoreboard having separate memory regions and storing sequential register size indicators |
US7423642B2 (en) * | 2005-12-14 | 2008-09-09 | Winbond Electronics Corporation | Efficient video frame capturing |
US7593018B1 (en) * | 2005-12-14 | 2009-09-22 | Nvidia Corp. | Method and apparatus for providing explicit weights for texture filtering |
US8698811B1 (en) | 2005-12-15 | 2014-04-15 | Nvidia Corporation | Nested boustrophedonic patterns for rasterization |
US9123173B2 (en) * | 2005-12-15 | 2015-09-01 | Nvidia Corporation | Method for rasterizing non-rectangular tile groups in a raster stage of a graphics pipeline |
US8701091B1 (en) | 2005-12-15 | 2014-04-15 | Nvidia Corporation | Method and system for providing a generic console interface for a graphics application |
US9117309B1 (en) | 2005-12-19 | 2015-08-25 | Nvidia Corporation | Method and system for rendering polygons with a bounding box in a graphics processor unit |
US7420572B1 (en) * | 2005-12-19 | 2008-09-02 | Nvidia Corporation | Apparatus, system, and method for clipping graphics primitives with accelerated context switching |
US7791617B2 (en) * | 2005-12-19 | 2010-09-07 | Nvidia Corporation | Method and system for rendering polygons having abutting edges |
US8390645B1 (en) * | 2005-12-19 | 2013-03-05 | Nvidia Corporation | Method and system for rendering connecting antialiased line segments |
US7714877B1 (en) | 2005-12-19 | 2010-05-11 | Nvidia Corporation | Apparatus, system, and method for determining clipping distances |
US8300059B2 (en) * | 2006-02-03 | 2012-10-30 | Ati Technologies Ulc | Method and apparatus for selecting a mip map level based on a min-axis value for texture mapping |
JP4734137B2 (en) * | 2006-02-23 | 2011-07-27 | 株式会社バンダイナムコゲームス | Program, information storage medium, and image generation system |
JP4734138B2 (en) * | 2006-02-23 | 2011-07-27 | 株式会社バンダイナムコゲームス | Program, information storage medium, and image generation system |
JP4782583B2 (en) * | 2006-02-23 | 2011-09-28 | 株式会社バンダイナムコゲームス | Program, information storage medium, and image generation system |
US8006236B1 (en) * | 2006-02-24 | 2011-08-23 | Nvidia Corporation | System and method for compiling high-level primitive programs into primitive program micro-code |
US8171461B1 (en) | 2006-02-24 | 2012-05-01 | Nvidia Coporation | Primitive program compilation for flat attributes with provoking vertex independence |
US7825933B1 (en) * | 2006-02-24 | 2010-11-02 | Nvidia Corporation | Managing primitive program vertex attributes as per-attribute arrays |
US7891012B1 (en) | 2006-03-01 | 2011-02-15 | Nvidia Corporation | Method and computer-usable medium for determining the authorization status of software |
US8452981B1 (en) | 2006-03-01 | 2013-05-28 | Nvidia Corporation | Method for author verification and software authorization |
TWI319166B (en) * | 2006-03-06 | 2010-01-01 | Via Tech Inc | Method and related apparatus for graphic processing |
JP2007287084A (en) * | 2006-04-20 | 2007-11-01 | Fuji Xerox Co Ltd | Image processor and program |
JP5085642B2 (en) * | 2006-04-20 | 2012-11-28 | テレフオンアクチーボラゲット エル エム エリクソン(パブル) | Method for compressing an image block, method for processing a compressed representation of an image block, block compressor and block decompressor |
JP2007287085A (en) * | 2006-04-20 | 2007-11-01 | Fuji Xerox Co Ltd | Program and device for processing images |
US8766995B2 (en) * | 2006-04-26 | 2014-07-01 | Qualcomm Incorporated | Graphics system with configurable caches |
WO2007130933A2 (en) * | 2006-05-01 | 2007-11-15 | Jeffrey W Bezanson | Apparatuses, methods and systems for vector operations and storage in matrix models |
US7965859B2 (en) | 2006-05-04 | 2011-06-21 | Sony Computer Entertainment Inc. | Lighting control of a user environment via a display device |
US7880746B2 (en) | 2006-05-04 | 2011-02-01 | Sony Computer Entertainment Inc. | Bandwidth management through lighting control of a user environment via a display device |
SG137754A1 (en) * | 2006-05-12 | 2007-12-28 | Nvidia Corp | Antialiasing using multiple display heads of a graphics processor |
US20070268289A1 (en) * | 2006-05-16 | 2007-11-22 | Chun Yu | Graphics system with dynamic reposition of depth engine |
US7395180B2 (en) * | 2006-05-17 | 2008-07-01 | Lockheed Martin Corporation | Efficient translation of data from a two-dimensional array to a wedge |
US8884972B2 (en) | 2006-05-25 | 2014-11-11 | Qualcomm Incorporated | Graphics processor with arithmetic and elementary function units |
US8869147B2 (en) * | 2006-05-31 | 2014-10-21 | Qualcomm Incorporated | Multi-threaded processor with deferred thread output control |
CA2652503C (en) * | 2006-06-09 | 2016-08-02 | Aisin Aw Co., Ltd. | Data updating system, terminal device, server, and method of data updating |
US8644643B2 (en) | 2006-06-14 | 2014-02-04 | Qualcomm Incorporated | Convolution filtering in a graphics processor |
US20070291031A1 (en) * | 2006-06-15 | 2007-12-20 | Right Hemisphere Limited | Three dimensional geometric data correction |
US7940262B2 (en) * | 2006-06-15 | 2011-05-10 | Right Hemisphere Limited | Unification and part hiding in three dimensional geometric data |
US8766996B2 (en) * | 2006-06-21 | 2014-07-01 | Qualcomm Incorporated | Unified virtual addressed register file |
US8928676B2 (en) * | 2006-06-23 | 2015-01-06 | Nvidia Corporation | Method for parallel fine rasterization in a raster stage of a graphics pipeline |
JP2008009696A (en) * | 2006-06-29 | 2008-01-17 | Fuji Xerox Co Ltd | Image processor and program |
JP4795138B2 (en) * | 2006-06-29 | 2011-10-19 | 富士ゼロックス株式会社 | Image processing apparatus and program |
US8477134B1 (en) | 2006-06-30 | 2013-07-02 | Nvidia Corporation | Conservative triage of polygon status using low precision edge evaluation and high precision edge evaluation |
US8284204B2 (en) * | 2006-06-30 | 2012-10-09 | Nokia Corporation | Apparatus, method and a computer program product for providing a unified graphics pipeline for stereoscopic rendering |
US8560495B1 (en) * | 2006-07-07 | 2013-10-15 | Sybase, Inc. | System and method for synchronizing message processing in a continuous processing system |
JP4979287B2 (en) * | 2006-07-14 | 2012-07-18 | 富士ゼロックス株式会社 | Image processing apparatus and program |
US8633927B2 (en) * | 2006-07-25 | 2014-01-21 | Nvidia Corporation | Re-render acceleration of frame with lighting change |
US9070213B2 (en) * | 2006-07-26 | 2015-06-30 | Nvidia Corporation | Tile based precision rasterization in a graphics pipeline |
US8085264B1 (en) | 2006-07-26 | 2011-12-27 | Nvidia Corporation | Tile output using multiple queue output buffering in a raster stage |
US8436864B2 (en) * | 2006-08-01 | 2013-05-07 | Nvidia Corporation | Method and user interface for enhanced graphical operation organization |
US8963932B1 (en) | 2006-08-01 | 2015-02-24 | Nvidia Corporation | Method and apparatus for visualizing component workloads in a unified shader GPU architecture |
US8607151B2 (en) * | 2006-08-01 | 2013-12-10 | Nvidia Corporation | Method and system for debugging a graphics pipeline subunit |
US8436870B1 (en) | 2006-08-01 | 2013-05-07 | Nvidia Corporation | User interface and method for graphical processing analysis |
US7778800B2 (en) * | 2006-08-01 | 2010-08-17 | Nvidia Corporation | Method and system for calculating performance parameters for a processor |
US7952588B2 (en) * | 2006-08-03 | 2011-05-31 | Qualcomm Incorporated | Graphics processing unit with extended vertex cache |
US8869027B2 (en) | 2006-08-04 | 2014-10-21 | Apple Inc. | Management and generation of dashboards |
US8493388B2 (en) * | 2006-08-09 | 2013-07-23 | Siemens Medical Solutions Usa, Inc. | Modular volume rendering using visual programming |
KR20080014402A (en) * | 2006-08-11 | 2008-02-14 | 삼성전자주식회사 | Method and apparatus for processing computer graphics data |
US7852347B1 (en) * | 2006-08-24 | 2010-12-14 | Nvidia Corporation | Texture map pixel pairing optimization |
US7905610B1 (en) * | 2006-08-29 | 2011-03-15 | Nvidia Corporation | Graphics processor system and associated method for projecting an image onto a three-dimensional object |
KR100745768B1 (en) * | 2006-08-29 | 2007-08-02 | 삼성전자주식회사 | Method for calculate lod value for reducing power consumption and 3 dimension rendering system using the same |
US8237739B2 (en) * | 2006-09-12 | 2012-08-07 | Qualcomm Incorporated | Method and device for performing user-defined clipping in object space |
US8730261B2 (en) * | 2006-09-13 | 2014-05-20 | Panasonic Corporation | Image processing device, image processing integrated circuit, image processing system, input assembler device, and input assembling integrated circuit |
JP4079378B2 (en) | 2006-09-21 | 2008-04-23 | 株式会社コナミデジタルエンタテインメント | Image processing apparatus, image processing apparatus control method, and program |
US8427487B1 (en) | 2006-11-02 | 2013-04-23 | Nvidia Corporation | Multiple tile output using interface compression in a raster stage |
US8537168B1 (en) | 2006-11-02 | 2013-09-17 | Nvidia Corporation | Method and system for deferred coverage mask generation in a raster stage |
US8237738B1 (en) | 2006-11-02 | 2012-08-07 | Nvidia Corporation | Smooth rasterization of polygonal graphics primitives |
US7701459B1 (en) * | 2006-11-03 | 2010-04-20 | Nvidia Corporation | Primitive oriented assembly for parallel vertex/geometry processing |
US8228328B1 (en) * | 2006-11-03 | 2012-07-24 | Nvidia Corporation | Early Z testing for multiple render targets |
US8482567B1 (en) | 2006-11-03 | 2013-07-09 | Nvidia Corporation | Line rasterization techniques |
US8059124B2 (en) | 2006-11-28 | 2011-11-15 | Adobe Systems Incorporated | Temporary non-tiled rendering of 3D objects |
US8300050B2 (en) * | 2006-11-28 | 2012-10-30 | Adobe Systems Incorporated | Temporary low resolution rendering of 3D objects |
US9965886B2 (en) | 2006-12-04 | 2018-05-08 | Arm Norway As | Method of and apparatus for processing graphics |
GB0710795D0 (en) * | 2007-06-05 | 2007-07-18 | Arm Norway As | Method of and apparatus for processing graphics |
US7974438B2 (en) | 2006-12-11 | 2011-07-05 | Koplar Interactive Systems International, Llc | Spatial data encoding and decoding |
US7891818B2 (en) | 2006-12-12 | 2011-02-22 | Evans & Sutherland Computer Corporation | System and method for aligning RGB light in a single modulator projector |
US8736627B2 (en) * | 2006-12-19 | 2014-05-27 | Via Technologies, Inc. | Systems and methods for providing a shared buffer in a multiple FIFO environment |
US7580035B2 (en) * | 2006-12-28 | 2009-08-25 | Intel Corporation | Real-time collision detection using clipping |
US7982733B2 (en) * | 2007-01-05 | 2011-07-19 | Qualcomm Incorporated | Rendering 3D video images on a stereo-enabled display |
EP2102823B8 (en) * | 2007-01-05 | 2016-06-29 | Landmark Graphics Corporation | Systems and methods for visualizing multiple volumetric data sets in real time |
ITMI20070038A1 (en) * | 2007-01-12 | 2008-07-13 | St Microelectronics Srl | RENDERING DEVICE FOR GRAPHICS WITH THREE DIMENSIONS WITH SORT-MIDDLE TYPE ARCHITECTURE. |
US7746355B1 (en) * | 2007-01-24 | 2010-06-29 | Vivante Corporation | Method for distributed clipping outside of view volume |
WO2008091198A1 (en) * | 2007-01-24 | 2008-07-31 | Swiftfoot Graphics Ab | Method, display adapter and computer program product for improved graphics performance by using a replaceable culling program |
US8549500B2 (en) * | 2007-02-14 | 2013-10-01 | The Mathworks, Inc. | Saving and loading graphical processing unit (GPU) arrays providing high computational capabilities in a computing environment |
WO2008103775A2 (en) | 2007-02-20 | 2008-08-28 | Pixologic, Inc. | System and method for interactive masking and modifying of 3d objects |
US7473258B2 (en) * | 2007-03-08 | 2009-01-06 | Cardica, Inc. | Surgical stapler |
US8471862B2 (en) * | 2007-03-09 | 2013-06-25 | Ati Technologies Ulc | Offset tiles in vector graphics |
US7694193B2 (en) * | 2007-03-13 | 2010-04-06 | Hewlett-Packard Development Company, L.P. | Systems and methods for implementing a stride value for accessing memory |
JP4446201B2 (en) * | 2007-03-30 | 2010-04-07 | アイシン・エィ・ダブリュ株式会社 | Image recognition apparatus and image recognition method |
US8155826B2 (en) * | 2007-03-30 | 2012-04-10 | Aisin Aw Co., Ltd. | Vehicle behavior learning apparatuses, methods, and programs |
EP2132713A1 (en) * | 2007-04-04 | 2009-12-16 | Telefonaktiebolaget LM Ericsson (PUBL) | Vector-based image processing |
US10605610B2 (en) * | 2007-04-09 | 2020-03-31 | Ian Cummings | Apparatus and methods for reducing data transmission in wireless client-server navigation systems |
JP4588736B2 (en) * | 2007-04-12 | 2010-12-01 | 富士フイルム株式会社 | Image processing method, apparatus, and program |
WO2008130992A1 (en) * | 2007-04-16 | 2008-10-30 | Sunfish Studio, Llc | Single-pass and order-independent transparency in computer graphics using constant memory |
GB2448717B (en) * | 2007-04-25 | 2012-09-19 | David Hostettler Wain | Method and apparatus for the efficient animation of textures based on images and graphical components |
US8203560B2 (en) * | 2007-04-27 | 2012-06-19 | Sony Corporation | Method for predictively splitting procedurally generated particle data into screen-space boxes |
US20080273113A1 (en) * | 2007-05-02 | 2008-11-06 | Windbond Electronics Corporation | Integrated graphics and KVM system |
US7876677B2 (en) * | 2007-05-22 | 2011-01-25 | Apple Inc. | Transmission control protocol queue sorting |
FR2917211A1 (en) * | 2007-06-08 | 2008-12-12 | St Microelectronics Sa | METHOD AND DEVICE FOR GENERATING GRAPHICS |
US8558832B1 (en) * | 2007-06-19 | 2013-10-15 | Nvida Corporation | System, method, and computer program product for generating a plurality of two-dimensional images and depth maps for a scene at a point in time |
KR101378372B1 (en) * | 2007-07-12 | 2014-03-27 | 삼성전자주식회사 | Digital image processing apparatus, method for controlling the same, and recording medium storing program to implement the method |
US8954871B2 (en) | 2007-07-18 | 2015-02-10 | Apple Inc. | User-centric widgets and dashboards |
US7925100B2 (en) * | 2007-07-31 | 2011-04-12 | Microsoft Corporation | Tiled packaging of vector image data |
US7805579B2 (en) * | 2007-07-31 | 2010-09-28 | International Business Machines Corporation | Methods and arrangements for multi-buffering data |
US8667415B2 (en) | 2007-08-06 | 2014-03-04 | Apple Inc. | Web widgets |
US8441497B1 (en) | 2007-08-07 | 2013-05-14 | Nvidia Corporation | Interpolation of vertex attributes in a graphics processor |
US8296738B1 (en) | 2007-08-13 | 2012-10-23 | Nvidia Corporation | Methods and systems for in-place shader debugging and performance tuning |
US8314803B2 (en) * | 2007-08-15 | 2012-11-20 | Nvidia Corporation | Buffering deserialized pixel data in a graphics processor unit pipeline |
US20090046105A1 (en) * | 2007-08-15 | 2009-02-19 | Bergland Tyson J | Conditional execute bit in a graphics processor unit pipeline |
US9035957B1 (en) | 2007-08-15 | 2015-05-19 | Nvidia Corporation | Pipeline debug statistics system and method |
US8736624B1 (en) | 2007-08-15 | 2014-05-27 | Nvidia Corporation | Conditional execution flag in graphics applications |
US8599208B2 (en) * | 2007-08-15 | 2013-12-03 | Nvidia Corporation | Shared readable and writeable global values in a graphics processor unit pipeline |
US8775777B2 (en) * | 2007-08-15 | 2014-07-08 | Nvidia Corporation | Techniques for sourcing immediate values from a VLIW |
US8521800B1 (en) | 2007-08-15 | 2013-08-27 | Nvidia Corporation | Interconnected arithmetic logic units |
US9183607B1 (en) | 2007-08-15 | 2015-11-10 | Nvidia Corporation | Scoreboard cache coherence in a graphics pipeline |
US8249391B2 (en) * | 2007-08-24 | 2012-08-21 | Ancestry.com Operations, Inc. | User interface method for skew correction |
US8156467B2 (en) | 2007-08-27 | 2012-04-10 | Adobe Systems Incorporated | Reusing components in a running application |
KR100933366B1 (en) * | 2007-09-13 | 2009-12-22 | 한국전자통신연구원 | Router device with black box function and network system including the device |
JP4501983B2 (en) * | 2007-09-28 | 2010-07-14 | アイシン・エィ・ダブリュ株式会社 | Parking support system, parking support method, parking support program |
US8176466B2 (en) | 2007-10-01 | 2012-05-08 | Adobe Systems Incorporated | System and method for generating an application fragment |
KR101407639B1 (en) * | 2007-10-22 | 2014-06-16 | 삼성전자주식회사 | Apparatus and method for rendering 3D Graphic object |
US8724483B2 (en) | 2007-10-22 | 2014-05-13 | Nvidia Corporation | Loopback configuration for bi-directional interfaces |
US8638341B2 (en) * | 2007-10-23 | 2014-01-28 | Qualcomm Incorporated | Antialiasing of two-dimensional vector images |
US8760450B2 (en) * | 2007-10-30 | 2014-06-24 | Advanced Micro Devices, Inc. | Real-time mesh simplification using the graphics processing unit |
US7765500B2 (en) * | 2007-11-08 | 2010-07-27 | Nvidia Corporation | Automated generation of theoretical performance analysis based upon workload and design configuration |
US8063903B2 (en) * | 2007-11-09 | 2011-11-22 | Nvidia Corporation | Edge evaluation techniques for graphics hardware |
US8035641B1 (en) | 2007-11-28 | 2011-10-11 | Adobe Systems Incorporated | Fast depth of field simulation |
US9153211B1 (en) * | 2007-12-03 | 2015-10-06 | Nvidia Corporation | Method and system for tracking accesses to virtual addresses in graphics contexts |
US8026912B1 (en) * | 2007-12-04 | 2011-09-27 | Nvidia Corporation | System and method for structuring an A-buffer |
US8040349B1 (en) | 2007-12-04 | 2011-10-18 | Nvidia Corporation | System and method for structuring an A-buffer |
US7940280B2 (en) * | 2007-12-06 | 2011-05-10 | Seiko Epson Corporation | System and method for color format conversion in a graphics environment |
US8102393B1 (en) | 2007-12-13 | 2012-01-24 | Nvidia Corporation | Cull streams for fine-grained rendering predication |
US9489767B1 (en) * | 2007-12-13 | 2016-11-08 | Nvidia Corporation | Cull streams for fine-grained rendering predication |
US8179394B1 (en) | 2007-12-13 | 2012-05-15 | Nvidia Corporation | Cull streams for fine-grained rendering predication |
US8878849B2 (en) * | 2007-12-14 | 2014-11-04 | Nvidia Corporation | Horizon split ambient occlusion |
US9064333B2 (en) | 2007-12-17 | 2015-06-23 | Nvidia Corporation | Interrupt handling techniques in the rasterizer of a GPU |
US8780123B2 (en) | 2007-12-17 | 2014-07-15 | Nvidia Corporation | Interrupt handling techniques in the rasterizer of a GPU |
CN101216944B (en) * | 2008-01-07 | 2011-08-03 | 北大方正集团有限公司 | A method and device for morphing shading in the process of typeset |
US20090184972A1 (en) * | 2008-01-18 | 2009-07-23 | Qualcomm Incorporated | Multi-buffer support for off-screen surfaces in a graphics processing system |
US20090189896A1 (en) * | 2008-01-25 | 2009-07-30 | Via Technologies, Inc. | Graphics Processor having Unified Shader Unit |
US9214007B2 (en) * | 2008-01-25 | 2015-12-15 | Via Technologies, Inc. | Graphics processor having unified cache system |
EP2260472A2 (en) * | 2008-01-30 | 2010-12-15 | Ramot at Tel-Aviv University Ltd. | Method, system and computer program product for manipulating a graphic entity |
GB0801812D0 (en) * | 2008-01-31 | 2008-03-05 | Arm Noway As | Methods of and apparatus for processing computer graphics |
US9619304B2 (en) | 2008-02-05 | 2017-04-11 | Adobe Systems Incorporated | Automatic connections between application components |
US8098251B2 (en) * | 2008-02-22 | 2012-01-17 | Qualcomm Incorporated | System and method for instruction latency reduction in graphics processing |
KR100866573B1 (en) * | 2008-02-22 | 2008-11-03 | 인하대학교 산학협력단 | A point-based rendering method using visibility map |
KR100914171B1 (en) | 2008-02-28 | 2009-08-28 | 한국전자통신연구원 | Apparatus and method for depth based image rendering on mobile broadcasting |
US7675513B2 (en) * | 2008-03-14 | 2010-03-09 | Evans & Sutherland Computer Corp. | System and method for displaying stereo images |
GB2458488C (en) * | 2008-03-19 | 2018-09-12 | Imagination Tech Ltd | Untransformed display lists in a tile based rendering system |
US7984317B2 (en) | 2008-03-24 | 2011-07-19 | Apple Inc. | Hardware-based power management of functional blocks |
US8125494B2 (en) * | 2008-04-03 | 2012-02-28 | American Panel Corporation | Method for mapping optical properties for a display device |
US8448002B2 (en) * | 2008-04-10 | 2013-05-21 | Nvidia Corporation | Clock-gated series-coupled data processing modules |
US8681861B2 (en) | 2008-05-01 | 2014-03-25 | Nvidia Corporation | Multistandard hardware video encoder |
US8923385B2 (en) | 2008-05-01 | 2014-12-30 | Nvidia Corporation | Rewind-enabled hardware encoder |
US8358317B2 (en) | 2008-05-23 | 2013-01-22 | Evans & Sutherland Computer Corporation | System and method for displaying a planar image on a curved surface |
JP5491498B2 (en) * | 2008-05-30 | 2014-05-14 | アドバンスト・マイクロ・ディバイシズ・インコーポレイテッド | Scalable and integrated computer system |
GB0810205D0 (en) * | 2008-06-04 | 2008-07-09 | Advanced Risc Mach Ltd | Graphics processing systems |
US8702248B1 (en) | 2008-06-11 | 2014-04-22 | Evans & Sutherland Computer Corporation | Projection method for reducing interpixel gaps on a viewing surface |
US8656293B1 (en) | 2008-07-29 | 2014-02-18 | Adobe Systems Incorporated | Configuring mobile devices |
US8427497B1 (en) | 2008-08-01 | 2013-04-23 | Marvell International Ltd. | Methods and apparatuses for processing cached image data |
US8654135B1 (en) * | 2008-09-10 | 2014-02-18 | Nvidia Corporation | A-Buffer compression for different compression formats |
US8130223B1 (en) | 2008-09-10 | 2012-03-06 | Nvidia Corporation | System and method for structuring an A-buffer to support multi-sample anti-aliasing |
US8553041B1 (en) | 2008-09-10 | 2013-10-08 | Nvidia Corporation | System and method for structuring an A-buffer to support multi-sample anti-aliasing |
US8370759B2 (en) | 2008-09-29 | 2013-02-05 | Ancestry.com Operations Inc | Visualizing, creating and editing blending modes methods and systems |
US9336624B2 (en) * | 2008-10-07 | 2016-05-10 | Mitsubishi Electric Research Laboratories, Inc. | Method and system for rendering 3D distance fields |
KR101496340B1 (en) | 2008-10-31 | 2015-03-04 | 삼성전자주식회사 | Processor and method for controling memory |
US8077378B1 (en) | 2008-11-12 | 2011-12-13 | Evans & Sutherland Computer Corporation | Calibration system and method for light modulation device |
US8355022B2 (en) * | 2008-11-25 | 2013-01-15 | Sony Computer Entertainment America Llc | Method and apparatus for aggregating light sources per-vertex in computer graphics |
US20100128038A1 (en) * | 2008-11-25 | 2010-05-27 | Sony Computer Entertainment America Inc. | Method and apparatus for interpolating color and direction as one entity in computer graphics |
WO2010062790A1 (en) * | 2008-11-25 | 2010-06-03 | Sony Computer Entertainment America Inc. | Computer graphics method for aggregating light sources per-vertex and interpolating color and direction as one entity |
EP2370968A4 (en) * | 2008-12-01 | 2013-01-16 | Life Image Inc | Medical imaging viewer |
KR101511273B1 (en) * | 2008-12-29 | 2015-04-10 | 삼성전자주식회사 | System and method for 3d graphic rendering based on multi-core processor |
GB0900700D0 (en) | 2009-01-15 | 2009-03-04 | Advanced Risc Mach Ltd | Methods of and apparatus for processing graphics |
CN104301705B (en) * | 2009-02-01 | 2016-09-07 | Lg电子株式会社 | Broadcasting receiver and three dimensional video data processing method |
US8384740B1 (en) * | 2009-02-24 | 2013-02-26 | A9.Com, Inc. | Method and system for virtually placing a tangible item on an appendage |
US8854379B2 (en) * | 2009-02-25 | 2014-10-07 | Empire Technology Development Llc | Routing across multicore networks using real world or modeled data |
US8095560B2 (en) * | 2009-02-26 | 2012-01-10 | Yahoo! Inc. | Edge attribute aggregation in a directed graph |
US20100241638A1 (en) * | 2009-03-18 | 2010-09-23 | O'sullivan Patrick Joseph | Sorting contacts |
US8330767B2 (en) * | 2009-03-24 | 2012-12-11 | Advanced Micro Devices, Inc. | Method and apparatus for angular invariant texture level of detail generation |
CN101859330B (en) * | 2009-04-09 | 2012-11-21 | 辉达公司 | Method for verifying integrated circuit effectiveness models |
KR100927128B1 (en) * | 2009-04-30 | 2009-11-18 | 주식회사 넥서스칩스 | Device and method of processing 3-dimension graphic using tile dirty table |
JP5304443B2 (en) * | 2009-05-28 | 2013-10-02 | 富士通セミコンダクター株式会社 | Drawing data processing method, drawing system, and drawing data creation program |
US8294714B1 (en) * | 2009-06-26 | 2012-10-23 | Nvidia Corporation | Accelerated rendering with temporally interleaved details |
KR101649098B1 (en) * | 2009-06-30 | 2016-08-19 | 삼성전자주식회사 | Apparatus and method for rendering using sensor in portable terminal |
US7973705B2 (en) * | 2009-07-17 | 2011-07-05 | Garmin Switzerland Gmbh | Marine bump map display |
US9142057B2 (en) * | 2009-09-03 | 2015-09-22 | Advanced Micro Devices, Inc. | Processing unit with a plurality of shader engines |
US9300969B2 (en) | 2009-09-09 | 2016-03-29 | Apple Inc. | Video storage |
GB2473682B (en) * | 2009-09-14 | 2011-11-16 | Sony Comp Entertainment Europe | A method of determining the state of a tile based deferred re ndering processor and apparatus thereof |
US20110063304A1 (en) * | 2009-09-16 | 2011-03-17 | Nvidia Corporation | Co-processing synchronizing techniques on heterogeneous graphics processing units |
US8692829B2 (en) * | 2009-10-05 | 2014-04-08 | Nvidia Corporation | Calculation of plane equations after determination of Z-buffer visibility |
US9438861B2 (en) * | 2009-10-06 | 2016-09-06 | Microsoft Technology Licensing, Llc | Integrating continuous and sparse streaming data |
US9058672B2 (en) * | 2009-10-06 | 2015-06-16 | Nvidia Corporation | Using a pixel offset for evaluating a plane equation |
EP2509527A1 (en) * | 2009-12-08 | 2012-10-17 | Koninklijke Philips Electronics N.V. | Ablation treatment planning and device |
CN102087752B (en) * | 2009-12-08 | 2013-11-20 | 鸿富锦精密工业(深圳)有限公司 | Illumination environment simulation system and method thereof |
US9530189B2 (en) | 2009-12-31 | 2016-12-27 | Nvidia Corporation | Alternate reduction ratios and threshold mechanisms for framebuffer compression |
TWI482998B (en) * | 2010-01-11 | 2015-05-01 | Hon Hai Prec Ind Co Ltd | Illumination environment simulation system and method |
JP5571977B2 (en) * | 2010-03-01 | 2014-08-13 | キヤノン株式会社 | Image processing device |
US9331869B2 (en) | 2010-03-04 | 2016-05-03 | Nvidia Corporation | Input/output request packet handling techniques by a device specific kernel mode driver |
US9058685B2 (en) * | 2010-03-11 | 2015-06-16 | Broadcom Corporation | Method and system for controlling a 3D processor using a control list in memory |
US8320622B2 (en) * | 2010-03-29 | 2012-11-27 | Sharp Laboratories Of America, Inc. | Color gradient object tracking |
US10786736B2 (en) | 2010-05-11 | 2020-09-29 | Sony Interactive Entertainment LLC | Placement of user information in a game space |
US20110285736A1 (en) | 2010-05-21 | 2011-11-24 | Kilgard Mark J | Decomposing cubic bèzier segments for tessellation-free stencil filling |
KR101016075B1 (en) * | 2010-06-04 | 2011-02-17 | 김시용 | Wiper blade |
US8593466B2 (en) * | 2010-06-08 | 2013-11-26 | Intel Corporation | Tile rendering for image processing |
US9053562B1 (en) | 2010-06-24 | 2015-06-09 | Gregory S. Rabin | Two dimensional to three dimensional moving image converter |
US10109103B2 (en) | 2010-06-30 | 2018-10-23 | Barry L. Jenkins | Method of determining occluded ingress and egress routes using nav-cell to nav-cell visibility pre-computation |
US9489762B2 (en) | 2010-06-30 | 2016-11-08 | Primal Space Systems, Inc. | Delivering and controlling streaming interactive media comprising rendered geometric, texture and lighting data |
US8493404B2 (en) | 2010-08-24 | 2013-07-23 | Qualcomm Incorporated | Pixel rendering on display |
KR101064178B1 (en) * | 2010-08-24 | 2011-09-14 | 한국과학기술원 | System and method for managing buffer cache |
KR101719485B1 (en) | 2010-09-20 | 2017-03-27 | 삼성전자주식회사 | Apparatus and method for early fragment discarding in graphic processing unit |
US8811699B2 (en) * | 2010-09-22 | 2014-08-19 | Siemens Aktiengesellschaft | Detection of landmarks and key-frames in cardiac perfusion MRI using a joint spatial-temporal context model |
US9171350B2 (en) | 2010-10-28 | 2015-10-27 | Nvidia Corporation | Adaptive resolution DGPU rendering to provide constant framerate with free IGPU scale up |
US9430036B1 (en) * | 2010-12-10 | 2016-08-30 | Wyse Technology L.L.C. | Methods and systems for facilitating accessing and controlling a remote desktop of a remote machine in real time by a windows web browser utilizing HTTP |
US8949726B2 (en) | 2010-12-10 | 2015-02-03 | Wyse Technology L.L.C. | Methods and systems for conducting a remote desktop session via HTML that supports a 2D canvas and dynamic drawing |
US9535560B1 (en) | 2010-12-10 | 2017-01-03 | Wyse Technology L.L.C. | Methods and systems for facilitating a remote desktop session for a web browser and a remote desktop server |
US9395885B1 (en) | 2010-12-10 | 2016-07-19 | Wyse Technology L.L.C. | Methods and systems for a remote desktop session utilizing HTTP header |
US9245047B2 (en) | 2010-12-10 | 2016-01-26 | Wyse Technology L.L.C. | Methods and systems for facilitating a remote desktop session utilizing a remote desktop client common interface |
US9244912B1 (en) | 2010-12-10 | 2016-01-26 | Wyse Technology L.L.C. | Methods and systems for facilitating a remote desktop redrawing session utilizing HTML |
KR20120065589A (en) * | 2010-12-13 | 2012-06-21 | 삼성전자주식회사 | Apparatus and method for tile binning for low power |
US9477597B2 (en) | 2011-03-25 | 2016-10-25 | Nvidia Corporation | Techniques for different memory depths on different partitions |
US8422770B2 (en) * | 2011-03-30 | 2013-04-16 | Mckesson Financial Holdings | Method, apparatus and computer program product for displaying normalized medical images |
US8701057B2 (en) | 2011-04-11 | 2014-04-15 | Nvidia Corporation | Design, layout, and manufacturing techniques for multivariant integrated circuits |
CN102739998B (en) * | 2011-05-11 | 2017-03-01 | 新奥特(北京)视频技术有限公司 | A kind of implementation method of space transformation in three-dimensional space |
GB2491156B (en) | 2011-05-25 | 2019-08-07 | Advanced Risc Mach Ltd | Processing pipeline control |
US9311433B2 (en) * | 2011-05-27 | 2016-04-12 | Airbus Operations S.L. | Systems and methods for improving the execution of computational algorithms |
AU2011202508B2 (en) | 2011-05-27 | 2013-05-16 | Canon Kabushiki Kaisha | Method, apparatus and system for rendering an object on a page |
US9342817B2 (en) | 2011-07-07 | 2016-05-17 | Sony Interactive Entertainment LLC | Auto-creating groups for sharing photos |
US9652560B1 (en) | 2011-07-18 | 2017-05-16 | Apple Inc. | Non-blocking memory management unit |
US9529712B2 (en) | 2011-07-26 | 2016-12-27 | Nvidia Corporation | Techniques for balancing accesses to memory having different memory types |
US9342322B2 (en) | 2011-09-12 | 2016-05-17 | Microsoft Technology Licensing, Llc | System and method for layering using tile-based renderers |
US9641826B1 (en) | 2011-10-06 | 2017-05-02 | Evans & Sutherland Computer Corporation | System and method for displaying distant 3-D stereo on a dome surface |
US20130106887A1 (en) * | 2011-10-31 | 2013-05-02 | Christopher Tremblay | Texture generation using a transformation matrix |
CN103108197A (en) | 2011-11-14 | 2013-05-15 | 辉达公司 | Priority level compression method and priority level compression system for three-dimensional (3D) video wireless display |
US9829715B2 (en) | 2012-01-23 | 2017-11-28 | Nvidia Corporation | Eyewear device for transmitting signal and communication method thereof |
US9633458B2 (en) * | 2012-01-23 | 2017-04-25 | Nvidia Corporation | Method and system for reducing a polygon bounding box |
US9087409B2 (en) | 2012-03-01 | 2015-07-21 | Qualcomm Incorporated | Techniques for reducing memory access bandwidth in a graphics processing system based on destination alpha values |
US20130235154A1 (en) * | 2012-03-09 | 2013-09-12 | Guy Salton-Morgenstern | Method and apparatus to minimize computations in real time photo realistic rendering |
US8959494B2 (en) * | 2012-03-20 | 2015-02-17 | Massively Parallel Technologies Inc. | Parallelism from functional decomposition |
US9411595B2 (en) | 2012-05-31 | 2016-08-09 | Nvidia Corporation | Multi-threaded transactional memory coherence |
US9148699B2 (en) * | 2012-06-01 | 2015-09-29 | Texas Instruments Incorporated | Optimized algorithm for construction of composite video from a set of discrete video sources |
US9251555B2 (en) | 2012-06-08 | 2016-02-02 | 2236008 Ontario, Inc. | Tiled viewport composition |
JP2014006674A (en) * | 2012-06-22 | 2014-01-16 | Canon Inc | Image processing device, control method of the same and program |
US20140010479A1 (en) * | 2012-07-09 | 2014-01-09 | Samsung Electro-Mechanics Co., Ltd. | Bilinear interpolation circuit for image and method thereof |
US9105250B2 (en) * | 2012-08-03 | 2015-08-11 | Nvidia Corporation | Coverage compaction |
US9323315B2 (en) | 2012-08-15 | 2016-04-26 | Nvidia Corporation | Method and system for automatic clock-gating of a clock grid at a clock source |
US8786889B2 (en) * | 2012-08-29 | 2014-07-22 | Eastman Kodak Company | Method for computing scale for tag insertion |
US8928929B2 (en) * | 2012-08-29 | 2015-01-06 | Eastman Kodak Company | System for generating tag layouts |
US9578224B2 (en) | 2012-09-10 | 2017-02-21 | Nvidia Corporation | System and method for enhanced monoimaging |
US8850371B2 (en) | 2012-09-14 | 2014-09-30 | Nvidia Corporation | Enhanced clock gating in retimed modules |
US9002125B2 (en) | 2012-10-15 | 2015-04-07 | Nvidia Corporation | Z-plane compression with z-plane predictors |
US8941676B2 (en) * | 2012-10-26 | 2015-01-27 | Nvidia Corporation | On-chip anti-alias resolve in a cache tiling architecture |
US9317948B2 (en) | 2012-11-16 | 2016-04-19 | Arm Limited | Method of and apparatus for processing graphics |
GB201223089D0 (en) | 2012-12-20 | 2013-02-06 | Imagination Tech Ltd | Hidden culling in tile based computer generated graphics |
US9082212B2 (en) * | 2012-12-21 | 2015-07-14 | Nvidia Corporation | Programmable blending via multiple pixel shader dispatches |
US9824009B2 (en) | 2012-12-21 | 2017-11-21 | Nvidia Corporation | Information coherency maintenance systems and methods |
US9251554B2 (en) * | 2012-12-26 | 2016-02-02 | Analog Devices, Inc. | Block-based signal processing |
US10102142B2 (en) | 2012-12-26 | 2018-10-16 | Nvidia Corporation | Virtual address based memory reordering |
US9591309B2 (en) | 2012-12-31 | 2017-03-07 | Nvidia Corporation | Progressive lossy memory compression |
US9317251B2 (en) | 2012-12-31 | 2016-04-19 | Nvidia Corporation | Efficient correction of normalizer shift amount errors in fused multiply add operations |
US9607407B2 (en) | 2012-12-31 | 2017-03-28 | Nvidia Corporation | Variable-width differential memory compression |
DE102013201377A1 (en) * | 2013-01-29 | 2014-07-31 | Bayerische Motoren Werke Aktiengesellschaft | Method and apparatus for processing 3d image data |
US20140225902A1 (en) * | 2013-02-11 | 2014-08-14 | Nvidia Corporation | Image pyramid processor and method of multi-resolution image processing |
KR101529942B1 (en) | 2013-02-18 | 2015-06-18 | 서경대학교 산학협력단 | Parallel processing rasterizer and parallel processing method for rasterizing |
US9992021B1 (en) | 2013-03-14 | 2018-06-05 | GoTenna, Inc. | System and method for private and point-to-point communication between computing devices |
US9229688B2 (en) | 2013-03-14 | 2016-01-05 | Massively Parallel Technologies, Inc. | Automated latency management and cross-communication exchange conversion |
GB2511817A (en) | 2013-03-14 | 2014-09-17 | Imagination Tech Ltd | Rendering in computer graphics systems |
US10169906B2 (en) | 2013-03-29 | 2019-01-01 | Advanced Micro Devices, Inc. | Hybrid render with deferred primitive batch binning |
US10957094B2 (en) | 2013-03-29 | 2021-03-23 | Advanced Micro Devices, Inc. | Hybrid render with preferred primitive batch binning and sorting |
GB2506706B (en) | 2013-04-02 | 2014-09-03 | Imagination Tech Ltd | Tile-based graphics |
US10008029B2 (en) | 2013-05-31 | 2018-06-26 | Nvidia Corporation | Updating depth related graphics data |
US9710894B2 (en) | 2013-06-04 | 2017-07-18 | Nvidia Corporation | System and method for enhanced multi-sample anti-aliasing |
US10204391B2 (en) | 2013-06-04 | 2019-02-12 | Arm Limited | Method of and apparatus for processing graphics |
KR20140142863A (en) * | 2013-06-05 | 2014-12-15 | 한국전자통신연구원 | Apparatus and method for providing graphic editors |
KR101451966B1 (en) * | 2013-06-17 | 2014-10-22 | (주)가비아 | System and method for providing mobile movie rendering |
US9418400B2 (en) | 2013-06-18 | 2016-08-16 | Nvidia Corporation | Method and system for rendering simulated depth-of-field visual effect |
US9177413B2 (en) * | 2013-06-26 | 2015-11-03 | Nvidia Corporation | Unique primitive identifier generation |
US9607574B2 (en) | 2013-08-09 | 2017-03-28 | Apple Inc. | Video data compression format |
US9569385B2 (en) | 2013-09-09 | 2017-02-14 | Nvidia Corporation | Memory transaction ordering |
US9230362B2 (en) | 2013-09-11 | 2016-01-05 | Nvidia Corporation | System, method, and computer program product for using compression with programmable sample locations |
US9230363B2 (en) | 2013-09-11 | 2016-01-05 | Nvidia Corporation | System, method, and computer program product for using compression with programmable sample locations |
US9437040B2 (en) | 2013-11-15 | 2016-09-06 | Nvidia Corporation | System, method, and computer program product for implementing anti-aliasing operations using a programmable sample pattern table |
US10935788B2 (en) | 2014-01-24 | 2021-03-02 | Nvidia Corporation | Hybrid virtual 3D rendering approach to stereovision |
US9276610B2 (en) * | 2014-01-27 | 2016-03-01 | Tensorcom, Inc. | Method and apparatus of a fully-pipelined layered LDPC decoder |
US20150228106A1 (en) * | 2014-02-13 | 2015-08-13 | Vixs Systems Inc. | Low latency video texture mapping via tight integration of codec engine with 3d graphics engine |
US9710957B2 (en) * | 2014-04-05 | 2017-07-18 | Sony Interactive Entertainment America Llc | Graphics processing enhancement by tracking object and/or primitive identifiers |
CN105100862B (en) * | 2014-04-18 | 2018-04-24 | 阿里巴巴集团控股有限公司 | The display processing method and its system of Grid Mobile |
GB2526598B (en) | 2014-05-29 | 2018-11-28 | Imagination Tech Ltd | Allocation of primitives to primitive blocks |
US9547918B2 (en) * | 2014-05-30 | 2017-01-17 | Intel Corporation | Techniques for deferred decoupled shading |
GB2524121B (en) * | 2014-06-17 | 2016-03-02 | Imagination Tech Ltd | Assigning primitives to tiles in a graphics processing system |
GB2524120B (en) * | 2014-06-17 | 2016-03-02 | Imagination Tech Ltd | Assigning primitives to tiles in a graphics processing system |
US9307249B2 (en) * | 2014-06-20 | 2016-04-05 | Freescale Semiconductor, Inc. | Processing device and method of compressing images |
US9721376B2 (en) * | 2014-06-27 | 2017-08-01 | Samsung Electronics Co., Ltd. | Elimination of minimal use threads via quad merging |
CN104217461B (en) * | 2014-07-10 | 2017-05-10 | 无锡梵天信息技术股份有限公司 | A parallax mapping method based on a depth map to simulate a real-time bump effect |
US9832388B2 (en) | 2014-08-04 | 2017-11-28 | Nvidia Corporation | Deinterleaving interleaved high dynamic range image by using YUV interpolation |
US9569862B2 (en) * | 2014-08-15 | 2017-02-14 | Qualcomm Incorporated | Bandwidth reduction using texture lookup by adaptive shading |
US9665370B2 (en) * | 2014-08-19 | 2017-05-30 | Qualcomm Incorporated | Skipping of data storage |
US10019834B2 (en) | 2014-09-26 | 2018-07-10 | Microsoft Technology Licensing, Llc | Real-time rendering of volumetric models with occlusive and emissive particles |
KR102281180B1 (en) | 2014-11-21 | 2021-07-23 | 삼성전자주식회사 | Image processing apparatus and method |
US9720769B2 (en) * | 2014-12-03 | 2017-08-01 | Sandisk Technologies Llc | Storage parameters for a data storage device |
US10249079B2 (en) * | 2014-12-11 | 2019-04-02 | Intel Corporation | Relaxed sorting in a position-only pipeline |
US9607414B2 (en) | 2015-01-27 | 2017-03-28 | Splunk Inc. | Three-dimensional point-in-polygon operation to facilitate displaying three-dimensional structures |
US9916326B2 (en) | 2015-01-27 | 2018-03-13 | Splunk, Inc. | Efficient point-in-polygon indexing technique for facilitating geofencing operations |
US9836874B2 (en) * | 2015-01-27 | 2017-12-05 | Splunk Inc. | Efficient polygon-clipping technique to reduce data transfer requirements for a viewport |
US10026204B2 (en) | 2015-01-27 | 2018-07-17 | Splunk Inc. | Efficient point-in-polygon indexing technique for processing queries over geographic data sets |
US9530237B2 (en) * | 2015-04-02 | 2016-12-27 | Apple Inc. | Interpolation circuitry and techniques for graphics processing |
US10255651B2 (en) | 2015-04-15 | 2019-04-09 | Channel One Holdings Inc. | Methods and systems for generating shaders to emulate a fixed-function graphics pipeline |
US9922449B2 (en) | 2015-06-01 | 2018-03-20 | Intel Corporation | Apparatus and method for dynamic polygon or primitive sorting for improved culling |
US9959665B2 (en) | 2015-07-21 | 2018-05-01 | Qualcomm Incorporated | Zero pixel culling for graphics processing |
KR20170034727A (en) | 2015-09-21 | 2017-03-29 | 삼성전자주식회사 | Shadow information storing method and apparatus, 3d rendering method and apparatus |
US10269154B2 (en) * | 2015-12-21 | 2019-04-23 | Intel Corporation | Rasterization based on partial spans |
KR102521654B1 (en) * | 2016-01-25 | 2023-04-13 | 삼성전자주식회사 | Computing system and method for performing graphics pipeline of tile-based rendering thereof |
US9818051B2 (en) * | 2016-01-29 | 2017-11-14 | Ricoh Company, Ltd. | Rotation and clipping mechanism |
US9906981B2 (en) | 2016-02-25 | 2018-02-27 | Nvidia Corporation | Method and system for dynamic regulation and control of Wi-Fi scans |
CN107180441B (en) | 2016-03-10 | 2019-04-09 | 腾讯科技(深圳)有限公司 | The method and apparatus for generating eye image |
US11847040B2 (en) | 2016-03-16 | 2023-12-19 | Asg Technologies Group, Inc. | Systems and methods for detecting data alteration from source to target |
US10332290B2 (en) * | 2016-03-21 | 2019-06-25 | Adobe Inc. | Fast, coverage-optimized, resolution-independent and anti-aliased graphics processing |
KR101821124B1 (en) | 2016-04-05 | 2018-01-23 | 한화테크윈 주식회사 | Method and apparatus for playing media stream on web-browser |
US10412130B2 (en) | 2016-04-04 | 2019-09-10 | Hanwha Techwin Co., Ltd. | Method and apparatus for playing media stream on web browser |
US9798672B1 (en) | 2016-04-14 | 2017-10-24 | Macom Connectivity Solutions, Llc | Data managment for cache memory |
EP3249612B1 (en) * | 2016-04-29 | 2023-02-08 | Imagination Technologies Limited | Generation of a control stream for a tile |
GB2553744B (en) | 2016-04-29 | 2018-09-05 | Advanced Risc Mach Ltd | Graphics processing systems |
JP7100624B2 (en) * | 2016-08-29 | 2022-07-13 | アドバンスト・マイクロ・ディバイシズ・インコーポレイテッド | Hybrid rendering with binning and sorting of preferred primitive batches |
US10756785B2 (en) * | 2016-09-29 | 2020-08-25 | Nokia Technologies Oy | Flexible reference signal design |
US10417134B2 (en) * | 2016-11-10 | 2019-09-17 | Oracle International Corporation | Cache memory architecture and policies for accelerating graph algorithms |
US10282889B2 (en) * | 2016-11-29 | 2019-05-07 | Samsung Electronics Co., Ltd. | Vertex attribute compression and decompression in hardware |
KR20180070314A (en) | 2016-12-16 | 2018-06-26 | 삼성전자주식회사 | Graphics processing apparatus and method for processing graphics pipeline thereof |
KR102637736B1 (en) | 2017-01-04 | 2024-02-19 | 삼성전자주식회사 | Graphics processing method and system |
JP7168578B2 (en) * | 2017-03-30 | 2022-11-09 | マジック リープ, インコーポレイテッド | Intensive rendering |
US10977858B2 (en) | 2017-03-30 | 2021-04-13 | Magic Leap, Inc. | Centralized rendering |
US10157493B2 (en) * | 2017-04-01 | 2018-12-18 | Intel Corporation | Adaptive multisampling based on vertex attributes |
GB2562041B (en) * | 2017-04-28 | 2020-11-25 | Imagination Tech Ltd | Multi-output decoder for texture decompression |
US10521877B2 (en) | 2017-05-23 | 2019-12-31 | Samsung Electronics Co., Ltd | Apparatus and method for speculative buffer reservations with cancellation mechanism |
US10510181B2 (en) * | 2017-06-27 | 2019-12-17 | Samsung Electronics Co., Ltd. | System and method for cache management using a cache status table |
US10969740B2 (en) | 2017-06-27 | 2021-04-06 | Nvidia Corporation | System and method for near-eye light field rendering for wide field of view interactive three-dimensional computer graphics |
CN107463398B (en) * | 2017-07-21 | 2018-08-17 | 腾讯科技(深圳)有限公司 | Game rendering intent, device, storage device and terminal |
GB2569775B (en) | 2017-10-20 | 2020-02-26 | Graphcore Ltd | Synchronization in a multi-tile, multi-chip processing arrangement |
GB2569271B (en) | 2017-10-20 | 2020-05-13 | Graphcore Ltd | Synchronization with a host processor |
GB2569844B (en) | 2017-10-20 | 2021-01-06 | Graphcore Ltd | Sending data off-chip |
US10600142B2 (en) * | 2017-11-02 | 2020-03-24 | Advanced Micro Devices, Inc. | Compression and decompression of indices in a graphics pipeline |
US11057500B2 (en) | 2017-11-20 | 2021-07-06 | Asg Technologies Group, Inc. | Publication of applications using server-side virtual screen change capture |
US10699374B2 (en) | 2017-12-05 | 2020-06-30 | Microsoft Technology Licensing, Llc | Lens contribution-based virtual reality display rendering |
GB2569546B (en) * | 2017-12-19 | 2020-10-14 | Sony Interactive Entertainment Inc | Determining pixel values using reference images |
US11611633B2 (en) | 2017-12-29 | 2023-03-21 | Asg Technologies Group, Inc. | Systems and methods for platform-independent application publishing to a front-end interface |
US10812611B2 (en) | 2017-12-29 | 2020-10-20 | Asg Technologies Group, Inc. | Platform-independent application publishing to a personalized front-end interface by encapsulating published content into a container |
US10877740B2 (en) | 2017-12-29 | 2020-12-29 | Asg Technologies Group, Inc. | Dynamically deploying a component in an application |
GB2572617B (en) | 2018-04-05 | 2021-06-16 | Imagination Tech Ltd | Blending hardware |
US10672182B2 (en) * | 2018-04-19 | 2020-06-02 | Microsoft Technology Licensing, Llc | Compact visibility state for GPUs compatible with hardware instancing |
WO2019225734A1 (en) | 2018-05-24 | 2019-11-28 | 株式会社 Preferred Networks | Rendering device, learning device, rendering method, and program |
GB2575294B8 (en) | 2018-07-04 | 2022-07-20 | Graphcore Ltd | Host Proxy On Gateway |
US10861230B2 (en) * | 2018-08-01 | 2020-12-08 | Nvidia Corporation | System-generated stable barycentric coordinates and direct plane equation access |
KR102622452B1 (en) * | 2018-09-13 | 2024-01-09 | 프라운호퍼 게젤샤프트 쭈르 푀르데룽 데어 안겐반텐 포르슝 에. 베. | Affine linear weighted intra prediction |
US11138747B1 (en) * | 2018-11-02 | 2021-10-05 | Facebook Technologies, Llc | Interpolation optimizations for a display engine for post-rendering processing |
GB2579412B (en) | 2018-11-30 | 2020-12-23 | Graphcore Ltd | Gateway pull model |
US10909659B2 (en) | 2018-12-12 | 2021-02-02 | Apical Limited | Super-resolution image processing using a machine learning system |
US11715262B2 (en) * | 2018-12-17 | 2023-08-01 | Advanced Micro Devices, Inc. | Optimizing primitive shaders |
KR102216749B1 (en) * | 2019-03-05 | 2021-02-17 | 네이버웹툰 유한회사 | Method, apparatus and computer program for coloring of a target image |
US10866280B2 (en) | 2019-04-01 | 2020-12-15 | Texas Instruments Incorporated | Scan chain self-testing of lockstep cores on reset |
US11640649B2 (en) * | 2019-06-19 | 2023-05-02 | Samsung Electronics Co., Ltd. | Methods and apparatus for efficient range calculation |
US11762634B2 (en) | 2019-06-28 | 2023-09-19 | Asg Technologies Group, Inc. | Systems and methods for seamlessly integrating multiple products by using a common visual modeler |
US11488349B2 (en) | 2019-06-28 | 2022-11-01 | Ati Technologies Ulc | Method and apparatus for alpha blending images from different color formats |
US10981059B2 (en) * | 2019-07-03 | 2021-04-20 | Sony Interactive Entertainment LLC | Asset aware computing architecture for graphics processing |
JP7245954B2 (en) * | 2019-07-30 | 2023-03-24 | ファルコンリー インコーポレイテッド | Smooth, resolution-friendly views of large amounts of time-series data |
US11755760B2 (en) | 2019-10-18 | 2023-09-12 | Asg Technologies Group, Inc. | Systems and methods for secure policies-based information governance |
US11941137B2 (en) | 2019-10-18 | 2024-03-26 | Asg Technologies Group, Inc. | Use of multi-faceted trust scores for decision making, action triggering, and data analysis and interpretation |
US11886397B2 (en) | 2019-10-18 | 2024-01-30 | Asg Technologies Group, Inc. | Multi-faceted trust system |
US11055067B2 (en) | 2019-10-18 | 2021-07-06 | Asg Technologies Group, Inc. | Unified digital automation platform |
US11269660B2 (en) | 2019-10-18 | 2022-03-08 | Asg Technologies Group, Inc. | Methods and systems for integrated development environment editor support with a single code base |
US11216993B2 (en) * | 2019-11-27 | 2022-01-04 | Arm Limited | Graphics processing systems |
US11210847B2 (en) | 2019-11-27 | 2021-12-28 | Arm Limited | Graphics processing systems |
US11210821B2 (en) * | 2019-11-27 | 2021-12-28 | Arm Limited | Graphics processing systems |
US11170555B2 (en) | 2019-11-27 | 2021-11-09 | Arm Limited | Graphics processing systems |
US11514549B2 (en) * | 2020-02-03 | 2022-11-29 | Sony Interactive Entertainment Inc. | System and method for efficient multi-GPU rendering of geometry by generating information in one rendering phase for use in another rendering phase |
US11508110B2 (en) | 2020-02-03 | 2022-11-22 | Sony Interactive Entertainment Inc. | System and method for efficient multi-GPU rendering of geometry by performing geometry analysis before rendering |
US11113858B2 (en) * | 2020-02-04 | 2021-09-07 | Inventive Software, LLC | System and method for deep compositing of images in web browsers |
US11321259B2 (en) * | 2020-02-14 | 2022-05-03 | Sony Interactive Entertainment Inc. | Network architecture providing high speed storage access through a PCI express fabric between a compute node and a storage server |
US11132831B1 (en) | 2020-03-02 | 2021-09-28 | Qualcomm Incorporated | Methods and apparatus for efficient multi-view rasterization |
US11243882B2 (en) * | 2020-04-15 | 2022-02-08 | International Business Machines Corporation | In-array linked list identifier pool scheme |
US11250627B2 (en) * | 2020-06-29 | 2022-02-15 | Intel Corporation | Tile sequencing mechanism |
US11277658B1 (en) | 2020-08-21 | 2022-03-15 | Beam, Inc. | Integrating overlaid digital content into displayed data via graphics processing circuitry |
WO2022081476A1 (en) | 2020-10-13 | 2022-04-21 | ASG Technologies Group, Inc. dba ASG Technologies | Geolocation-based policy rules |
CN116670723A (en) * | 2020-10-22 | 2023-08-29 | 彩滋公司 | System and method for high quality rendering of composite views of customized products |
US11232628B1 (en) * | 2020-11-10 | 2022-01-25 | Weta Digital Limited | Method for processing image data to provide for soft shadow effects using shadow depth information |
US11481933B1 (en) | 2021-04-08 | 2022-10-25 | Mobeus Industries, Inc. | Determining a change in position of displayed digital content in subsequent frames via graphics processing circuitry |
US11601276B2 (en) * | 2021-04-30 | 2023-03-07 | Mobeus Industries, Inc. | Integrating and detecting visual data security token in displayed data via graphics processing circuitry using a frame buffer |
US11477020B1 (en) | 2021-04-30 | 2022-10-18 | Mobeus Industries, Inc. | Generating a secure random number by determining a change in parameters of digital content in subsequent frames via graphics processing circuitry |
US11586835B2 (en) | 2021-04-30 | 2023-02-21 | Mobeus Industries, Inc. | Integrating overlaid textual digital content into displayed data via graphics processing circuitry using a frame buffer |
US11483156B1 (en) | 2021-04-30 | 2022-10-25 | Mobeus Industries, Inc. | Integrating digital content into displayed data on an application layer via processing circuitry of a server |
US11682101B2 (en) | 2021-04-30 | 2023-06-20 | Mobeus Industries, Inc. | Overlaying displayed digital content transmitted over a communication network via graphics processing circuitry using a frame buffer |
US11475610B1 (en) | 2021-04-30 | 2022-10-18 | Mobeus Industries, Inc. | Controlling interactivity of digital content overlaid onto displayed data via graphics processing circuitry using a frame buffer |
CN113256485B (en) * | 2021-05-21 | 2024-01-30 | 百果园技术(新加坡)有限公司 | Image stretching method, device, electronic equipment and storage medium |
US20220410002A1 (en) * | 2021-06-29 | 2022-12-29 | Bidstack Group PLC | Mesh processing for viewability testing |
US11562153B1 (en) | 2021-07-16 | 2023-01-24 | Mobeus Industries, Inc. | Systems and methods for recognizability of objects in a multi-layer display |
US20230334736A1 (en) * | 2022-04-15 | 2023-10-19 | Meta Platforms Technologies, Llc | Rasterization Optimization for Analytic Anti-Aliasing |
US11882295B2 (en) | 2022-04-15 | 2024-01-23 | Meta Platforms Technologies, Llc | Low-power high throughput hardware decoder with random block access |
US20230334728A1 (en) * | 2022-04-15 | 2023-10-19 | Meta Platforms Technologies, Llc | Destination Update for Blending Modes in a Graphics Pipeline |
CN114529705B (en) * | 2022-04-22 | 2022-07-19 | 山东捷瑞数字科技股份有限公司 | Interface layout processing method of three-dimensional engine editor |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5880736A (en) * | 1997-02-28 | 1999-03-09 | Silicon Graphics, Inc. | Method system and computer program product for shading |
Family Cites Families (131)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
FR2353185A1 (en) | 1976-04-09 | 1977-12-23 | Thomson Csf | RAPID CORRELATOR DEVICE, AND SYSTEM FOR PROCESSING THE SIGNALS OF A RECEIVER INCLUDING SUCH A DEVICE |
FR2481489A1 (en) | 1980-04-25 | 1981-10-30 | Thomson Csf | BIDIMENSIONAL CORRELATOR DEVICE |
US4484346A (en) | 1980-08-15 | 1984-11-20 | Sternberg Stanley R | Neighborhood transformation logic circuitry for an image analyzer system |
US4559618A (en) | 1982-09-13 | 1985-12-17 | Data General Corp. | Content-addressable memory module with associative clear |
US4783829A (en) | 1983-02-23 | 1988-11-08 | Hitachi, Ltd. | Pattern recognition apparatus |
US4581760A (en) | 1983-04-27 | 1986-04-08 | Fingermatrix, Inc. | Fingerprint verification method |
US4670858A (en) | 1983-06-07 | 1987-06-02 | Tektronix, Inc. | High storage capacity associative memory |
US4594673A (en) | 1983-06-28 | 1986-06-10 | Gti Corporation | Hidden surface processor |
US4532606A (en) | 1983-07-14 | 1985-07-30 | Burroughs Corporation | Content addressable memory cell with shift capability |
US4564952A (en) | 1983-12-08 | 1986-01-14 | At&T Bell Laboratories | Compensation of filter symbol interference by adaptive estimation of received symbol sequences |
US4694404A (en) | 1984-01-12 | 1987-09-15 | Key Bank N.A. | High-speed image generation of complex solid objects using octree encoding |
US4794559A (en) | 1984-07-05 | 1988-12-27 | American Telephone And Telegraph Company, At&T Bell Laboratories | Content addressable semiconductor memory arrays |
US4622653A (en) | 1984-10-29 | 1986-11-11 | Texas Instruments Incorporated | Block associative memory |
US4669054A (en) | 1985-05-03 | 1987-05-26 | General Dynamics, Pomona Division | Device and method for optically correlating a pair of images |
SE445154B (en) | 1985-07-08 | 1986-06-02 | Ibm Svenska Ab | METHOD OF REMOVING HIDDEN LINES |
US4695973A (en) | 1985-10-22 | 1987-09-22 | The United States Of America As Represented By The Secretary Of The Air Force | Real-time programmable optical correlator |
US4758982A (en) | 1986-01-08 | 1988-07-19 | Advanced Micro Devices, Inc. | Quasi content addressable memory |
US4890242A (en) | 1986-06-05 | 1989-12-26 | Xox Corporation | Solid-modeling system using topology directed subdivision for determination of surface intersections |
US5067162A (en) | 1986-06-30 | 1991-11-19 | Identix Incorporated | Method and apparatus for verifying identity using image correlation |
US4998286A (en) | 1987-02-13 | 1991-03-05 | Olympus Optical Co., Ltd. | Correlation operational apparatus for multi-dimensional images |
US4825391A (en) | 1987-07-20 | 1989-04-25 | General Electric Company | Depth buffer priority processing for real time computer image generating systems |
US5129060A (en) | 1987-09-14 | 1992-07-07 | Visual Information Technologies, Inc. | High speed image processing computer |
US5146592A (en) | 1987-09-14 | 1992-09-08 | Visual Information Technologies, Inc. | High speed image processing computer with overlapping windows-div |
US4841467A (en) | 1987-10-05 | 1989-06-20 | General Electric Company | Architecture to implement floating point multiply/accumulate operations |
GB2215623B (en) | 1987-10-23 | 1991-07-31 | Rotation Limited | Apparatus for playing a game for one or more players and to games played with the apparatus |
US4888712A (en) | 1987-11-04 | 1989-12-19 | Schlumberger Systems, Inc. | Guardband clipping method and apparatus for 3-D graphics display system |
US4945500A (en) | 1987-11-04 | 1990-07-31 | Schlumberger Technologies, Inc. | Triangle processor for 3-D graphics display system |
FR2625345A1 (en) | 1987-12-24 | 1989-06-30 | Thomson Cgr | THREE-DIMENSIONAL VIEWING METHOD OF NUMERICALLY ENCODED OBJECTS IN TREE FORM AND DEVICE FOR IMPLEMENTING THE SAME |
DE68918724T2 (en) | 1988-02-17 | 1995-05-24 | Nippon Denso Co | Fingerprint verification process using multiple correlation decision levels and successive decision levels. |
US4888583A (en) | 1988-03-14 | 1989-12-19 | Ligocki Terry J | Method and apparatus for rendering an image from data arranged in a constructive solid geometry format |
US5083287A (en) | 1988-07-14 | 1992-01-21 | Daikin Industries, Inc. | Method and apparatus for applying a shadowing operation to figures to be drawn for displaying on crt-display |
US5133052A (en) | 1988-08-04 | 1992-07-21 | Xerox Corporation | Interactive graphical search and replace utility for computer-resident synthetic graphic image editors |
US4996666A (en) | 1988-08-12 | 1991-02-26 | Duluk Jr Jerome F | Content-addressable memory system capable of fully parallel magnitude comparisons |
GB8828342D0 (en) | 1988-12-05 | 1989-01-05 | Rediffusion Simulation Ltd | Image generator |
US4970636A (en) | 1989-01-23 | 1990-11-13 | Honeywell Inc. | Memory interface controller |
FR2646046B1 (en) | 1989-04-18 | 1995-08-25 | France Etat | METHOD AND DEVICE FOR COMPRESSING IMAGE DATA BY MATHEMATICAL TRANSFORMATION WITH REDUCED COST OF IMPLEMENTATION, IN PARTICULAR FOR TRANSMISSION AT REDUCED THROUGHPUT OF IMAGE SEQUENCES |
JPH0776991B2 (en) | 1989-10-24 | 1995-08-16 | インターナショナル・ビジネス・マシーンズ・コーポレーション | NURBS data conversion method and apparatus |
US5245700A (en) | 1989-11-21 | 1993-09-14 | International Business Machines Corporation | Adjustment of z-buffer values for lines on the surface of a polygon |
JPH03166601A (en) | 1989-11-27 | 1991-07-18 | Hitachi Ltd | Symbolizing device and process controller and control supporting device using the symbolizing device |
US5129051A (en) | 1990-03-16 | 1992-07-07 | Hewlett-Packard Company | Decomposition of arbitrary polygons into trapezoids |
US5123085A (en) | 1990-03-19 | 1992-06-16 | Sun Microsystems, Inc. | Method and apparatus for rendering anti-aliased polygons |
US5128888A (en) | 1990-04-02 | 1992-07-07 | Advanced Micro Devices, Inc. | Arithmetic unit having multiple accumulators |
GB9009127D0 (en) | 1990-04-24 | 1990-06-20 | Rediffusion Simulation Ltd | Image generator |
US5369734A (en) | 1990-05-18 | 1994-11-29 | Kabushiki Kaisha Toshiba | Method for processing and displaying hidden-line graphic images |
DE69122557T2 (en) | 1990-06-29 | 1997-04-24 | Philips Electronics Nv | Imaging |
JPH0475183A (en) | 1990-07-17 | 1992-03-10 | Mitsubishi Electric Corp | Correlativity detector for image |
US5054090A (en) | 1990-07-20 | 1991-10-01 | Knight Arnold W | Fingerprint correlation system with parallel FIFO processor |
US5050220A (en) | 1990-07-24 | 1991-09-17 | The United States Of America As Represented By The Secretary Of The Navy | Optical fingerprint correlator |
JPH07120435B2 (en) | 1990-12-06 | 1995-12-20 | インターナショナル・ビジネス・マシーンズ・コーポレイション | Method and system for initializing and updating high-speed Z buffer |
FR2670923A1 (en) | 1990-12-21 | 1992-06-26 | Philips Lab Electronique | CORRELATION DEVICE. |
JPH07122908B2 (en) | 1991-03-12 | 1995-12-25 | インターナショナル・ビジネス・マシーンズ・コーポレイション | Apparatus and method for generating displayable information representing a three-dimensional solid object |
US5289567A (en) | 1991-04-01 | 1994-02-22 | Digital Equipment Corporation | Computer apparatus and method for finite element identification in interactive modeling |
US5293467A (en) | 1991-04-03 | 1994-03-08 | Buchner Gregory C | Method for resolving priority between a calligraphically-displayed point feature and both raster-displayed faces and other calligraphically-displayed point features in a CIG system |
US5315537A (en) | 1991-04-08 | 1994-05-24 | Blacker Teddy D | Automated quadrilateral surface discretization method and apparatus usable to generate mesh in a finite element analysis system |
US5263136A (en) | 1991-04-30 | 1993-11-16 | Optigraphics Corporation | System for managing tiled images using multiple resolutions |
US5347619A (en) | 1991-04-30 | 1994-09-13 | International Business Machines Corporation | Nonconvex polygon identifier |
US5299139A (en) | 1991-06-21 | 1994-03-29 | Cadence Design Systems, Inc. | Short locator method |
US5493644A (en) | 1991-07-11 | 1996-02-20 | Hewlett-Packard Company | Polygon span interpolator with main memory Z buffer |
US5295235A (en) | 1992-02-14 | 1994-03-15 | Steve Newman | Polygon engine for updating computer graphic display employing compressed bit map data |
US5319743A (en) | 1992-04-02 | 1994-06-07 | Digital Equipment Corporation | Intelligent and compact bucketing method for region queries in two-dimensional space |
US5669010A (en) | 1992-05-18 | 1997-09-16 | Silicon Engines | Cascaded two-stage computational SIMD engine having multi-port memory and multiple arithmetic units |
WO1993023816A1 (en) | 1992-05-18 | 1993-11-25 | Silicon Engines Inc. | System and method for cross correlation with application to video motion vector estimation |
US5621866A (en) | 1992-07-24 | 1997-04-15 | Fujitsu Limited | Image processing apparatus having improved frame buffer with Z buffer and SAM port |
US5455900A (en) | 1992-10-20 | 1995-10-03 | Ricoh Company, Ltd. | Image processing apparatus |
US5388206A (en) | 1992-11-13 | 1995-02-07 | The University Of North Carolina | Architecture and apparatus for image generation |
TW241196B (en) | 1993-01-15 | 1995-02-21 | Du Pont | |
JP3240447B2 (en) | 1993-02-19 | 2001-12-17 | 株式会社リコー | Image processing device |
US5574835A (en) | 1993-04-06 | 1996-11-12 | Silicon Engines, Inc. | Bounding box and projections detection of hidden polygons in three-dimensional spatial databases |
US5509110A (en) | 1993-04-26 | 1996-04-16 | Loral Aerospace Corporation | Method for tree-structured hierarchical occlusion in image generators |
US6167143A (en) * | 1993-05-03 | 2000-12-26 | U.S. Philips Corporation | Monitoring system |
US5684939A (en) | 1993-07-09 | 1997-11-04 | Silicon Graphics, Inc. | Antialiased imaging with improved pixel supersampling |
US5579455A (en) | 1993-07-30 | 1996-11-26 | Apple Computer, Inc. | Rendering of 3D scenes on a display using hierarchical z-buffer visibility |
GB9316214D0 (en) * | 1993-08-05 | 1993-09-22 | Philips Electronics Uk Ltd | Image processing |
JPH07182537A (en) | 1993-12-21 | 1995-07-21 | Toshiba Corp | Device and method for plotting graphic |
US5699497A (en) | 1994-02-17 | 1997-12-16 | Evans & Sutherland Computer Corporation | Rendering global macro texture, for producing a dynamic image, as on computer generated terrain, seen from a moving viewpoint |
US5778245A (en) | 1994-03-01 | 1998-07-07 | Intel Corporation | Method and apparatus for dynamic allocation of multiple buffers in a processor |
US5623628A (en) | 1994-03-02 | 1997-04-22 | Intel Corporation | Computer system and method for maintaining memory consistency in a pipelined, non-blocking caching bus request queue |
US5546194A (en) * | 1994-03-23 | 1996-08-13 | Videofaxx, Inc. | Method and apparatus for converting a video image format to a group III fax format |
US5596686A (en) | 1994-04-21 | 1997-01-21 | Silicon Engines, Inc. | Method and apparatus for simultaneous parallel query graphics rendering Z-coordinate buffer |
US5544306A (en) | 1994-05-03 | 1996-08-06 | Sun Microsystems, Inc. | Flexible dram access in a frame buffer memory and system |
JPH0855239A (en) | 1994-07-21 | 1996-02-27 | Internatl Business Mach Corp <Ibm> | Method and apparatus for judgment of visibility of graphicalobject |
US5572634A (en) | 1994-10-26 | 1996-11-05 | Silicon Engines, Inc. | Method and apparatus for spatial simulation acceleration |
JPH08127167A (en) * | 1994-11-01 | 1996-05-21 | Arutetsuku Kk | Apparatus and method for detecting end of rolled sheet |
US5594854A (en) * | 1995-03-24 | 1997-01-14 | 3Dlabs Inc. Ltd. | Graphics subsystem with coarse subpixel correction |
US5798770A (en) | 1995-03-24 | 1998-08-25 | 3Dlabs Inc. Ltd. | Graphics rendering system with reconfigurable pipeline sequence |
US5710876A (en) | 1995-05-25 | 1998-01-20 | Silicon Graphics, Inc. | Computer graphics system for rendering images using full spectral illumination data |
JPH08329276A (en) | 1995-06-01 | 1996-12-13 | Ricoh Co Ltd | Three-dimensional graphic processor |
JPH11515121A (en) * | 1995-07-26 | 1999-12-21 | レイカー,インコーポレイティド | Method and apparatus for span and subspan sorting rendering system |
US5841447A (en) | 1995-08-02 | 1998-11-24 | Evans & Sutherland Computer Corporation | System and method for improving pixel update performance |
US5977977A (en) | 1995-08-04 | 1999-11-02 | Microsoft Corporation | Method and system for multi-pass rendering |
US5949428A (en) | 1995-08-04 | 1999-09-07 | Microsoft Corporation | Method and apparatus for resolving pixel data in a graphics rendering system |
US5990904A (en) | 1995-08-04 | 1999-11-23 | Microsoft Corporation | Method and system for merging pixel fragments in a graphics rendering system |
US5864342A (en) | 1995-08-04 | 1999-01-26 | Microsoft Corporation | Method and system for rendering graphical objects to image chunks |
DE69636599T2 (en) * | 1995-08-04 | 2007-08-23 | Microsoft Corp., Redmond | METHOD AND SYSTEM FOR REPRODUCING GRAPHIC OBJECTS BY DIVISION IN PICTURES AND COMPOSITION OF PICTURES TO A PLAY PICTURE |
US5767859A (en) | 1995-09-28 | 1998-06-16 | Hewlett-Packard Company | Method and apparatus for clipping non-planar polygons |
US5854631A (en) * | 1995-11-22 | 1998-12-29 | Silicon Graphics, Inc. | System and method for merging pixel fragments based on depth range values |
JP2882465B2 (en) * | 1995-12-25 | 1999-04-12 | 日本電気株式会社 | Image generation method and apparatus |
US5574836A (en) | 1996-01-22 | 1996-11-12 | Broemmelsiek; Raymond M. | Interactive display apparatus and method with viewer position compensation |
US5850225A (en) | 1996-01-24 | 1998-12-15 | Evans & Sutherland Computer Corp. | Image mapping system and process using panel shear transforms |
US6046746A (en) * | 1996-07-01 | 2000-04-04 | Sun Microsystems, Inc. | Method and apparatus implementing high resolution rendition of Z-buffered primitives |
US5751291A (en) * | 1996-07-26 | 1998-05-12 | Hewlett-Packard Company | System and method for accelerated occlusion culling |
US5767589A (en) | 1996-09-03 | 1998-06-16 | Maximum Products Inc. | Lighting control circuit for vehicle brake light/tail light/indicator light assembly |
US5860158A (en) | 1996-11-15 | 1999-01-12 | Samsung Electronics Company, Ltd. | Cache control unit with a cache request transaction-oriented protocol |
US6167486A (en) | 1996-11-18 | 2000-12-26 | Nec Electronics, Inc. | Parallel access virtual channel memory system with cacheable channels |
US5936629A (en) | 1996-11-20 | 1999-08-10 | International Business Machines Corporation | Accelerated single source 3D lighting mechanism |
US6111582A (en) * | 1996-12-20 | 2000-08-29 | Jenkins; Barry L. | System and method of image generation and encoding using primitive reprojection |
US6697063B1 (en) * | 1997-01-03 | 2004-02-24 | Nvidia U.S. Investment Company | Rendering pipeline |
US5852451A (en) | 1997-01-09 | 1998-12-22 | S3 Incorporation | Pixel reordering for improved texture mapping |
US5949424A (en) * | 1997-02-28 | 1999-09-07 | Silicon Graphics, Inc. | Method, system, and computer program product for bump mapping in tangent space |
US6259452B1 (en) | 1997-04-14 | 2001-07-10 | Massachusetts Institute Of Technology | Image drawing system and method with real-time occlusion culling |
US6084591A (en) * | 1997-04-29 | 2000-07-04 | Ati Technologies, Inc. | Method and apparatus for deferred video rendering |
US5920326A (en) | 1997-05-30 | 1999-07-06 | Hewlett Packard Company | Caching and coherency control of multiple geometry accelerators in a computer graphics system |
US5889997A (en) | 1997-05-30 | 1999-03-30 | Hewlett-Packard Company | Assembler system and method for a geometry accelerator |
US6002412A (en) | 1997-05-30 | 1999-12-14 | Hewlett-Packard Co. | Increased performance of graphics memory using page sorting fifos |
US6118452A (en) | 1997-08-05 | 2000-09-12 | Hewlett-Packard Company | Fragment visibility pretest system and methodology for improved performance of a graphics system |
US6002410A (en) | 1997-08-25 | 1999-12-14 | Chromatic Research, Inc. | Reconfigurable texture cache |
US6128000A (en) | 1997-10-15 | 2000-10-03 | Compaq Computer Corporation | Full-scene antialiasing using improved supersampling techniques |
US6204859B1 (en) | 1997-10-15 | 2001-03-20 | Digital Equipment Corporation | Method and apparatus for compositing colors of images with memory constraints for storing pixel data |
JPH11161819A (en) * | 1997-11-27 | 1999-06-18 | Sega Enterp Ltd | Image processor, its method and recording medium recording image processing program |
US6201540B1 (en) * | 1998-01-07 | 2001-03-13 | Microsoft Corporation | Graphical interface components for in-dash automotive accessories |
US6259460B1 (en) | 1998-03-26 | 2001-07-10 | Silicon Graphics, Inc. | Method for efficient handling of texture cache misses by recirculation |
US6246415B1 (en) | 1998-04-30 | 2001-06-12 | Silicon Graphics, Inc. | Method and apparatus for culling polygons |
US6243744B1 (en) * | 1998-05-26 | 2001-06-05 | Compaq Computer Corporation | Computer network cluster generation indicator |
US6650327B1 (en) * | 1998-06-16 | 2003-11-18 | Silicon Graphics, Inc. | Display system having floating point rasterization and floating point framebuffering |
US6216004B1 (en) * | 1998-06-23 | 2001-04-10 | Qualcomm Incorporated | Cellular communication system with common channel soft handoff and associated method |
US6263493B1 (en) * | 1998-07-08 | 2001-07-17 | International Business Machines Corporation | Method and system for controlling the generation of program statements |
US6771264B1 (en) * | 1998-08-20 | 2004-08-03 | Apple Computer, Inc. | Method and apparatus for performing tangent space lighting and bump mapping in a deferred shading graphics processor |
US6577317B1 (en) * | 1998-08-20 | 2003-06-10 | Apple Computer, Inc. | Apparatus and method for geometry operations in a 3D-graphics pipeline |
US6552723B1 (en) | 1998-08-20 | 2003-04-22 | Apple Computer, Inc. | System, apparatus and method for spatially sorting image data in a three-dimensional graphics pipeline |
US6275235B1 (en) * | 1998-12-21 | 2001-08-14 | Silicon Graphics, Inc. | High precision texture wrapping method and device |
US6228730B1 (en) * | 1999-04-28 | 2001-05-08 | United Microelectronics Corp. | Method of fabricating field effect transistor |
-
1998
- 1998-12-17 US US09/213,990 patent/US6771264B1/en not_active Expired - Lifetime
-
1999
- 1999-08-20 WO PCT/US1999/019036 patent/WO2000011614A2/en active Application Filing
- 1999-08-20 US US09/372,137 patent/US6614444B1/en not_active Expired - Lifetime
- 1999-08-20 WO PCT/US1999/018971 patent/WO2000030040A1/en active IP Right Grant
- 1999-08-20 AU AU56875/99A patent/AU5687599A/en not_active Abandoned
- 1999-08-20 EP EP99945112A patent/EP1138023A4/en not_active Withdrawn
- 1999-08-20 AU AU57825/99A patent/AU5782599A/en not_active Abandoned
- 1999-08-20 AU AU57797/99A patent/AU5779799A/en not_active Abandoned
- 1999-08-20 WO PCT/US1999/019241 patent/WO2000011604A2/en active Application Filing
- 1999-08-20 US US09/377,503 patent/US6717576B1/en not_active Expired - Lifetime
- 1999-08-20 WO PCT/US1999/019190 patent/WO2000011613A2/en active Application Filing
- 1999-08-20 WO PCT/US1999/019254 patent/WO2000019377A1/en active IP Right Grant
- 1999-08-20 JP JP2000582972A patent/JP3657519B2/en not_active Expired - Lifetime
- 1999-08-20 KR KR10-2001-7002201A patent/KR100485241B1/en not_active IP Right Cessation
- 1999-08-20 US US09/378,637 patent/US6597363B1/en not_active Expired - Lifetime
- 1999-08-20 KR KR10-2001-7002171A patent/KR100478767B1/en not_active IP Right Cessation
- 1999-08-20 AU AU55765/99A patent/AU5576599A/en not_active Abandoned
- 1999-08-20 AU AU56904/99A patent/AU5690499A/en not_active Abandoned
- 1999-08-20 AU AU56878/99A patent/AU5687899A/en not_active Abandoned
- 1999-08-20 JP JP2000572802A patent/JP3657518B2/en not_active Expired - Lifetime
- 1999-08-20 EP EP99943867A patent/EP1105844A1/en not_active Withdrawn
- 1999-08-20 WO PCT/US1999/019363 patent/WO2000011605A2/en active Application Filing
-
2003
- 2003-06-09 US US10/458,493 patent/US7167181B2/en not_active Expired - Lifetime
-
2004
- 2004-03-16 JP JP2004136902A patent/JP4516350B2/en not_active Expired - Fee Related
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5880736A (en) * | 1997-02-28 | 1999-03-09 | Silicon Graphics, Inc. | Method system and computer program product for shading |
Cited By (22)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7170513B1 (en) | 1998-07-22 | 2007-01-30 | Nvidia Corporation | System and method for display list occlusion branching |
US7023437B1 (en) | 1998-07-22 | 2006-04-04 | Nvidia Corporation | System and method for accelerating graphics processing using a post-geometry data stream during multiple-pass rendering |
US6844880B1 (en) | 1999-12-06 | 2005-01-18 | Nvidia Corporation | System, method and computer program product for an improved programmable vertex processing model with instruction set |
US7209140B1 (en) | 1999-12-06 | 2007-04-24 | Nvidia Corporation | System, method and article of manufacture for a programmable vertex processing model with instruction set |
US7002588B1 (en) | 1999-12-06 | 2006-02-21 | Nvidia Corporation | System, method and computer program product for branching during programmable vertex processing |
US6870540B1 (en) * | 1999-12-06 | 2005-03-22 | Nvidia Corporation | System, method and computer program product for a programmable pixel processing model with instruction set |
US6734861B1 (en) | 2000-05-31 | 2004-05-11 | Nvidia Corporation | System, method and article of manufacture for an interlock module in a computer graphics processing pipeline |
US6664963B1 (en) | 2000-05-31 | 2003-12-16 | Nvidia Corporation | System, method and computer program product for programmable shading using pixel shaders |
US6532013B1 (en) | 2000-05-31 | 2003-03-11 | Nvidia Corporation | System, method and article of manufacture for pixel shaders for programmable shading |
US7068272B1 (en) | 2000-05-31 | 2006-06-27 | Nvidia Corporation | System, method and article of manufacture for Z-value and stencil culling prior to rendering in a computer graphics processing pipeline |
US6690372B2 (en) | 2000-05-31 | 2004-02-10 | Nvidia Corporation | System, method and article of manufacture for shadow mapping |
US6778181B1 (en) | 2000-12-07 | 2004-08-17 | Nvidia Corporation | Graphics processing system having a virtual texturing array |
US6982718B2 (en) | 2001-06-08 | 2006-01-03 | Nvidia Corporation | System, method and computer program product for programmable fragment processing in a graphics pipeline |
US7006101B1 (en) | 2001-06-08 | 2006-02-28 | Nvidia Corporation | Graphics API with branching capabilities |
US7456838B1 (en) | 2001-06-08 | 2008-11-25 | Nvidia Corporation | System and method for converting a vertex program to a binary format capable of being executed by a hardware graphics pipeline |
US7286133B2 (en) | 2001-06-08 | 2007-10-23 | Nvidia Corporation | System, method and computer program product for programmable fragment processing |
US6697064B1 (en) | 2001-06-08 | 2004-02-24 | Nvidia Corporation | System, method and computer program product for matrix tracking during vertex processing in a graphics pipeline |
US7162716B2 (en) | 2001-06-08 | 2007-01-09 | Nvidia Corporation | Software emulator for optimizing application-programmable vertex processing |
US6704025B1 (en) | 2001-08-31 | 2004-03-09 | Nvidia Corporation | System and method for dual-depth shadow-mapping |
US7009615B1 (en) | 2001-11-30 | 2006-03-07 | Nvidia Corporation | Floating point buffer system and method for use during programmable fragment processing in a graphics pipeline |
US7009605B2 (en) | 2002-03-20 | 2006-03-07 | Nvidia Corporation | System, method and computer program product for generating a shader program |
US8106904B2 (en) | 2002-03-20 | 2012-01-31 | Nvidia Corporation | Shader program generation system and method |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6771264B1 (en) | Method and apparatus for performing tangent space lighting and bump mapping in a deferred shading graphics processor | |
US7808503B2 (en) | Deferred shading graphics pipeline processor having advanced features | |
US5990904A (en) | Method and system for merging pixel fragments in a graphics rendering system | |
US6160557A (en) | Method and apparatus providing efficient rasterization with data dependent adaptations | |
US5949428A (en) | Method and apparatus for resolving pixel data in a graphics rendering system | |
US7570266B1 (en) | Multiple data buffers for processing graphics data | |
US10055883B2 (en) | Frustum tests for sub-pixel shadows | |
JP2001357410A (en) | Graphic system for composing three-dimensional images generated separately | |
WO1997005576A9 (en) | Method and apparatus for span and subspan sorting rendering system | |
EP0870282A1 (en) | Method and apparatus for span and subspan sorting rendering system | |
US7116333B1 (en) | Data retrieval method and system | |
US5926183A (en) | Efficient rendering utilizing user defined rooms and windows | |
US7256796B1 (en) | Per-fragment control for writing an output buffer | |
KR20210117988A (en) | Methods and apparatus for decoupled shading texture rendering |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
AK | Designated states |
Kind code of ref document: A2 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A2 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application | ||
AK | Designated states |
Kind code of ref document: A3 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: A3 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
DFPE | Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101) | ||
AK | Designated states |
Kind code of ref document: B1 Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG UZ VN YU ZA ZW |
|
AL | Designated countries for regional patents |
Kind code of ref document: B1 Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG |
|
B | Later publication of amended claims | ||
REG | Reference to national code |
Ref country code: DE Ref legal event code: 8642 |
|
122 | Ep: pct application non-entry in european phase |