US 7002571 B2
A technique for spatial partitioning and a data structure for storing references to objects in a scene. A grid-based loose octree (GLOtree) is a data structure that combines advantages of a uniform grid-based subdivision method and an octree-based subdivision method to provide a general purpose spatial partitioning method that works well with both static and dynamic scenes. In a GLOtree, objects are located at lower levels of the tree than in the prior art octree. This allows traversals to search for specific objects to be accomplished more quickly when a starting search level of the GLOtree is predicted. The GLOtree uses loose octree nodes that adapt the sizes of octants to the scene.
1. A method of spatial partitioning of a scene represented by a scene graph comprising:
collecting statistics of objects in the scene graph;
determining a looseness value and a world size for the scene;
building a loose octree using objects from the scene graph and the looseness value;
determining a minimum depth level based on the collected statistics; and
subdividing the scene to the minimum depth level using a three-dimensional grid superimposed on the loose octree to form a single data structure called a grid-based loose octree to represent spatial partitioning of the scene, wherein a size of each cell in the three dimensional grid is determined by multiplying the looseness value by the world size and dividing by two to the power of the minimum depth level.
2. The method of
3. The method of
4. The method of
5. The method of
6. The method of
7. An article comprising: a storage medium having a plurality of machine readable instructions, wherein when the instructions are executed by a processor, the instructions provide for spatial partitioning of a scene represented by a scene graph, the instructions for collecting statistics of objects in the scene graph, determining a looseness value and a world size for the scene, building a loose octree using objects from the scene graph and the looseness value, determining a minimum depth level based on the collected statistics, and subdividing the scene to the minimum depth level using a three-dimensional grid superimposed on the loose octree to form a single data structure called a grid-based loose octree to represent spatial partitioning of the scene, wherein a size of each cell in the three dimensional grid is determined by multiplying the looseness value by the world size and dividing by two to the power of the minimum depth level.
8. The article of
9. The article of
10. The article of
11. The article of
12. The article of
13. An article comprising a computer-readable medium encoded with a data structure defining
a gnd-based loose octree having a root node and a plurality of child nodes, each node representing an octant of a scene, wherein a size of an octant is adjusted by a looseness value, at least one node of the grid-based loose octree is referenced by a cell of a three-dimensional grid uniformly subdividing the scene and superimposed on the grid-based loose octree, a size of each cell in the three superimposed on the grid-based loose octree, wherein a size of each cell in the three dimensional grid is determined by multiplying the looseness value by the world size and dividing by two to the power of a minimum depth level.
14. The article of
15. The article of
A portion of the disclosure of this patent document contains material that is subject to copyright protection. The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent disclosure, as it appears in the Patent and Trademark Office patent file or records, but otherwise reserves all copyright rights whatsoever.
The present invention relates generally to computer graphics and, more specifically, to spatial partitioning of a two-dimensional (2D) or three-dimensional (3D) scene.
In 2D and 3D computer graphics, spatial partitioning typically includes the action of dividing up a space into smaller portions. The space may represent a scene in a world. The portions are sometimes called octants (for 3D), quadrants (for 2D), or gridlets (for 2D or 3D). Subdividing space is usually done during graphics pipeline processing to minimize future computation and to minimize the number of objects sent down the graphics pipeline. For example, if some objects in the scene are not currently visible based on the current camera position, then there is no need to further process these objects. Instead of examining each object submitted into the pipeline, when using spatial partitioning, graphics processing examines only the octants actually containing objects of the scene. For example, entire octants may be removed from further processing consideration if they are outside of the currently visible region of the scene, thus saving the time of transforming, rendering, and rasterizing the polygons of the objects contained in the non-visible octants.
Once a scene is partitioned and all objects in the scene are identified for a particular portion of the space, the results are typically stored in a data structure for subsequent use by graphics processing components (such as a video game engine or an animation generator, for example). This data structure is usually generated after scene creation, but before scene visualization and user interaction with the scene. During scene visualization, it may be necessary to find an object in the scene corresponding to a selected point. Given a point in 2D coordinates (e.g., x, y) or 3D coordinates (e.g., x, y, z), the data structure is typically traversed to find information about the object containing the point.
There are at least several existing techniques and corresponding data structures for spatial partitioning. These include uniform grids (also called regular grids), binary space partitioning (BSP) trees, octrees, and k-d trees. Each technique has its own advantages and disadvantages.
A uniform grid is the simplest spatial partitioning scheme. A grid is uniformly subdivided and superimposed onto the scene. For example, if the space is 10 units by 10 units by 10 units, the space could be partitioned into 1 unit by 1 unit by 1 unit cubes (e.g., octants). Advantages of this technique include predictable and relatively fast speed for traversal of the data structure when there is a reasonably uniform distribution of objects in the scene, and fast speed to insert an object into the scene. One drawback is that dense and sparse areas of the scene are presented with identical subdivision. The consequence is that grid cells in densely populated areas contain a high amount of objects while grid cells in sparse areas are nearly empty. As a result, a uniform grid is efficient only if the scene contents are uniformly distributed. Hence, a uniform grid solution partitions the space poorly and doesn't take into account how objects are placed in the scene. Uniform grids are most suited for dynamic scenes.
A BSP tree is a data structure used when recursively dividing space into pairs of subspaces, each separated by a plane of arbitrary orientation and position. Partitions usually focus on polygons of static objects in the scene. Non-uniform partitioning is accomplished by positioning partition planes depending on where objects are located. The BSP tree is typically not axis-aligned and objects may be split by the partitioning planes. Splitting creates a considerable amount of sub-polygons. This technique is suitable for static scenes. However, for a dynamic scene, the data structure must be reconstructed, which is very time consuming and is inadequate for real-time applications such as a video game or an interactive 3D website.
An octree is a data structure similar to a binary tree, but having a greater number of child nodes per parent node. Whereas a binary tree node typically has two child nodes (e.g., left and right children), an octree node has a specified number of child nodes (such as eight). Each node of the octree represents an octant of the scene. Each octant is sub-divided into a number of sub-octants (e.g., eight). Each node of the data structure contains pointers to the octants contained inside (e.g., a parent node points to child nodes). For 2D scenes, the tree is called a quadtree. For 3D scenes, the tree is a called an octree. Finer subdivision is performed in densely populated areas. If no objects are in an octant, then the octant is not subdivided further. An octree may be uniformly or non-uniformly subdivided, depending on the spatial partitioning algorithm used. The octree hierarchically partitions space up to a specified depth. Octrees provide better overall results than the uniform grid based scheme, but partitioning can be poor if objects lie on partitioning planes. Scene traversal for octrees is more expensive than with uniform grids. The octree adapts slowly to irregular scene structure. Another disadvantage is that partitioning with octrees tends to cluster objects at higher levels of the tree. When traversing the octree in a dynamic scene, tree traversal may be slow due to this high level clustering effect.
A k-d tree is a generalization of a binary tree where k is the dimension of the tree and d stands for dimension. For spatial partitioning, a k-d tree may be used to divide space. For example, each division of space denotes which side of a line (in 2D) or a plane (in 3D) an object may be on. Each time the left side of a line (or plane) or below a line (or plane) is considered, a left node of the k-d tree is examined. Each time the right side (or plane) or above a line (or plane) is considered, a right node of the k-d tree is examined. This process may be continued until a predetermined depth of the k-d tree is reached. One disadvantage of k-d trees is there is no predicatability. If objects move, the partitioning will not be effective.
Many game engines use a BSP tree or a k-d tree for spatial partitioning. Both of these algorithms construct a spatial hierarchical data structure in a computationally expensive preprocessing stage. If any objects move in the scene, the spatial hierarchical data structure becomes outdated, possibly resulting in incorrect results for queries. One solution to this problem is to update and reinitialize the data structure. However, reconstructing the BSP tree or k-d tree is a very time consuming process that renders these techniques unsuitable for use in processing a dynamic scene.
Although various techniques exist for spatial partitioning, none of the above-described techniques perform optimally in a variety of settings. Techniques that work well for static scenes perform poorly for dynamic scenes. What is needed is a technique and corresponding data structure that performs well for both static and dynamic scenes.
The features and advantages of the present invention will become apparent from the following detailed description of the present invention in which:
An embodiment of the present invention is a technique for spatial partitioning and a data structure for storing references to objects in a scene. According to embodiments of the present invention, a grid-based loose octree (GLOtree) is a data structure that combines advantages of the uniform grid-based subdivision scheme and the octree-based subdivision scheme to provide a general purpose solution that works well with both static and dynamic scenes. In a GLOtree, objects may be located at lower levels of the tree than in the prior art octree. This allows traversals to search for specific objects to be accomplished more quickly when a starting search level of the GLOtree is predicted. The GLOtree uses the concept of loose octree nodes that can adapt the sizes of octants to the scene.
The use of the GLOtree is faster and more predictable than prior art methods. The GLOtree is more predictable than octrees in that the GLOtree does not incur potential latencies when fetching data (as can occur when using an octree). Octrees incur this latency because, as objects are fetched, the entire tree needs to be traversed starting from the root node of the octree down to the level of the octree with the node referencing the object of interest. In a GLOtree, the cell of a 3D grid superimposed on the scene containing a selected object may be predicted. This cell corresponds to a node of the GLOtree. The GLOtree may then be traversed starting from this node instead of the root node. In addition, the GLOtree achieves a combination of speed, good adaptable partitioning, and the ability to be modified quickly.
The GLOtree may be used for spatial partitioning in a scene management system. The present invention works well for dynamic scene interaction operations such as collision detection, picking, and ray intersection.
Reference in the specification to “one embodiment” or “an embodiment” of the present invention means that a particular feature, structure or characteristic described in connection with the embodiment is included in at least one embodiment of the present invention. Thus, the appearances of the phrase “in one embodiment” appearing in various places throughout the specification are not necessarily all referring to the same embodiment.
The GLOtree is based on the octree. An octree is an acyclic directed graph with a maximum of eight children at every node. This structure works well for representing a 3D world subdivided into cubes. The root node of the octree contains a cube that encloses all of the geometry in the world (in other words, the cube corresponding to the root node circumscribes the space of interest). The children at each node are the eight cubes of equal size that subdivide the parent into octants. Subdivision stops when some user-defined heuristic is met, typically either the bounding cubes are of a certain size, or some minimum number of objects are contained within each node. The bounding cubes (e.g., octants) at each node are useful for using an octree for spatial subdivision. Each node contains pointers to all of the polygons that lie within that octant's volume. For visibility determination, the axis-aligned bounding cube of a node of the octree may be tested against the view frustrum. If a node's objects are fully visible, then all of the node's geometry (including interior octants) will be rendered. If the node's objects are partially visible, traversal of the octree continues down to the children nodes. If the node's objects are not visible (e.g., outside of the view frustrum), traversal can stop because all objects of the node are not visible.
The GLOtree of embodiments of the present invention improves upon the octree by placing a uniform 3D grid structure on top of an octree for a predetermined number of levels (e.g., two or three levels) of the GLOtree. The octree may be constructed and then the uniform grid may be applied, once the position of objects in the scene are known. This transforms the octree into a grid-based octree. To determine the uniform grid, the grid-based octree may be divided recursively into overlapping partitions (that is, in embodiments of the present invention, octants may be non-uniform and overlap). The extent of the overlap may be controlled by a looseness constant denoted “k”, thereby creating the grid-based loose octree (GLOtree). In one embodiment, a value for k may be between 1.0 and 2.0, although other values may also be used. The looseness of the GLOtree, as defined by the looseness constant, allows the size of one or more octants to be modified to encompass one or more objects without splitting the object. In one embodiment, looseness of an octant may be a factor of the objects straddling partition planes of the octant.
One advantage of overlapping partitions (i.e., the loose part of the GLOtree) is that it forces small objects that straddle octree partition planes lower down into the GLOtree. This results in a better spatial partitioning of the scene. In other words, an object doesn't have to be split with the present invention. Instead, the octant size will accommodate the object at a particular level of the octree.
A comparison of the present invention to an octree may be useful for understanding the advantages of the GLOtree.
When using an octree, a spatial partitioning mechanism will subdivide the space 10 into a first level of octants. In the 2D slice of
Table II presents the allocation of the objects (e.g., models) of the scene of
Note in comparing the data in Tables I and II, that application of the GLOtree results in objects being accessible at lower levels of the tree structure. This is a more desirable partition because an examination of the tree for collision detection must examine nodes at the current level of the tree, all of the levels above the current level, and a small subset of the nodes lower in the tree hierarchy. The only nodes below the current node of the GLOtree that have to be considered are the ones in which octants are occupied by objects. Thus, the number of tests during collision detection may be reduced for the GLOtree as compared to the octree.
The GLOtree data structure may be comprised of a tree containing one root node and a plurality of child nodes.
Table III shows one embodiment of a class definition of the GLOtree data structure. The example shown in Table III does not limit the scope of the invention in any way and is included merely for illustrative purposes. Other implementations of a GLOtree definition within the scope of the invention may be made.
Table IV shows one embodiment of a class definition of a data structure for a node of the GLOtree. The example shown in Table IV does not limit the scope of the invention in any way and is included merely for illustrative purposes. Other implementations of a GLOtree node definition within the scope of the invention may be made.
A first step to building a GLOtree is to obtain a list of objects from the scene graph. Next the world bounds for each object may be obtained to compute the overall world bounds. Once the object list has been created (and the looseness constant has been determined), the root node of the GLOtree may be constructed as defined (in one embodiment) in
At block 204, the GLOtree may be built using the objects from the scene graph, the looseness constant and the world size. For each object in the scene, the object may be added to a node of the GLOtree (with new nodes being created as needed). If an object straddles a boundary region between octants, the size of the relevant octant may be increased such that it can contain the object (within the constraint of the looseness constant). At block 206, the minimum depth level of the GLOtree may be determined based on the statistics gathered in block 200. The minimum depth level is the starting level in the GLOtree where tree traversal (using the 3D grid) begins. The minimum depth level may be a function of one or more of the least depth at which objects are contained, the least depth at which non-static objects are contained, and the gathered statistics.
The scene may then be recursively and uniformly subdivided at block 208 to the minimum depth level, handling all objects in the scene and creating nodes in the GLOtree, until no more unprocessed objects exist. At block 210, a grid lookup table into the GLOtree may be created. The grid lookup table may be used to directly access a level of the GLOtree specified by the minimum depth level when traversing the GLOtree. In one embodiment, the grid lookup table comprises a tabular array of n×n×n pointers. Each pointer references a node in the GLOtree. The index into the grid lookup table may be computed using an object's center position and quantizing the position into the cells of the grid (e.g., object center x, y, z/cell size).
Once the looseness constant, the world size, and the minimum depth level are known, the size of a cell in the 3D grid may be determined by multiplying the looseness constant by the world size and dividing by (2 ^^ minimum depth level).
Table V illustrates one embodiment of pseudo code for building a GLOtree. The example pseudo code shown in Table V does not limit the scope of the invention in any way and is included merely for illustrative purposes. Other implementations of building a GLOtree within the scope of the invention may be made.
Once the GLOtree has been built, the GLOtree may be traversed for various purposes during use. In one technique, the GLOtree may be traversed hierarchically from the root node (similar to existing techniques for traversing an octree). In another technique according to an embodiment of the present invention, the GLOtree may be traversed using the grid to select a “sub-tree” of the GLOtree containing the desired object or point in space.
Table VI illustrates one embodiment of pseudo code for one method of traversing a GLOtree. The example pseudo code shown in Table V does not limit the scope of the invention in any way and is included merely for illustrative purposes. Other implementations of traversing a GLOtree within the scope of the invention may be made.
In another embodiment, this data structure may be adapted to each octant on demand. This is different from other embodiments where all the octants were uniform. Here, if and only if an object straddles the partition plane, the octant may be loosened to accommodate the straddling object, but by a maximum factor of K (i.e., the looseness constant of the GLOtree). In this case, the individual looseness of each octant varies from 1.0 to K. One advantage of this embodiment is that the number of octants to be tested is reduced, because not all of the octants are loose and overlap.
Embodiments of the present invention are faster, more predictable, have adaptable partitioning for irregular datasets, and faster adaptation to dynamic datasets than other known data structures and partitioning schemes. The present invention is more predictable because the GLOtree reduces potential latencies when fetching data associated with data structures such as octrees. In addition, the present invention is faster than other methods by making use of uniform partitions higher up in the data structure, which is computed by obtaining statistical information about the scene. In at least one prior art method, the computational complexity of traversing an octree is O(n log n). Different types of traversals of the GLOtree may be performed, each traversal algorithm exhibiting a complexity. The computational complexity of simple tree traversal is O(log n). With a GLOtree, the computational complexity of a simple traversal of the GLOtree with a minimum depth of n/2 is O(log(n−minimum depth level))=O(log(n/2)). Since the GLOtree is hierarchical, the GLOtree is able to adapt to irregular data sets, and since the GLOtree is loose, the partitioning is more effective. The GLOtree may work with other scene management algorithms operating on uniform grids, octrees, and BSP trees.
By using overlapping partitions (e.g., the loose part of the GLOtree), embodiments of the present invention force small objects that straddle the octree partition plane lower down the tree, thereby achieving a better partition. This differs from a uniform octree that either forces a split to the object at the octant edge boundary or would duplicate objects in two or more octants, thereby increasing the computational cost of computing the octree and during traversal due to maintenance of additional data structures. The computation includes the clipping against the octree boundary, which tends to be expensive, particularly at run time. In contrast, a GLOtee will simply increase in size to accept the entire object that straddles the boundary with simple comparison tests.
Another advantage of using the GLOtree is the large reduction in any object-object interactions. The most expensive of these interactions are model-light associations and model-model collision detection. With octants, one may specify only lights within neighboring octants and the selected octant for lighting. Similarly, the potential number of colliding objects may be reduced, resulting in performance increases proportional to the number of models in the scene. Uniform subdivisions at higher levels of the GLOtree can produce information on near objects and far objects, which enables the use of image-based rendering techniques for geometry simplification. Thus, the GLOtree also enables an effective pipeline to combine acceleration techniques from image-based rendering for geometric simplification, and fast visibility culling of large scenes.
The techniques described herein are not limited to any particular hardware or software configuration; they may find applicability in any computing or processing environment. The techniques may be implemented in hardware, software, or a combination of the two. The techniques may be implemented in programs executing on programmable machines such as mobile or stationary computers, personal digital assistants, set top boxes, cellular telephones and pagers, and other electronic devices, that each include a processor, a storage medium readable by the processor (including volatile and non-volatile memory and/or storage elements), at least one input device, and one or more output devices. Program code is applied to the data entered using the input device to perform the functions described and to generate output information. The output information may be applied to one or more output devices. One of ordinary skill in the art may appreciate that the invention can be practiced with various computer system configurations, including multiprocessor systems, minicomputers, mainframe computers, and the like. The invention can also be practiced in distributed computing environments where tasks may be performed by remote processing devices that are linked through a communications network.
Each program may be implemented in a high level procedural or object oriented programming language to communicate with a processing system. However, programs may be implemented in assembly or machine language, if desired. In any case, the language may be compiled or interpreted.
Program instructions may be used to cause a general-purpose or special-purpose processing system that is programmed with the instructions to perform the operations described herein. Alternatively, the operations may be performed by specific hardware components that contain hardwired logic for performing the operations, or by any combination of programmed computer components and custom hardware components. The methods described herein may be provided as a computer program product that may include a machine readable medium having stored thereon instructions that may be used to program a processing system or other electronic device to perform the methods. The term “machine readable medium” used herein shall include any medium that is capable of storing or encoding a sequence of instructions for execution by the machine and that cause the machine to perform any one of the methods described herein. The term “machine readable medium” shall accordingly include, but not be limited to, solid-state memories, optical and magnetic disks, and a carrier wave that encodes a data signal. Furthermore, it is common in the art to speak of software, in one form or another (e.g., program, procedure, process, application, module, logic, and so on) as taking an action or causing a result. Such expressions are merely a shorthand way of stating the execution of the software by a processing system cause the processor to perform an action of produce a result.
While this invention has been described with reference to illustrative embodiments, this description is not intended to be construed in a limiting sense. Various modifications of the illustrative embodiments, as well as other embodiments of the invention, which are apparent to persons skilled in the art to which the invention pertains are deemed to lie within the spirit and scope of the invention.