US 7375728 B2 Abstract A virtual mirror is rendered by a computer graphics display or screen. The display substantially replicates the optical properties of a physical mirror. The display is provided on a host vehicle.
Claims(68) 1. A method of generating a display on a mobile body, comprising:
obtaining position information including coordinates that are indicative of a position of the mobile body on the surface of the Earth;
obtaining display data indicative of items in proximity to the mobile body based on the position information;
generating a virtual mirror display of the items that would be seen in a mirror with an unobstructed view based on the display data.
2. The method of
performing a mirror transformation on the display data, the mirror transformation transforming the display data into reflected data indicative of a reflection of the items.
3. The method of
performing a stencil transformation on the reflected data to obtain stencil data indicative of conformance of the reflected data to a simulated physical mirror boundary.
4. The method of
performing a forward transformation on the stencil data to obtain perspective data indicative of the stencil data in perspective on the virtual mirror display.
5. The method of
rendering the virtual mirror display based on the perspective data.
6. The method of
prior to performing the mirror transformation, determining whether the display data is indicative of an item that will be displayed.
7. The method of
if the display data is indicative of an item that will not be displayed, selecting other display data prior to performing the mirror transformation.
8. The method of
representing lines or object surface boundaries in the virtual mirror display by coordinate vectors; and
mathematically transforming the coordinate vectors to create visual representations of objects.
9. The method of
obtaining mobile body orientation data indicative of an orientation of the mobile body.
10. The method of
accessing a geospatial database based on the position information.
11. The method of
sensing objects proximate to the mobile body and not found in the geospatial database.
12. The method of
generating the virtual mirror display as a rearview mirror on the motor vehicle displaying items obstructed from view by the motor vehicle.
13. The method of
obtaining object information indicative of location of an object proximate the mobile body; and
generating the virtual mirror display based on the object information.
14. The method of
obtaining virtual mirror data indicative of spatial characteristics of the virtual mirror; and
generating the virtual mirror display based on the virtual mirror data.
15. The method of
obtaining location, orientation, shape and size data indicative of a desired location, orientation, shape and size of the virtual mirror.
16. The method of
obtaining virtual mirror data indicative of optical characteristics of the virtual mirror.
17. The method of
obtaining user information indicative of an orientation of a user viewing the virtual mirror display; and
modifying the virtual mirror display based on changes in the user information.
18. The method of
tracking user eye orientation.
19. The method of
tracking user head orientation.
20. The method of
displaying the items differently based on a position of the items relative to the mobile body.
21. The method of
changing a visual characteristic of the items based on a distance the items are from the mobile body.
22. The method of
changing an attribute of the virtual mirror.
23. The method of
dynamically changing a computed radius of curvature of the virtual mirror in generating the virtual mirror display.
24. The method of
changing the virtual mirror between, flat, concave and/or convex.
25. The method of
changing the computed radius of curvature of different portions of the mirror such that the different portions have different radii of curvature.
26. The method of
generating a rendering of the items.
27. The method of
generating a plurality of virtual mirror displays.
28. The method of
generating a display element representative of the items.
29. The method of
selectively suppressing light reflection that would otherwise be seen in the virtual mirror display.
30. The method of
oscillating display of the items in the virtual mirror display to create depth.
31. The method of
obtaining item information, indicative of physical characteristics of the items in proximity to the mobile body, by wireless transmission from the items.
32. The method of
33. The method of
34. The method of
accessing a database based on the portion of the VIN, and returning data indicative of the shape of the vehicle.
35. The method of
displaying the shape of the vehicle based on the data indicative of the shape.
36. The method of
37. The method of
generating an alarm based on a position of the items relative to the mobile body.
38. The method of
rendering the display on a flat panel display.
39. The method of
providing a stimulation output to stimulate a sense modality of a user based on a position of an item relative to the mobile body.
40. The method of
invoking exposed three dimensional graphics programming interfaces.
41. The method of
defining a stencil to be drawn.
42. The method of
defining a mirror coordinate system using a model view matrix.
43. The method of
rendering the stencil; and
activating portions of the stencil.
44. The method of
defining the model view matrix as a mirror transformation; and
rendering a mirror view based on the model view matrix.
45. The method of
defining the model view matrix as a forward-looking view;
activating different portions of the stencil; and
rendering the forward-looking view behind the virtual mirror.
46. A display system for generating a display on a mobile body, the display system comprising:
a location system generating location data including coordinates that are indicative of a location of the mobile body on the surface of the Earth;
a geospatial database storing display item information indicative of items to be displayed;
a display device; and
a controller coupled to the location system, the display device, and the geospatial database and configured to access the geospatial database based on the location data and output display data to the display device indicative of a virtual mirror display of the items.
47. The display system of
48. The display system of
49. The display system of
a projector receiving the display data and projecting the virtual mirror display on the display device.
50. The display system of
a ranging system, coupled to the controller, generating object data indicative of objects proximate the mobile body.
51. The display system of
52. The display system of
an imaging system, coupled to the controller, generating object data indicative of physical characteristics of objects proximate the mobile body.
53. The display system of
54. The display system of
55. The display system of
56. The display system of
57. The display system of
58. The display system of
59. The display system of
60. The display system of
61. A computer generated virtual mirror display on a mobile body, comprising:
a vehicle location system configured to generate a location signal including coordinates of a position of the mobile body on the surface of the Earth;
a geospatial database including road data, the road data including road boundary information corresponding to a boundary of a road on which the mobile body is traveling;
a conformal virtual mirror display configured to display a virtual image representing the boundary of the road based on the road boundary information and the location signal;
wherein the virtual image is displayed in a manner that generates a visual effect of the virtual image substantially overlying the corresponding boundary of the road that would be seen if the display were a mirror having an unobstructed view of the road.
62. The computer generated display of
a conformal mirror display of items that would be seen in a mirror mounted on the vehicle if the view were unobstructed.
63. The computer generated display of
road lane boundaries.
64. The computer generated display of
sensed objects proximate to the mobile body.
65. The computer generated display of
other mobile bodies.
66. The computer generated display of
at least one of traffic signs, guard rails, other road furniture, and landmarks.
67. The computer generated display of
68. The computer generated display of
Description The present invention hereby incorporates by reference and claims priority from co-pending U.S. patent application Ser. No. 09/618,613, filed Jul. 18, 2000. The present invention deals with mirrors. More particularly, the present invention deals with a vision enhancement device in the form of a computer generated virtual replica of a mirror. Mirrors associated with motor vehicles assist the driver by providing spatial awareness of traffic conditions behind (and to the sides of) the drivers vantage point. Conventional mirrors have several disadvantages. A first of these disadvantages is the blind zones associated with a conventional mirror. Mirrors of conventional widths, on automobiles, provide a viewing angle from approximately 15 degrees to 30 degrees. Typically, drivers set the mirrors such that there is a great degree of overlap in these already narrow fields of view. Obstructions and limitations to the fields of view are due to the location and size of real mirrors. The problems associated with large blind spots, or blind zones, are aggravated for elderly people who may have less acute peripheral vision and/or may encounter difficulty in turning their head. Another problem with conventional mirrors is that the view from the mirror to the rear or side of the vehicle is often obstructed. For example, a passenger's head or vehicle pillars may obstruct the rearward or sideward looking views from the driver or from the mirror. Similarly, on large vehicles, such as trucks and sport utility vehicles, the views from the mirrors can be obstructed by portions of the vehicle body itself. Another disadvantage presented by conventional mirrors is vibration. While this disadvantage is not as pronounced on small passenger cars, it presents a great deal of difficulty on large, commercial vehicles. As such a vehicle encounters uneven road surfaces, induced torques are transmitted to the mirror mounting hardware which causes the mirror to vibrate. These vibrations can be at frequencies which significantly disrupt the ability of the driver to distinguish meaningful information from the mirror. Also, because the shaky reflection from a vibrating mirror is caused by the changing mirror orientation mirror angle, a change of θ in mirror orientation creates a change of 2θ in image position. Therefore, as the mirror oscillates back and forth, the image also oscillates back and forth with twice the magnitude of the mirror oscillation, leading to a blurred image. Another disadvantage of conventional mirrors is that they must be quite large in order to provide a significant rearward or sideward looking view. However, any mirror mounted forward of the operator obstructs the forward view. Therefore, large conventional mirrors can undesirably obstruct the operator's forward looking view. Another disadvantage arises from the fact that many commercial vehicles use convex mirrors, in addition to flat mirrors. The convex mirrors show a wider area than flat mirrors. However, objects reflected in convex mirrors appear smaller than when looking at the same objects in a flat mirror. Due to this effect, the object appears to be further away from the host vehicle than it actually is. This requires drivers to allow for distance overestimation when maneuvering the vehicle. One prior art technique for addressing some of the deficiencies of conventional mirrors involves the use of periscopes. Such a periscope consists of the usual inside mirror coupled to an ensemble of mirrors located on the roof of the vehicle. These mirrors are presumed to have an unrestricted field of view. However, such periscopes have been found to be very large and bulky, and adversely increase the air resistance and drag of the vehicle. Other prior art approaches to addressing the problems associated with conventional mirrors involve a TV camera, fitted with wide angle optics installed on the rear of the vehicle. This technique also required a monitor in the instrument panel of the vehicle to provide a wide, unobstructed image. Such a setup requires a one meter wide display to cover a space at a distance of 70 meters away from the camera, at a one-to-one magnification. This translates to approximately a 90 degree field of view. In addition, such a TV monitor hooked up to a rearward looking camera does not operate satisfactorily under poor visibility conditions. The display can also lead to confusion because of the likely absence of the right-left lateral inversion associated with a reflected image. A virtual mirror is rendered by a computer graphics display. The display substantially replicates the optical properties of a physical mirror. The display is provided on a host vehicle. The present invention can be used with substantially any mobile body. However, the present description proceeds with respect to an illustrative embodiment in which the invention is implemented on a motor vehicle as a driver vision assist device (or virtual mirror). In one embodiment, controller Operator interface As is described in greater detail later in the specification, vehicle location system Geospatial database The digital map stored in the geospatial database In one embodiment, each road point of the digital map in database It should be noted that there are many possible coordinate systems that can be used to express a location on the surface of the earth, but the most common coordinate frames include the longitudinal and latitudinal angles, the state coordinate system, and the county coordinate system. Because the earth is approximately spherical in shape, it is convenient to determine a location on the surface of the earth if the location values are expressed in terms of an angle from a reference point. Longitude and latitude are the most commonly used angles to express a location on the earth's surface or in orbits around the earth. Latitude is a measurement on a globe of location north or south of the equator, and longitude is a measurement of the location east or west of the prime meridian at Greenwich, the specifically designated imaginary north-south line that passes through both geographic poles of the earth and Greenwich, England. The combinations of meridians of longitude and parallels of latitude establishes a framework or grid by means of which exact positions can be determined in reference to the prime meridian and the equator. Many of the currently available GPS systems provide latitude and longitude values as location data. Even though the actual landscape on the earth is a curved surface, it is recognized that land is utilized as if it is a flat surface. A Cartesian coordinate system whose axes are defined as three perpendicular vectors is usually used. Each state has its own standard coordinate system to locate points within their state boundaries. All construction and measurements are done using distance dimensions (such as meters or feet). Therefore, a curved surface on the earth needs to be converted into a flat surface and this conversion is referred to as a projection. There are many projection methods used as standards for various local areas on the earth's surface. Every projection involves some degree of distortion due to the fact that a surface of a sphere is constrained to be mapped onto a plane. One standard projection method is the Lambert Conformal Conic Projection Method. This projection method is extensively used in an ellipsoidal form for large scale mapping of regions of predominantly east-west extent, including topographic, quadrangles for many of the U.S. state plane coordinate system zones, maps in the International Map of the World series and the U.S. State Base maps. The method uses well known, and publicly available, conversion equations to calculate state coordinate values from GPS receiver longitude and latitude angle data. Ranging system Inter-vehicle communication can also be used to detect other vehicles and the positions of those vehicles. Vehicle location, speed, size, weight and shape can be communicated to all other vehicles. Vehicle characteristics are encoded in the vehicle identification number (VIN) of each vehicle while vehicle dynamics and braking characteristics can be determined or inferred from the VIN as well, so communicating the VIN from vehicle-to-vehicle is one way of communicating this information. This can be done using a suitable wireless communication device. Display Alternatively, display It should also be noted, however, that combiners or other similar devices or flat screen displays can be placed about the driver to cover fields of view directed to both rear view and side view mirrors. This can illustratively be implemented using a plurality of projectors or a single projector with appropriate optics to scan the projected image across the appropriate fields of view. It can also be accomplished by simply providing the controller signal to multiple flat screen or flat panel displays. Before discussing the operation of system In a specific illustrative embodiment, differential GPS receiver and correcting system Optional driver's eye (or camera) position tracking system Alternatively, optional driver's eye position tracking system Projector It should be noted that combiners Each combiner Another embodiment is a helmet supported visor (or eyeglass device) on which images are projected, through which the driver can still see. Such displays might include technologies such as those available from Kaiser Electro-Optics, Inc. of Carlsbad, Calif., The MicroOptical Corporation of Westwood, Mass., Universal Display Corporation of Ewing, N.J., Microvision, Inc. of Bothell, Wash., IODisplay System LLC of Menlo Park, Calif. The heading angle of the vehicle can be estimated from the past history of the GPS location data. Alternatively, a rate gyroscope can be used to determine vehicle heading as well. As noted initially, though heading angle estimation by successive differentiation of GPS data can be used, any other suitable method to measure an absolute heading angle can be used as well, such as a magnetometer (electronic compass) or an inertial measurement unit. Once controller A number of transformation matrices are constructed, as described in greater detail below, from the location and heading angle of the moving vehicle, and from the optional driver's head or eye data and vehicle orientation data, where that data is sensed. The location data is converted into a local coordinate measurement using the transformation matrices, and is then fed into the perspective projection routines to calculate and draw the road shape and sensed object icons in the computer's graphic memory. The road shape and sensed object icons are then projected as a virtual view in the driver's visual field, as illustrated in In any case, after the vehicle position data is received, the ranging information from ranging system Based on the position signal, controller Controller However, where the geospatial database does not indicate an expected sensed object in the present location, then the sensed object information is determined to correspond to an unexpected sensed object, such as a moving vehicle behind or to the side of the host vehicle, such as a stalled car or a pedestrian on the side of the road, or such as some other transitory sensed object that has not been mapped to the geospatial database as a permanent, or expected. It has been found that if all expected sensed objects are brought to the operator's attention, this substantially amounts to noise so that when real sensed objects are brought to the operator's attention, they are not as readily perceived by the operator. Therefore, optional filtering of sensed objects not posing a threat to the driver can be performed as is illustrated by block Once such sensed objects have been filtered, the transformation is performed using the transformation matrices. The result of the transformation provides the road boundary data, as well as the sensed object data, seen from the driver's eye perspective through the mirror. The road boundary and sensed object data is output, as illustrated by block It should also be noted that the actual image projected is clipped such that it only includes that part of the road which would be visible by the operator with an unobstructed mirror view. Clipping is described in greater detail below. In order to generate the virtual mirror display, controller If the line is trivially rejected, then processing returns to block The lines which will be visible on the mirror display after the mirror transformation is performed are output and controller Once the stencil transformation is performed, the points in the virtual mirror display can be treated like ordinary points in the forward field of view. That is, they can undergo the entire sequence of forward transformation and be mapped for viewing. The forward transformation enables perspective projection so that a three-dimensional world can be represented on a two-dimensional space, while retaining the feeling of “depth” in the scene. Thus, parallel lines seem to meet at a point called the vanishing point. Objects appear larger if they are closer to the point of view than if they are further away. This is referred to herein as “perspective foreshortening”. Performance of the forward transformation is indicated by block After the forward transformation has been performed, the line is rendered on the virtual mirror display based on the transformed data. Rendering the line is illustrated by block Blocks As briefly discussed above, the aim of the forward transformation is to achieve perspective projection so that the 3D world can be projected on to a 2D space. The procedure to accomplish this involves projecting all vertices in the world coordinate system onto an infinite 2D plane referred to as the projection plane by straight lines referred to as projectors. For perspective projection, all projectors meet at a single point referred to as the projection reference point (PRP). The view reference coordinate (VRC) system is an arbitrary coordinate system defined so that the projection plane becomes the XY plane of the view reference coordinate system, and the projection reference point (PRP) is a positive Z coordinate in this coordinate system. The view reference coordinate system is given by the homogeneous transformation matrix X The term window is a rectangular area on the projection plane. The projection plane and the window lie in the Z=0 plane of the view reference coordinate system. The window is not to be confused with desktop windows or computer screen windows launched by window managers. The window limits the view seen from the camera position (PRP). The edges of the window are parallel to the X and Y axes of the view reference coordinate system. Therefore, it is possible to define a window by the minimum and maximum limits of the X and Y coordinates of the VRC:
where the WINDOW is represented by a four element vector, WNDLEFT=the minimum x limit; WNDRIGHT=the maximum x limit; WNDBOTTOM=the minimum y limit; and WNDTOP=the maximum y limit. Four projectors By the very manner in which the frustrum was generated, it is evident that the window The viewport All coordinate systems are defined in terms of 4×4 homogenous transformation matrices. The first, second and third columns are unit vectors that give the directions of the X, Y and Z axes of the coordinate system with respect to the axes of the World Coordinate System. The fourth column gives the position of the origin of the coordinate system with respect to the World Coordinate System. The fourth row of the homogeneous transformation matrices is always [0 0 0 1]. The notation used here to denote transformation matrices and their inverses is described as follows: Homogeneous transformation matrices are 4×4 matrices that describe 3 dimensional rotation, translation and scaling. These matrices transform one coordinate system to another coordinate system as follows: Translation to point {x, y, z}; x, y and z in meters:
Rotation about the X axis through an angle of φ radians:
Rotation about the Y axis through an angle of φ radians:
Rotation about the Z axis through an angle of φ radians:
Scaling about the X, Y and Z axes by a factor a, b and c respectively:
Any number of translation, rotation and scaling matrices can be multiplied together to form a composite or aggregate matrix. The fourth row is always [0 0 0 1] for the composite matrix. Given that a homogeneous transformation matrix transforms coordinate system A to coordinate system B, the matrix inverse transforms coordinate system B to coordinate system A. Inverse of translation matrix:
Pure rotational matrices are orthonormal, and hence their inverses are just their transposes.
Since the scaling matrix is a diagonal matrix, its inverse is a diagonal matrix with the elements of the diagonal being reciprocals of the elements in the original matrix.
Composite matrices formed by multiplying any number of translating, rotational and scaling matrices have a simple inverse.
The 3×3 principal sub-matrix of the inverse is the transpose of the 3×3 principal sub-matrix of the original matrix. The first three elements of the fourth column of the inverse are the negatives of the dot products of the fourth column of the original matrix with the first three columns of the original matrix. This avoids the necessity to use generic but computationally expensive methods for computing the matrix inverse like the known Gauss-Jordan elimination. For purposes of this discussion, the World Coordinate System (WCS) is a fixed reference coordinate system with respect to which all other coordinate systems are defined. The host coordinate system is given by the homogeneous transformation matrix X Given the following: - 1. {
^{WCS}x,^{WCS}y} are the coordinates of the host vehicle GPS antenna**28**in the WCS (in meters). - 2. The GPS antenna
**28**is at a height of GPSHeight meters above the ground (z=0 plane of the WCS). In one example embodiment, GPSHeight=3.4036 m. (134″) - 3. θ
_{2 }is the angle (in radians) between the longitudinal axis**125**of the host vehicle**122**and east direction (the X axis of the WCS). leads to:
*X*_{GPS}=Trans(^{WCS}*x,*^{WCS}*y*, GPSHeight)·Rot*z*(θ_{2}−π/2) Eq. 12
Measurements of the eye position (or camera position) and field of view need to be taken with respect to some coordinate system located inside the host vehicle cabin because the driver's eye (or camera) is located there. Therefore, we introduce another coordinate system called the windshield coordinate system. The windshield coordinate system is given by the homogeneous transformation matrix X This means that if X Hence, given the following:
- 1.
^{SCS}gps_x=X coordinate of GPS antenna in X_{SCS }coordinates=0 - 2.
^{SCS}gps_y=Y coordinate of GPS antenna in X_{SCS }coordinates=1.3716 m (54″) - 3.
^{SCS}gps_z=Z coordinate of GPS antenna in X_{SCS }coordinates=1.143 m (45″) this leads to:
*X*_{SCS}*=X*_{GPS}·Rot*x*(π/2)·Trans(−^{SCS}gps_{—}*x, −*^{SCS}gps_{—}*y, −*^{SCS}gps_{—}*z*) Eq. 13
The following parameters are required to calculate the forward transformation. The location of the camera is conveniently measured in the X - 1. PRPx=X coordinate of camera lens in X
_{SCS }coordinates - 2. PRPy=Y coordinate of camera lens in X
_{SCS }coordinates - 3. PRPz=Z coordinate of camera lens in X
_{SCS }coordinates The camera position in X_{SCS }is:
^{SCS}PRP=[PRP*x*PRP*y*PRP*z*1]′ Eq. 14
The window - 1. WNDLEFT=Minimum X coordinate on the windshield corresponding to the window;
- 2. WNDRIGHT=Maximum X coordinate on the windshield corresponding to the window;
- 3. WNDBOTTOM=Minimum Y coordinate on the windshield corresponding to the window;
- 4. WNDTOP=Maximum Y coordinate on the windshield corresponding to the window;
where all coordinates are X_{SCS }coordinates, the window is given by Eq. 1.
The near clipping plane The viewing volume Assuming that the eye (or camera) is in a fixed position and does not change orientation, X
The above transformation can readily be recognized as a similarity transformation. The yaw and roll take place in a temporary coordinate system defined at the camera lens such that the axes of this coordinate system are parallel to X
The camera yaw does not affect the position of the camera relative to the new projection plane Again, note that for zero roll,
With the parameters set in this way, the forward transformation can now be computed. The entire forward transformation sequence is illustrated by the flow diagram of Once the view reference coordinate system is computed, the next step is to compute the normalizing viewing transformation N Once the lines are clipped, controller The process of projection (illustrated by block An example may be helpful. The MnRoad Low Volume Road (LVR) is located at the Minnesota Department of Transportation (MnDOT) facility at Albertville, Minn., and will be used as the basis of this example. The center line stripe CL consists of the following six segments: - 1. Two large arcs CD and EF with a radius of curvature 275′ which subtend an angle of 262 deg 03 min 26.4 sec.
- 2. Two small arcs BC and FA with a radius of curvature 275′ which subtend an angle of 82 deg 03 min 26.5 sec.
- 3. Two straight lines AB and DE that are inclined at an angle θ
_{1}=+142.6° from the east.
The CL can be described by the ten points that define its geometry. Four of the points, viz. The letters se, nw, SE and NW, represent the centers of the four curved sections, while the other six, viz. A, B, C, D, E and F represent the starting points for the six segments. These ten points were made available in Wright County coordinates. They were converted to Minnesota South State Plane coordinates using the publicly available MnCon software provided by the Minnesota Department of Transportation. This software converts coordinates between any two of 232 map projections including the three Minnesota State plane zones in both NAD27 and NAD83 coordinates. In addition to the ten points which describe the geometry, there are two more points cc The coordinates of these twelve points in the Wright county, the Minnesota South State Plane coordinate systems and the World Coordinate System are given in Table 1.
The inner lane has an 8′ wide shoulder and the outer lane has a 12′ wide shoulder. Thus the entirety of the LVR can be thought to be composed of five polylines—the center lane stripe, the outer lane boundary, the inner lane boundary, the outer shoulder and the inner shoulder. The inner shoulder boundary (IS), the inner lane boundary (ILB), the outer lane boundary (OLB) and the outer shoulder boundary (OS) are offset from the CL at lateral distances of −20′, −12′, +12′ and +24′ respectively. Negative lateral distance for a loop implies that the loop lies within the area enclosed by the CL and positive offset distances imply otherwise. The road database that serves the virtual mirror application is made up of four linked lists of lines—the first contains the inner shoulder and the inner lane boundary, the second contains the CL, the third contains the outer lane boundary and the outer shoulder and the fourth contains the two calibration crosses. The host vehicle position and orientation are given by the following equations and these values are used in the subsequent calculations as an example of but one position only. As the vehicle moves, these values will, of course, be updated:
where,
θ The camera position These numbers imply that the camera lens is 0.127 m (5″) to the right of the windshield origin (i.e. on the passenger side), 0.2921 m (11½″) above the windshield origin and 0.7112 m (28″) away from the plane of the windshield. The view seen through the camera (or from the eye perspective), after eliminating the portion that will occupy the top 80 rows on the computer screen (that may be reserved for the GUI), is the window The near and far clipping planes The normalizing viewing transformation N The canonical perspective viewing volume is still a truncated frustrum of a four-sided pyramid with its apex located at the origin of the World Coordinate System. Its six faces are given by: - 1. The near clipping plane
**118**z=z_{min}, where −1<z_{min}<0. (z_{min }is given by Eq. 35 below) - 2. The far clipping plane
**120**z=−1 - 3. The plane x=z (the lower dotted line in
FIG. 13A ) - 4. The plane x=−z (the upper dotted line in
FIG. 13A ) - 5. The plane y=z
- 6. The plane y=−z
By reducing the arbitrary viewing volume to a standard shape, it becomes straightforward to apply a transformation-independent clipping algorithm to the standard shape. The normalizing viewing transformation N -
- PRP(n), n=1, 2, 3 are the n
^{th }elements of the 4×1 vector PRP (given by Eq. 19) - VRP′ (3)=3
^{rd }element of the 4×1 vector VRP′
- PRP(n), n=1, 2, 3 are the n
B=far clipping plane distance from the projection plane=−1000 (meters) WINDOW(n), n=1, 2, 3, 4 are the n SH Each line is assigned a pair of 4×1 column vector endpoints at inception. These coordinates are {v The lines are now clipped against the six faces of the canonical viewing volume using the Liang-Barsky clipping algorithm, as indicated by block The function call for clipping is:
The parameters passed to the Liang-Barsky clipping function Clip3D( ) are the (transformed) line endpoints v VRP′(3)=the third element of the 4×1 vector VRP′ (given by Eq. 31). If the line lies partly or completely inside the viewing volume, then the Clip3D function sets a visible variable to 1. The endpoints v After applying the projection transformation and homogeneous division (as discussed below), the frustrum shaped viewing volume of The parallel-projection volume is given by its six planes z=0, z=−1, x=−1, x=1, y=−1 and y=1. The projection transformation matrix M is given by:
Each visible vertex from the previous (clipping) stage is multiplied by this projection matrix.
As can be seen, the last row of the matrix M is not [0 0 0 1]. Hence this transformation converts the vertices into what is called the homogeneous coordinates [x y z w]′ where w is not equal to 1. To convert homogeneous coordinates back to 3D coordinates, each element of the 4×1 column vector (v Hence,
Note that v After homogeneous division, the Z coordinate denotes the depth of the point. As can be seen from Most graphic device contexts define their origin as (0,0) at the upper left corner, and the X coordinate increases from left to right, and the Y coordinate increases from top to bottom. Therefore, the origin of the projected coordinate system is first translated from (0,0) to the upper left corner (−1, +1). The X axis is then scaled up by VIEWPORT_W/2 and the Y axis scaled up by VIEWPORT_H/2 and then the Y coordinates are reflected about the X axis. This is accomplished by the following transformation:
For the odd graphic device context which has its origin at the lower left corner, with X coordinates increasing from left to right and the Y coordinate increasing from bottom to top:
Thus, after applying the viewport transformation
A camera (or eye) view corresponding to the present example is shown in The mirror transformation sequence is shown in greater detail in Each of these items is now discussed, in turn. This section also includes an example of a mirror transformation that includes the mapping to viewport stage and shows how the virtual mirror display will not match with the camera image if this stage is performed. This shows that a link between the mirror transformation and the forward transformation (the stencil transformation) is needed. The basic reflection transformation is now discussed. A Mirror Coordinate System (MCS) is defined so that the mirror plane lies in its XY plane. If a vertex has coordinates w={x, y, z} with respect to the MCS, then the reflected image of the vertex has coordinates w′={x, y, −z} with respect to the MCS. Therefore, reflection is achieved by simply flipping the sign of the Z coordinate. This is achieved by premultiplying the vertex w by the scaling matrix Z However, points are not defined in the MCS. They are usually defined with respect to the World Coordinate System (WCS). The coordinates of the reflection v′ are now derived in world coordinates, and this discussion proceeds with respect to Now the reflected vertex has coordinates w′ with respect to the MCS. As already discussed, w′ is obtained from w by flipping the sign of the Z coordinate of w, i.e. premultiplying w by the scaling matrix Z Finally, Hence the net transformation R The mirror coordinate system X
Hence the vectors OC′ and OA form the X and Y axes of the mirror coordinate system respectively. The X, Y and Z coordinates of the origin of this mirror coordinate system in the X The various parameters required to calculate the mirror transformation are now discussed. The camera position is If the driver's eye were located at the camera origin, then similar measures would be calculated for the eyes. The mirror where, I2M=0.0254, a factor which converts inches to meters. The mirror Otherwise, points behind the mirror The mirror As already mentioned, the mirror With these parameters, the mirror transformation can be computed. With the exception of a different normalizing viewing transformation and absence of the mapping to the viewport stage, the mirror transformation is the same as the forward transformation. Therefore, it is not discussed in great detail here. Each vertex is first reflected about the mirror plane using the reflection matrix R where, as per Equation 29,
As discussed with respect to the forward transformation, clipping is accomplished using the Liang-Barsky clipping algorithm. The Z coordinates of the lines are then ignored. Therefore, a 2D projected view is obtained as shown in The mirror transformation stops with the projection. The last stage, viz. mapping to the viewport is not carried out. If the 2D projected view of As noted with respect to the mirror transformation, directly mapping the projected view to the viewport does not give the desired result. The 2D rectangle of the projected view needs to be mapped to the 2D rectangle of the mirror. To this effect, the projected view area bounded by −1<x<1 and −1<y<1 is first scaled up to the mirror dimensions. Given that:
where, MIRROR(n), n=1, 2, 3 and 4 are the 4 elements of the 4×1 vector MIRROR as given by Equation 57, the scaling matrix is:
Next, the origin (0,0) of this scaled view is translated to the center of the mirror where CM(n), n=1, 2, 3 are the nth elements of the 4×1 vector CM given by Equation 67. The above scaling and translation are with respect to X To summarize, the stencil transformation is a simple 2D space to 2D space transformation. It transforms the 2D projected view of the mirror transformation of The scene displayed by the virtual mirror consists of lines. Each line has to be transformed, clipped and projected. Transformation consists of multiplying each of the two 4×1 endpoints with a 4×4 transformation matrix. The line is then clipped against the six faces of the perspective canonical volume by the Liang-Barsky algorithm. Next it is projected, which involves multiplying the endpoints by the 4×4 projection matrix followed by homogeneous division. These computations are expensive to perform. If the line lies far away from the host vehicle, so that it may not even be inside the original viewing volume, or if it is inside the viewing volume but too far away to substantially affect the final rendered view, then the line can optionally be rejected before the computations rather than going through the computations and then being rejected. To accomplish this, a 2D rectangular clipping region is introduced around the host vehicle The elevation data (Z coordinate) of the line endpoints, if any, is ignored. All the lines are examined to see if they can be “trivially rejected” by the known Cohen-Sutherland algorithm against the rectangular clipping region. As can be seen from The Cohen-Sutherland algorithm has a trivial rejection portion which can be used for line clipping. Only lines which can be trivially rejected are precluded from the transformation, clipping and projection stages. The four sides of the rectangular clipping region divide the plane outside the clipping region into nine areas. Each area is identified by a four bit “outcode”. A line endpoint lying in a particular area is assigned the same outcode as the area. If the logical AND of the outcodes of the line endpoints is not zero, then the line is trivially rejected. To compute the endpoint outcode, the four bits comprising the outcode are set as per Table 2.
Using Table 2, the endpoint outcodes for lines AB, AC, AD and AE shown in
Computing an outcode consists of testing four floating-point differences. Examining for trivial rejection consists of testing the result of the binary AND operation performed on the two outcodes. Hence the trivial rejection algorithm can be performed very quickly. In one embodiment, to reduce rendering time, a matrix optimization is performed. The line endpoints (represented by two 4×1 coordinate vectors) are then multiplied by 4×4 matrices using C code constructs. This makes the code highly repetitive and very fast. Another optimization is to avoid multiplying the last row of the homogeneous transformation matrices, since the last row is always [0 0 0 1]. This reduces the multiplication of a 4×4 homogeneous transformation matrix with a 4×1 coordinate vector to 12 multiplications and 9 additions instead of 16 multiplications and 12 additions. This brings down the time taken to transform vertices to about 10 ms per frame and the time required to draw one frame to between 33 and 65 ms, averaging 40 ms per frame. An alternative embodiment of the present invention will now be discussed in which graphics APIs are used. One illustrative embodiment uses OpenGL, although others could be used as well. Open GL is a low-level graphics API and is a standard for 3D graphics and includes a mirror implementation and is described in T. Hall, A How To For Using OpenGL to Render Mirrors, posting to comp.graphics.api.opengl, August, 1996. The 3D scene in OpenGL is manipulated via two matrices—the projection matrix and the modelview matrix. The projection matrix allows the user to apply a perspective or orthographic projection. If a perspective projection is used, the user specifies a frustrum shaped viewing volume, by specifying the left, right, bottom, top, near and far parameters. The modelview matrix, so called because a transformation applied to a model in the scene is equivalent to applying the inverse of the transformation to the point of view, is used to place models and the viewpoint in the scene. The modelview matrix defines the current coordinate system. All drawing primitives such as lines and polygons are then drawn with respect to this coordinate system. The coordinate system can be translated, rotated or scaled to another coordinate system by means of the function calls glTranslate, glRotate and glScale respectively. These function calls are highly optimized and hardware accelerated because they need to be called for millions of polygon vertices. Hardware acceleration means that the CPU no longer need perform the repetitive matrix multiplication operations on the vertices, this is done by the graphics accelerator. In addition, if the graphics card is of the AGP variety, then the graphics card is mounted on a port on the motherboard, with which it communicates directly. Hence the standard PCI bus is spared the high bandwidth communication between the CPU and the graphics card and can perform other I/O tasks more efficiently. AGP is twice as fast as the PCI bus, and the current AGP 4× is 8 times as fast as the PCI bus. Graphics cards also have a double buffer, which means that the screen need not wait for the graphics card to compute the pixel data. Rather, the graphics card performs the calculations and updates a second buffer that is not visible to the user. When the buffer update is complete, the graphics card instantly swaps this buffer with the one that the user is currently seeing. This gives the feeling of smooth, connected flow between the frames. Complex models that do not distort internally can be stored in precompiled byte code called display lists. The computations performed on the vertices within the model are stored while compiling the list and when the list is subsequently redrawn, only the results of the computations are accessed and rendered. This feature increases the frame refresh rates. Each 3D line or polygon vertex has a depth, usually a 32-bit value. OpenGL performs a sort on depth to correctly occlude the vertex of greater depth (lying farther away from the viewpoint), whenever two vertices compete for the same pixel. It helps to have this sorting hardware accelerated. Other features in OpenGL include an 8-bit α channel per pixel in addition to the 24-bit red, green, blue (RGB) value. This α channel determines the transparency of each pixel and allows blending a destination pixel with a transparent source pixel to give a “see-through” effect. OpenGL also includes texture mapping, which allows a bitmap loaded from a file to be mapped onto an arbitrarily shaped polygon. This adds realistic, life-like quality to the scene. Graphics cards can have textures resident in their memory instead of on conventional RAM. This significantly speeds up the drawing of textures. The most important OpenGL feature from the point of view of the virtual mirror application is the stencil buffer. A stencil is like a mask, and one can specify whether to draw to the stencil or around it. The number of stencils that can be used is controlled by the number of stencil bits in the stencil buffer. For 8 stencil bits, 2 The entire environment is set up in the Windows® environment via the following structure:
The actual OpenGL code is quite simple. The basic procedure is as follows: - 1. Define a stencil, instructing that when it is drawn, the stencil bits be set to 1.
- 2. Set up the modelview matrix to equal the mirror coordinate system (X
_{Mirror }of Equation 52). - 3. Draw the stencil, which is a quadrilateral, representing the virtual mirror.
- 4. Next, only the stenciled portion (where the stencil bit is set to 1) is marked active, so that all the drawing commands that follow are restricted to the inside of the stencil and the lines and polygons that extend beyond the stencil edges will get clipped.
- 5. The modelview matrix is set up to equal the mirror transformation (R
_{Mirror}, Eq. 51). - 6. Render the entire scene. This creates the view in the mirror.
- 7. Set up the modelview matrix for the forward-looking view.
- 8. Only the non-stenciled portion (where the stencil bit is not set to 1) is marked active, so that all the drawing commands that follow are restricted to the outside of the stencil and the lines and polygons that extend inside the stencil edges will get clipped.
- 9. Render the entire scene. This creates the forward-looking view.
The following code snippet gives this entire algorithm in almost as many lines of code. All lines following a “//” or enclosed between a “/*” and a “*/” are comments.
In addition to the above basic algorithm, there are two refinements. First, the vertex normals need to be reversed when drawing polygons in the mirror view. This is because instead of viewing the front surfaces of the polygons as in the forward view, the rear surfaces of the polygons are now visible in the mirror view. This means that the surface that has to be culled (eliminated because it is not visible from the viewpoint) is the front surface. This automatic computation of the reversed normals and culling of the front face of the polygons instead of the rear face is achieved by:
Vector mathematics reveals that if the normal to the plane is outward (points away from the origin of the World Coordinate System), then d is the negative of the dot product of the unit normal to the mirror plane and the origin of the Mirror Coordinate System. Hence, if the Mirror Coordinate System is given by the homogeneous transformation matrix X The clipping plane is now specified by the following code:
One representation of the virtual mirror, along with the forward-looking view is shown in It can thus be seen that the present invention provides a vision assist device in the form of a virtual mirror. The virtual mirror addresses many of the disadvantages associated with conventional mirrors without the drawbacks of prior art techniques for addressing those disadvantages. It should also be noted that the mirror representation can be manipulated as desired. For instance, any desired mirror optics (such as concave, convex) can be produced. Similarly, different parts of the mirror representation can emulate different mirror optics. Also, the mirror optics can change dynamically based on substantially any desired parameter. Thus, for example, the mirror can be flat when objects are closer, but convex when they are further away. Further, the present virtual mirror can be rendered over the top of a mirror with moving portions darkened and positioned to dull the glare from headlights or glare from the sun. Such an embodiment can be formed using an LCD film that selectively blackens areas based on sensed light or based on an estimate of glare from the sun or other vehicles based on the position and orientation of the host vehicle relative to the sun or other vehicle. Similarly, the present virtual mirror display can be controlled to oscillate slightly, at a desired frequency, to give an indication of depth. Similarly, the position of the display Also, though the present mirror display can be used to present a mirror display of items which would otherwise be obstructed by the vehicle body itself, the vehicle body is not displayed. However, the vehicle body can be rendered in the display in outlined form so the driver can see the position of the vehicle body relative to the items in the mirror display. This may be particularly helpful when moving the vehicle through tight spaces or when backing up. Further, the present invention can be enhanced with audio or haptic feedback. For example, if the driver moves the host vehicle too close to a detected object or lane boundary, a voice, scraping sound, or other sound can be generated. Similarly, haptic feedback, such as that provided by (1) a vibrating seat which allows for control of the amplitude, frequency and position of the vibration or motion across the seat surface, or (2) a steering wheel which has a torque function applied to it to provide a position dependent signal to the driver's hands. This signal can be a vibrational wave that directs the driver to turn right or left. The source of sound or vibration can be positioned to indicate the direction of the detected object or lane boundary. In addition, the mirror display can be enhanced as well. For instance, as detected objects draw near, they can be displayed differently, such as flashing or in a different color, etc. Although the present invention has been described with reference to preferred embodiments, workers skilled in the art will recognize that changes may be made in form and detail without departing from the spirit and scope of the invention. Patent Citations
Non-Patent Citations
Referenced by
Classifications
Legal Events
Rotate |