US 7826637 B2 Abstract A digital watermark embedding apparatus for embedding watermark information into a digital content, obtains a predetermined topological invariant corresponding to the watermark information to be embedded into the digital content, and sets the predetermined topological invariant to the digital content by modifying the digital content into which the watermark information is to be embedded.
Claims(29) 1. A watermark embedding apparatus, comprising:
a processor comprising:
a homotopy class calculating unit configured to calculate a homotopy class of an input image to be watermarked;
a watermark information class transforming unit configured to calculate a homotopy class of watermark information; and
a homotopy class changing unit configured to change at least a portion of the input image based on a topological invariant, wherein the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the input image, the local distortion including a Stirmark attack, the topological invariant corresponding to watermark information to be embedded into the input image, and the local distortion being expressed by functions, each of the functions depending on a position of the points;
wherein the homotopy class changing unit is further configured to change said at least said portion of the input image so that the homotopy class of the input image corresponds to the homotopy class of the watermark information.
2. The apparatus according to
the topological invariant is a homotopy invariant.
3. The apparatus according to
said changing unit is further configured to change a predetermined part of a bit string corresponding to a spatial or time-basis position of the input image to a target value determined by the position of the bit string and the topological invariant or a target restorable value.
4. The apparatus according to
said changing unit is further configured to generate a mapping to a target topological space defined by the predetermined part of said bit string from a base topological space defined by the spatial or time-basis position, and obtain the target value corresponding to the position based on a function for applying the topological invariant.
5. The apparatus according to
the function has the topological invariant as a parameter.
6. The apparatus according to
the target restorable value is a value obtained by scaling the target value, and
the changing unit is further configured to select a scale power for scaling the target value so as to decrease the influence on the input image for the target value corresponding to the position.
7. The apparatus according to
the changing unit is further configured to evaluate an influence on the input image if the topological invariant is set to the input image using a plurality of different functions, and to use a function evaluated to minimize the influence on the input image as the function for applying the topological invariant.
8. The apparatus according to
said changing unit is further configured to set a value, allowing a representative value representing a divided region which is obtained by dividing the input image in a spatial manner or a time-basis manner and obtained from a predetermined part of one or a plurality of bit strings corresponding to a spatial or time-basis position of the divided region to become a target value determined by the spatial or time-basis position of the divided region and the topological invariant or a target restorable value, to the predetermined part of the one or the plurality of bit strings.
9. The apparatus according to
the bit string corresponding to the plurality of positions enables configuring a content of one divided region, and
the representative value is an average value of values of the predetermined parts of the plurality of bit strings.
10. The apparatus according to
the input image into which the watermark information is to be embedded is a content dividable into a plurality sub-contents, and the topological invariant corresponding to the watermark information is set to the plurality of sub-contents.
11. The apparatus according to
the input image into which the watermark information is to be embedded is a content dividable into a plurality sub-contents, and a value according to a pseudo-random number generated based on the watermark information is set, as the topological invariant, to the plurality of sub-contents.
12. The apparatus according to
13. A watermark detecting apparatus, comprising:
a processor comprising:
a homotopy class calculating unit configured to calculate a homotopy class of a watermarked image; and
a detector configured to detect watermark information by obtaining a topological invariant, wherein the watermarked image comprises points, the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the watermarked image, the local distortion including a Stirmark attack, the local distortion being expressed by functions, each of the functions depending on a position of the points;
wherein the detector is further configured to obtain watermark information corresponding to the calculated homotopy class.
14. The apparatus according to
the topological invariant is a homotopy invariant.
15. The apparatus according to
said detector is further configured to obtain a representative value representing a divided region obtained by dividing the watermarked image in a spatial manner or a time-basis manner based on a value read from a predetermined part of one or a plurality of bit strings corresponding to a spatial or time-basis position of the divided region, and obtain the topological invariant based on the representative value representing the divided region.
16. The apparatus according to
a plurality of positions are provided, a watermarked image of one divided region is constituted by the bit string corresponding to the plurality of positions, and the representative value is an average value of values of the predetermined part of the plurality of bit strings.
17. The apparatus according to
said detector is further configured to obtain a target value normalizing the representative value of the divided region, and perform a predetermined summing arithmetic operation for the target value and a difference in target values, thereby obtaining the topological invariant.
18. The apparatus according to
the watermarked image into which the watermark information is to be embedded is a content dividable into a plurality of sub-contents, topological invariants are detected from the plurality of sub-contents, respectively, and the watermark information is obtained based on the obtained topological invariants.
19. The apparatus according to
the watermarked image is image data of a still image or moving image frames, and
the part is a pixel value vector of the image data.
20. The apparatus according to
the watermarked image is moving image data, and
the part is a pixel value vector of the moving image data.
21. The apparatus according to
the watermarked image is voice data, and
the part is an amplitude value vector of the voice data.
22. The apparatus according to
23. A non-transitory computer readable medium having computer executable instructions, which when executed by a computer, cause the computer to perform a method comprising:
calculating a homotopy class of a watermarked image
detecting watermark information of the watermarked image by obtaining a topological invariant, wherein the watermarked image comprises points, the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the watermarked image, the local distortion including a Stirmark attack, the local distortion being expressed by functions, each of the functions depending on a position of the points; and
obtaining watermark information corresponding to the calculated homotopy class.
24. The medium according to
25. The medium according to
26. A watermark embedding apparatus, comprising:
a processor comprising:
a watermark information class transforming unit configured to calculate a homotopy class of watermark information; and
a homotopy class changing unit configured to generate an embedding function corresponding to an element of the homotopy class of the watermark information and to change at least a portion of an input image based on a topological invariant, wherein the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the input image, the local distortion including a Stirmark attack, the topological invariant corresponding to the embedding function and the local distortion being expressed by functions, each of the functions depending on a position of the points;
wherein the homotopy class changing unit is further configured to change said at least said portion of the input image so that the homotopy class of the input image corresponds to the homotopy class of the watermark information.
27. A watermark embedding apparatus, comprising:
a processor comprising:
a frame dividing unit configured to divide a moving image into frames;
a watermark information class transforming unit configured to calculate a homotopy class of watermark information;
a homotopy class changing unit configured to change at least a portion of each of the frames based on a topological invariant, wherein the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of a frame, the local distortion including a Stirmark attack, the topological invariant corresponding to watermark information to be embedded into moving image data, and the local distortion being expressed by functions, each of the functions depending on a position of the points; and
a frame composing unit configured to compose the frames having a portion which is changed into the moving image data;
wherein the homotopy class changing unit is further configured to change said at least said portion of each of the frames so that the homotopy class of the frames corresponds to the homotopy class of the watermark information.
28. A watermark detecting apparatus, comprising:
a processor comprising:
a frame dividing unit configured to divide a moving image into frames; and
a detector configured to detect watermark information by obtaining a topological invariant, wherein the frame comprises points, the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the frame, the local distortion including a Stirmark attack, the local distortion being expressed by functions, each of the functions depending on a position of the points;
wherein the detector is further configured to obtain watermark information corresponding to a calculated homotopy class.
29. A non-transitory computer readable medium having computer executable instructions, which when executed by a computer, cause the computer to perform a method comprising:
dividing a watermarked moving image into frames;
detecting watermark information of the watermarked image by obtaining a topological invariant, wherein the frame comprises points, the topological invariant is maintained when a local distortion which is a homeomorphism is applied to a topological space formed of a set of points of the frame, the local distortion including a Stirmark attack, the local distortion being expressed by functions, each of the functions depending on a position of the points; and
obtaining watermark information corresponding to a calculated homotopy class.
Description This application is based upon and claims the benefit of priority from the prior Japanese Patent Application No. 2000-333725, filed Oct. 31, 2000, the entire contents of which are incorporated herein by reference. 1. Field of the Invention The present invention relates to a digital watermark embedding apparatus and a digital watermark embedding method for embedding watermark information into a content such as digital still image data, moving image data, voice data or music data, and a digital watermark detecting apparatus and a digital watermark detecting method for detecting watermark information from a content. 2. Description of the Related Art Digital watermarking is a technique for embedding information such as discrimination information on the copyright owner or user of a content such as a digital still image, moving image, voice or music, ownership information on the copyright owner, content using conditions, secret information necessary to use the content and/or copy control information (which information is referred to as watermark information) into the content so as to make it difficult to recognize the information, detecting the information later at need from the content, thereby protecting the copyright of the content including the controlling the use of the content and the controlling the copying of the content and promoting the secondary use of the content. If the digital watermarking is intended to prevent the illegal use of a digital book, it is necessary for the digital watermarking to have robustness for preventing watermark information from being lost or falsified by various operations or intentional attacks normally considered to be applied to the digital book. Depending on a digital watermarking applied field, even if a content which has been converted into an analog signal is digitalized again, it is desirable not to lose the watermark information. This is exemplified by embedding, as watermark information, copy control information into a content. As another example, if ownership information is embedded into a still image and the still image is printed out (or sometimes copied) and digitalized by a scanner (D-A-D conversion), then it is desirable not to lose the watermark information. D-A-D conversion is considered to cause a geometrical distortion to an image as shown in In case of digital watermarking in a frequency region, frequency components do not greatly change as long as a geometrical distortion to an image is small and robustness against the StirMark attack is considered to be relatively high. However, as indicated by the change of a dotted line to a solid line shown in On the other hand, in case of digital watermarking in a space region, the multiplication of pseudo-random numbers and the superimposition of mask patterns are performed for each pixel. Therefore, if the positions of pixels are moved by a StirMark attack, it is necessary to strictly synchronize the pixel positions (those during embedding with those during detection) by some method; otherwise, it is difficult to ensure accurate detection. The geometrical distortion is roughly divided into two distortions, a global distortion and a local distortion. Here, the global distortion means a distortion expressed by parameters independent of positions and the local distortion means a distortion expressed by the parameters which are locally different. The local distortion does not mean that a distortion is limited to a local region. Namely, a global distortion corresponds to a special case of the local distortion. The global distortion is the scaling, rotation and translation of an entire image and can be expressed by an Affine transformation. The Affine transformation is expressed using six parameters as shown in the next formula:
On the other hand, the local distortion is ordinary two-dimensional coordinate transformation as shown in the following formula, where f and g are arbitrary functions:
Conventionally, some methods for synchronizing pixel positions for a global distortion employs an original image and others do not employ the original image. If an original image is employed, a global transformation is manually performed so as to coincide a detection target image with the original image or a transformation having a highest correlation value is searched by the correlation of pixel values between the images. There have been proposed many synchronization methods for the digital watermarking system while an original image is not used during detection. It is, therefore, necessary to restore the synchronization of the original image with the detection target image without using the original image. Such synchronization methods are roughly divided into the following three methods (see, for example, Document (1) Kutter Martin, “Towards Affine Invariant Image Watermarking Schemes,” Watermarking Workshop, 1999.) (1) Template Based Watermarking A method for embedding a signal (template) for recognizing an image distortion in a frequency region or a space region is referred to as “template watermarking” (see, for example, Document (2) Bender, W., D. Gruhl, N. Morimoto and A. Lu, “Techniques for data hiding,” IBM Systems Journal, 35, 3&4, 313-336, 1996. Document (3) Fleet, David J. and David J. Heeger, “Embedding Invisible Information in Color Images,” ICCP'97, 532-535, 1997. Document (4) Rhoads, Geoffrey B. “Steganography methods employing embedded calibration data,” U.S. Pat. No. 5,636,292, 1997. Document (5) Pereira, Shelby and Thierry Pun, “Fast Robust Template Matching for Affine Resistant Image Watermarking,” The Third Workshop on Information Hiding, 207-218, 1999). The disadvantage of this method is that it is difficult to maintain robustness while suppressing image degradation caused by the template. (2) Invariant Watermarking A method for embedding digital watermark information into a region invariant to a geometrical distortion is referred to as “invariant watermarking” (see, for example, Document (6) Ó Ruanaidh, Joseph J. K. and Thierry Pun, “Rotation, scale and transformation invariant digital image watermarking,” Proceedings of ICIP'97, 536-539, 1997. Document (7) Ó Ruanaidh, Joseph J. K. and Thierry Pun, “Rotation, scale and transformation invariant spread spectrum digital image watermarking,” Signal Processing, 66, 303-317, 1998.). This method is effective only to a uniform scale transformation and rotation and not invariant to a transformation for transforming an aspect ratio. (3) Self-Reference Watermarking A method for employing digital watermark information itself as a template is referred to as “self-reference watermarking” (see, for example, Document (8) Kutter, M., F. Jordan and F. Bossen, “Digital signature of color images using amplitude modulation,” Proc. of SPIE storage and retrieval for image and video database, 3022-5, 518-526, February 1997; Journal of Electronic Imaging, 7, 2, 326-332, April 1998). According to this method, a reference point is defined in an image, digital watermark information having a preset value is embedded into the reference point, and the reference point is obtained by full retrieval and then other watermark information is extracted during detection. With this method, if only rotation, translation or scaling is performed, detection can be completed in a relatively short time. However, if a retrieval space becomes wide, calculation cost is excessively pushed up. It is, therefore, difficult to apply this method to a local distortion requiring a wider retrieval space. As a development of this self-reference watermarking method, there is known a method for embedding the same watermark information into horizontally shifted positions and vertically shifted positions a plurality of times and employing the watermark information as a correction signal (see, for example, Document (9) Kutter, M, “Watermarking resisting to translation, rotation, and scaling,” Proceeding of SPIE International Symposium on Voice, Video, and Data Communications, 1998). Further, there is known another self-reference watermarking method for limiting a geometrical distortion to translation and cut, obtaining a translation quantity from the phase of a Fourier transform coefficient for a digital watermark pattern to thereby reduce retrieval cost (see, for example, Document (10) Takao Nakamura, Hiroshi Ogawa, Atsuki Tomioka and Yohichi Takashima, “A scheme for improving digital watermarking translation and cut resistance,” SCIS'99, 193-198, 1999.). Additionally, there is known yet another self-reference watermarking method for dividing an image into small blocks and approximating an image distortion to a block translation (see, for example, Document (1)). The above-stated three improved methods relate to a global distortion. If applied to a local distortion, retrieval cost is disadvantageously pushed up and these improvements are not effective to the local distortion. As stated above, according to the conventional watermarking, watermark information is lost or falsified by a geometrical distortion such as D-A-D conversion or StirMark attack, with the result that the embedded watermark information such as copy control information and/or ownership information may not be possibly able to be detected. As for robustness against a global distortion out of a geometrical distortion, there have been conventionally proposed some techniques. However, these techniques are not effective for a local distortion such as a StirMark attack or D-A-D conversion. Further, if measures against a global distortion are forced to be applied to a local distortion, it is feared that considerably long processing time is required. Besides, if a local distortion becomes large, the measures against the global distortion becomes ineffective. It is an object of the present invention to provide a digital watermark embedding apparatus, a digital watermark detecting apparatus, a digital watermark embedding method and a digital watermark detecting method having robustness against a local distortion such as a StirMark attack and D-A-D conversion. According to one embodiment of the present invention, there is provided a digital watermark embedding apparatus for embedding watermark information into a digital content, comprising: a unit configured to obtain a predetermined topological invariant corresponding to the watermark information to be embedded into the digital content; and a setting unit configured to set the predetermined topological invariant to the digital content by modifying the digital content into which the watermark information is to be embedded. According to another embodiment of the present invention, there is provided a digital watermark detecting apparatus for detecting watermark information from a digital content, comprising: a detector configured to detect a predetermined topological invariant set to the digital content based a content into which the watermark information is embedded; and an output unit configured to output watermark information corresponding to the detected predetermined topological invariant. An embodiment of the present invention will now be described with reference to the accompanying drawings. The present invention is applicable to a case of embedding and detecting various watermark information (e.g., discrimination information on the copyright owner or user of the content, ownership information on the copyright owner, content use conditions, secret information necessary to use the content, copy control information, etc. as well as combinations thereof) into and from the content (e.g., a digital still image, moving image, voice or music) for various purposes (e.g., for controlling the use of the content, copyright protection including copy controlling or the promotion of secondary use). A digital watermark embedding apparatus The digital watermark embedded content obtained by the digital watermark embedding apparatus A digital watermark detecting apparatus As will be described later in detail, the digital watermark embedding apparatus It is noted that description will be given while assuming that a digital content is input into the digital watermark embedding apparatus The digital watermark embedding apparatus If the content provider employs the digital watermark embedding apparatus If the digital watermark detecting apparatus The block diagram shown below functions as both an apparatus functional block diagram and a software (program) functional module diagram or procedure diagram. Description will be given to a case where still image data is mainly used as a digital content by way of example. Needless to say, other media data can be used as a digital content. It is noted that a local distortion such as a StirMark attack or D-A-D conversion is a geometrical distortion to still image data if the target digital content is a still image and is a distortion to voice data in a time axis direction if the target digital content is a voice. Also, if a moving image is processed in units of frames, a local distortion is a geometrical distortion to the data as in the case of the still image. If the moving image is processed in light of time-basis positions over a plurality of frames, a local distortion is a geometrical distortion and a time-axis direction distortion (time space distortion) to the data. Meanwhile, according to one aspect of the present invention, a topological invariant is embedded, as watermark information, into a content (for example, the pixel value or the like of a target content is changed so that the topological invariant of by the target content corresponds to the watermark information), thereby providing a robustness against a local distortion such as a StirMark attack or D-A-D conversion. Description will be given to an example of a configuration wherein a homotopy invariant is employed as an example of the topological invariant. The concepts of a topological invariant, a homotopy invariant, a homotopy set, a homotopy class will be described. Mathematical Preparations: First, mathematical concepts related to the present invention will be described. If a certain set S is a topological space, it means that the set S can provide a group U of subsets satisfying the following three conditions: (1) An empty set and the set S are included in the group U. (2) Common parts of the elements of the group U are also the elements of group U. (3) A sum-set of an arbitrary number of elements in the group U are the elements of the group U. The elements of the group U are referred to as an open set and the group U is referred to as a topology on the set S. In a topological space X, if a set U(x) of subsets of X are neighborhood systems of a point x which is each point in the topological space X, the following four conditions are established: (1) If UεU(x), xεU. (2) If U (3) If UεU(x) and U⊂V, VεU(x). (4) WεU(x) satisfying UεU(y)′ for all yεW exists for UεU(x). A subset U which belongs to U(x) is referred to as a neighborhood of x. If a topological space X is a domain and a mapping f the range of which is included in a topological space Y is continuous at a point If the mapping f is continuous at each point in the topological space X, the mapping f is said to be continuous in the topological space X or referred to as a continuous mapping. If a continuous mapping between two topological spaces is a bijective mapping and the inverse mapping thereof is also continuous, then this mapping is referred to as a homeomorphism. If a homeomorphism exists between two topological spaces, these spaces are said to be homeomorphic. A property common to the spaces homeomorphic to each other is referred to as a topological property and the quantity of the property is called a topological invariant. Closed intervals [0, 1] are expressed by I. Two topological spaces are expressed by X and Y and f Also, the mapping F is referred to as homotopy from f A quantity having an equal value for two arbitrary homotopic mappings is referred to as a homotopy invariant. A homotopic relation is an equivalence relation on the entire continuous mapping of X to Y. [f] having a homotopic relation in the mapping f of X→Y are referred as a homotopy class of f. It is noted that π A mapping f*: π(Y; Z)→n(X; Z) can be defined by f*[g]=[g·f] for a composition g·f of the mapping f from X to Y and a mapping g from Y to Z. Also, the following relation is established:
A point in a topological space is set and this point is referred to as a base point. A continuous mapping only applies to a case of mapping one base point to another base point. I π Alternatively, the above-stated homotopy group is defined by an n-order spherical surface and one point on the spherical surface instead of I and ∂I If mappings f: X→X′ and g: X′→X exist between the two topological spaces X and X′ and compositions g·f and f·g are homotopic to equivalent mappings, respectively, then X and X′ are said to be homotopy equivalent. If a certain quantity corresponds to the topological space X and the same quantity corresponds to topological paces which are homotopy equivalent, then this quantity is referred to as a homotopy type invariant. The homotopy type invariant is a topological invariant. Please note that the homotopy invariant is a quantity corresponding to the mapping and the homotopy type invariant is a quantity corresponding to the topological space. A homotopy set from a topological space X to a topological space Y is homotopy type invariants for the topological space X. If f is a mapping from X to X and homotopic to an equivalent mapping, in particular, then a mapping f* induced from the mapping f is an equivalent mapping in the homotopy set and the homotopy class is invariant by using the mapping f. Mathematical Fact: The present invention utilizes this mathematical fact for digital watermarking. That is, two topological spaces are assumed as X and Y and a mapping between X and Y is assumed as g. A mapping from X to X is assumed as f and f is assumed to be homotopic to an equivalent mapping. If so, a mapping between homotopy sets induced by the mapping f is an equivalent mapping. Accordingly, a homotopy class has no change. According to the present invention, the above-stated mathematical concepts are associated with an actual target as follows. For example, if a still image as a digital content is a target into and from which watermark information is embedded and detected, the single image comprises a group of a plurality of pixels. Since a moving image comprises a plurality of images (=a group of a plurality of pixels), the moving image is also a group of a plurality of pixels. Although an image is constituted only out of summing finite pixels, the image is idealized and regarded as a topological space herein. An image has always ends. In case of a still image, for example, the still image is normally a two-dimensional rectangular region and the still image has, therefore, boundaries of four edges, an upper edge, a lower edge, a left edge and a right edge. Many methods may be considered to handle the ends. For brevity, these boundary points are regarded as an identical point in this embodiment. Therefore, a still image is a topological space homeomorphic to a two-dimensional spherical surface S Each of the pixels constituting a still image has a space for expressing the color of the pixel. The space is, for example, a subspace having coordinates of brightness and chrominance in a three-dimensional space. While both the brightness and the chrominance have discrete values, this color space is also idealized as a topological space herein. By associating the mathematical concepts to an actual target, one image which becomes a target into and from which watermark information is embedded and detected can be regarded as a mapping from a topological space X constituted of a set of pixels to a topological space Y which is a subspace in a color space. Since the topological space X is, in particular, homeomorphic to the two-dimensional spherical surface, the image can be regarded as a mapping from the two-dimensional spherical surface to a topological space Y. The homotopy set of such mappings forms a homotopy group π A geometrical distortion to an image is regarded as a mapping from a topological space X to a topological space X′. Further, since the topological space X′ is also homeomorphic to a two-dimensional spherical surface, the geometrical distortion can be regarded as a mapping from a two-dimensional spherical surface to a two-dimensional spherical surface. Empirical Facts: The empirical facts assumed in the present invention are as follows. A geometrical distortion including a global distortion such as scaling, rotation or translation and a local distortion such as a StirMark attack with respect to an image is regarded as a mapping homotopic to an equivalent mapping. Providing that a geometrical distortion which is not homotopic to an equivalent mapping is added, an image is greatly influenced by the geometrical distortion and the economic value of the image is lowered. For that reason, such a geometrical distortion is not often added. Accordingly, if the above-stated empirical fact is assumed, it is expected that such a geometrical distortion (mapping homotopic to an equivalent mapping) holds the homotopy class more invariant than because of the mathematical fact described previously. A topological invariant (e.g., homotopy class) to be embedded into a content (watermark information embedding target content), is input (step S A watermark information embedding target content (e.g., image data) is input (step S The content of the target content (e.g., pixel value if the content is image data) is modified so that the content has a topological invariant, i.e., the digital watermark detecting apparatus can detect the topological invariant of the content such as the image data (step S The content (e.g., image data) into which the watermark information has been embedded is output (step S A target content (e.g., image data) from which watermark information is detected is input (step S The topological invariant of the content (e.g., image data) is calculated from the content (e.g., a pixel value if the content is the image data) (step S The calculated topological invariant (e.g. homotopy class) is output as watermark information (step S The examples shown in As shown in A watermark information embedding target content (e.g., moving data) is input (step S The target content is divided into a plurality of regions (e.g., frames) (step S The content of the target content (e.g., a pixel value if the respective regions are the frames of the moving image data) is modified so that the respective regions have topological invariants, i.e., the digital watermark detecting apparatus can detect the topological invariant of the target content from the target content such as the moving image data or the like for the respective regions (e.g., frames) (step S The content (e.g., moving data) including a plurality of regions (e.g., frames) into which the watermark information has been embedded is output (step S A target content (e.g., moving image data) from which watermark information is to be detected is input (step S The target content is divided into a plurality of regions (e.g., frames) (step S The topological invariants of the respective regions (e.g., frames) are calculated from the contents thereof (e.g., pixel values if the respective regions are the frames of moving image data), respectively (step S The value of the maximum likelihood topological invariant which is estimated to be the most accurate from those calculated for the respective regions (e.g., frames) (step S The maximum likelihood topological invariant (e.g., homotopy class) thus calculated is output as watermark information (step S The above description is related to the principles of the present invention. As can be seen, the digital watermark embedding apparatus In the first embodiment, a homotopy class which is inherently held by a target content is detected, and the difference (=homotopy class b−homotopy class a) of the homotopy class a from a homotopy class b corresponding to watermark information to be embedded into the target content is input to this target content. Please note if the topological space X is a cube (spherical space) of an n-order, a homotopy set is grouped so that the homotopy set is called a homotopy group. Thus, the homotopy class is the same as the element of the homotopy group. One example of processing procedures in the first embodiment is basically the same as those shown in The watermark information/homotopy class transforming section While the homotopy class has a value of a positive integer, 0 or a negative integer, adjacent homotopy classes having close values which can be used for watermark information may be set distant from each other by a constant value (e.g., 2) or all usable homotopy classes may be used. Further, if watermark information given as a numerical value is used as a homotopy class, the watermark information/homotopy class transforming section As will be described later in detail, the homotopy class calculating section As will be described later in detail, the homotopy class changing section The region dividing section If a divided region comprises one pixel, the representative coordinate values of the divided region is the coordinate values of the pixel. If a divided region comprises a plurality of pixels, the representative coordinate values of the divided region are, for example, the coordinate values of the center of gravity of the divided region. Actually, it is preferable that not the entire bits of a pixel value but a preset, specific intermediate bit plane region which does not have a great influence on the quality of an image (which region have normally a plurality of continuous bits) is employed for embedding watermark information. If a block is one pixel, the representative pixel value of the block is a value indicated by a specific bit string obtained by extracting the above-stated preset, specific intermediate bit plane region from all bits of the pixel value. If a block comprises a plurality of pixels, the representative pixel value of the block is the average value of the values indicated by the above-stated specific bit string for the plural pixels. The number of vertical and horizontal pixels constituting one block or the content of a representative pixel value may be appropriately set or made selectable. Alternatively, they may be fixed to a preset one method. The target value calculating section First, description will be given to a case where a representative pixel value is located in the target space. Next, description will be given to a case where a representative pixel value is not located in the target space but transformed into a target value in the target space by a transformation processing performed by the target value calculating section A target image into which watermark information is embedded is, for example, a finite region in a two-dimensional space. If the peripheral part of the image is regarded as one point, the image can be identified with a two-dimensional spherical surface S On the other hand, the color space is a finite region in a three-dimensional space expressed by three coordinates of R, G and B. The original pixels of an image is, for example, a space having eight gradations and gradations fewer than the eight gradations are employed for watermark information. In any case, the color space to be handled herein has finite gradations but the color space is approximately regarded as a continuous space. By doing so, the color space becomes a finite region in a three-dimensional Euclidean space (R To be specific, the target space can be set by selecting an intermediate bit plane region which is less influenced by noise and which does not have a great influence on the quality of an image among pixel values and embedding a two-dimensional spherical surface into the intermediate bit plane region. Based on the above, a target space is attached for each point in the base space and an image is allocated one point in the target space at each point in the base space. As can be understood from the above, one image is defined as a mapping from S Since the element of the homotopy group is invariant even by a local distortion to an image or a voice, it is possible to realize a digital watermark strongly resisting a local distortion. For example, if watermark information 0 is embedded, the pixel value of each divided region (e.g., block) is changed so as to express an element expressed by 1 in the homotopy group. If watermark information 1 is embedded, the pixel value of each divided region is changed so as to express an element represented by −1 in the homotopy group. To create a pixel value in the target space to express an arbitrary element in the homotopy group, there is, for example, a method employing the following function Ψ. In the function, H is the height of an image, W is the width of the image, x and y are the coordinates of the image, For example, if the upper left of the image shown in The above-stated functions are only one example and various other methods are applicable. Meanwhile, the target space which is a two-dimensional spherical surface is embedded into the color space. It does not, however, necessarily mean that the representative pixel value exists in the target space. It is rather normal that the image before watermark information is embedded, a content after a compression operation, an expansion operation or the like or a content provided trough the distribution path has a value in a region other than the target space. Also, as described above, if watermark information is embedded, a method for allocating a value (e.g., a value obtained by scaling a target value vector) with which the target space can be regenerated instead of allocating a value corresponding to the target space itself to the base space may be considered. In this case, at the time of embedding watermark information, the representative pixel value has already been located in a region other than the target space. Further, as will be described later, the element of the homotopy group of an image is obtained from a representative value in the representative space (from an average pixel value of the divided region). Therefore, as a preprocess thereof, it is necessary to carry out a processing for associating a representative pixel value with one point in the target space. Considering this, in this embodiment, the target value calculating section The integrating section
Actually, the target value is not in the continuous two-dimensional spherical surface but in a discrete space represented by a divided region (e.g., block). Therefore, a differential in an integral function is replaced by a difference and integration is replaced by sum. That is to say, an integral result is obtained by the following formula.
It is expected that the integral result is an integer. However, since there are various reasons for causing an error, an integral closest to the integral result so that the integral represents the element of the homotopy group may be obtained. The embedding function generating section The image changing section To accurately update the element of the homotopy class, therefore, the following processing may be considered as an example of a processing performed by the image changing section Next, with respect to the other half region (e.g., right half or left half), the function Ψ Thereafter, the functions respectively defined in the half regions are regarded as one function and a function defined in the entire region is composed therefrom. If the function thus composed is expressed by f*g, it follows that I(f*g)=I(f)+I(g) and, therefore, I(f*Ψ In the setting of these half regions, the above-stated operation can be variously performed. Also, the areas or the number of blocks of two types of regions may not be set to have a ratio of 1:1 but the other ratio. The embedded image is expressed by a function (e.g., the above-stated composed function) for obtaining a target value. As shown in It is assumed that the representative pixel value of each divided region is obtained by, for example, the representative coordinate values of the divided region. If a representative pixel value to be embedded into a certain divided region (e.g., block) is obtained and one block comprises one pixel, then the designated representative pixel value is written into a predetermined bit string portion used for watermark information on the respective pixel values of R, G and B. Alternatively, the designated representative pixel value (vector) may be scaled for each pixel so that the distance of the designated representative pixel value (vector) to the representative pixel value (vector) indicated by the value of the element of the predetermined bit string portion of the pixel value of the pixel and then the scaled value may be written into the predetermined bit string portion. In addition, a certain representative pixel value is designated for a certain block and one block comprises a plurality of pixels, then the designated representative pixel value is written so that, for example, the average value of the predetermined bit string portions of these pixels becomes equal to the designated representative pixel value. For example, the same value (i.e., the designated representative pixel value) may be written into a plurality of pixels constituting one block or different values (which may include a case where not less than two pixels having the same value exist) may be written into a plurality of pixels constituting one block by obtaining discrete values from the designated representative pixel value by an appropriate method. Further, if discrete values are obtained from the representative pixel value, the discrete values may be optimized so that the values are less changed from the element values of the respective pixels based on the element values of the respective pixels and then the optimized discrete values may be written into the respective pixels. Alternatively, the designated representative pixel value (vector) may be scaled so that the representative pixel value is less changed from the values of the elements of the respective pixels based on the values of the elements of the respective pixels and written for each block. The homotopy class calculating section As already described above, even if the representative pixel value is located in the target space in the image into which the digital watermark has been just embedded, there possibly occur cases where the representative pixel value does not exist in the target space due to noise, compression or an attack which occurs after embedding when the digital watermark is detected. Also, if the representative pixel value does not exist in the target space right after the embedding, it is considered that the representative pixel value does not exist in the target space at the time of detection. In this embodiment, however, the target value calculating section ( The homotopy class/watermark information transforming section As described above, it is assumed that the embedded homotopy class has a predetermined value and the embedded watermark has a predetermined value. The detecting section In addition, the homotopy class detected by the homotopy class calculating section Furthermore, if the value of the homotopy class detected by the homotopy class calculating section If one apparatus serving both the digital watermark embedding apparatus and the digital watermark detecting apparatus is used, one homotopy class calculating section can be used as both the homotopy class calculating section In the above description, the homotopy class is embedded into the entire image data. Alternatively, the homotopy class may be embedded into a specific part of the image data. In the latter case, the embedding function generating section It is noted that image data often includes important information in the central portion of an image. Therefore, it is also possible to embed a homotopy class not into the peripheral portion of the image but only into the central portion thereof. In that case, it suffices that the value of the function becomes constant in the peripheral portion. As described above, in this embodiment, the value of the topological invariant (e.g., homotopy class) of a target content is set at a value corresponding to watermark information to be embedded. Therefore, even if a local distortion such as a StirMark attack or D-A-D conversion to the target data occurs, it is possible to maintain and store the topological invariant, to detect an accurate topological invariant from the target content and to obtain accurate watermark information corresponding to the topological invariant thus detected. Other embodiments of the present invention will be described. The same portions as those of the first embodiment will be indicated in the same reference numerals and their detailed description will be omitted. In the first embodiment, a composed function of the representative pixel value of an original image and an embedding function is created and the composed function is used to replace the representative pixel value of the original image, thereby generating a watermark information embedded image. In the second embodiment, by contrast, the representative pixel value of an original image is not employed and the representative value is replaced by an embedding function corresponding to a class to be embedded. That is to say, a homotopy class b corresponding to watermark information to be embedded is input to a target content without detecting a homotopy class a which the target content inherently holds. Now, description will be given while centering around the differences between the first embodiment and the second embodiment. In the first embodiment, the embedding function generating section The embedding function generating section The image changing section Next, description will be given to the variation of the homotopy class changing section The embedding function generating section As shown in, for example, Besides the parameter for uniformly scaling the embedding function, a parameter for uniform rotation or, more generally, a parameter expressing non-uniform coordinate transformation can be employed. If a certain representative pixel value to be embedded is designated for a certain divided region (block) and one block comprises one pixel, the image changing section Also, if a certain representative pixel value is designated for a certain divided region (block) and one block comprises a plurality of pixels, then the image changing section It is noted that the above-stated variations are also applicable to the first embodiment. Further, the digital watermark detecting device of the second embodiment is the same as that of the first embodiment. As described above, in the second embodiment, a homotopy class corresponding to watermark information to be embedded is set to a target content. Therefore, even if a local distortion such as a StirMark attack or D-A-D conversion is applied to the target content, it is possible to maintain and store the homotopy class, to detect an accurate homotopy class from the target content and to obtain accurate watermark information corresponding to the homotopy class. In the first embodiment, a composed function of the representative pixel value of the original image and the embedding function is created and the composed function replaces the representative pixel value of the original image, thereby creating a watermark information embedded image. In the second embodiment, an embedding function corresponding to a homotopy class to be embedded replaces the representative value without employing the representative pixel value of the original image. The first and second embodiments are combined. Namely, as first watermark information, the function f is embedded according to the second embodiment and then, as the second watermark information, the function g is embedded according to the first embodiment. By doing so, it is possible to multiplex the functions f and g and to embed the multiplexed function. In this case, as shown in, for example, Now, description will be given while centering around the differences of the third embodiment from the first and second embodiments. In this case, first, if a target content and the first watermark information (corresponding to a homotopy class [f]) are input into the first digital watermark detecting section Next, if the content into which this homotopy class [f] has been embedded and the second watermark information (corresponding to a homotopy class [g]) are input into the second digital watermark embedding section During detection, if the content (the content output from the second watermark embedding section Next, if the content (the content output from the second digital watermark embedding section It is noted that the first digital watermark detecting section Further, the second digital watermark detecting section Methods other than those stated above are applicable, as well as follows. For example, a target content and the first watermark information (corresponding to a homotopy class Next, the content into which this homotopy class During detection, if the content (the content output from the second digital watermark embedding section Next, if the content (the content output from the second digital watermark embedding section It is noted that the first digital watermark detecting section Further, the second digital watermark detecting section In the third embodiment, as in the case of the preceding embodiments, all of or a part of the overlapped constituent elements of the first digital watermark embedding section Further, description has been given above to a case of duplexing the homotopy classes. It is also possible to obtain multiplexed (triplex or more) homotopy classes. Description has been given so far while taking a case where a still image is a target content as an example. Now, description will be given to embodiments in a case where a moving image is a target content. In the fourth embodiment, a digital watermark is embedded into a moving image instead of detecting a homotopy class inherently held by the target content as seen in the first embodiment. One example of processing procedures in this embodiment is basically the same as those shown in The watermark information/homotopy class transforming section The frame dividing section The digital watermark embedding section As a result, as shown in In this case, the element of the homotopy group of the obtained image may sometimes differ from the element to be embedded (e.g., if r= The frame composing section The frame dividing section The digital watermark detecting section The maximum likelihood class calculating section The homotopy class/watermark information transforming section It is described above that the fourth embodiment is applied to a case of embedding a digital watermark into a moving image in the first embodiment without detecting the homotopy class inherently held by a target content. Alternatively, the fourth embodiment may be applied to the second embodiment. If a digital watermark is embedded into a moving image without detecting the homotopy class inherently held by the target content in the second embodiment, it suffices that the digital watermark embedding section In the fourth embodiment, the same homotopy class is embedded to all the frames in the first, second or third embodiment. In the fifth embodiment, different homotopy classes (which may include a case where not less than two pixels having the same value exist) are embedded into respective frames in the first, second or third embodiment. The frame dividing section The randomizing section This operation is intended to make it more difficult to falsify watermark information by randomly embedding different classes into the respective frames. Namely, even if a person intending to falsify watermark information wants to embed a class corresponding to certain watermark information into each frame, the series of the classes cannot be accurately generated unless the person is aware of the key information and a random number generating method, thereby making falsification difficult. The frame dividing section The inverse transforming section Various method for the processing performed by the inverse transforming section In the examples shown in Further, the random numbers generated by the randomizing section To avoid an attack such as the sampling of frames, if a sequence subjected to error correction coding to provide an error correcting capability if the disappearance of frames occurs is embedded as a sequence indicating classes embedded into respective frames, the resistance of the system can be improved. The sixth embodiment related to a modification of the fifth embodiment will be described hereinafter. The frame dividing section The watermark information/random number seed transforming section The random number generating section The frame dividing section The random number generating section The random number seed generating section The random number seed/watermark information transforming section In the fourth embodiment shown in In the fifth embodiment shown in First, the random number seed generating section The random number generating section The correlation determining section If applied with the determination result indicating inconsistence from the correlation determining section After repeating a series of processings stated above, the correlation determining section If applied with the determination result indicating consistence from the correlation determining section The random number seed/watermark information transforming section If a correlation determination result indicating consistence is not eventually obtained, it indicates a state in which no watermark information is embedded into the target content. The correlation calculation of the correlation determining section To be specific, it is assumed that copy approval or copy prohibition is embedded, as watermark information, into a content. It is also assumed that the random number seed corresponding to copy approval is a binary number x of a predetermined bits and that corresponding to copy prohibition is a binary number y (≠x) of a predetermined bit. It is further assumed that the copying of a detection target content is approved, and that homotopy classes 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 1, 1, 0, . . . generated with the binary number x of the predetermined bit set as a random number seed are set to the respective frames of the content. When detecting watermark information, the random number seed generating section The random number generating section 1, 1, 0, 1, 0, 1, 1, 1, 0, 0, 1, 1, 0, . . . . On the other hand, the digital watermark detecting section 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 1, 1, 1, 0, . . . . The correlation determining section Next, the random number seed generating section The random number generating section 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 1, 1, 0, . . . . On the other hand, the digital watermark detecting section 0, 1, 0, 0, 1, 1, 0, 1, 0, 1, 1, 1, 0, . . . . The correlation determining section If applied with the determination result indicating consistence from the correlation determining section The random number seed/watermark information transforming section It is noted that the random number sequence generated by the random number seed generating section Various methods with which the random number seed generating section Meanwhile, in the above description, the random number seed generating section In this case, instead of the random number seed generating section and the random number seed/the watermark information transforming section, the watermark information generating section and the watermark information/random number seed transforming section are provided. The watermark information generating section outputs certain watermark information. The watermark information/random number seed transforming section outputs a random number seed corresponding to the watermark information applied from the watermark information generating section. The random number generating section The correlation determining section If the determination result indicating inconsistence is input to the watermark information generating section from the correlation determining section After a series of the above-stated processings, the correlation determining section If applied with the determination result indicating consistence from the correlation determining section Various methods with which the watermark information generating section generates watermark information are considered. Now, a variation common to all of or some of the embodiments stated above will be described in detail. In the embodiments stated so far, description has been given while assuming that the base space is S For example, the upper end and the lower end of an image are identified with each other and the right end and the left end thereof are identified with each other, making it possible to set a base space as a torus T Next, variations of the function representing an embedded image will be described. A function representing an embedded image is set so that the change of the function in the vicinity of the periphery of the image is gentler and that the function has values gradually closer to a certain value at positions closer to the periphery. By doing so, if a part of the image in the vicinity of the center of the image is cut, the lost peripheral portion has a smaller influence on an integral value. Accordingly, if an image or a content has an important content concentrated in the vicinity of the center thereof (it is considered that there are many contents of this type), it is possible to realize resistance against the cutting to some extent. Normally, even if only the important part is cut, it is possible to leave the digital watermark by setting the function is set so as to have large change on the important part of the image. Next, variations to the processing for a moving image will be described. In the embodiments stated so far, a processing is basically performed for each frame of a moving image. There are also other modified, expanded embodiments for the moving image. For example, there is an embodiment of embedding a watermark employing a homotopy group represented by π That is to say, each of the frames or fields (rectangles shown in It is assumed here that each of the two-dimensional spherical surfaces having a start point and an end point, respectively, is contracted into one point ( By setting the three-dimensional spherical surface thus constituted as a base space, the elements of the homotopy group π In this case, as in the case of the above, if a certain moving image has a representative value expressed by a function f in a predetermined number of continuous frames ( Needless to say, as in the case of the still image (second embodiment), the representative value of an original image may be replaced by the value of the embedding function corresponding to the element of a desired homotopy group. Next, variations to the processing for voice data will be described. Description has been given so far while taking a case of using a still image or a moving image as a target content, as an example. It goes without saying that the present invention is also applicable to the other media and digital contents. As one example of the modifications, description will be given to a case of using a digital content such as a voice or music as a target content. As shown in Now, a certain time period is set as a base space. In this period, by identifying a start point with an end point, the base space can be identified with a one-dimensional spherical surface S In this two-dimensional space, a one-dimensional spherical surface (circle) is set as a target space. By do setting, the element of a homotopy class π Also, as another example of selecting a target space, three frequency bands are selected as a target space, the LSB (least significant bit) of each frequency band is selected and a two-dimensional torus T Furthermore, another method for setting a target space will be described. Voice data include data formed of multichannel (e.g., stereo voice data formed of two channels, i.e., right and left channels). In case of such data, a bit region is set for each channel and a target region can be constituted of the bit regions. In these examples, it is possible to realize a robustness against an operation for extending or contracting a time axis, as shown in Next, variations to the topological invariant will be described. In the embodiments stated so far, description has been given to a case where the homotopy class is employed as a topological invariant. It is also possible to employ topological invariants other than the homotopy class. Many topological invariants other than the element of a homotopy group, which are exemplified by a homology group, cohomology group, characteristic class such as a Stiefel-Whitney class, a Chern class and a Pontryagin class for a vector bundle, Euler numbers for a polyhedron, an index, a signature, the Alexander topological invariant relating to a knot and the Milnor topological invariant relating to a tangled thread (see, for example, Iwanami Sugaku Jiten (Iwanami's Mathematical Dictionary), 3rd edn. The Mathematical Society of Japan, Iwanami Shoten, Publishers). These other topological invariants can be utilized. While the integral quantity given by, for example, the Gauss-Bonnet theorem is utilized for the homotopy group. In case of characteristic classes such as the Chern class, integral quantities given by, for example, the Atiyah-Singer exponential theorem can be utilized. In the latter case, the integral quantities exemplified for the homotopy group may be replaced by integral quantities deriving these topological invariants. Now, a hardware configuration and a software configuration in this embodiment will be described. The digital watermark embedding apparatus in this embodiment can be realized as either a hardware or a software, i.e., program for allowing a computer to execute predetermined means, to function as predetermined means or to realize predetermined functions. If the digital watermark embedding apparatus is realized by a software, the program can be transmitted and received to and from a recording medium or a communication medium. The same thing is true for the digital watermark detecting apparatus. If the digital watermark embedding apparatus and the digital watermark detecting apparatus are realized as hardware, they can be constituted as semiconductor devices, respectively. If the digital watermark embedding apparatus to which the present invention is applied is constituted or a digital watermark embedding program is created, blocks or modules having the same configuration can be individually created. It is also possible to prepare only one or an appropriate number of blocks or modules having the same configuration and to share the blocks or modules among the respective parts of the algorithm. The same thing is true if the digital watermark detecting apparatus is constituted or a digital watermark detecting program is created. Further, if a system including the digital watermark embedding apparatus and a digital watermark detecting apparatus is constituted or a system including the digital watermark embedding program and the digital watermark detecting program is created, it is possible to prepare only one or an appropriate number of blocks or modules having the same configuration over the digital watermark embedding apparatus or program and the digital watermark detecting apparatus or program and to share the blocks or modules among the respective parts of the algorithm. Furthermore, if the digital watermark embedding apparatus and the digital watermark detecting apparatus comprise software, it is possible to perform parallel processings using a multiprocessor and to thereby accelerate processings. The configurations exemplified in this embodiment are provided only for illustrative purposes and not intended to exclude the other configurations. A part of the exemplified configurations can be replaced by the others or can be omitted, other functions can be added to the exemplified configurations, and other configurations obtained by a combination thereof can be utilized. In addition, other configurations logically equivalent to the exemplified configurations, other configurations including a part logically equivalent to the exemplified configurations, other configurations logically equivalent to the important parts of the exemplified configurations and the like can be utilized. Also, other configurations for attaining the same or similar objects as those of the exemplified configurations, other configurations having the same or similar advantages as those of the exemplified configurations and the like can be utilized. The diverse variations to the respective constituent elements can be executed by appropriately combining them. Moreover, the embodiments of the present invention contain and include inventions relating to various viewpoints, stages, concepts or categories such as an invention as an individual apparatus, an invention as an entire system, an invention relating to the internal constituent elements of the individual apparatus and method inventions corresponding thereto. Accordingly, the invention can be extracted from the content disclosed in the embodiments of the present invention without limiting the invention to the exemplified configurations. The present invention should not be limited to the above-stated embodiments and various changes and modifications can be made within the technical scope of the invention. According to the present invention, the topological invariant corresponding to the watermark information to be embedded is set to the target content. Therefore, even if the target content receives a local distortion such as a StirMark attack or D-A-D conversion on the distribution path, the topological invariant set to the target content is maintained and held. Further, it is possible to detect a topological invariant from the content and to obtain watermark information corresponding to the detected topological invariant irrespectively of whether or not the content is locally distorted. Moreover, in the embodiments for writing data expressing a topological invariant so that the change of the bit string of the content is as small as possible, it is possible to prevent the content from being influenced by the topological invariant as much as possible. Patent Citations
Non-Patent Citations
Referenced by
Classifications
Rotate |