Publication number | US20030165263 A1 |

Publication type | Application |

Application number | US 10/274,358 |

Publication date | Sep 4, 2003 |

Filing date | Oct 21, 2002 |

Priority date | Feb 19, 2002 |

Also published as | EP1529207A2, US7079675, US20050276457, WO2004017052A2, WO2004017052A3 |

Publication number | 10274358, 274358, US 2003/0165263 A1, US 2003/165263 A1, US 20030165263 A1, US 20030165263A1, US 2003165263 A1, US 2003165263A1, US-A1-20030165263, US-A1-2003165263, US2003/0165263A1, US2003/165263A1, US20030165263 A1, US20030165263A1, US2003165263 A1, US2003165263A1 |

Inventors | Michael Hamer, Maria Petrou, Anastaslos Kesidis |

Original Assignee | Hamer Michael J., Maria Petrou, Anastaslos Kesidis |

Export Citation | BiBTeX, EndNote, RefMan |

Patent Citations (14), Referenced by (77), Classifications (13), Legal Events (1) | |

External Links: USPTO, USPTO Assignment, Espacenet | |

US 20030165263 A1

Abstract

A method of measuring oestrogen or progesterone receptor (ER or PR) comprises identifying in histopathological specimen image data pixel groups indicating cell nuclei, and deriving image hue and saturation. The image is thresholded using hue and saturation and preferentially stained cells identified. ER or PR status is determined from normalised average saturation and proportion of preferentially stained cells. A method of measuring C-erb-2 comprises correlating window functions with pixel sub-groups to identify cell boundaries, computing measures of cell boundary brightness and sharpness and brightness extent around cell boundaries, and comparing the measures with comparison images associated with different values of C-erb-2. A C-erb-2 value associated with a comparison image having similar brightness-related measures is assigned. A method of measuring vascularity comprises deriving image hue and saturation, producing a segmented image by hue and saturation thresholding and identifying contiguous pixels. Vascularity is determined from contiguous pixel area corresponding to vascularity expressed as a proportion of total image area.

Claims(105)

a) obtaining histopathological specimen image data; and

b) identifying in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the method also includes the steps of:

c) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

d) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

e) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.

a) obtaining histopathological specimen image data; and

b) identifying in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the method also includes the steps of:

c) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

d) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

e) determining ER or PR status from normalised average saturation.

a) obtaining histopathological specimen image data; and

b) identifying in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the method also includes the steps of:

c) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

d) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

e) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.

and saturation from an expression

where (x, y) and ({tilde over (x)}, {tilde over (y)}) are respectively image pixel coordinates and reference colour coordinates in the chromaticity space.

a) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

b) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

c) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

d) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

b) producing a segmented image by thresholding the image data on the basis of hue and saturation;

c) identifying in the segmented image groups of contiguous pixels; and

d) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.

a) obtaining histopathological specimen image data; and

b) identifying in the image data contiguous pixel groups corresponding to respective cell nuclei associated with surrounding cell boundary staining;

characterised in that the method also includes the steps of:

c) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

d) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

e) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

f) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) generating a mean value μ_{R }and a standard deviation σ_{R }for pixels in the red image plane,

b) generating a cyan image plane from the image data and calculating a mean value μ_{C }for its pixels,

c) calculating a product CMMμ_{C }where CMM is a predetermined multiplier,

d) calculating a quantity R_{B }equal to the number of adjacent linear groups of pixels of predetermined length and including at least one cyan pixel which is less than CMMμ_{C},

e) for each red pixel calculating a threshold equal to {RMMμ_{R}−σ_{R}(R(4−R_{B})} and RMM is a predetermined multiplier,

f) forming a thresholded red image by discarding each red pixel that is greater than or equal to the threshold,

g) determining the number of contiguous pixel groups in the thresholded red image,

h) changing the values of RMM and CMM and iterating steps c) to g),

i) changing the values of RMM and CMM once more and iterating steps c) to g),

j) comparing the numbers of contiguous pixel groups determined in steps g) to i), treating the three pairs of values of RMM and CMM as points in a two dimensional space, selecting the pair of values of RMM and CMM associated with the lowest number of contiguous pixel groups, obtaining its reflection in the line joining the other two pairs of values of RMM and CMM, using this reflection as a new pair of values of RMM and CMM and iterating steps c) to g) and this step j).

a) obtaining histopathological specimen image data;

characterised in that the method also includes the steps of:

b) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

c) producing a segmented image by thresholding the image data on the basis of hue and saturation; and

d) identifying in the segmented image groups of contiguous pixels; and

e) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.

i) defining M and m for each pixel as respectively the maximum and minimum of R, G and B; and

ii) setting S to zero if m equals zero and setting S to (M−m)/M otherwise.

a) defining new values newr, newg and newb for each pixel given by newr=(M−R)/(M−m), newg=(M−G)/(M−m) and newb=(M−B)/(M−m) in order to convert each pixel value into the difference between its magnitude and that of the maximum of the three colour magnitudes of that pixel, this difference being divided by the difference between the maximum and minimum of R, G and B, and

b) calculating H as tabulated immediately below:

a) processing histopathological specimen image data to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the program is also arranged to implement the steps of:

b) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

c) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

d) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.

a) processing histopathological specimen image data to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the program is also arranged to implement the steps of:

b) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

c) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

d) determining ER or PR status from normalised average saturation.

a) processing histopathological specimen image data to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei;

characterised in that the program is also arranged to implement the steps of:

c) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

d) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.

and saturation from an expression

where (x, y) and ({tilde over (x)}, {tilde over (y)}) are respectively image pixel coordinates and reference colour coordinates in the chromaticity space.

a) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

b) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

c) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

d) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

b) producing a segmented image by thresholding the image data on the basis of hue and saturation; and

c) identifying in the segmented image groups of contiguous pixels; and

d) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.

a) processing histopathological specimen image data to identify contiguous pixel groups corresponding to respective cell nuclei associated with surrounding cell boundary staining;

characterised in that the computer program is also arranged to implement the steps of:

b) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

c) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

d) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

e) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) generating a mean value PR and a standard deviation σ_{R }for pixels in the red image plane,

b) generating a cyan image plane from the image data and calculating a mean value μ_{C }for its pixels,

c) calculating a product CMMμ_{C }where CMM is a predetermined multiplier,

d) calculating a quantity R_{B }equal to the number of adjacent linear groups of pixels of predetermined length and including at least one cyan pixel which is less than CMMμ_{C},

e) for each red pixel calculating a threshold equal to {RMMμ_{R}−σ_{R}(4−R_{B})} and RMM is a predetermined multiplier,

f) forming a thresholded red image by discarding each red pixel that is greater than or equal to the threshold,

g) determining the number of contiguous pixel groups in the thresholded red image,

h) changing the values of RMM and CMM and iterating steps c) to g),

i) changing the values of RMM and CMM once more and iterating steps c) to g),

j) comparing the numbers of contiguous pixel groups determined in steps g) to i), treating the three pairs of values of RMM and CMM as points in a two dimensional space, selecting the pair of values of RMM and CMM associated with the lowest number of contiguous pixel groups, obtaining its reflection in the line joining the other two pairs of values of RMM and CMM, using this reflection as a new pair of values of RMM and CMM and iterating steps c) to g) and this step j).

a) using histopathological specimen image data to derive hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

b) producing a segmented image by thresholding the image data on the basis of hue and saturation; and

c) identifying in the segmented image groups of contiguous pixels; and

d) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.

i) defining M and m for each pixel as respectively the maximum and minimum of R, G and B; and

ii) setting S to zero if m equals zero and setting S to (M−m)/M otherwise.

a) defining new values newr, newg and newb for each pixel given by newr=(M−R)/(M−m), newg=(M−G)/(M−m) and newb=(M−B)/(M−m) in order to convert each pixel value into the difference between its magnitude and that of the maximum of the three colour magnitudes of that pixel, this difference being divided by the difference between the maximum and minimum of R, G and B, and

b) calculating H as tabulated immediately below:

a) deriving hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

b) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

c) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.

b) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

c) determining ER or PR status from normalised average saturation.

b) thresholding the image data on the basis of hue and saturation and identifying pixels corresponding to cells which are preferentially stained relative to surrounding specimen tissue; and

c) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.

and saturation from an expression

where (x, y) and ({tilde over (x)}, {tilde over (y)}) are respectively image pixel coordinates and reference colour coordinates in the chromaticity space.

a) correlate window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

b) compute brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

c) compare the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

d) assign to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) derive hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;

b) produce a segmented image by thresholding the image data on the basis of hue and saturation;

c) identify in the segmented image groups of contiguous pixels; and

d) determine vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.

a) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,

b) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,

c) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and

d) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.

a) generating a mean value μ_{R }and a standard deviation σ_{R }for pixels in the red image plane,

b) generating a cyan image plane from the image data and calculating a mean value μ_{C }for its pixels,

c) calculating a product CMMμ_{C }where CMM is a predetermined multiplier,

d) calculating a quantity R_{B }equal to the number of adjacent linear groups of pixels of predetermined length and including at least one cyan pixel which is less than CMMμ_{C},

e) for each red pixel calculating a threshold equal to {RMMμ_{R}−σ_{R}(4−R_{B})} and RMM is a predetermined multiplier,

f) forming a thresholded red image by discarding each red pixel that is greater than or equal to the threshold,

g) determining the number of contiguous pixel groups in the thresholded red image,

h) changing the values of RMM and CMM and iterating steps c) to g),

i) changing the values of RMM and CMM once more and iterating steps c) to g),

j) comparing the numbers of contiguous pixel groups determined in steps g) to i), treating the three pairs of values of RMM and CMM as points in a two dimensional space, selecting the pair of values of RMM and CMM associated with the lowest number of contiguous pixel groups, obtaining its reflection in the line joining the other two pairs of values of RMM and CMM, using this reflection as a new pair of values of RMM and CMM and iterating steps c) to g) and this step j).

b) producing a segmented image by thresholding the image data on the basis of hue and saturation; and

c) identifying in the segmented image groups of contiguous pixels; and

i) defining M and m for each pixel as respectively the maximum and minimum of R, G and B; and

ii) setting S to zero if m equals zero and setting S to (M−m)/M otherwise.

a) defining new values newr, newg and newb for each pixel given by newr=(M−R)/(M−m), newg=(M−G)/(M−m) and newb=(M−B)/(M−m) in order to convert each pixel value into the difference between its magnitude and that of the maximum of the three colour magnitudes of that pixel, this difference being divided by the difference between the maximum and minimum of R, G and B, and

b) calculating H as tabulated immediately below:

Description

- [0001]This invention relates to a method, a computer program and an apparatus for histological assessment, and more particularly for making measurements upon histological imagery to provide clinical information on potentially cancerous tissue such as for example (but not exclusively) breast cancer tissue.
- [0002]Breast cancer is a common form of female cancer: once a lesion indicative of breast cancer has been detected, tissue samples are taken and examined by a histopathologist to establish a diagnosis, prognosis and treatment plan. However, pathological analysis of tissue samples is a time consuming and inaccurate process. It entails interpretation of colour images by human eye, which is highly subjective: it is characterised by considerable inaccuracies in observations of the same samples by different observers and even by the same observer at different times. For example, two different observers assessing the same ten tissue samples may easily give different opinions for three of the slides −30% error. The problem is exacerbated by heterogeneity, i.e. complexity of some tissue sample features. Moreover, there is a shortage of pathology staff.
- [0003]Oestrogen and progesterone receptor (ER and PR) status, C-erb-2 and vascularity are parameters which are data of interest for assisting a clinician to formulate a diagnosis, prognosis and treatment plan for a patient. C-erb-2 is also known as Cerb-B2, her-2, her-2/neu and erb-2.
- [0004]It is an object of the invention to provide a technique for objective measurement of at least one of ER status, PR status, C-erb-2 and vascularity.
- [0005]In a first aspect, the present invention provides a method of measuring oestrogen or progesterone receptor (ER or PR) status having the steps of:
- [0006]a) obtaining histopathological specimen image data; and
- [0007]
- [0008]characterised in that the method also includes the steps of:
- [0009]
- [0010]
- [0011]e) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.
- [0012]The invention provides the advantage that it is computer-implementable, and hence is carried out in a way which avoids the subjectivity of a manual inspection process.
- [0013]In an alternative first aspect, the invention may provide a method of measuring ER or PR status having the steps of:
- [0014]a) obtaining histopathological specimen image data; and
- [0015]
- [0016]characterised in that the method also includes the steps of:
- [0017]
- [0018]
- [0019]e) determining ER or PR status from normalised average saturation.
- [0020]In a further alternative first aspect, the invention may provide a method of measuring ER or PR status having the steps of:
- [0021]a) obtaining histopathological specimen image data; and
- [0022]
- [0023]characterised in that the method also includes the steps of:
- [0024]
- [0025]
- [0026]e) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.
- [0027]Step b) may implemented using a K-means clustering algorithm employing a Mahalanobis distance metric.
- [0028]Step c) may be implemented by transforming the image data into a chromaticity space, and deriving hue and saturation from image pixels and a reference colour. Hue may be obtained from an angle φ equal to
${\mathrm{sin}}^{-1}\ue89e\frac{\uf603\stackrel{~}{x}\ue89e\text{\hspace{1em}}\ue89ey-x\ue89e\text{\hspace{1em}}\ue89e\stackrel{~}{y}\uf604}{\sqrt{{\stackrel{~}{x}}^{2}+{\stackrel{~}{y}}^{2}}\ue89e\sqrt{{x}^{2}+{y}^{2}}}$ - [0029]
- [0030]where (x, y) and ({tilde over (x)}, {tilde over (y)}) are respectively image pixel coordinates and reference colour coordinates in the chromaticity space. It may be adapted to lie in the range 0 to 90 degrees and a hue threshold of 80 degrees may be set in step d). A saturation threshold S
_{o }may be set in step d), S_{o }being 0.9 for saturation in the range 0.1 to 1.9 and 0 for saturation outside this range. - [0031]The fraction of pixels corresponding to preferentially stained cells may be determined by counting the number of pixels having both saturation greater than a saturation threshold and hue modulus less than a hue threshold and expressing such number as a fraction of a total number of pixels in the image: it may be awarded a score 0, 1, 2, 3, 4 or 5 according respectively to whether it is (i) 0, (ii) >0 and <0.01, (iii) ≧0.01 and ≦0.10, (iv) ≧0.11 and ≦0.33, (v) ≧0.34 and ≦0.66 or (vi) ≧0.67 and ≦1.0.
- [0032]Normalised average saturation may be accorded a score 0, 1, 2 or 3 according respectively to whether it is (i) ≦25%, (ii) >25% and ≦50%, (iii) >50% and ≦75% or (iv) >75% and ≦100%.
- [0033]Scores for normalised average saturation and fraction of pixels corresponding to preferentially stained cells may be added together to provide a measurement of ER or PR.
- [0034]The method of the invention may include measuring C-erb-2 status by the following steps:
- [0035]
- [0036]
- [0037]
- [0038]
- [0039]The method of the invention may include measuring vascularity by the following steps:
- [0040]
- [0041]b) producing a segmented image by thresholding the image data on the basis of hue and saturation;
- [0042]c) identifying in the segmented image groups of contiguous pixels; and
- [0043]
- [0044]In a second aspect, the invention provides a method of measuring C-erb-2 status having the steps of:
- [0045]a) obtaining histopathological specimen image data; and
- [0046]b) identifying in the image data contiguous pixel groups corresponding to respective cell nuclei associated with surrounding cell boundary staining;
- [0047]c) characterised in that the method also includes the steps of:
- [0048]d) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,
- [0049]e) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,
- [0050]f) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and
- [0051]g) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.
- [0052]In this aspect, at least some of the window functions may have non-zero values of 6, 12, 24 and 48 pixels respectively and zero values elsewhere. Pixels associated with a cell boundary are identified from a maximum correlation with a window function, the window function having a length which provides an estimate of cell boundary width.
- [0053]The brightness-related measure of cell boundary brightness and sharpness may be computed in step d) using a calculation including dividing cell boundaries by their respective widths to provide normalised boundary magnitudes, selecting a fraction of the normalised boundary magnitudes each greater than unselected equivalents and summing the normalised boundary magnitudes of the selected fraction.
- [0054]In step d) a brightness-related measure of brightness extent around cell boundaries may be computed using a calculation including dividing normalised boundary magnitudes into different magnitude groups each associated with a respective range of magnitudes, providing a respective magnitude sum of normalised boundary magnitudes for each magnitude group, and subtracting a smaller magnitude sum from a larger magnitude sum.
- [0055]The comparison image having brightness-related measures closest to those determined for the image data may be determined from a Euclidean distance between the brightness-related measures of the comparison image and the image data.
- [0056]In step b) identifying in the image data contiguous pixel groups corresponding to respective cell nuclei is carried out by an adaptive thresholding technique arranged to maximise the number of contiguous pixel groups identified. For image data including red, green and blue image planes the adaptive thresholding technique may include:
- [0057]a) generating a mean value μ
_{R }and a standard deviation σ_{R }for pixels in the red image plane, - [0058]
_{C }for its pixels, - [0059]c) calculating a product CMMμ
_{C }where CMM is a predetermined multiplier, - [0060]
_{B }equal to the number of adjacent linear groups of pixels of predetermined length and including at least one cyan pixel which is less than CMMμ_{C}, - [0061]e) for each red pixel calculating a threshold equal to {RMMμ
_{R}−σ_{R}(4−R_{B})} and RMM is a predetermined multiplier, - [0062]
- [0063]g) determining the number of contiguous pixel groups in the thresholded red image,
- [0064]h) changing the values of RMM and CMM and iterating steps c) to g),
- [0065]i) changing the values of RMM and CMM once more and iterating steps c) to g),
- [0066]
- [0067]The first three pairs of RMM and CMM values may be 0.802 and 1.24, 0.903 and 0.903, and 1.24 and 0.802 respectively.
- [0068]Brown pixels may be removed from the thresholded red image if like-located pixels in the cyan image are less than CMMμ
_{C}; edge pixels may be removed likewise if like-located pixels in a Sobel-filtered cyan image having a standard deviation σ_{C }are greater than (μ_{C}+1.5σ_{C}). Pixels corresponding to lipids may also be removed if their red green and blue pixel values are all greater than the sum of the relevant colour's minimum value and 98% of its range of pixel values in each case. - [0069]The thresholded red image may be subjected to a morphological closing operation.
- [0070]In a third aspect, the present invention provides a method of measuring vascularity having the steps of:
- [0071]a) obtaining histopathological specimen image data; characterised in that the method also includes the steps of:
- [0072]
- [0073]c) producing a segmented image by thresholding the image data on the basis of hue and saturation; and
- [0074]d) identifying in the segmented image groups of contiguous pixels; and
- [0075]e) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.
- [0076]In this aspect the image data may comprise pixels with red, green and blue values designated R, G and B respectively, characterised in that a respective saturation value S is derived in step b) for each pixel by:
- [0077]a) defining M and m for each pixel as respectively the maximum and minimum of R, G and B; and
- [0078]b) setting S to zero if m equals zero and setting S to (M−m)/M otherwise.
- [0079]Hue values designated H may be derived by:
- [0080]
- [0081]b) calculating H as tabulated immediately below:
M H 0 180 R 60(newb − newg)* G 60(2 + newr − newb)* B 60(4 + newg − newr)* - [0082]provided that if H proves to be >360, then 360 is subtracted from it, and if H proves to be <0, 360 is added to it.
- [0083]The step of producing a segmented image may be implemented by designating for further processing only those pixels having both a hue H in the range 282-356 and a saturation S in the range 0.2 to 0.24. The step of identifying in the segmented image groups of contiguous pixels may include the step of spatially filtering such groups to remove groups having insufficient pixels to contribute to vascularity. The step of determining vascularity may include treating vascularity as having a high or a low value according to whether or not it is at least 31%.
- [0084]In a fourth aspect, the present invention provides a computer program for measuring ER or PR status, the program being arranged to control computer apparatus to execute the steps of:
- [0085]
- [0086]characterised in that the program is also arranged to implement the steps of:
- [0087]
- [0088]
- [0089]d) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.
- [0090]In an alternative fourth aspect, the present invention provides a computer program for measuring ER or PR status, the program being arranged to control computer apparatus to execute the steps of:
- [0091]
- [0092]b) characterised in that the program is also arranged to implement the steps of:
- [0093]
- [0094]
- [0095]e) determining ER or PR status from normalised average saturation.
- [0096]In a further alternative fourth aspect, the present invention provides a computer program for measuring ER or PR status, the program being arranged to control computer apparatus to execute the steps of:
- [0097]
- [0098]characterised in that the program is also arranged to implement the steps of:
- [0099]
- [0100]
- [0101]d) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.
- [0102]In a fifth aspect, the present invention provides a computer program for use in measuring C-erb-2 status arranged to control computer apparatus to execute the steps of:
- [0103]a) processing histopathological specimen image data to identify contiguous pixel groups corresponding to respective cell nuclei associated with surrounding cell boundary staining;
- [0104]characterised in that the computer program is also arranged to implement the steps of:
- [0105]b) correlating window functions of different lengths with pixel sub-groups within the identified contiguous pixels groups to identify pixels associated with cell boundaries,
- [0106]c) computing brightness-related measures of cell boundary brightness and sharpness and brightness extent around cell boundaries from pixels corresponding to cell boundaries,
- [0107]d) comparing the brightness-related measures with predetermined equivalents obtained from comparison images associated with different values of C-erb-2, and
- [0108]e) assigning to the image data a C-erb-2 value which is that associated with the comparison image having brightness-related measures closest to those determined for the image data.
- [0109]In a sixth aspect, the present invention provides a computer program for use in measuring vascularity arranged to control computer apparatus to execute the steps of:
- [0110]a) using histopathological specimen image data to derive hue and saturation for the image data in a colour space having a hue coordinate and a saturation coordinate;
- [0111]
- [0112]c) identifying in the segmented image groups of contiguous pixels; and
- [0113]f) determining vascularity from the total area of the groups of contiguous pixels which are sufficiently large to correspond to vascularity, such area being expressed as a proportion of the image data's total area.
- [0114]In a seventh aspect, the present invention provides an apparatus for measuring ER or PR status including means for photographing histopathological specimens to provide image data and computer apparatus to process the image data, the computer apparatus being programmed to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei, characterised in that the computer apparatus is also programmed to execute the steps of:
- [0115]
- [0116]
- [0117]c) determining ER or PR status from proportion of pixels corresponding to preferentially stained cells.
- [0118]In an alternative seventh aspect, the present invention provides an apparatus for measuring ER or PR status including means for photographing histopathological specimens to provide image data and computer apparatus to process the image data, the computer apparatus being programmed to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei, characterised in that the computer apparatus is also programmed to execute the steps of:
- [0119]
- [0120]
- [0121]c) determining ER or PR status from normalised average saturation.
- [0122]In a further alternative seventh aspect, the present invention provides an apparatus for measuring ER or PR status including means for photographing histopathological specimens to provide image data and computer apparatus to process the image data, the computer apparatus being programmed to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei, characterised in that the computer apparatus is also programmed to execute the steps of:
- [0123]
- [0124]
- [0125]c) determining ER or PR status from normalised average saturation and fraction of pixels corresponding to preferentially stained cells.
- [0126]In an eighth aspect, the present invention provides an apparatus for measuring C-erb-2 status including means for photographing histopathological specimens to provide image data and computer apparatus to process the image data, the computer apparatus being programmed to identify in the image data groups of contiguous pixels corresponding to respective cell nuclei, characterised in that the computer apparatus is also programmed to execute the steps of:
- [0127]
- [0128]
- [0129]
- [0130]
- [0131]In a ninth aspect, the present invention provides an apparatus for measuring vascularity including means for photographing histopathological specimens to provide image data and computer apparatus to process the image data, characterised in that the computer apparatus is also programmed to execute the steps of:
- [0132]
- [0133]
- [0134]c) identifying in the segmented image groups of contiguous pixels; and
- [0135]
- [0136]The computer program and apparatus aspects of the invention may have preferred features corresponding to those of respective method aspects.
- [0137]In order that the invention might be more fully understood, embodiments thereof will now be described, by way of example only, with reference to the accompanying drawings, in which:—
- [0138][0138]FIG. 1 is a block diagram of a procedure for measuring indications of cancer to assist in formulating diagnosis and treatment;
- [0139][0139]FIG. 2 is a block diagram of a process for measuring ER and PR receptor status in the procedure of FIG. 1;
- [0140][0140]FIG. 3 is a pseudo three dimensional view of a red, green and blue colour space (colour cube) plotted on respective orthogonal axes;
- [0141][0141]FIG. 4 is a transformation of FIG. 3 to form a chromaticity space;
- [0142][0142]FIG. 5 is a drawing of a chromaticity space reference system;
- [0143][0143]FIG. 6 illustrates use of polar co-ordinates;
- [0144][0144]FIG. 7 is a block diagram of a process for measuring C-erb-2 in the procedure of FIG. 1; and
- [0145][0145]FIG. 8 is a block diagram of a process for measuring vascularity in the procedure of FIG. 1.
- [0146]The examples to be described herein are three different inventions which can be implemented separately or together, because they are all measurements which individually or collectively assist a clinician to diagnose cancer and to formulate a treatment programme. In descending order of importance, the procedures are determination of oestrogen and progesterone receptor status, determination of C-erb-2 and determination of vascularity.
- [0147]A procedure 10 for the assessment of tissue samples in the form of histopathological slides of potential carcinomas of the breast is shown in FIG. 1. This drawing illustrates processes which generate measurements of specialised kinds for use by a pathologist as the basis for assessing patient diagnosis, prognosis and treatment plan.
- [0148]The procedure 10 employs a database which maintains digitised image data obtained from histological slides as will be described later. Sections are taken (cut) from breast tissue samples (biopsies) and placed on respective slides. Slides are stained using a staining agent selected from the following depending on which parameter is to be determined:
- [0149]a) Immunohistochemical staining for C-erb-2 with diaminobenzidine (DAB) as substrate (chemical staining agent)—collectively “Cerb-DAB”—this is for assessing C-erb-2 gene amplification status;
- [0150]b) Oestrogen receptor (ER) with DAB as substrate (collectively “ER-DAB”) for assessing the expression (the amount expressed or emitted) of the oestrogen receptors. Progesterone receptor (PR) status is investigated using chemical treatment giving the same colouration as in ER.
- [0151]c) Immunohistochemical staining for CD31 with fuchsin (F) as substrate for assessing vascularity (angiogenesis).
- [0152]In a prior art manual procedure, a clinician places a slide under a microscope and examines a region of it (referred to as a tile) at magnification of ×40 for indications of C-erb-2, ER and PR status and at ×20 for vascularity.
- [0153]The present invention requires data from histological slides in a suitable form. In the present example, image data were obtained by a pathologist using Zeiss Axioskop microscope with a Jenoptiks Progres 3012 digital camera. Image data from each slide is a set of digital images obtained at a linear magnification of 40 (i.e. 40×), each image being an electronic equivalent of a tile.
- [0154]To select images, a pathologist scans the microscope over a slide, and at 40× magnification selects regions (tiles) of the slide which appear to be most promising in terms of an analysis to be performed. Each of these regions is then photographed using the microscope and digital camera referred to above, which produces for each region a respective digitised image in three colours, i.e. red, green and blue (R, G & B). Three intensity values are obtained for each pixel in a pixel array to provide an image as a combination of R, G and B image planes. This image data is stored temporarily at 12 for later use.
- [0155]Three tiles are required for vascularity measurement at 14, and one tile for each of oestrogen and progesterone receptor measurement at 16 and C-erb-2 measurement at 18. These measurements provide input to a diagnostic report at 20.
- [0156]The prior art manual procedure for scoring C-erb-2 involves a pathologist subjectively and separately estimating stain intensity, stain location and relative number of cells associated with a feature of interest in a tissue sample. The values obtained in this way are combined by a pathologist to give a single measurement for use in diagnosis, prognosis and reaching a decision on treatment. The process hereinafter described in this example replaces the prior art manual procedure with an objective procedure.
- [0157]Referring now to FIG. 2, processing
**16**to determine ER status will be outlined and then described in more detail later. It begins with a pre-processing stage**30**in which a K-means clustering algorithm is applied to a colour image using a Mahalanobis metric. This determines or cues image regions of interest for further processing by associating pixels into clusters on the basis of their having similar values of the Mahalanobis metric. At**32**the colour image is transformed into a chromaticity space which includes a location of a reference colour. Hue and saturation are calculated at**34**for pixels in clusters cued by K-means clustering. The number of brown stained pixels is computed at**36**by thresholding on the basis of hue and saturation. An ER status measurement is then derived at**38**from a combination of the fraction of stained pixels and average colour saturation. - [0158]The input for the ER preprocessing stage
**30**consists of raw digital data files of a single histopathological colour image or tile. A triplet of image band values for each pixel represents the colour of that pixel in its red, green, and blue spectral components or image bands. These values in each of the three image bands are in the range [0 . . . 255], where [0,0,0] corresponds to black and [255,255,255] corresponds to white. The K-means clustering algorithm**30**is applied to the digital colour image using clusters and the Mahalanobis distance metric. A cluster is a natural grouping of data having similar values of the relevant metric, and the Mahalanobis distance metric is a measurement that gives an indication of degree of closeness of data items to a cluster centre. It is necessary to have some means for locating cell nuclei as pixel groups but it is not essential to use four clusters or the Mahalanobis distance metric: these have been found to work well in identifying groups of contiguous pixels which correspond to respective cell nuclei. The K-means algorithm is described by J. A. Hartigan and M. A. Wong, in a paper entitled ‘A K-means clustering algorithm’, Algorithm AS 136, Applied Statistics Journal, 1979. The Mahalanobis distance metric is described by F. Heijden, in ‘Image Based Measurement Systems—object recognition and parameter estimation’, John Wiley & Sons, 1994 and by R. Schalkoff, in ‘Pattern Recognition—Statistical, Structural and Neural approaches’, John Wiley & Sons Inc., 1992. The process comprises an initialisation step a) followed by computation of a covariance matrix at step b). This leads to a likelihood calculation at step c), which effectively provides the distance of a pixel from a cluster centre. The procedure is as follows: - [0159]a) Initially, cluster centres are set using 30+(cluster number +1)×10 subtracted from the mean of the red, green and blue image bands respectively. For example the first cluster values would be set at mean_red −30+(0 +1)×10 (hence mean_red −20), similarly for mean_green and mean_blue. The second cluster would be mean_red −10, mean_green −10, and mean_blue −10, and similarly for other clusters. Pixels are then assigned to clusters for later readjustment.
- [0160]b) For each cluster the following computations are carried out:
- [0161]i) Compute elements of the kind σ
_{ij}^{k }of a covariance matrix of the image bands indicating the degree of variation between intensities of different colours in pixels of each cluster from Equation (1):$\begin{array}{cc}{\sigma}_{i\ue89e\text{\hspace{1em}}\ue89ej}^{k}=\frac{1}{{N}^{k}}\ue89e\sum _{l=1}^{{N}^{k}}\ue89e\text{\hspace{1em}}\ue89e\left({c}_{\mathrm{li}}-{\mu}_{i}^{k}\right)\ue89e\left({c}_{\mathrm{lj}}-{\mu}_{j}^{k}\right)& \left(1\right)\end{array}$ - [0162]where:
- [0163]σ
_{ij}^{k }is the ij^{th }element of the covariance matrix, - [0164]N
_{k }is the number of pixels in cluster k, - [0165]C
_{li}, and C_{lj}, are the values of pixel l in image bands i and j, - [0166]i, j take values 1, 2, 3, which represent the red, green and blue image bands respectively,
- [0167]μ
_{i}^{k }is the mean of all pixels in image band i belonging to cluster k, and - [0168]μ
_{j}^{k }is the mean of all pixels in image band j belonging to cluster k. - [0169]
- [0170]
- [0171]c). With index i denoting pixel number, each pixel {right arrow over (x)}
_{i }is now treated as a vector having three elements x_{i,1}, x_{i,2}, x_{i,3 }which are the red (x_{i,1}), green (x_{i,2}) and blue (x_{i,3}) pixel values: the red, green and blue image bands are therefore represented by second subscript indices 1, 2 and 3 respectively. With i ranging over all pixels in a cluster k, the likelihood d^{k}({right arrow over (x)}_{i}) of a pixel vector {right arrow over (x)}_{i }not belonging to that cluster is computed from Equation (2) below:$\begin{array}{cc}{d}^{k}\ue8a0\left({\overrightarrow{x}}_{i}\right)=\mathrm{ln}\ue8a0\left(\sqrt{\sum _{d\ue89e\text{\hspace{1em}}\ue89e\mathrm{et}}^{k}}\right)+1/2\ue8a0\left[{\left({\overrightarrow{x}}_{i}-{\overrightarrow{\mu}}^{k}\right)}^{t}\ue89e\sum _{i\ue89e\text{\hspace{1em}}\ue89e\mathrm{nv}}^{k}\ue89e\left({\overrightarrow{x}}_{i}-{\overrightarrow{\mu}}^{k}\right)\right]& \left(2\right)\end{array}$ - [0172]
- [0173]are as defined above,
- [0174]μ
_{i}^{k }is the mean of all pixel vectors {right arrow over (x)}_{i }in cluster k, and - [0175]t indicates the transpose of the difference vector ({right arrow over (x)}
_{i}−{right arrow over (μ)}^{k}). - [0176]Equation (2) is re-evaluated for the same pixel vector {right arrow over (x)}
_{i }in all other clusters also. Pixel vector {right arrow over (x)}_{1 }has the highest likelihood of belonging to a cluster (denoted k_{m}) for which d^{k}({right arrow over (x)}_{i})has a minimum value i.e. {d^{k}^{ m }({right arrow over (x)}_{i})}; cluster k_{m }is then the most suitable to receive pixel {right arrow over (x)}_{i}; i.e. find:— - d
^{k}^{ m }({right arrow over (x)}_{i})≦d^{k ({right arrow over (x)}}_{i}) for all k≠k_{m}(3) - [0177]Assign pixel {right arrow over (x)}
_{i}to cluster k_{m } - [0178]d). For each cluster k:
- [0179]Store a record of which pixels belong to cluster k as an array X
^{k}, update it with each pixel vector assigned to that cluster and update the number N^{k }of pixels in that cluster. - [0180]
- [0181]Iterate steps b) to d) until convergence, i.e. when no more pixels change clusters or the number of iterations reaches a total of 20.
- [0182]The first cluster (k=1) now corresponds to cell nuclei and the corresponding pixel vectors are those which are cued as of interest for output and further processing.
- [0183]Transformation of the image at
**32**from red/green/blue (RGB) to chromaticity space. In the present example, as will be described, a reference colour is used: if necessary, this can be avoided using e.g. the approach of the Cerb B2 example described later. The chemical staining used in the present example results in brown colouration and the approach used here is arranged to detect that preferentially; a different staining could however be used, in which case the technique would be adapted to detect a different pixel colour. - [0184]In practice brightness is liable to vary due to variation in degree of chemical staining and sample thickness across a slide, as well as possible vignetting by a camera lens used to produce the images. In consequence in this example emphasis is placed on computing a measurement of hue (or colour) and saturation as described later.
- [0185](a) Referring now also to FIGS.
**3**to**6**, each RGB image is transformed into a chromaticity space. FIG. 3 shows an RGB cube**40**in which red, green and blue pixel values (expressed as R, G and B respectively) are normalised and represented as values in the range 0 to 1. These pixel values are represented on red, green and blue axes**52**,**54**and**56**respectively. The chromaticity space is a plane**58**for which R+G+B=1: it is triangular within the RGB cube**50**and passes through the points (1,0,0), (0,1,0) and (0,0,1). - [0186](b) FIG. 4 shows the axes
**52**,**54**and**56**and chromaticity space**58**looking broadly speaking along a diagonal of the RGB cube**50**from the point (1,1,1) (not shown) to the origin (0,0,0) now referenced O for convenience. The points (0,0,1), (0,1,0) and (1,0,0) in FIG. 3 are now referenced J, K and L respectively. D is a midpoint of a straight line between J and L. Image pixel values from the input RGB image are projected on to the chromaticity space**108**and the resulting projections become data points for further processing. - [0187]The projection calculation is as follows:
- [0188]
- [0189]Perpendiculars from a point P in the chromaticity space
**108**to the lines JK and LD meet the latter at E and G respectively. Perpendiculars from P and G to the plane JOK meet the latter at F and H respectively. Using Equations (5), the point P in the triangular chromaticity space**58**may then be defined by x and y co-ordinates shown in FIG. 4 and given by:$\begin{array}{cc}x=\mathrm{DE}=\mathrm{HF}=\frac{g-r}{\sqrt{2}}\ue89e\text{\hspace{1em}}\ue89e\mathrm{and}\ue89e\text{\hspace{1em}}\ue89ey=\mathrm{PE}=\mathrm{GD}=b\ue89e\sqrt{\frac{3}{2}}& \left(6\right)\end{array}$ - [0190](c) In FIG. 5, the chromaticity space
**58**is shown with x and y co-ordinate axes extending from an origin Q. A reference colour denoted by a point S in the drawing is now defined as that specified for this purpose by a clinician: it is the colour of that part of the image which is most positively stained (the most intense colour on the part of the original slide from which the image was taken). The reference colour's RGB components are taken from the image and its x and y co-ordinates are computed using Equations (5) and (6): these co-ordinates are denoted as ({tilde over (x)}, {tilde over (y)}). - [0191](d) In FIG. 6, a polar co-ordinate system (r,θ) is now defined on the (R+G+B=1) plane or chromaticity space
**58**. The co-ordinate system origin is the centre of gravity G of the triangle**58**. A reference direction for θ=0 is defined as the direction QS of the radius vector to the reference colour S in FIG. 5. For any point such as P on the triangle defined as having co-ordinates (x, y) in the HSV colour space, hue H is defined as the angle φ between the radius vector (e.g. QP) to itself and the radius vector QS to the reference colour. This is computed at**34**from the following expressions for φ:$\begin{array}{cc}\mathrm{sin}\ue89e\text{\hspace{1em}}\ue89e\phi =\frac{\stackrel{~}{x}\ue89ey-x\ue89e\stackrel{~}{y}}{\sqrt{{\stackrel{~}{x}}^{2}+{\stackrel{~}{y}}^{2}}\ue89e\sqrt{{x}^{2}+{y}^{2}}}& \left(7\right)\\ \mathrm{cos}\ue89e\text{\hspace{1em}}\ue89e\phi =\frac{x\ue89e\stackrel{~}{x}+y\ue89e\stackrel{~}{y}}{\sqrt{{\stackrel{~}{x}}^{2}+{\stackrel{~}{y}}^{2}}\ue89e\sqrt{{x}^{2}+{y}^{2}}}& \left(8\right)\end{array}$ - [0192]
- [0193]For convenience the definition of hue H is now altered somewhat to render all values positive and in the range 0 to π/2: the transformation of earlier values φ into a new version ψ is shown in Table 1 below:
TABLE 1 Condition Magnitude of ψ (New Hue H) sin φ > 0 and cos φ > 0 φ sin φ > 0 and cos φ < 0 π − φ sin φ < 0 and cos φ > 0 −φ sin φ < 0 and cos φ < 0 φ − π - [0194]A hue (H) threshold ψ
_{0 }is set at**36**by a user or programmer of the procedure as being not more than π/2, a typical value which might be chosen being 80 degrees. Saturation S is defined to be$\begin{array}{cc}\mathrm{saturation}=\frac{x\ue89e\stackrel{~}{x}+y\ue89e\stackrel{~}{y}}{{\stackrel{~}{x}}^{2}+{\stackrel{~}{y}}^{2}}& \left(10\right)\end{array}$ - [0195]Two values of saturation threshold S
_{0 }are set according to whether or not image pixel saturation value S lies in the range 0.1 to 1.9: this is set out in Table 2 below:TABLE 2 Saturation S S _{0}Either S < 0.1 or S > 1.9 0 0.1 ≦ S ≦ 1.9 0.9 - [0196]At
**36**, the thresholds are used to count selectively the number N_{b }of pixels which are sufficiently brown (having a large enough value of saturation) having regard to the reference colour. All H and S pixel values in the image are assessed. The conditions to be satisfied by a pixel's hue and saturation values for it to be counted in the brown pixel number N_{b }are set out in Table 3 below.TABLE 3 Condition Action For each pixel with both hue modulus Treat as a “saturated” pixel; |ψ| < ψ _{0 }and saturation S > S_{0}increase count N _{b }of brownpixels by 1 For each pixel with |ψ| ≧ ψ _{0 }and/orTreat as an “unsaturated” saturation S ≦ S _{0}pixel; leave N _{b }unchanged - [0197]The average saturation of the N
_{b }saturated pixels determined in Table 3 is computed by adding all their saturation values S together and dividing the resulting sum by N_{b}. The maximum saturation value of the saturated pixels is then determined, and the average saturation is normalised by expressing it as a percentage of this maximum: this approach is used to counteract errors due to variation in colour staining between different images. The normalised average saturation is then accorded a score at 38 of 0, 1, 2 or 3 according respectively to whether this percentage is (a) ≦25%, (b) >25% and ≦50%, (c) >50% and ≦75% or (d) >75% and ≦100%. - [0198]The fraction of saturated pixels—those corresponding to cells stained sufficiently brown relative to surrounding tissue—is computed at 38 from the ratio N
_{b}/N where N is the total number of pixels in the image. This fraction is then quantised to a score in the range 0 to 5 as set out in Table 5 below.TABLE 5 N _{b}/N:Fraction of imagepixels that are stained Score 0.00 0 <0.01 1 0.01-0.10 2 0.11-0.33 3 0.34-0.66 4 0.67-1.00 5 - [0199]The two scores determined above, i.e. for normalised average saturation and fraction of sufficiently brown pixels are now added together to give a measure in the range 0 to 8. The higher this number is, the more oestrogen (ER) positive the sample is, as shown in Table 6 below.
TABLE 6 Description of ER status (ER Score) Range Strongly positive 7-8 Positive 4-6 Weakly positive 2-3 Negative 0-1 - [0200]Women with an ER score of 7 or 8 will respond favourably to hormonal treatment such as Tamoxifen; women with an ER score in the range 4 to 6 will have 50% of chance of responding to this treatment. Women scoring 2 or 3 will not respond very well, and those scoring 0 or 1 will not respond to hormonal treatment at all.
- [0201]Images for ER and PR are indistinguishable visually and they are distinguished by the fact that they are produced using different stains. A PR score is therefore produced from stained slides in the same way as an ER score described above. The significance of progesterone receptor (PR) positivity in a breast carcinoma is less well understood than the equivalent for ER. In general, cancers that are ER positive will also be PR positive. However, carcinomas that are PR positive, but not ER positive, may have a worse prognosis.
- [0202]Turning now to C-erb-2 the conventional manual technique involves processing a histopathological slide with chemicals to stain it appropriately, after which it is viewed by a clinician. Breast cells on the slide will have stained nuclei with a range of areas which allows discrimination between tissue cells of interest and unwanted cell types which are not important to cancer assessment. Cancerous cells will usually have a larger range of sizes of nuclei which must be allowed for in the discrimination process. A clinician needs to ignore unwanted cell types and to make a measurement by subjectively grading cells of interest as follows:
Score Staining Pattern 0 membrane staining in less than 10% of cells 1 just perceptible membrane staining in more than 10% of cells but membranes incompletely stained 2 weak to moderate complete membrane staining of more than 10% of cells 3 strong complete membrane staining of more than 10% cells - [0203]Scores 0 and 1 are negative (not justifying treatment), whereas scores 2 and 3 are called positive (justifying treatment).
- [0204]Unfortunately, there are artefacts which make measurement more complicated, as follows:
- [0205]Retraction (shrinking) artefact: less sharply defined than true membrane staining;
- [0206]Thermal artefact: if a electrocautery instrument is used, rather ill-defined staining occurs;
- [0207]Crushing artefact: the tissue is inadvertently mechanically deformed allowing more ill-defined staining.
- [0208]Thermal and crushing artefacts are normally confined to boundaries of a tissue specimen and would hopefully be excluded to some extent by a clinician photographing tiles from a slide. However, it is still important to guard against ill-defined staining not attached to a cell membrane.
- [0209]The technique of this invention attempts to measure the parameters mentioned above namely:
- [0210]Completeness of cell membrane staining;
- [0211]Intensity and thinness of cell membrane staining; and
- [0212]Ratio of cell membrane staining.
- [0213]There are two main stages in the present invention, and these may optionally be preceded by pre-processing if images are poor. The main stages are:
- [0214]finding cell nuclei which satisfy area and location limitations associated with tumours; and
- [0215]determining a score which characterises the membranes of the cell nuclei found in the preceding stage.
- [0216]Referring now to FIG. 7, the C-erb-2 technique of the invention will firstly be outlined and later described in more detail. An optional preprocessing step
**70**is carried out if images of tiles are poor due to camera vignetting or colour errors across the image. - [0217]Image segmentation is carried out in steps
**71**to**78**, i.e. automated separation of objects from a background in a digital image. The original digital image of a tile has red, green and blue image planes: from the green and blue image planes a cyan image plane is derived at**71**and a Sobel-filtered cyan image plane at**72**. There are now five image planes: of these only the red and blue image planes are essential with conventional colour staining, the other image planes are used for desirable but not essential filtering operations upon the red image planes. Statistical measures of the five image planes are computed at**74**and**76**, and then a segmented image is optimised and generated at**78**which has been filtered to remove unwanted pixels and spatial noise. The segmented image identifies cell nuclei. Step**78**is an adaptive thresholding technique using information from regions around pixels: it is shown in more detail within chain lines**80**with arrows**82**indicating iterations. It is an alternative to the K-means clustering algorithm previously described, which could also be used. - [0218]If at
**84**the number of cells found is less than 16, the image is rejected at**86**: if it is 16 or greater, then having found the cell nuclei, and hence the cells, the strength, thinness and completeness of each cell's surrounding membrane staining are measured and the membrane stainings are then ranked. - [0219]For each cell, at
**88**a sequence of cross-correlation windows of varying widths is passed along four radii from the cell centroid to determine the cell boundary brightness value, membrane width and distance from the centroid of the most intense staining. Cell boundary brightness value is normalised by dividing by membrane width, and nuclear area and sum of normalised boundary brightness values are then obtained. Statistical measures characterising membrane-staining strength, specificity and completeness are then deduced: these measures are compared with equivalents obtained from four reference images. The measured image is then graded by assigning it a score which is that of the closest reference, with the metric of Euclidean-distance. Other metrics may also be used. Alternatively, the scores of a moderately large sample may be used as references. - [0220]The C-erb-2 process will now be described in more detail. The process
**18**is applied to one image or tile obtained by magnifying by a factor of**40**an area of a histological slide. Referring to FIG. 7 once more, The optional preprocessing step**70**is carried out by either: - [0221](a) dividing the image into a suitable number of tiles (with less individual variability) and processing them separately—this should be considered an option in general, though it is not necessary if there is reasonable uniformity across individual images; or
- [0222](b) preferably, if.sufficient images are available from the same camera objective lens, computing its deficiency and correcting it, rather than processing sub-images with more part-cells split across boundaries.
- [0223]The digital image of a slide is a three colour or red green and blue (RGB) image as defined above, i.e. there is a respective image plane for each colour. For the purposes of the following analysis, the letters R, G and B for each pixel are treated as the red green and blue intensities at that pixel. The RGB image is used at
**71**to compute a cyan image derived from the blue and green image planes: i.e. for each pixel a cyan intensity C is computed from C=(2×B+G)/3, the respective pixel's green (G) intensity being added to twice its blue (B) intensity and the resulting sum being divided by three. When repeated for all pixels this yields a cyan image or image plane. Cyan is used because it is a complementary colour to brown, which is the cell boundary colour produced by conventional chemical staining of a specimen. The blue image plane could be used instead but does not normally produce results as good as the cyan image. If a different colour staining were to be use, the associated complementary colour image would be selected. This process step is not essential but it greatly assists filtering out unwanted pixels and it does so without a reference colour (see the ER/PR example which uses an alternative approach). - [0224]At
**72**, a Sobel edge filter is applied to the cyan image plane: this is a standard image processing technique published in Klette R., & Zamperoni P., ‘Handbook of image processing operators’, John Wiley & Sons, 1995. A Sobel edge filter consists of two 3×3 arrays of numbers S_{P }and S_{Q}, each of which is convolved with successive 3×3 arrays of pixels in an image. Here$\begin{array}{cc}{S}_{P}=\left[\begin{array}{ccc}1& 2& 1\\ 0& 0& 0\\ -1& -2& -1\end{array}\right]\ue89e\text{\hspace{1em}}\ue89e\text{and}\ue89e\text{\hspace{1em}}\ue89e{S}_{Q}=\left[\begin{array}{ccc}1& 0& -1\\ 2& 0& -2\\ 1& 0& -1\end{array}\right]& \left(11\right)\end{array}$ - [0225]The step
**72**initially selects a first cyan 3×3 array of pixels in the top left hand corner of the cyan image: designating as C_{ij }a general cyan pixel in row i and column j, the top left hand corner of the image consists of pixels C_{11}, to C_{13}, C_{21 }to C_{23 }and C_{31 }to C_{33}. C_{ij }is then multiplied by the respective digit of S_{P }located in the S_{P }array as C_{ij }is in the 3×3 cyan pixel array: i.e. C_{11 }to C_{13 }are multiplied by 1, 2 and 1 respectively, C_{21 }to C_{23 }by zeroes and C_{31 }to C_{33 }by −1, −2 and −1 respectively. The products so formed are added algebraically and provide a value p. - [0226]The value of p will be relatively low for pixel values changing slowly between the first and third rows either side of the row of C
_{22}, and relatively high for pixel values changing rapidly between those rows: in consequence p provides an indication of image edge sharpness across rows. This procedure is repeated using the same pixel array but with S_{Q }replacing S_{P}, and a value q is obtained: q is relatively low for pixel values changing slowly between the first and third columns either side of the column of C_{22}, and relatively high for pixel values changing rapidly between those columns: and q therefore provides an indication of image edge sharpness across columns. The square root of the sum of the squares of p and q are then computed i.e. {square root}{square root over (p^{2}+q^{2})}, which is defined as an “edge magnitude” and becomes T_{22 }(replacing pixel C_{22 }at the centre of the 3×3 array) in the transformed cyan image. It is also possible to derive an edge “phase angle”as tan^{−1}p/q, but that is not required in the present example. - [0227]A general pixel T
_{ij }(row i, column j) in the transformed image is derived from C_{i−1,j−1 }to C_{i−1,j+1}, C_{i,j−1 }to C_{i,j+1 }and C_{i+1,j−1 }to C_{i+1,j+1 }of the cyan image. Because the central row and column of the Sobel filters in Equation (11) respectively are zeros, and other coefficients are 1s and 2s, p and q for T_{ij }can be calculated as follows: -
*p={C*_{i−1,,j−1}+2*C*_{i−1,j}*+C*_{i−1,,j+1}*}−{C*_{i+1,,j−1}+2*C*_{i+1,+1}} (12) -
*q={C*_{i−1,,j−1}+2*C*_{i,j−1}*+C*_{i+1,,j−1}*}−{C*_{i−1,,j+1}+2*C*_{i,j+1}*+C*_{i+1,j+1}} (13) - [0228]Beginning with i=j=2, p and q are calculated for successive 3×3 pixel arrays by incrementing j by 1 and evaluating Equations (2) and (3) for each such array until the end of a row is reached; j is then incremented by 1 and the procedure is repeated for a second row and so on until the whole image has been transformed. This transformed image is referred to below as the “Sobel of Cyan” image or image plane.
- [0229]The Sobel filter cannot calculate values for pixels at image edges having no adjacent pixels on one or other of its sides: i.e. in a pixel array having N rows and M columns, edge pixels are the top and bottom rows and the first and last columns, or in the transformed image pixels T
_{11 }to T_{1M}, T_{N1 }to T_{NM}, T_{11 }to T_{1M }and T_{1M }to T_{NM}. By convention in Sobel filtering these edge pixels are set to zero. - [0230]A major problem with measurements on histopathological images is that the staining of different slides can vary enormously, e.g. from blue with dark spots to off-white with brown outlines. The situation can be improved by sifting the slides and using only those that conform to a predetermined colouration. However, it has been found that it is possible to cope with variation in staining to a reasonable extent by using statistical techniques to normalise images: in this connection steps
**74**and**76**derive a variety of statistical parameters for use in image segmentation in step**78**. - [0231]In Step
**74**is computed the mean and standard deviation of the transformed pixel values T_{ij}. For convenience a change of nomenclature is implemented: index k is substituted for i and j, i.e. k=1 to NM for i, j=1, 1 to N, M: this treats a two dimensional image as a single composite line composed of successive rows of the image. Also x is substituted for T in each pixel value, so T_{ij }becomes X_{k}. The following Equations (14) and (15) respectively are used for computing the mean μ and standard deviation σ of the transformed pixels x_{k}.$\begin{array}{cc}\mu =\frac{1}{\mathrm{NM}}\ue89e\sum _{k=1}^{\mathrm{NM}}\ue89e\text{\hspace{1em}}\ue89e{x}_{k}& \left(14\right)\\ \sigma =\sqrt{\frac{1}{\mathrm{NM}-1}\ue89e\sum _{k=1}^{\mathrm{NM}}\ue89e\text{\hspace{1em}}\ue89e{\left({x}_{k}-\mu \right)}^{2}}& \left(15\right)\end{array}$ - [0232]At
**76**, various statistical parameters are computed for the Red, Green, Blue and Cyan image planes using Equations (14) and (15) above. - [0233]For the Red image plane the statistical parameters are the mean μ
_{R }and standard deviation σ_{R }of its pixel values: in Equations (14) and (15), x_{k }represents a general pixel value in the Red image plane. In addition, the Red image plane's pixels are compared with one another to obtain their maximum, minimum and range (maximum-minimum). Similarly, pixels in each of the Green and Blue image planes are compared with one another to obtain a respective maximum, minimum and range for each plane. Finally, for the Cyan image, pixels' mean and standard deviation are computed using Equations (14) and (15), in which x_{k }represents a general pixel value in the Cyan image plane. - [0234]In step
**78**, the image is segmented to identify and locate cell nuclei. a pixel is counted as part of a cell nucleus if and only if it survives a combination of thresholding operations on the Red, Green, Blue, Cyan and Sobel of Cyan image planes followed by closure of image gaps left after thresholding operations. It is necessary to determine threshold values in a way which allows for variation in chemical staining between different images. The technique employed in this example is to perform a multidimensional optimisation of some thresholds with nuclei-number as the objective-function to be maximised: i.e. for a given image, threshold values are altered intelligently until a near maximum number of nuclei is obtained. Starting values are computed for the optimisation routines by choosing those suitable for provision of threshold levels. In this example, two dimensional optimisation is used requiring three starting values indicated by suffixes 1, 2 and 3 and each with two components: the starting values represent vertices of a triangle in a two dimensional plane. The starting values are RMM1/CMM1, MMM2/CMM2 and RMM3/CMM3, RMM indicating a “Red Mean Multiplier” and CMM indicating a “Cyan Mean Multiplier”. Tests using a substantial number of images have shown that suitable starting values are RMM1=0.802, CMM1=1.24, RMM2=CMM2=0.903, RMM3=1.24 and CMM3=0.802. - [0235]For images counterstained with Haemotoxylin and Eosin (H&E) cell nuclei are strongly stained blue—i.e. they have very low values in the complementary red plane. Hence the red plane is the primary plane used in thresholding as follows:
- [0236](a) Produce a thresholded image for the Red image plane (approximately complimentary to Blue) as follows: for every Red pixel value that is less than an adaptive threshold, set the corresponding pixel location in the thresholded Red image to 1, otherwise set the latter to 0. A respective adaptive threshold is computed separately for every pixel location as follows. At a) in step
**78**, the Red image threshold value is dependent on the presence of enclosing brown stain in the neighbourhood of each pixel, i.e. it is a function of Cyan mean μ_{C }and Red mean μ_{R}. A check for enclosing brown is performed by searching radially outwards from a pixel under consideration. The procedure is in the Cyan image plane to select the same pixel location as in the Red image plane and from it to search in four directions—north, south, east and west directions—for a distance of seventy pixels (or as many as are available up to seventy). Here north, south, east and west have the following meanings: north: upward from the pixel in the same column; south: downward from the pixel in the same column; east: rightward from the pixel in the same row; and west: leftward from the pixel in the same row. More directions (e.g. diagonals north-east, north-west, south-east and south-west) could be used to improve accuracy but four have been found to be adequate for the present example. In any of these directions or radii either a cyan pixel will fall below a threshold (indicating a brown pixel) or a radius of 70 pixels will be reached without a cyan pixel doing so. The number R_{B }of “brown” radii (radii intersecting at least one brown pixel) is then used to change the red threshold adaptively in the following way: There is calculated a new Red image plane threshold RTN=RMM1μ_{R}−σ_{R}(4−R_{B}), where RMM1μ_{R }is the product of RMM1 and μ_{R }and σ_{R }is the standard deviation of the Red image plane. A limit is placed on RTN giving it a maximum possible value of 255. If the Red image plane pixel under consideration is less than the Red image plane threshold calculated for it, the corresponding pixel at the same location in the thresholded Red image is set to one, otherwise it is set to zero. - [0237](b) Using the Cyan image plane, and with the Cyan mean μ
_{C }from step**74**, for every Cyan pixel value that is less than the product of CMM1 and μ_{C}, set the pixel in the corresponding location in the thresholded Red image to 0, otherwise do not change the pixel. This has the effect of removing excess brown pixels. - [0238](c) Using the Sobel of Cyan image plane, and with the Cyan mean μ
_{C }and standard deviation σ_{C }from step**74**: i.e. for every Cyan pixel value that is greater than (μ_{C}+1.5σ_{C}) set the corresponding pixel in the thresholded Red image to 0, otherwise do not change the pixel. This has the effect of removing brown edge pixels. - [0239](d) Pixels corresponding to lipids are now removed as follows: using the pixel minimum and range values computed at step
**76**, a thresholded Red image is produced using data obtained from the Red, Green and Blue image planes: for each Red, Green and Blue pixel group at a respective pixel location that satisfies all three criteria at (i) to (iii) below, set the pixel at the corresponding location in the thresholded Red image to 0, otherwise do not change the pixel; this has the effect of removing lipid image regions (regions of fat which appear as highly saturated white areas). Removal of these regions is not essential but is desirable to improve processing. The criteria for each set of Red, Green and Blue values at a respective pixel are: - [0240](i) Red value>Red minimum+0.98×(red range), AND
- [0241](ii) Green pixel>Green minimum+0.98×(green range), AND
- [0242](iii) Blue pixel>Blue minimum+0.98×(blue range)
- [0243]Steps (c) and (d) could be moved outside the recursion loop defined within chain lines
**80**if desired, with consequent changes to the procedure. - [0244](e) The next step is to apply to the binary image obtained at step (d) of
**78**above a morphological closing operation, which consist of a dilation operation followed by an erosion operation. These morphological operations fuse narrow gaps and eliminate small holes in individual groups of contiguous pixels appearing as blobs in an image. They are not essential but they improve processing. They can be thought of as removal of irregularities or spatial “noise”, and they are standard image processing procedures published in Umbaugh S. C., ‘Colour vision and image processing’, Prentice Hall, 1998. - [0245](f) A connected component labelling process is now applied to the binary image produced at step (e). This is a known image processing technique (sometimes referred to as ‘blob colouring’) published by R Klette and P Zamperoniu, ‘Handbook of Image Processing Operators’, John Wiley & Sons, 1996, and A Rosenfeld and A C Kak, ‘Digital Picture Processing’, Vols. 1 & 2, Academic Press, New York, 1982. It gives numerical labels to “blobs” in the binary image, blobs being regions or groups of like-valued contiguous or connected pixels in an image: i.e. each group or blob consists of connected pixels which are all
**1**s, and each is assigned a number different to those of other groups. This enables individual blobs to be distinguished from others by means of their labels. The number of labelled image regions or blobs in the image is computed from the labels and output. Connected component labelling also determines each labelled image region's centroid (pixel location of region centre), height, width and area. Image regions are now removed from the binary image if they are not of interest because they are too small or too large in area or they have sufficiently dissimilar height and width indicating they are flattened. The remaining regions in the binary image pass to the next stage of processing at (g). - [0246]Steps (a) to (f) are carried out for all three starting points or triangle vertices RMM1/CMM1, RMM2/CMM2 and RMM3/CMM3: this yields three values for the number of regions remaining in the binary image in each case.
- [0247](g) This step is referred to as the Downhill Simplex method: it is a standard iterative statistical technique for multidimensional optimisation published in Nelder J. A., Mead R., 1965, Computer Journal, vol. 7, pp 308-313, 1965. It takes as input the three numbers of regions remaining after step (f). It is possible to use other optimisation techniques such as that referred to as Powell which uses gradients. The starting point/vertex yielding the lowest number of regions remaining is then selected. A new starting point is then generated as the reflection of the selected vertex in the line joining to the two other vertices: i.e. if the three vertices were to have been at 1,1, 1,2 and 2,1, and 1,1 was the selected vertex, then the new starting point is 2,2. The selected vertex is then discarded and the other two retained. The new starting point or vertex becomes RMM4/CMM4 and steps (a) to (f) are repeated using it to generate a new number of regions remaining for comparison with those associated with the two retained vertices. Again a vertex yielding the lowest number of regions remaining is selected, and the process of new RMM/CMM values and steps (a) to (f) is iterated as indicated by arrows
**82**. Iterations continue until the rate of change of remaining number of image regions (cell nuclei number) slows down, i.e. when successive iterations show a change of less than 10% in this number: at that point optimisation is terminated and the binary image remaining after step (f) selected for further processing is that generated using the RMM/CMM values giving the highest nuclei number. - [0248]The procedure
**18**is now concerned with determining quantities referred to as “grand_mean” and “mean_range” to be defined later. If the Downhill Simplex method (g) has determined that there are less than a user specified number of image regions or cell nuclei, sixteen in the present example, then at**84**processing is switched to**86**indicating a problem image which is to be rejected. - [0249]If the Downhill Simplex method has determined that there are at least sixteen image regions, then at
**84**processing is switched to**88**where a search to characterise these regions' boundaries is carried out. The search uses each region's area and centroid pixel location as obtained in connected component labelling at**78**(*f*), and each region is assumed to be a cell with a centroid which is the centre of the cell's nucleus. This assumption is justified for most cells, but there may be misshapen cells for which it does not hold: it is possible to discard misshapen cells by eliminating those with concave boundary regions for example, but this is not implemented in the present example. - [0250]The search to characterise the regions' boundaries is carried out along the respective north, south, east and west directions (as defined earlier) from the centroid (more directions may be used to improve accuracy): it is carried out in each of these directions for a distance δ which is either 140 pixels or 2{square root}{square root over (region area)}, whichever is the lesser. It employs the original (2B+G)/3 cyan image because experience shows that this image gives the best defined cell boundaries with the slide staining previously described. Designating C
_{ij }as the intensity of a region's centroid pixel in the cyan image at row i and column j, then pixels to be searched north, south, east and west of this centroid will have intensities in the cyan image of C_{i+1,j }to C_{i+δ,j}, C_{i−1,j }to C_{i−δ,j}, C_{i,j+1 }to C_{i,j+δ }and C_{i,j−1 }to C_{i,j−δ }respectively. The cyan intensity of each of the pixels to be searched is subtracted from the centroid pixel's cyan intensity C_{ij }to produce a difference value, which may be positive or negative. In a cyan image, a cell nucleus is normally blue whereas a boundary is brown (with staining as described earlier). - [0251]Each pixel is then treated as being part of four linear groups or “windows” of six, twelve, twenty-four and forty-eight pixels each including the pixel and extending from it in a continuous line north, south, east or west (as defined earlier) according respectively to whether the pixel is north, south, east or west of the centroid. In effect pixels in each of the chosen directions have mathematical window functions applied to them, the function having the value 1 at pixels within a group and the value 0 outside it. In the linear groups in the present example, C
_{i+1,j }is for example grouped with C_{i+2,j }to C_{i+6,j}, C_{i+2,j }to C_{i+12,j}, C_{i+2,j }to C_{i+24,j}, and C_{i+2,j }to C_{i+48,j }(inclusive in each case). This provides a total of 16δ groups from 4δ groups in each of four directions. For each group the difference between each of its pixels' cyan intensities and that of the centroid is calculated: the differences are summed over the group algebraically (positive and negative differences cancelling one another). This sum is divided by the number of pixels in the group to provide a net difference per pixel between the cyan intensities of the group's pixels and that of the centroid. - [0252]For each direction, i.e. north, south, east and west, there is now a respective set of 4δ net differences per pixel: in each set the net differences per pixel are compared and their maximum value is identified. This produces a respective maximum net difference per pixel for each of the sets, i.e. for each of the north, south, east and west-directions, and size of window (number of pixels in group) in which the respective maximum occurred. The four maxima so obtained (one for each direction) and the respective window size in each case are stored. Each maximum is a measure of the region boundary (cell membrane) magnitude in the relevant direction, because in a cyan image the maximum difference as compared to a blue cell nucleus occurs at a brown cell boundary. The window size associated with each maximum indicates the region boundary width, because a boundary width will give a higher maximum in this technique with a window size which it more nearly matches as compared to one it matches less well. Greater accuracy is obtainable by using more window sizes and windows matched to cell boundary shape, i.e. multiplying pixels in each linear group by respective values collectively forming a boundary shape function. The process is in fact mathematically a correlation operation in which a window shape is correlated with a linear group of pixels. A further option is to record the position of the maximum or boundary (cell radius) as being that of one of the two pixels at the centre of the window in which the maximum occurs: this was not done in the present example, although it would enable misshapen cells to be detected and discarded as being indicated by significant differences in the positions of maxima in the four directions, and it would improve width measure by accounting for oblique intersections of windows and cell boundaries.
- [0253]Each maximum or region boundary magnitude is then divided by the associated window size (region boundary width) used to derive it: this forms what is called for the purposes of this specification a normalised boundary magnitude—it is a measure of both brightness and sharpness: It enables discrimination against ill-defined staining not attached to a cell membrane.
- [0254]The next step
**90**is to apply what is referred to as a “quicksort” to the four normalised boundary magnitudes to sort them into descending order of magnitude. Quicksort is a known technique published by Klette R., Zamperoniu P., ‘Handbook of Image Processing - [0255]Operators’, John Wiley & Sons, 1996, and will not be described. It is not essential but convenient. For each image region, measurements made as described above are now recorded in a respective 1-dimensional vector as set out in Table 7 below: in this table the directions North, East etc are lost in the quicksort ordering into largest, second largest, third largest and smallest.
TABLE 7 Item number Parameter 1 Largest normalised boundary magnitude 2 Second Largest normalised boundary magnitude 3 Third Largest normalised boundary magnitude 4 Smallest normalised boundary magnitude 5 Sum of Largest, Second Largest, Third Largest and Smallest normalised boundary magnitudes - [0256]A further quicksort is now applied (also at
**90**) to the image regions to sort them into descending order of item 5 values in Table 7 above, i.e. sum of Largest, Second Largest, Third Largest and Smallest normalised boundary magnitudes. A subset of the image regions is now selected as being those having large values of item 5: these are the most significant image regions and they are the best one eighth of the total number of image regions in terms of item 5 magnitude. From this subset of image regions the following parameters are computed at**92**, “grand_mean”, “mean_range” and “relative_range” as defined below: - octile=one eighth of the total number of image regions or cell nuclei (16)
- boundaries=normalised boundary magnitudes (17)
- Σ=sum of . . . (over all boundaries in the subset or best octile) (18)
- item 1=Largest normalised boundary magnitude (19)
- item 3=Third Largest normalised boundary magnitude (20)
- grand_mean=6×[(ΣLargest boundaries)+(ΣSecond Largest boundaries) +(ΣThird Largest boundaries)+(ΣSmallest boundaries)]/4octile (21)
- mean_range=[(Σitem 1)−(Σitem 3)]/octile (22)
- relative_range=10×mean_range/grand_mean (23)
- [0257]Grand_mean is indicative of the degree to which an image exhibits good cell boundary sharpness and brightness. Relative_range indicates the degree to which an image exhibits brightness extending around cell boundaries—the smallest boundaries (item 4) are omitted from this computation to provide some robustness against incomplete cells. A cell boundary that exhibits a large value of relative_range will have brightness varying appreciably around the boundary corresponding to non-uniformity of staining or possibly even absence of a boundary.
- [0258]At
**94**an overall distance measure is computed: this measure provides an estimate of how far the current cyan image (generated at**71**) is from each member of a predetermined standard set of images, four images in the present example. In this example the distance measure is computed against a set of four predetermined standard images: the standard images were obtained by dividing a large test dataset of images into four different image types corresponding respectively to four different C-erb-2 status indicators (as will be described later in more detail). The images of each image type were analysed to determine grand mean and relative range for each image using the process**18**. A respective average grand mean M_{i }(i=0, 1, 2 and 3) and a respective average relative range RR_{i }were determined for the images of each of the four image types. As an alternative, it is also possible to select four good quality images of the relevant types by inspection from many images, and to determine M_{i }and RR_{i }from them. The values M_{i }and RR_{i }become the components of respective four-element vectors M and RR, and are used in the following expression: - C-erb-2 indicator=min
_{i}{(M_{i}−grand mean)^{2}+(RR_{i}−relative range)^{2}} (24) - [0259]where min
_{i }is the value of i (i=0, 1, 2 or 3) for which the expression within curved brackets { } on the right of Equation (24) is a minimum. For the vector M, from the dataset the following elements were determined: M_{O}=12.32, M_{1}=23.16, M_{2}=42.34 and M_{3}=87.35; elements determined likewise for the vector RR were RR_{0}=2.501, RR_{1}=1.85, RR_{2}=1.111 and RR_{3}=0.5394. The value of the index i is returned as the indicator for the C-erb-2 measurement process. - [0260]If a value of i=3 is obtained in the C-erb-2 measurement process, this is regarded as a strongly positive result: the patient from whom the original tissue samples were taken is regarded as highly suitable for treatment, currently with herceptin. A value of i=2 is weakly positive indicating doubtful suitability for treatment, and i=1 or 0 is a negative result indicating unsuitability. This is tabulated below in Table 8.
TABLE 8 C-erb-2 status i Value Strongly positive 3 Weakly positive 2 Negative 0, 1 - [0261]Referring now to FIG. 8, there is shown a flow diagram of the process
**14**(see FIG. 1) for measurement of vascularity. The process**14**is applied to three images each of ×20 magnification compared to the histopathological slide from which they were taken. At**100**each image is transformed from red/green/blue (RGB) to a different image space hue/saturation/value (HSV). The RGB to HSV transformation is described by K. Jack in ‘Video Demystified’, 2^{nd }ed., HighText Publications, San Diego, 1996. In practice value V (or brightness) is liable to vary due to staining and thickness variations across a slide, as well as possible vignetting by a camera lens used to produce the images. In consequence in this example the V component is ignored: it is not calculated, and emphasis is placed on the hue (or colour) and saturation values H and S. H and S are calculated for each pixel of the two RGB images as follows: - Let M=maximum of (R,G,B) (25)
- Let m=minimum of (R,G,B) (26)
- Then newr=(M−R)/(M−m) (27)
- newg=(M−G)/(M−m) and (28)
- newb=(M−B)/(M−m) (29)
- [0262]This converts each colour of a pixel into the difference between its magnitude and that of the maximum of the three colour magnitudes of that pixel, this difference being divided by the difference between the maximum and minimum of (R,G,B).
- [0263]Saturation (S) is set as follows:
- if M equals zero, then S=0 (30)
- if M does not equal zero, then S=(M−m)/M (31)
- [0264]The calculation for Hue (H) is as follows: from Equation (25) M must be equal to at least one of R, G and B:
- if M equals zero, then H=180 (32)
- If M equals R then H=60(newb−newg) (33)
- If M equals G then H=60(2+newr−newb) (34)
- If M equals B then H=60(4+newg−newr) (35)
- If H is greater than or equal 360 then H=H −360 (36)
- If H is less than 0 then H=H+360 (37)
- [0265]The Value V is not used in this example, but were it to be used it would be set to the maximum of (R,G,B).
- [0266]The next step
**102**is to apply colour segmentation to obtain a binary image. This segmentation is based on thresholding using the Hue and Saturation from the HSV colour space, and is shown in Table 9 below.TABLE 9 Binary Image Threshold Criterion Pixel Value Pixel with both Hue H in the range 282-356 degrees Set pixel to 1 (scale 0 to 360), and Saturation S in the range 0.2 to 0.24 (scale 0 to 1) Pixel with either Hue outside the range 282-356 Set pixel to 0 degrees, and/or Saturation outside the range 0.2-0.24 - [0267]This produces a segmented binary image in which pixels set to 1 are processed further and those set to 0 are discarded.
- [0268]The next stage
**104**is to apply connected component labelling (as defined previously) to the segmented binary image: this provides a binary image with regions of contiguous pixels equal to 1, the regions being uniquely labelled for further processing and their areas being determined. The labelled binary image is then spatially filtered to remove small connected components (image regions with less than 10 pixels) which have insufficient pixels to contribute to vascularity: this provides a reduced binary image. - [0269]The sum of the area of the remaining image regions in the reduced binary image is then determined at
**106**from the results of connected component labelling, and this sum is then expressed as a percentage of the area of the whole image. This procedure is carried out for each of the original RGB images separately to provide three such percentage area values: the average of the three percentage area values is computed, and it represents an estimate of the percentage of the area of a tissue sample occupied by blood vessels—i.e. the sample vascularity. - [0270]As set out in Table 10 below, vascularity is determined to be high or low depending on whether or not it is equal to at least 31%.
TABLE 10 Description of vascularity Range High 31%-100% Low 0%-30% - [0271]High vascularity corresponds to relatively fast tumour growth because tumour blood supply has been facilitated, and early treatment is indicated. Low vascularity corresponds to relatively slow tumour growth, and early treatment is less important.
- [0272]The procedures given in the foregoing description for calculating quantities and results can clearly be evaluated by an appropriate computer program recorded on a carrier medium and running on a conventional computer system. Such a program is straightforward for a skilled programmer to implement without requiring invention, because the mathematical expressions used are well known computational procedures. Such a program and system will therefore not be described.
- [0273]The process steps described in the examples of all three inventions described herein are not all essential and alternatives may be provided. It is for example possible to omit a step of ignoring unsuitably small areas in selecting areas for later processing, if the consequent increase in processing burden is acceptable. The above examples are intended to provide an enabling disclosure, not to limit the invention.

Patent Citations

Cited Patent | Filing date | Publication date | Applicant | Title |
---|---|---|---|---|

US4724543 * | Sep 10, 1985 | Feb 9, 1988 | Beckman Research Institute, City Of Hope | Method and apparatus for automatic digital image analysis |

US5134662 * | Mar 4, 1991 | Jul 28, 1992 | Cell Analysis Systems, Inc. | Dual color camera microscope and methodology for cell staining and analysis |

US5202931 * | Jun 23, 1992 | Apr 13, 1993 | Cell Analysis Systems, Inc. | Methods and apparatus for the quantitation of nuclear protein |

US5235522 * | Oct 10, 1990 | Aug 10, 1993 | Cell Analysis Systems, Inc. | Method and apparatus for automated analysis of biological specimens |

US5548661 * | Sep 7, 1994 | Aug 20, 1996 | Price; Jeffrey H. | Operator independent image cytometer |

US5577131 * | May 3, 1994 | Nov 19, 1996 | U.S. Philips Corporation | Device for segmenting textured images and image segmentation system comprising such a device |

US5598481 * | Sep 29, 1995 | Jan 28, 1997 | Arch Development Corporation | Computer-aided method for image feature analysis and diagnosis in mammography |

US5787208 * | Jun 7, 1995 | Jul 28, 1998 | Neopath, Inc. | Image enhancement method and apparatus |

US5933519 * | Jun 3, 1997 | Aug 3, 1999 | Neo Path, Inc. | Cytological slide scoring apparatus |

US6546123 * | Jul 12, 2000 | Apr 8, 2003 | Chromavision Medical Systems, Inc. | Automated detection of objects in a biological sample |

US20030231791 * | Jun 12, 2003 | Dec 18, 2003 | Torre-Bueno Jose De La | Automated system for combining bright field and fluorescent microscopy |

US20040071327 * | Oct 2, 2003 | Apr 15, 2004 | Chromavision Medical Systems, Inc., A California Corporation | Histological reconstruction and automated image analysis |

US20040120562 * | Dec 5, 2003 | Jun 24, 2004 | Presley Hays | Automated method for image analysis of residual protein |

US20050276457 * | Aug 8, 2003 | Dec 15, 2005 | Qinetiq Limited | Histological assessment |

Referenced by

Citing Patent | Filing date | Publication date | Applicant | Title |
---|---|---|---|---|

US7337398 * | Feb 28, 2003 | Feb 26, 2008 | Adobe Systems Incorporated | Reconstitute tag-delimited tables in a graphics editing application |

US7646905 * | Dec 16, 2003 | Jan 12, 2010 | Qinetiq Limited | Scoring estrogen and progesterone receptors expression based on image analysis |

US7761139 | Jan 26, 2004 | Jul 20, 2010 | The General Hospital Corporation | System and method for identifying tissue using low-coherence interferometry |

US7796270 | Jan 10, 2007 | Sep 14, 2010 | The General Hospital Corporation | Systems and methods for generating data based on one or more spectrally-encoded endoscopy techniques |

US7797119 | Dec 13, 2007 | Sep 14, 2010 | The General Hospital Corporation | Apparatus and method for rangings and noise reduction of low coherence interferometry LCI and optical coherence tomography OCT signals by parallel detection of spectral bands |

US7809225 | Sep 5, 2008 | Oct 5, 2010 | The General Hospital Corporation | Imaging system and related techniques |

US7809226 | Sep 5, 2008 | Oct 5, 2010 | The General Hospital Corporation | Imaging system and related techniques |

US7864822 | Feb 11, 2009 | Jan 4, 2011 | The General Hospital Corporation | Process and apparatus for a wavelength tuning source |

US7889348 | Oct 13, 2006 | Feb 15, 2011 | The General Hospital Corporation | Arrangements and methods for facilitating photoluminescence imaging |

US7898656 | Apr 30, 2008 | Mar 1, 2011 | The General Hospital Corporation | Apparatus and method for cross axis parallel spectroscopy |

US7903257 | Dec 12, 2007 | Mar 8, 2011 | The General Hospital Corporation | Apparatus and method for ranging and noise reduction of low coherence interferometry (LCI) and optical coherence tomography (OCT) signals by parallel detection of spectral bands |

US7920271 | Aug 24, 2007 | Apr 5, 2011 | The General Hospital Corporation | Apparatus and methods for enhancing optical coherence tomography imaging using volumetric filtering techniques |

US7925133 | Sep 5, 2008 | Apr 12, 2011 | The General Hospital Corporation | Imaging system and related techniques |

US7933021 | Oct 30, 2008 | Apr 26, 2011 | The General Hospital Corporation | System and method for cladding mode detection |

US7941275 | Mar 27, 2005 | May 10, 2011 | Ventana Medical Systems, Inc. | Method and system for automated detection of immunohistochemical (IHC) patterns |

US7949019 | Jan 17, 2008 | May 24, 2011 | The General Hospital | Wavelength tuning source based on a rotatable reflector |

US7969578 | Jun 13, 2008 | Jun 28, 2011 | The General Hospital Corporation | Method and apparatus for performing optical imaging using frequency-domain interferometry |

US7979212 | Jan 31, 2005 | Jul 12, 2011 | Ventana Medical Systems, Inc. | Method and system for morphology based mitosis identification and classification of digital images |

US7982879 | Feb 21, 2007 | Jul 19, 2011 | The General Hospital Corporation | Methods and systems for performing angle-resolved fourier-domain optical coherence tomography |

US7995210 | Nov 21, 2005 | Aug 9, 2011 | The General Hospital Corporation | Devices and arrangements for performing coherence range imaging using a common path interferometer |

US7995627 | Nov 30, 2009 | Aug 9, 2011 | The General Hospital Corporation | Process and apparatus for a wavelength tuning source |

US8018598 | Jul 23, 2004 | Sep 13, 2011 | The General Hospital Corporation | Process, system and software arrangement for a chromatic dispersion compensation using reflective layers in optical coherence tomography (OCT) imaging |

US8045177 | Apr 17, 2008 | Oct 25, 2011 | The General Hospital Corporation | Apparatus and methods for measuring vibrations using spectrally-encoded endoscopy |

US8050747 | Aug 8, 2008 | Nov 1, 2011 | The General Hospital Corporation | Method and apparatus for determination of atherosclerotic plaque type by measurement of tissue optical properties |

US8054468 | Dec 13, 2007 | Nov 8, 2011 | The General Hospital Corporation | Apparatus and method for ranging and noise reduction of low coherence interferometry LCI and optical coherence tomography OCT signals by parallel detection of spectral bands |

US8078964 | Jan 8, 2008 | Dec 13, 2011 | Adobe Systems Incorporated | Reconstitute tag-delimited tables in a graphics editing application |

US8081316 | Aug 8, 2005 | Dec 20, 2011 | The General Hospital Corporation | Process, system and software arrangement for determining at least one location in a sample using an optical coherence tomography |

US8097864 | Jan 26, 2010 | Jan 17, 2012 | The General Hospital Corporation | System, method and computer-accessible medium for providing wide-field superresolution microscopy |

US8145018 | Jan 17, 2007 | Mar 27, 2012 | The General Hospital Corporation | Apparatus for obtaining information for a structure using spectrally-encoded endoscopy techniques and methods for producing one or more optical arrangements |

US8149418 | Oct 22, 2010 | Apr 3, 2012 | The General Hospital Corporation | Method and apparatus for optical imaging via spectral encoding |

US8150496 | Aug 8, 2008 | Apr 3, 2012 | The General Hospital Corporation | Method and apparatus for determination of atherosclerotic plaque type by measurement of tissue optical properties |

US8174702 | Jul 27, 2009 | May 8, 2012 | The General Hospital Corporation | Speckle reduction in optical coherence tomography by path length encoded angular compounding |

US8175685 | May 4, 2007 | May 8, 2012 | The General Hospital Corporation | Process, arrangements and systems for providing frequency domain imaging of a sample |

US8208995 | Aug 24, 2005 | Jun 26, 2012 | The General Hospital Corporation | Method and apparatus for imaging of vessel segments |

US8289522 | Oct 16, 2012 | The General Hospital Corporation | Arrangements and methods for providing multimodality microscopic imaging of one or more biological structures | |

US8351665 * | Apr 28, 2006 | Jan 8, 2013 | The General Hospital Corporation | Systems, processes and software arrangements for evaluating information associated with an anatomical structure by an optical coherence ranging technique |

US8369669 | Apr 11, 2011 | Feb 5, 2013 | The General Hospital Corporation | Imaging system and related techniques |

US8416818 | Jun 14, 2011 | Apr 9, 2013 | The General Hospital Corporation | Process and apparatus for a wavelength tuning source |

US8428887 | Nov 9, 2011 | Apr 23, 2013 | Ventana Medical Systems, Inc. | Method for automated processing of digital images of tissue micro-arrays (TMA) |

US8515683 | Apr 4, 2011 | Aug 20, 2013 | Ventana Medical Systems, Inc. | Method and system for automated detection of immunohistochemical (IHC) patterns |

US8559012 | May 7, 2012 | Oct 15, 2013 | The General Hospital Corporation | Speckle reduction in optical coherence tomography by path length encoded angular compounding |

US8593619 | May 7, 2009 | Nov 26, 2013 | The General Hospital Corporation | System, method and computer-accessible medium for tracking vessel motion during three-dimensional coronary artery microscopy |

US8611619 * | Aug 21, 2008 | Dec 17, 2013 | Phadia Ab | Read-out method and apparatus |

US8649581 | Oct 16, 2009 | Feb 11, 2014 | Koninklijke Philips N.V. | Colour management for biological samples |

US8676013 | Feb 4, 2013 | Mar 18, 2014 | The General Hospital Corporation | Imaging system using and related techniques |

US8705046 | Dec 19, 2012 | Apr 22, 2014 | The General Hospital Corporation | Method and apparatus for performing optical imaging using frequency-domain interferometry |

US8760663 | Apr 2, 2012 | Jun 24, 2014 | The General Hospital Corporation | Method and apparatus for optical imaging via spectral encoding |

US8804126 | Mar 7, 2011 | Aug 12, 2014 | The General Hospital Corporation | Systems, methods and computer-accessible medium which provide microscopic images of at least one anatomical structure at a particular resolution |

US8838213 | Oct 19, 2007 | Sep 16, 2014 | The General Hospital Corporation | Apparatus and method for obtaining and providing imaging information associated with at least one portion of a sample, and effecting such portion(s) |

US8861910 | Jun 19, 2009 | Oct 14, 2014 | The General Hospital Corporation | Fused fiber optic coupler arrangement and method for use thereof |

US8922781 | Nov 29, 2005 | Dec 30, 2014 | The General Hospital Corporation | Arrangements, devices, endoscopes, catheters and methods for performing optical imaging by simultaneously illuminating and detecting multiple points on a sample |

US8928889 | Oct 8, 2012 | Jan 6, 2015 | The General Hospital Corporation | Arrangements and methods for providing multimodality microscopic imaging of one or more biological structures |

US8937724 | Dec 10, 2009 | Jan 20, 2015 | The General Hospital Corporation | Systems and methods for extending imaging depth range of optical coherence tomography through optical sub-sampling |

US8965487 | Aug 24, 2005 | Feb 24, 2015 | The General Hospital Corporation | Process, system and software arrangement for measuring a mechanical strain and elastic properties of a sample |

US9020221 | Oct 24, 2012 | Apr 28, 2015 | Sony Corporation | Method and apparatus for automatic cancer diagnosis scoring of tissue samples |

US9060689 | Jun 1, 2006 | Jun 23, 2015 | The General Hospital Corporation | Apparatus, method and system for performing phase-resolved optical frequency domain imaging |

US9069130 | May 3, 2010 | Jun 30, 2015 | The General Hospital Corporation | Apparatus, method and system for generating optical radiation from biological gain media |

US9087368 | Jan 19, 2007 | Jul 21, 2015 | The General Hospital Corporation | Methods and systems for optical imaging or epithelial luminal organs by beam scanning thereof |

US9173572 | Nov 25, 2013 | Nov 3, 2015 | The General Hospital Corporation | System, method and computer-accessible medium for tracking vessel motion during three-dimensional coronary artery microscopy |

US9176319 | Mar 21, 2008 | Nov 3, 2015 | The General Hospital Corporation | Methods, arrangements and apparatus for utilizing a wavelength-swept laser using angular scanning and dispersion procedures |

US9178330 | Feb 4, 2010 | Nov 3, 2015 | The General Hospital Corporation | Apparatus and method for utilization of a high-speed optical wavelength tuning source |

US9186066 | Feb 1, 2007 | Nov 17, 2015 | The General Hospital Corporation | Apparatus for applying a plurality of electro-magnetic radiations to a sample |

US9186067 | May 24, 2013 | Nov 17, 2015 | The General Hospital Corporation | Apparatus for applying a plurality of electro-magnetic radiations to a sample |

US9226660 | Nov 16, 2011 | Jan 5, 2016 | The General Hospital Corporation | Process, system and software arrangement for determining at least one location in a sample using an optical coherence tomography |

US9226665 | May 23, 2013 | Jan 5, 2016 | The General Hospital Corporation | Speckle reduction in optical coherence tomography by path length encoded angular compounding |

US20050136509 * | Sep 10, 2004 | Jun 23, 2005 | Bioimagene, Inc. | Method and system for quantitatively analyzing biological samples |

US20050266395 * | Jan 31, 2005 | Dec 1, 2005 | Bioimagene, Inc. | Method and system for morphology based mitosis identification and classification of digital images |

US20060067887 * | Dec 16, 2003 | Mar 30, 2006 | Qinetiq Limited | Scoring estrogen and progesterone receptors expression based on image analysis |

US20080097225 * | Oct 19, 2007 | Apr 24, 2008 | The General Hospital Corporation | Apparatus and method for obtaining and providing imaging information associated with at least one portion of a sample, and effecting such portion(s) |

US20090003789 * | Sep 5, 2008 | Jan 1, 2009 | The General Hospital Corporation | Imaging system and related techniques |

US20090022463 * | Sep 5, 2008 | Jan 22, 2009 | The General Hospital Corporation | Imaging system and related techniques |

US20100284583 * | Aug 21, 2008 | Nov 11, 2010 | Phadia Ab | Read-out method and apparatus |

US20110200240 * | Oct 16, 2009 | Aug 18, 2011 | Koninklijke Philips Electronics N.V. | Colour management for biological samples |

USRE44042 | Nov 24, 2008 | Mar 5, 2013 | The General Hospital Corporation | System and method for optical coherence imaging |

USRE45512 | Sep 12, 2012 | May 12, 2015 | The General Hospital Corporation | System and method for optical coherence imaging |

WO2005045734A1 * | Oct 22, 2004 | May 19, 2005 | Bioimagene Inc | Method and system for automatically determinig diagnostic saliency of digital images |

WO2010046821A1 * | Oct 16, 2009 | Apr 29, 2010 | Koninklijke Philips Electronics N.V. | Colour management for biological samples |

Classifications

U.S. Classification | 382/133, 382/165 |

International Classification | G06T7/00, G01N21/27, G01N15/00, G06T1/00, G01N33/483, G01N33/48, G06K9/00, G01J3/46 |

Cooperative Classification | Y10S128/922, G06K9/00127 |

European Classification | G06K9/00B |

Legal Events

Date | Code | Event | Description |
---|---|---|---|

Dec 23, 2002 | AS | Assignment | Owner name: QINETIQ LIMITED, UNITED KINGDOM Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:HAMER, MICHAEL J.;PETROU, MARIA;KESIDIS, TASOS;REEL/FRAME:013610/0833;SIGNING DATES FROM 20021031 TO 20021205 |

Rotate