Title of Invention | "A SYSTEM FOR COMPUTER-IMPLEMENTED HISTOLOGICAL ASSESSMENT OF NUCLEAR PLEOMORPHISM" |
---|---|
Abstract | A system for computer-implemented histological assessment of nuclear pleomorphism by identifying image regions potentially corresponding to cell nuclei in histological image data, the system incorporating a microscope and a camera for photographing a histopathological specimen to obtain digitised colour image data, and computer means for receiving the digitised colour image data, and the system being arranged to, transform the digitised colour image data into greyscale image data with improved image definition compared to an individual red, green or blue plane in colour image data, threshold the greyscale image data to render it binary, determine perimeters and areas of identified image regions in the binary greyscale image data and calculate image region shape factors from the perimeters and areas and assess pleomorphism from the shape factors' statistical parameters. |
Full Text | The present invention relates to a system for computer-implemented histological assessment of nuclear pleomorphism This invention relates to a method, an apparatus and a computer program for histological assessment of nuclear ,'piebmorphism: it is particularly (although not exclusively) relevant to assessment :;of histological Images to provide clinical information on cancers such as breast cancer. The method is also relevant.to other -A cancers, 'e.g. colon and cervical cancer as well as breast cancer. Breast cancer is a common form of female cancer, and it occurs also in the male albeit to a lesser extent: once a lesion indicative of breast cancer has been detected, tissue samples are taken and examined by a histopaihologist to establish a diagnosis, prognosis and treatment plan. However, pathological analysis of tissue samples is a time consuming and subjective process. It entails interpretation of images by human • eye: it can be characterised by inconsistencies in observations of the same samples by different observers and even by the same observer at different times. For example, two different observers assessing images of the same tissue samples may give different opinions for a number of the images. Different opinions- may arise in as many as 30% of the images. The problem is exacerbated by heterogeneity, i.e. complexity of some tissue sample features. There is a need to provide an objective measurement of the degree of nuclear pleomorphism to support the pathologist's diagnosis and the patient's treatment. The present invention provides a method of histological assessment of nuclear pleomorphism by identifying image regions potentially corresponding to cell nuclei in histological image data, the method including thresholding the image data to render it •binary, determining perimeters and areas of identified image regions, calculating image region shape factors from the perimeters and areas and assessing pleomorphism from the shape factors' statistical parameters. The invention provides the advantage that it is an objective measurement of aleomorphism to-inform a pathologist's diagnosis and patient treatment. the shape factors' statistical parameters. The invention provides the advantage that it is an objective measurement of pleomorphism to inform a pathologist's diagnosis and patient treatment. The shape factors' statistical parameters may comprise at least one of their mean, weighted mean, median, mode, maximum and minimum. The step of thresholding the imaged data may be Otsu thresholding. The step of assessing pleomorphism may determine pleomorphism as being relatively low, moderate or high according to whether the mean or median of the shape factors is relatively low, moderate or high respectively. A shape factor S for an image region kP2 potentially corresponding to a cell nucleus may be given by S = , where k is a A constant, P is image region perimeter and A is image region area, and a mean shape factor Sm for a set of image regions potentially corresponding to cell nuclei may be thresholded as Sm 35/c (high) corresponding to pleomorphism being relatively low, moderate or high respectively. The step of thresholding the image data to render it binary may be preceded by transforming colour image data into greyscale image data with improved image definition compared to an individual red green or blue plane in colour image data, and the step of thresholding the image data is then carried out upon the greyscale image data. The step of transforming colour image data into greyscale image data may be carried out by Principal Component Analysis (PCA) in which the greyscale image data is a first principal component. The step of identifying image regions potentially corresponding to cell nuclei in histological image data includes filtering the image data to overwrite regions which are not of interest using a filtering process which does not appreciably affect image region perimeter. The overwriting step may include setting relatively small image regions to a background pixel value and setting hole pixels in relatively larger image regions to a non-hole image region pixel value. Image regions potentially corresponding to cell nuclei in histological image data may be identified by a procedure including: a) dividing the image data into overlapping sub-images, b) applying PCA to each sub-image to provide a respective greyscale sub-image, and c) removing from the greyscale sub-images: i) image regions touching or intersecting sub-image boundaries, ii) unsuitably small image regions, and iii) holes in relatively large image regions, and d) reassembling the sub-images into a binary image. In another aspect, the present invention provides an apparatus for histological assessment of nuclear pleomorphism by identifying image regions potentially corresponding to cell nuclei in histological image data, the apparatus incorporating a computer programmed to threshold the image data to render it binary, determine perimeters and areas of identified image regions, calculate image region shape factors from the perimeters and areas and assess pleomorphism from the shape factors' statistical parameters. In a further aspect, the present invention provides computer software for use in histological assessment of nuclear pleomorphism and having instructions for controlling a computer to identify image regions potentially corresponding to cell nuclei in histological image data, the software also having instructions for controlling a computer to threshold the image data to render it binary, determine perimeters and areas of identified image regions, calculating image region shape factors from the perimeters and areas and assess pleomorphism from the shape factors' statistical parameters. The computer apparatus and computer software aspects of the invention may have preferred features equivalent to corresponding method aspects of the invention. In order that the invention might be more fully understood, embodiments thereof will now be described, by way of example only, with reference to the accompanying drawings, in which:- Figure 1 is a block diagram of a procedure for measuring pleomorphism to assist in formulating diagnosis and treatment; and Figure 2 is a block diagram showing in more detail pleomorphism feature detection in accordance with the invention for use in the procedure of Fiaure 1. A procedure 10 for the assessment of tissue samples in the form of histopathological slides of carcinomas of the breast is shown in Figure 1. This drawing illustrates processes which measure degree of pleomorphism for use in assessment of patient condition. The procedure 10 employs a database 12, which maintains digitised image data obtained from histological slides as will be described later. Sections are taken (cut) from breast tissue samples (biopsies) and placed on respective slides. Slides are stained using the staining agent haemotoxylin & eosin (H&E). H&E is a very common stain for delineating tissue and cellular structure. Tissue stained with H&E is used to assess pleomorphism. Nuclear pleomorphism is a measurement of degree of variability in shape of the nucleus of tumour cells within tissue. In normal tissue, cell nuclei have a regular structure in terms of shape and size, whereas nuclei of cancer cells can become larger and irregularly shaped, with a marked variation in shape and size. In a prior art manual procedure, a clinician places a slide under a microscope with magnification of 40X and examines a region of it (often referred to as a tile) for indications of the degree of pleomorphism. This manual procedure requires a pathologist subjectively to assess unusual size and shape of cell nuclei in a tissue sample. The values obtained in this way are combined to give a single measurement for use in diagnosis. The process of the invention replaces the prior art subjective procedure with an objective procedure. In the present example, image data were obtained from histological slides by a pathologist using Zeiss Axioskop microscope with a Jenoptiks Progres 3012 digital camera. Image data from each slide is a set of digital images obtained at a linear magnification of 40 (i.e. 40X linear, 1600X area), each image being an electronic squivalent of a tile. Images obtained by other microscopes and cameras can also be jsed. To obtain images, a pathologist scans the microscope over a slide, and at 40X nagnification selects regions of the slide which appear to be most promising in terms )f pleomorphism assessment. Each of these regions is then photographed using the microscope and camera mentioned above, and this produces for each region a respective digitised image in three colours, red, green and blue (R, G & B). Three intensity values are obtained for each pixel in a pixel array to provide an image as a combination of R, G and B image planes. This image is stored temporarily at 12 for later use. Two tiles are required for pleomorphism measurement by a process 16: the results of the process 16 are converted into a measurement at 20 for input to a diagnostic report at 22. Referring now to Figure 2, the pleomorphism feature detection process 16 is shown in more detail. It is carried out for each of the two tiles or digitised images (raw (RGB) input images) mentioned above, and will be described for one such image. At a first stage 30 the raw (RGB) input image is separated into overlapping windows of size 128x128 pixels. In each window, 64 pixels overlap with 64 pixels of respective neighbouring windows in both horizontal and vertical directions. For example, an image of 256x256 would give a 3x3 set of 128x128 overlapping windows: thus, each window overlaps half of each of its row and column neighbours. To each window a technique referred to as "Principal Component Analysis" (PCA, Karhunen-Loeve Transform) is applied. PCA is a standard mathematical technique described by Jollie IT., 'Principal Component Analysis', Springer series in statistics, Springer Verlag, 1986. It is also described by Jackson J.E., 'A User Guide to Principal Components', John Wiley & Sons, 1991, pp 1-25. PCA is a transformation into a domain or representation in which data is more easily exploited or understood (e.g. separable or classified into different classes or categories). It is a standard mathematical technique for transforming a set of (possibly) correlated variables into a smaller number of uncorrelated variables called principal components. The principal components are associated with respective eigenvalues: a first principal component has the largest eigenvalue and accounts for as much of the variables' variability as possible compared to other principal components. In this first principal component, image definition, contrast and characteristics are improved compared to R, G or B image planes in colour image data. Only the first principal component is used in this example, but alternatively a linear combination of all the principal components can be used and might be appropriate if their eigenvalues were not very different. There are other transformations (analysis techniques or filters) which could also be used, and which have particular characteristics (e.g. linear or non-linear, exploiting data characteristics, means, variances etc. in different ways). Neural networks/adaptive networks and related techniques are also used to implement such transformations. PCA involves calculating a covariance matrix and solving for its eigenvalues and eigenvectors. The image is now treated as being arranged as an Nx3 matrix, i.e. having N pixels and three planes (R, G and B). The covariance matrix is calculated using a formula for its matrix elements Cy as follows: N-l Cu=~ rr-i (D where Cv is the covariance of variable / with variable j, xk and yk are the ith and jth feature values of the kth object, //„ is the mean of all N values of xk, ^ is the mean of all N values of yk. The covariance matrix is 3x3 and PCA yields three eigenvectors: the eigenvectors are treated as a 3x3 matrix, which is used to multiply the transpose of the Nx3 image matrix to produce a product matrix. The product matrix has an Nx1 first column which is the first principal component, which may be considered as the most important component: it is the component with the maximum eigenvalue, and it provides a greyscale sub-image (one pixel value for each of N pixels) with a maximum range of information compared to equivalents associated with other components. PCA therefore produces monochromatic or greyscale image data. It is carried out for each of the overlapping windows defined above, and each provides a respective first principal component and greyscale sub-image of size 128x128 pixels. At 32, a thresholding technique referred to as "Otsu" is applied to each sub-image resulting from 30 to transform it into a respective binary sub-image. Otsu is a standard thresholding technique published by Otsu N., 'A thresholding selection method from grey level histograms', IEEE Trans Systems, Man & Cybernetics, vol. 9, 1979, pp 62-66. The Otsu threshold selection method aims to minimise for two classes a ratio of within-class variance to between-class variance: i.e. the higher the variance between classes the better the separation. Otsu is a particularly preferred thresholding technique. In the present example the two classes are a below-threshold class (pixel value 0) and an above-threshold class (pixel value 1), so Otsu thresholding converts each greyscale sub-image to a binary sub-image containing a set of blobs: here blobs are image regions (objects in the image) each of which is a respective group of mutually contiguous pixels all having value 1 and possibly corresponding to a cell nucleus. The blobs may have holes ( pixel value 0) in them. At 34 all blobs that touch or are intersected by sub-image boundaries are removed. Thus, if at any pixel a blob meets a border it is removed by setting its pixels to a background pixel value. This is because boundaries meeting blobs produce artificially straight blob edges which can give misleading results later. Because of sub-image overlap, a blob which appears partly in one image may appear wholly in another sub-image and so is not necessarily lost. At 36 the sub-images from 34 are inverted (pixel value 0 changes to 1 and vice versa) and connected component labelling (CCL) is applied to remove holes in blobs. CCL is a known image processing technique (sometimes referred to as 'blob colouring') published by Klette R., Zamperoniu P., 'Handbook of Image Processing Operators', John Wiley & Sons, 1996, and Rosenfeld A., Kak A.C., 'Digital Picture Processing', i/ols. 1 & 2, Academic Press, New York, 1982. CCL gives a respective label to each group of contiguous pixels of pixel value 1. Because of the inversion, areas of pixel value 1 which are labelled in CCL are now holes within blobs together with background pixels. Holes within each blob are now removed (filled) by setting their pixels to the value of other pixels of the blob. Background pixels are left unchanged. At 38, the sub-images from 36 are inverted once more and CCL is applied again: due to this second inversion, areas labelled by CCL are now filled blobs within each sub-image. CCL has a facility for removal of blobs with areas smaller than a user-defined minimum area threshold. In this example, using 40X magnification, the minimum area threshold in this example is 400 pixels: all blobs with areas less than 400 pixels are therefore rejected and merged into the background by setting their pixels to a background value (0). Remaining blobs with areas of at least 400 pixels are accepted for further processing as set of labelled blobs. CCL also gives for each remaining blob its perimeter P and area A in number of pixels in each case. After stages 30 to 38, each sub-image is clear of unwanted small blobs: all remaining blobs have been filled to remove holes within them, so they consist of pixels which are all the same value. The advantage of stages 30 to 38 is that they provide spatial filtering but do not appreciably affect perimeter shapes of blobs, which is important for subsequent processing. Such filtering is not essential but it is helpful to reduce processing burden At 40, the sub-images output from step 38 are reassembled into a new binary image: the new binary image has the same size as the original raw (RGB) input image, and contains only blobs that will be assessed in subsequent pleomorphism processing. Steps 30 to 40 are pre-processing steps which identify a set of blobs within the original raw (RGB) input image: each of these blobs should now correspond to a cell nucleus. P2 \t 42 a statistical analysis is applied. A shape factor S =— is calculated for each A )lob from blob perimeter P and area A obtained in CCL at 38. S could also be a P2 multiple or fraction of —- were this to be convenient, and other functions of P and A A could be used to indicate shape. The value of S increases with increasing irregularity of blob shape, it is 4n (-12.57) for a perfect circle. A mean value Sm of S is calculated for the complete image by adding the S values of all blobs in the image together and dividing by their number. Sm is thresholded to derive a measure of the pleomorphism of the original raw (RGB) input image: Sm thresholds were derived from an analysis of a test set of 200 pleomorphism images. There are three threshold categories, Sm 35 (high) with pleomorphism scores 1, 2 and 3 P2 respectively as tabulated below. These thresholds are for S = —: other shape factor A expressions may need different thresholds. (Table Remove) ForS A , where k is a constant, the three threshold categories become Sm (low), 30k Sm may be calculated by other procedures. It might simply be the median of an odd number of values of S, or the average of two central values of an even number of values of S . It is also possible to use the maximum or minimum of a series of values of S, or with a largely single mode distribution of values of S, the mode value. However, this will affect thresholds. A weighted mean S^ could be calculated by multiplying each value of S by a weight factor w/, adding the weighted values and dividing their sum by the sum of the weights; each weight would indicate the probability of the associated blob or cell nucleus giving a reliable indication of pleomorphism: i.e. o —.__!_ (2) The above discussion shows that there are a number of ways an overall shape factor can be determined from the statistical characteristics of a set of shape factors, eg. their mean, weighted mean, median, mode, maximum and minimum may all be used individually. At 20 the score for the nuclear pleomorphism measurement has a value 1, 2 or 3 and a respective score is obtained for each of the input tiles. The maximum of these two scores is taken as the overall pleomorphism score. Measurement of Nuclear Pleomorphism Meaning Points Uniform Minimal increase in size and variation in size compared to normal cell nuclei, i.e. nuclei are relatively small and uniform in size. Moderate Variation Moderate increase and variation in size and shape with vesicular nuclei. Marked Variation Marked variation in size and shape with vesicular nuclei. The measurement of nuclear pleomorphism may be combined with others obtained for mitosis and tubules by different methods to derive an overall grading referred to in medicine as a "Bloom and Richardson grading" or modified versions of this grading: it is used by clinicians as a measure of cancer status. The examples given in the foregoing description for calculating results can clearly be evaluated by an appropriate computer program on a carrier medium and running on a conventional computer system. Such a program is straightforward for a skilled programmer to implement without requiring invention because the procedures are well known, and it will therefore not be described further. We claim: 1. A system for computer-implemented histological assessment of nuclear pleomorphism by identifying image regions potentially corresponding to cell nuclei in histological image data, the system incorporating a microscope and a camera for photographing a histopathological specimen to obtain digitised colour image data, and computer means as herein described, for receiving the digitised colour image data, and the system being arranged to: a) transform the digitised colour image data into greyscale image data with improved image definition compared to an individual red, green or blue plane in colour image data, b) threshold the greyscale image data to render it binary, c) determine perimeters and areas of identified image regions in the binary greyscale image data, and d) calculate image region shape factors from the perimeters and areas and assess pleomorphism from the shape factors' statistical parameters. 2. A system as claimed in Claim 1 wherein the shape factors' statistical parameters comprise at least one of their mean, weighted mean, median, mode, maximum and minimum. 3. A system as claimed in Claim 1 arranged to threshold the greyscale image data using Otsu thresholding. 4. A system as claimed in Claim 1 arranged to assess pleomorphism as being relatively low, moderate or high according to whether the mean or median of the shape factors is relatively low, moderate or high respectively. 5. A system as claimed in Claim 4 arranged to a) determine a shape factor S for an image region potentially corresponding to a cell nucleus is given by where (Formula Removed) is a constant, P is image region perimeter and A is image region area, and b) determine a mean shape factor Sm for a set of image regions potentially corresponding to cell nuclei, to threshold the mean shape factor as Sm (moderate) and Sm > 35 k (high) and to indicate pleomorphism being relatively low, moderate or high respectively. 6. A system as claimed in Claim 1 arranged to transform the histological colour image data into grey scale image data using Principal Component Analysis (PCA) in which the greyscale image data is a first principal component. 7. A system as claimed in Claim 6 arranged to: a) divide the image data into overlapping sub-images, b) apply PCA to each sub-image to provide a respective greyscale sub-image, c) remove from the greyscale sub-images: i) image regions touching or intersecting sub-image boundaries, ii) unsuitably small image regions, and iii) holes in relatively large image regions, and d) reassemble the sub-images into a binary image. 8. A system as claimed in Claim 1 arranged to set relatively small image regions to a background pixel value and to set hole pixels in relatively larger image regions to a non-hole image region pixel value. |
---|
2092-DELNP-2005-Abstract-13-05-2008.pdf
2092-delnp-2005-claims-(30-05-2008).pdf
2092-DELNP-2005-Claims-13-05-2008.pdf
2092-DELNP-2005-Correspondence Others-(22-03-2011).pdf
2092-delnp-2005-correspondence-others-(30-05-2008).pdf
2092-DELNP-2005-Correspondence-Others-13-05-2008.pdf
2092-delnp-2005-correspondence-others.pdf
2092-delnp-2005-description (complete)-13-05-2008.pdf
2092-delnp-2005-description (complete)-30-05-2008.pdf
2092-delnp-2005-description (complete).pdf
2092-DELNP-2005-Drawings-13-05-2008.pdf
2092-DELNP-2005-Form-1-13-05-2008.pdf
2092-DELNP-2005-Form-2-13-05-2008.pdf
2092-DELNP-2005-Form-27-(22-03-2011).pdf
2092-delnp-2005-pct-notificatian.pdf
2092-delnp-2005-pct-search report.pdf
2092-delnp-2005-Petition Others-(29-03-2012).pdf
Patent Number | 221958 | |||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
Indian Patent Application Number | 2092/DELNP/2005 | |||||||||||||||
PG Journal Number | 32/2008 | |||||||||||||||
Publication Date | 08-Aug-2008 | |||||||||||||||
Grant Date | 12-Jul-2008 | |||||||||||||||
Date of Filing | 17-May-2005 | |||||||||||||||
Name of Patentee | QINETIQ LIMITED | |||||||||||||||
Applicant Address | 85, BUCKINGHAM GATE, LONDON SW1E 6PD, GREAT BRITAIN. | |||||||||||||||
Inventors:
|
||||||||||||||||
PCT International Classification Number | G06F 19/00 | |||||||||||||||
PCT International Application Number | PCT/GB2003/004910 | |||||||||||||||
PCT International Filing date | 2003-11-13 | |||||||||||||||
PCT Conventions:
|