In image segmentation, we want to segment or separate a region of interest (ROI) from the entire image. We usually do this to extract useful information or identify objects from the image. The segmentation is done based on the features unique to the ROI.
In this activity, we want to segment objects from the background based on their color information. But real 3D objects in images, although monochromatic, may have shading variations. Hence, it is better to use the normalized chromaticity coordinates (NCC) instead of the RGB color space to enable the separation of brightness and pure color information.
To do this, we consider each pixel and the image and let the total intensity, I, for that pixel be I = R + G + B. Then for that pixel, the normalized chromaticity coordinates are computed as:
r = R/I; g = G/I; b = B/I
The sum of all three is equal to unity, so it is enough to express chromaticity using only two coordinates r and g since b is dependent on both (i.e. b = 1- r - g). The r-g color space is shown in Figure 1.
I. Parametric probability distribution estimation
Figure 2. The yellow car is to be segmented from the entire image.
Image was taken from: http://x-wp.blogspot.com/2011/08/yellow-lamborghini-wallpaper-hd.html
Figure 3. ROI Segmented from the image using parametric estimation and the corresponding reference patches used where (a) was taken from the hood of the car and (b) was taken from the door.
For non-parametric probability distribu, we use the image's 2D histogram to determine the membership of a pixel to the region of interest. To do this, we use histogram back-projection in which we give each pixel location a value equal to it's histogram value in chromaticity space.
The 2D histogram (32 bins) of the two reference patches in Figure 3 are shown in Figure 4. We know that this is correct since the location of the peaks are in approximately the same region as the observed color for the ROI (i.e. yellow). The histogram of the second reference patch covers a larger area since it has more shade variations.
Figure 4. 2D histogram of the two reference patches for the ROI (a) for the patch taken from the hood and (b) for the patch taken from the door
Again, we try to segment the car in Figure 2 using the same reference patches. The results are shown in Figure 5. Again, the second patch gave better results than the first patch.
Figure 5. ROI Segmented from the image using non- parametric estimation and the corresponding reference patches used where (a) was taken from the hood of the car and (b) was taken from the door.
Segmented ROIs using the first method produced images with smoother variations in shade since we used an analytic function to estimate the PDF, giving us continuous values. Segmentation using non-parametric was able to segment more of the car (observe the roof part) but it also included other parts of the image which had yellowish hue such as the portions of land visible through the vegetation. This means that parametric estimation is more selective in pixel-membership tagging compared to non-parametric estimation.
I would like to thank Ms. Maria Isabel Saludares for helpful discussions.
Finally, I give myself a grade of 10/10 for successfully segmenting a region of interest (ROI) from the background. :)
Reference:
1. M. Soriano, "A11-Color Image Segmentation, " AP 186 Manual, 2012
Comments
Post a Comment