(untagged)

Computer Vision Applications with C# - Fuzzy C-means Clustering

Christophe Gauge

0.00/5 (No votes)

9 Mar 2011

This code performs a fuzzy C-means clustering and segmentation of color images, and can be used for feature extraction.

Download source code - 56.6 KB

Introduction

Image segmentation, the partitioning of an image into homogeneous regions based on a set of characteristics, is a key element in image analysis and computer vision. Clustering is one of the methods available for this purpose. Clustering is a process which can be used for classifying pixels based on similarity according to the pixel's color or gray-level intensity.

The K-means algorithm has been used for a fast and crisp "hard" segmentation. The Fuzzy Set theory has improved this process by allowing the concept of partial membership, in which an image pixel can belong to multiple clusters. This "soft" clustering allows for a more precise computation of the cluster membership, and has been used successfully for image clustering and for the unsupervised segmentation of medical, geological, and satellite images.

Algorithm

The fuzzy C-means (FCM) algorithm follows the same principles as the K-means algorithm in that it compares the RGB value of every pixel with the value of the cluster center. The main difference is that instead of making a hard decision about which cluster the pixel should belong to, it assigns a value between 0 and 1 describing "how much this pixel belongs to that cluster" for each cluster. Fuzzy rule states that the sum of the membership value of a pixel to all clusters must be 1. The higher the membership value, the more likely that pixel is to belong to that cluster. The FCM clustering is obtained by minimizing an objective function shown in equation (1):

(1)

Where:

J is the objective function
n is the number of pixels in the image E
c is the number of clusters
µ is the fuzzy membership value from table
m is a fuzziness factor (a value > 1)
pi is the ith pixel in E
vk is the centroid of the kth cluster
|pi – vk| is the Euclidean distance between pi and vk defined by equation (2):

(2)

The calculation of the centroid of the kth cluster is achieved using equation (3):

(3)

The fuzzy membership table is calculated using the original equation (4):

(4)

This algorithm has been extended for clustering of color images in the RGB color space. Hence, the computation given in equation (2) to compute the Euclidean distance between the values pi and vk is modified to incorporate RGB colors, and is shown in equation (5):

(5)

Pseudo-Code

As mentioned earlier, this is an iterative process. The pseudo-code is as follows:

Step 1: Set the number of clusters, the fuzzy parameter (a constant > 1), and the stopping condition
Step 2: Initialize the fuzzy partition matrix
Step 3: Set the loop counter k = 0
Step 4: Calculate the cluster centroids, calculate the objective value J
Step 5: For each pixel, for each cluster, compute the membership values in the matrix
Step 6: If the value of J between consecutive iterations is less than the stopping condition, then stop; otherwise, set k=k+1 and go to step 4
Step 7: Defuzzification and segmentation

Using the Code

The GUI is pretty straightforward, but the calculations can be very intensive. For that reason, the processing is done in a worker thread, which complicates the interaction with the GUI.

First, use the "File" menu to load an image. Beware, large images will tale a long time!

The options available to be changed from the GUI are the number of clusters, the maximum number of iterations (so that the program doesn't just keep running forever), and the precision. The algorithm will stop before the maximum number of iterations if the precision is achieved.

Clicking the "Fuzzy C-means Clustering" button will start the computation. The program will start by creating a ClusterPoint object for every pixel in the image:

List<ClusterPoint> points = new List<ClusterPoint>();
for (int row = 0; row < originalImage.Width; ++row)
{
    for (int col = 0; col < originalImage.Height; ++col)
    {
        Color c2 = originalImage.GetPixel(row, col);
        points.Add(new ClusterPoint(row, col, c2));
    }
}

Then, create the requested number of ClusterCentroid cluster objects:

List<ClusterCentroid> centroids = new List<ClusterCentroid>();

//Create random points to use a the cluster centroids
Random random = new Random();
for (int i = 0; i < numClusters; i++)
{
    int randomNumber1 = random.Next(sourceImage.Width);
    int randomNumber2 = random.Next(sourceImage.Height);
    centroids.Add(new ClusterCentroid(randomNumber1, randomNumber2, 
                  filteredImage.GetPixel(randomNumber1, randomNumber2))); 
}

The cluster centers (or centroids) are selected randomly for the first pass, and will be adjusted by the algorithm.

Finally, create the FCM object and start the iterations:

FCM alg = new FCM(points, centroids, 2, filteredImage, (int)numericUpDown2.Value);
k++;
alg.J = alg.CalculateObjectiveFunction();
alg.CalculateClusterCentroids();
alg.Step();
double Jnew = alg.CalculateObjectiveFunction();
Console.WriteLine("Run method i={0} accuracy = {1} delta={2}", 
                  k, alg.J, Math.Abs(alg.J - Jnew));
backgroundWorker.ReportProgress((100 * k) / maxIterations, "Iteration " + k);
if (Math.Abs(alg.J - Jnew) < accuracy) break;

Progress is displayed on the status bar:

Once the iterations are done, the algorithm will perform a defuzzification process to assign the pixel to the cluster for which it has the highest membership value. For each cluster, the program will then save a segmented image containing the pixels from the original image that are contained in that cluster.

As an example, the clustering of the following test image with 2 clusters:

will result in the following clustered image:

By using the cluster information and the pixels from the original image, the following regions can be extracted:

Because this algorithm is very computationally intensive, it has a tendency to "lock" the GUI and not refresh the status and the working image. For that reason, I had to use a worker thread.

Points of Interest

This is my first post, and my first C# program, so I am sure that it leaves a lot of room for improvement. But overall, I hope that you will find this project fun and interesting!

References

Java Image Processing Cookbook, the Fuzzy C-means algorithm by Rafael Santos.
Modified Fuzzy C-means Clustering Algorithm with Spatial Distance to Cluster Center of Gravity, 2010 IEEE International Symposium on Multimedia, Taichung, Taiwan, December 13-December 15 2010.

License

This article has no explicit license attached to it but may contain usage terms in the article text or the download files themselves. If in doubt please contact the author via the discussion board below.

A list of licenses authors might use can be found here