1 Introduction
Machine vision is a promising solution to automatic visual inspection and performs competitively against human vision. Machine vision generally consists of image acquisition, image processing, and decision making. Image processing is important in the detection process. It extracts features that are useful for decision making. Image-based defect detection methods are used in steel bars [
1], printed circuit boards [
2], fabric [
3], solar wafers [
4,
5], thin-film transistor-liquid crystal displays [
6], bottle caps [
7], and many other applications.
Buttons, as important clothing accessories, contain various defects, such as cracks, stains, printing deviation, fading, and uneven surfaces. Button defects are irregular and difficult to identify. The fast production of buttons and the various patterns on their surfaces make the automatic detection of defects even more difficult. Therefore, a fast and generally applicable defect detection method is urgently needed.
Several researchers have converted the defect detection problem to a template matching problem by calculating the similarity between an inspection image and a corresponding defect-free template image [
8,
9]; this approach is called the referential method. Normalized cross correlation is widely used to measure the similarity. In the frequency domain, many techniques, such as Fourier transform [
10], wavelet transform [
11,
12], and Gabor filter [
13,
14], are also widely used in defect detection.
Although calculating the similarity between an inspection image and a corresponding defect-free template image is a practical and widely used method, the process requires high alignment accuracy. To reduce the false alarms caused by misalignment, Wang et al. [
15] applied the gray-level threshold method to extract significant points in an inspection window and calculated the correlation coefficient of only the selected significant pixel points. Bai et al. [
8] extracted salient regions through phase-only Fourier transform and detected defects in these regions. Meanwhile, Zontak and Cohen [
16] proposed a defect detection method based on anisotropic kernel reconstruction. In their method, the source image is estimated with the weighted sum of neighboring features from the reference image. Their method does not require image alignment and is robust to pattern variations. However, its computation cost is relatively high. Tsai et al. [
9] proposed a dissimilarity measure based on the optical-flow technique. The measure is robust to misalignment and random product variation. It is applicable to gray-level images and suitable for detecting defects that are sparse signals with a uniform background.
Static templates and gray-level images are widely used in all these methods. The methods are sensitive to illumination vibration because the templates are static. A referential method using the weighted least-squares model was developed in the current study. The method does not require precise alignment and is robust to illumination variations. Instead of using gray-level images, all three channels of red–green–blue (RGB) images are used to improve the accuracy of the method.
The remainder of this paper is organized as follows. The proposed model and its implementation are presented in Section 2. The experiment results and a comparison of the proposed method with two other methods are provided in Section 3. The tuning of key parameter values is also included in this section. Section 4 presents the conclusions.
2 Weighted least-squares model and implementation details
The referential method relies heavily on alignment accuracy. Even if two compared images are perfectly aligned, local deviations may still exist in the aligned image due to random product variations in manufacturing. As a result, the residual image exhibits high intensity near the edge regions. To address this problem, the proposed method uses the weighted least-squares model to reconstruct a template image for each test sample adaptively (some details are shown in Section 2.2) and obtain a corresponding residual image. To segment defects effectively, the Gaussian mixture model (GMM) is used to model the background distribution according to the defect-free residual images. The entire procedure is summarized as follows (Fig. 1).
Fig.1 Block diagram of the proposed method for defect detection |
Full size|PPT slide
1) Preprocess original images to extract the region of interest (ROI) and align the images to achieve a similar orientation;
2) Combine defect-free images to obtain matrix B through the weighted least-squares model;
3) Input test samples and decompose them into defect-free template and residual images adaptively;
4) Learn the GMM parameters with the defect-free residual images;
5) Input a test sample’s residual image and detect the defect region with the GMM parameters.
2.1 Preprocessing
In this step, the objects (buttons) were extracted from original RGB images {I} (). The buttons were then rotated at certain degrees to ensure that all buttons had the same orientation. Most types of buttons have a common characteristic: circular holes distributed around the center symmetrically. Therefore, we calculated the rotation degrees after locating the holes and button center by using auto-threshold algorithms. Subsequently, we rotated the buttons around their centers to achieve the same orientation.
2.2 Weighted least-squares model
Given the data set, we assumed that response variable can be represented by a linear regression of n-vector variablewith residual term. The common least-squares model is in the form
This equation can be presented in a vectorized form as
where , and . The error function is defined as
To minimize , its derivatives must be determined with respect to . Thereafter, these derivatives need to be set to zero.
To minimize the ith error term in , a large weight should be provided to . The weighted least-squares model is described as follows:
where . The weighted estimated parameters are
where. For each RGB image , M and N represent the height and width of an image, respectively, and 3 represents the three channels of an RGB image. is the intensity of the pixel at (i, j) of an image in the c channel. Furthermore, I can be vectorized as a 1D column vector.
If defect-free samples are selected and vectorized into column vectors, the training data matrix can be described as, where n is the number of training samples and represents the kth sample in the training set. In the training step, the acquisition of matrix B can be implemented offline. In the testing step, inspection image D is vectorized as column vector d. As a result, the best parameter,, and the appropriate item can be described as . Adaptive template imagecan be reconstructed from vector a, and residual image is, as shown in Fig. 2.
Fig.2 Image decomposition process. (a) Inspection image; (b) adaptive template image; and (c) residual image |
Full size|PPT slide
In the experiment, the residual images sometimes showed high intensity near the edge, and this high intensity may cause a false alarm when the common least-squares method is used (Fig. 3). In the weighted least-squares model, a large weight is provided to pixels near the edge regions to solve the problem. Figure 3(c) shows that the residual error was suppressed near the edge regions.
Fig.3 Detection results of different methods. (a) is the input image. (b) and (d) are the results of the common least-squares model. (b) is the enhanced residual image, and (d) is the defect segmentation result. (c) and (e) are the results of the weighted least-squares model. (c) is the enhanced residual image, and (e) is the defect segmentation result |
Full size|PPT slide
In all the experiments, the weight matrix was in the form
where represents the mean gradient intensity of the training samples. is a diagonal matrix with the mean gradient intensity lying on the main diagonal.
2.3 Defect segmentation
GMM has been widely used in automated visual surveillance. GMM segments moving regions from image sequences. Given that our defect detection problem is rather similar to moving object detection, GMM was used to segment defects. The basic idea was to model each background pixel by a mixture of K Gaussian distributions, where K is a small number from 3 to 5. The probability that a certain pixel with a value of x exists at time t can be written as
where K is the number of Gaussian components and is the weight of theith Gaussian component. It satisfies.is the normal distribution of the component represented by
where
and
are the mean value and covariance of the
ith component, respectively. In particular,
is defined as
. To model background distributions, parameters are updated by adding new samples and discarding old ones. The parameters of the first Gaussian component are updated when they match the value of the new sample. The implementation details can be found in Refs. [
17–
19]. In the test stage, each new pixel value is checked against existing model components according to the order of the fitness value. The match criterion is
where C is the control constant and dist(x, y) represents the Euclidean distance between vectors x and y. If no match is found, this pixel will be identified as a foreground pixel (defect).
3 Experiment results and discussion
The performance of the proposed method was evaluated in terms of true positive recognition (TPR) rate, true negative recognition (TNR) rate, and recognition (R) rate, which are defined as
where is the number of correctly detected defect-free samples, is the number of falsely detected defect-free samples, is the number of correctly detected defective samples, and is the number of falsely detected defective samples. Positive and negative means defect-free and defective, respectively.
We evaluated the performance of the proposed method on three typical types of buttons by comparing it with two other methods, namely, saliency-based method in Ref. [
8] and independent component analysis (ICA)-based method in Ref. [
5]. For simplicity, we denoted these three types of buttions as sample1, sample2, and sample3. The details of the test samples are presented in Table 1. The proposed method was implemented in Matlab 2014a on an Intel 2.1 GHz PC with 4 GB of memory. The speed of this method was about 15 fps, which meets the requirement of button quality assurance.
Tab.1 Details of the test samples used in the experiments |
sample | number of defect-free buttons | number of defective buttons | ROI size/pixel |
---|
sample1 | 51 | 104 | 233×233×3 |
sample2 | 60 | 39 | 199×199×3 |
sample3 | 151 | 189 | 167×167×3 |
The detailed detection rates are shown in Table 2. The saliency-based method showed a relatively high true positive rate but a low true negative rate because several defective samples were missed. The ICA-based method showed a good true negative rate but a low true positive rate because several normal regions were incorrectly identified as defects. In this experiment, the performance of our method was better than that of the two other methods in terms of the three detection rates.
Tab.2 Experimental performance of different methods |
sample | saliency-based method | | ICA-based method | | proposed method |
---|
TPR | TNR | R | TPR | TNR | R | TPR | TNR | R |
---|
sample1 | 0.92 | 0.72 | 0.78 | | 0.78 | 0.91 | 0.87 | | 0.94 | 0.94 | 0.94 |
sample2 | 0.98 | 0.51 | 0.79 | 0.77 | 0.87 | 0.81 | 0.98 | 1 | 0.99 |
sample3 | 0.95 | 0.71 | 0.82 | 0.75 | 0.86 | 0.81 | 0.97 | 0.99 | 0.98 |
3.1 Comparison of detection results
The defect detection results of samples 1–3 are shown in Fig. 4. In the experiment, the parameters were according to the recommendation of ICA-based [
5] and saliency-based [
8] methods. Figure 4(a) shows the detection results for sample1, in which printed words exist on the surfaces of buttons. The main defect types were printing deviation, stain, crack, and other defects. The defect points are shown in white in the binary images. The first column presents one defect-free and three defective samples. The second and third columns show the detection results of the saliency-based and ICA-based methods, respectively. The last column presents the detection results of the proposed method.
Figure 4(b) shows the detection results for sample2, which contains the fade-type defect. The first column presents one defect-free and two defective samples. The second and third columns show the detection results of the saliency-based and ICA-based methods, respectively. The results of the proposed method are shown in the last column. The proposed method performed better than the two other methods because the two methods ignored color information. This result suggests that color information contributes significantly to the detection of fade-type defects.
Figure 4(c) presents detection results for sample3, whose main defects consist of uneven surfaces and cracking. The first column shows one defect-free and three defective samples. The second column presents the detection results of saliency-based method. This method detected edge-type and dot-type defects effectively, given that these defects showed large salient values in the saliency detection step. Owing to the use of the search window strategy (SW), the method is robust to spatial misalignments. The third column shows detection results of the ICA-based method, which is sensitive to misalignment and has a high false alarm rate. The result of proposed method is shown in the last column. Our method is robust to spatial misalignments and has a high recognition rate.
Fig.4 Comparison of defect detection results on three samples by using different methods. (a)–(c) correspond to samples1–3. The first column contains the original images; the second and third columns show the results of saliency-based and ICA-based methods, respectively; and the last column presents the detection results of the proposed method |
Full size|PPT slide
3.2 Tuning of training dataset size
An experiment was conducted to reveal the relationship between recognition rate and training dataset size. Figure 5 shows that the recognition rate increased as the size of the training dataset increased. However, the recognition rate decreased slightly when the number was over 40 with the increasing computation cost. Therefore, the appropriate size of the dataset was 40. Our training dataset was composed of eight buttons. Aside from the original images, another four deviation images were generated by each button by rotating at a small angle around the button center. From the view of linear regression, insufficient data may cause underfitting, and excessive data may cause overfitting. Both scenarios may cause poor performance in the test dataset.
Fig.5 Recognition rate versus training dataset sizes in samples 1–3 |
Full size|PPT slide
3.3 Effect of weight matrix on recognition rate
We adopted a weight matrix to reduce the false alarms caused by large residual errors near edges. An experiment was conducted to verify the effectiveness of the weight matrix. Three weight matrices were used. indicates that the mean gradient intensity was used as weight matrix W in the proposed method. indicates that W is an identity matrix, and all pixels have the same weight. indicates that pixels on edges have a smaller weight than non-edge pixels.
Table 3 shows that when was used, the recognition rate improved. This result means that using the mean gradient intensity as the weight matrix can improve the recognition result.
Tab.3 Relationship between recognition rate and weight matrix W |
| recognition rate |
---|
sample1 | sample2 | sample3 |
---|
| 0.94 | 0.99 | 0.98 |
| 0.88 | 0.93 | 0.95 |
| 0.75 | 0.78 | 0.89 |
3.4 Effect of parameter setting
The key parameters included in the segmentation with GMM consisted of control constant C and number of Gaussian components K. Sensitivity analyses of these parameters were conducted as follows.
3.4.1 Receiver operating characteristic (ROC) analysis of control constant C
Control constant C determines if a pixel belongs to a Gaussian component. A large control constant provides a loose control limit and may ignore several defects, whereas a small control constant provides a tight control limit and may generate false alarms. To verify the detection performance quantitatively, the ROC curve of control constant C was constructed by scanning the control constant from a small number (1.5) to a large number (10), as shown in Fig. 6. In this experiment, the parameters of the number of Gaussian components K were all set to 5.
Fig.6 Receiver operating characteristic curves of the proposed method |
Full size|PPT slide
The ROC curve of the proposed method rises swiftly upward when C ranges from 1.5 to 4.5. In Fig. 6, when C is equal to 4, good performance is obtained with both low false positive and high true positive rates.
3.4.2 Analysis of Gaussian components K
Gaussian components K determines the ability to describe the distribution of the background. A large K can describe a highly complex distribution. If K is set to 1, the model degenerates into a single Gaussian model. According to Refs. [17,18], Kshould be set from 3 to 5. To further verify this point, we conducted experiments on samples 1–3 under different K values (from 1 to 5). Control constant C was set to 4. The experiment results are presented in Table 4. The table shows that setting K from 4 to 5 is appropriate, so K was set to 5 in the rest of our experiments.
Tab.4 Recognition rate versus number of Gaussian components K |
Gaussian components K | recognition rate |
---|
sample1 | sample2 | sample3 |
---|
1 | 0.61 | 0.62 | 0.66 |
2 | 0.82 | 0.88 | 0.81 |
3 | 0.90 | 0.98 | 0.98 |
4 | 0.94 | 0.99 | 0.98 |
5 | 0.94 | 0.99 | 0.98 |
3.5 Effect of rotation deviation
In the proposed method, we rotated the buttons to the same orientation as that in the preprocessing step. Therefore, all buttons were coarsely aligned. However, small deviations still existed among the aligned buttons (most of the deviations were within 5° according to the experiment result). To evaluate the robustness of proposed method to this subtle rotation deviation, we generated another seven test sets by rotating the aligned images at angles ranging from 1° to 7° and evaluated the performances in the eight test sets. Figure 7 shows the performance in sample3 using the three methods. The saliency-based method and the proposed method presented a more stable performance than the ICA-based method, and the proposed method achieved a higher detection rate.
Fig.7 Comparison of the recognition rate of the proposed method with those of the saliency-based and ICA-based methods in terms of rotation deviation angles |
Full size|PPT slide
3.6 Effect of illumination variation
To visualize defect detection performance, Fig. 8 shows the detection results for the same defective button under different illumination conditions. The first column presents the original inspection images under different illumination intensities ranging from 2500 to 4000 lx. The second column shows the reconstructed adaptive template images corresponding to the first column. The last column presents the binary results, which show that the four defective images can be detected steadily. In the experiment, illumination variation was equal to the effect of adding a light intensity offset to every pixel of the inspection image. The reason for the robustness to illumination variation is that to minimize the error function (Eq. (6)), the proposed method adjusts combination coefficients adaptively. As a result, it generates the most appropriate template image adaptively for each inspection image.
Fig.8 Effect of illumination variations. (a) Original inspection images under different illumination intensities (top to bottom correspond to 2500, 3000, 3500, and 4000 lx); (b) adaptive template images corresponding to the first column; and (c) binary results corresponding to the first column |
Full size|PPT slide
To further evaluate the robustness of our method to illumination variation, we performed experiments with sample3 under different illuminations (ranging from 2500 to 4500 lx). In this experiment, the samples under 3500 lx were used as training datasets. Table 5 shows that the proposed method presents a good recognition rate in both under-exposed (2500 lx) and over-exposed (4500 lx) situations.
Tab.5 Recognition rates under different illuminations |
illumination/lux | 2500 | 3000 | 3500 | 4000 | 4500 |
---|
recognition rates | 0.93 | 0.96 | 0.98 | 0.97 | 0.93 |
4 Conclusions
Referential methods have been widely used in surface quality inspection, but these methods rely heavily on alignment accuracy. To address this problem, we proposed a defect detection method based on the weighted least-squares model to suppress the residual error near the image edge. The defects were segmented with GMM.
The proposed method can be applied successfully to different types of buttons. It does not require precise alignment and is robust to both illumination variation and rotation deviation. The method can be potentially applied to various patterned surfaces without high alignment accuracy. To further improve the detection rate, defect detection considering local texture features is worthy of further investigation.
{{custom_sec.title}}
{{custom_sec.title}}
{{custom_sec.content}}