1. Introduction
In the past two decades, plenty of research efforts have been dedicated into the domain of computer vision and pattern recognition, and tremendous progress has been achieved [1,31-34]. Face recognition, as a nonintrusive biometric technology, has been one of the most active research topics in this field, owing to its scientific challenges and useful applications. Nowadays, appearance-based face recognition methods are the mainstream [1] and they usually represent a face image as a high-dimensional vector with plenty of redundant information. Apparently, it is necessary to extract discriminative information from such a high-dimensional vector to form a new low-dimensional vector representation. This process is called feature extraction which is beneficial to both increasing recognition accuracy and reducing computational complexity.
In the last two decades, face recognition systems are considered to be critically dependent on discriminative feature extraction, about which PCA [2,3] and LDA [4] are the two most representative ones. PCA poses the K-L transformation on training face images to find a set of optimal orthogonal bases (also called Eigenfaces) for a low-dimensional subspace and represents a face as a linear combination of Eigenfaces. Though PCA is optimal in the sense of minimum reconstruction error, it may not be favorable to classification because it does not incorporate any class specific discriminatory information into feature extraction. On the contrary, LDA takes class discriminatory information into account and seeks to find a set of optimal projection vectors by maximizing the ratio between the between-class and the within-class scatter matrices of the training samples. Existing experimental results demonstrate that LDA generally outperforms PCA in recognition rate when there are enough and representative training samples from per subject [5].
Unfortunately, in some real-world applications, the number of samples of each subject is usually very small. For example, there is only one image available in the scenario of identity card or passport verification, law enforcement, surveillance or access control. This is the so called Single Sample per Person (SSPP) problem which severely challenges existing feature extracting algorithms, especially their robustness to variations such as expression, illumination and disguises. For example, the performance of PCA will degrade seriously [6], while LDA even fails to work because in this case the within-class scatter matrices of the training samples will not be estimated directly.
In this paper, we propose a simple yet effective method, called local similarity based linear discriminative analysis (LS_LDA) to address SSPP problem. The intuitive idea is that to make full use of the single image, we can divide it into many local blocks and analyze them respectively. Then based on the “divide and conquer” [18] strategy, we first classify each local block, and then integrate all the results of classification by majority voting to make a final decision. However, classifying each local block will have to face the difficulty that those local blocks corresponding to the same location but coming from different subjects might be close to each other. Intuitively, if we project these blocks from different persons into a lower-dimentional subspace and keep their respective projection in this subspace as far apart as possible, the classification will have a higher recognition rate. Although LDA is the best choice for this task, it cannot work under SSPP condition because the intra-class scatter matrices cannot be calculated. To make LDA feasible for SSPP problem, we further divide each local block into overlapped patches and propose local similarity assumption. As the patches in a local block have strong similarity, they can be regarded as to be from the same class. Based on this idea, the within-class scatter can be computed by using the overlapped patches in a local block. Finally, the classification outputs of all the local blocks by majority voting that further improves the performance. However, considering the fact that the class sample average of existing LDA models cannot provide an accurate estimate for the class population mean when there are outliers in the training set, we further propose local similarity based median discriminant analysis (LS_MDA) which uses the class median vector to estimate the class population mean in LDA. To evaluate the proposed methods, we perform a series of experiments on three datasets including Extended Yale B、PIE and AR face databases. Experimental results demonstrate that the proposed methods not only outperform those specially designed methods for SSPP problem, but also have strong robustness to expression, illumination, occlusion and time variation.
This paper is an extension to what we have presented earlier in our conference paper [26]. The newly incorporated work is highlighted as follows:
(1) We propose to use median discriminant analysis (MDA) to replace LDA and further improve the performance. In our previous work, we proposed LS_LDA to solve the problem that LDA cannot work under SSPP problem. Nonetheless, the class average vector of LDA cannot provide an accurate estimate of the class population mean, especially when there are outliers. Our newly proposed LS_MDA takes advantages of class median vector to estimate the class center and further improves the performance.
(2) We have conducted more experiments to verify the effectiveness of the proposed methods. Previously, we only provided the experimental results for SSPP problem on two databases. In this paper, the experimental results on three databases with more training samples are also published.
(3) We have further analyzed the advantages of our proposed methods and provided theoretical explanation, which were not discussed in our previous work.
The rest of this paper is organized as follows. We start by introducing related work in Section 2. Then in Section 3, we present the proposed local similarity based discriminant analysis. Section 4 demonstrates experiments and results. Finally, we conclude in Section 5 by highlighting key points of our work.
2. Related Work
In order to address SSPP problem, some literatures proposed to use virtual samples or generic training set. Virtual sample generation methods aim to generate extra samples for each person in order to extract the discriminatory information embedded in the intra-personal variations. For example, Shan et al. [7] presented a face-specific subspace method based on PCA which first generates a few virtual samples from single gallery image of per subject and then uses PCA to build a projection subspace for each person. In order to make LDA suitable for SSPP problem, they also extended Fisherfaces by generating virtual faces via geometraic transform and photometric changes. In addition, Zhang et al. [8] applied SVD decomposition to the only face image of a person and the obtained non-significant SVD basis images were used to estimate the within-class scatter matrix of this person approximately. However, these methods are highly limited to the prior information about human face. For generic learning methods, a generic training set, in which each person has more than one training sample, is adopted to extract the discriminatory information. For example, Su et al. proposed an Adaptive Generic Learning (AGL) [24] method, which adapts a generic discriminant model to better distinguish the persons with single sample. Yang et al. proposed the sparse variation dictionry leanring (SVDL) [27] scheme by using the relationship between the gallery set and the external generic set. Recently, Deng et al. [28] proposed a novel generic learning method by mapping the intra-class facial difference of the generic faces to the zeros vectors to further enhace the generalization capability of their proposed linear regression analysis (LRA). They also proposed the extended sparse representation-based calssifier (ESRC) [29] to solve SSPP problem.
All the above-mentioned methods for SSPP problem treat the whole image as a high-dimensional vector and belong to holistic representation. However, some other schemes favor local representation, in which a face image is divided into blocks and vector representation of information is conducted by blocks other than globally. Compared with holistic representation, local representation is proved to be more robust against variations [9]. For example, Chen et al. [10] proposed BlockFLD method which generates multiple training samples for each person by partitioning each face image into a set of same sized blocks and then applies FLD-based methods with these blocks. But the great differences between the appearances of long-distance blocks from one image may go against the compactness of the within-class scatter after projection. With the blocking idea, they also proposed sub-pattern based PCA (SpPCA) [22], which operates PCA directly on a set of partitioned subpatterns and extracts corresponding local sub-features and then combine them into final global features. However, decision fusion may be more effective than the feature fusion scheme of SpPCA. Recently, Zhu et al. [11] proposed patch based CRC (PCRC) for small sample size (SSS) problem. They also proposed a local generic representation (LGR) [30] based framework for SSPP problem which takes advantages of patch based local representation and generic variation representaion. Kumar et al. [12] proposed patch based n-nearest classifier to improve the stability and generalization ability. Lu et al. [25] proposed a discriminative multi-manifold analysis (DMMA) method by leanring discriminant feature from image patches. Generally speaking, these patch based methods has a commonality that they just consider each patch independently. Hence, they will lose the correlation information between patches, which is very important for classification.
3. Local Similarity based Discriminant Analysis for Face Recognition
3.1 Local Simialrity based Assumption
To describe local structure, we illustrate three kinds of neighborhood in Fig. 1. The P neighbor pixels on a square of radius R form a squarely symmetric neighbor sets. Suppose there are N pixels in an image. For the i -th pixel in the image, its P neighbor pixels can be denoted by .
Fig. 1.Squarely symmetric neighbor sets for different R
For the ij-th pixel in the neighbor set , it’s regarded as the center of a S × S local patch (e.g. S=3, 5). All the S2 pixels within the patch form a m dimensional local patch vector xij, j = 1,…,P , where m = S2 . Similarly, the center pixel i also corresponds to a same sized local patch which is denoted by the vector . For the i -th pixel in the image, all the local patches, xij(j = 0,…,P), determine a local block centered at the i -th pixel. Fig. 2 shows an example of a local block which contains a central patch and 16 neighbor patches, noticing that the size of patch is S2 = 3 × 3 = 9 and the size of the block is 7 × 7 = 49 . For the pixel on the margin of an image, we use the mirror transform first and then determine its local block.
Fig. 2.Illustration of local patches in a local block
As the patches are overlapped and concentrate in a small block, they are strongly similar. Intuitively, they can be regarded as be from the same class. Therefore, we assume that the overlapped patches in a local block are from the same class.
3.2 Local Similarity based Linear Discriminant Analysis (LS_LDA)
Suppose there are C training images {x1,x2,...,xC} and each of them belongs to a different person. According to the “dividing and conquer” [18] strategy, we first divide these training images into many local blocks. For example, the only training image xk from k-th person is divided into a set of N overlapped blocks , where the i -th pixel of xk corresponds to the local block . The i -th local blocks from all the C training images form a blocking training set Bi . Similarly, the probe image y is also decomposed into N overlapped blocks {y1,y2,…,yN} in the same way. Then, we can classify each block respectively. However, we observe that those local blocks corresponding to the same location but coming from different subjects might be close to each other because those individuals are similar-looking.
As shown in Fig 3. (a), yi is supposed to be from Class 1 and are the training images’ block from Class 1 Class 2, Class 3 respectively. It can be seen that the distance between yi and is similar with d1,2 and d1,3 . Hence, these kinds of probe image blocks are easily misclassified. To make such misclassifications less likely, we aim to learn a mapping to project these samples into a low-dimensional subspace to enlarge the between-class distance and shorten the intra-class distance, as shown in Fig. 3 (b). Although LDA is the best choice for this task, it cannot work under SSPP condition because the intra-class scatter matrices cannot be calculated. To address this problem, we further divide each block into overlapped patches and take advantage of the above-mentioned local structure. As shown in Fig. 4, the testing block yi is divided into P + 1 overlapped patches, where . Similarly, the training block can also be decomposed into P + 1 overlapped patches, where . According to the description of local structure, these overlapped patches in local block have strong similarity. They can be theoretically regarded as to be from the same class. Then, the computation of the within-class scatter and the between-class scatter for the block training set Bi is as follows:
Where and μi denote the means of the k class and all the classes respectively. Then, LDA can be used to seek the projection matrix W that enlarge the between-class distance and shorten the intra-class distance in the projected subspace. It’s equivalent to solve the following optimization problem:
This optimization problem equals to the following generalized eigenvalue problem:
The solution of (3) can be obtained by applying an eigen-decomposition to the matrix is nonsingular, or is nonsingular. The rows of the projection matrix W are eigenvectors corresponding to the C - 1 nonzero eigenvalues.
Fig. 3.Illustration of the basic idea
Fig. 4.The flowchart of local similarity based linear discriminant analysis for face recognition
After computing the projection matrix W , we project the block training set Bi and testing block into the lower-dimensional subspace and Cosine-distance based Nearest Neighbor (NN) classifier is used to classify the testing block. In order to decrease the cost of computing, we only focus on the classification of the central patch of the testing block. After classify each block, the classification outputs of all blocks can then be aggregated. Majority voting is used for the final decision making, which means that the test sample is finally classified into the class with the largest number of votes. Fig. 4 shows the whole recognition procedure, which can be summarized as “divide-conquer-aggregate” procedure.
3.3 Local Similarity based Median Discriminant Analysis (LS_MDA)
In current LDA models, class sample average is used to estimate the class population mean and construct the between-class and within-class scatters. However, when the number of training samples is limited, the class sample average cannot provide an accurate estimate of the class population mean, particularly when there are outliers in the training set. To overcome this weakness, Yang et al. [20] proposed median Fisher discriminator (MFD) which uses the class median vector, rather than the class sample average, to estimate the class population mean vector in the LDA modeling.
In probability theory and statistics, the median is defined as the middle value in a distribution, above and below which lie an equal number of values. Like the sample average, the median can also be used to estimate the central tendency of the population. And, the median is more robust than the sample average, especially when there are outliers in the training sets. Specifically, given a set of Nk training sample vectors {x1,x2,⋯, xNk} in Class k, we can obtain its class median vector Mk as Mk = (m1,m2,⋯mn)T , where mj is the median value of the j-th row in data matrix X . The data matrix X is defined as:
Then the total mean vector of all samples can be estimated by:
Where Nall is the number of all samples. Then the with-in class scatter and between-class scatter can be computed by:
Where wkj is the weighting coefficient, which is introduced to alleviate the influence of outliers on the construction of the within-class scatter matrix. A small weighting coefficient is imposed on the samples which are far way from the class center. Therefore the weighting coefficients wkj can be generated by a monotone decreasing function with respect to the distance between the sample xkj and the class median vector Mk . In [20], Yang et al. chose the Gaussian function as the weighting function:
Theoretically speaking, MFD is more robust to noise and outliers than LDA. The reasons lie in the following points: 1) the class median vector is more robust to estimate class center than the class sample average; 2) the weighting strategy further alleviates the influence of outliers. In small sample size cases, although there are only a few of training samples for each class, the class median vector Mk generally provides more accurate approximation to the true class population mean, especailly when there are outliers in the training set.
Inspired by MFD, we further propose local similarity based median discriminant analysis (LS_MDA) which uses MFD to replace LDA in the LS_LDA model. Specifically, the computation of the within-class scatter and the between-class scatter for the block training set Bi is as follows:
Where is the median vector of the k-th class from the block training set Bi and Mi is the total mean vector of all samples from Bi .
3.4 Analysis of the Proposed Methods
Comparing with those global appearance based methods, the advantages of the proposed methods lie in several factors. First, the dilemma of high data dimensionality and small sample size can be alleviated by the “divide-conquer” strategy in our method, while those global appearance based methods should first use dimension reduction algorithms to solve the problem. Furthermore, each sub-image classification problem produced by “divide-conquer” strategy can be solved quickly and in parallel. Second, as local feature based methods, our proposed methods can eliminate or lower the effects of illumination changes, occlusion and expression changes by analyzing face images locally since face local information is less sensitive to those changes. However, global appearance based method are easily affected by illumination, expression changes and occlusions. Third, the local similarity assumption not only describes the relationship of the patches in a local block but also makes LDA feasible to SSPP problem. Finally, the majority voting with all the classification results of different blocks further improves the performance. It has already been found that majority voting is by far the simplest, yet as effective as more complicated schemes in improving the recognition results. This point has already been proved in [13] and [14]. In the contrary, those global appearance based methods only output a single classification results from a single classifier based on global appearance representation.
Comparing with SpPCA [22] and BlockFLD [10], our proposed LS_LDA and LS_MDA also utilize the blocking idea. However, we make classification for each block and combine their decisions, rather than fuse the features from each block like SpPCA [22]. Comparing with features fusion scheme, the data size of decision fusion is small. Moreover, decision fusion also has strong anti-interference capacity because the error classification results of some blocks can be suppressed by majority voting. With regard to BlockFLD [10], although it makes LDA feasible for SSPP problem by dividing each face image into a set of same size blocks, the differences between the long-distance blocks from one image is great and enlarges the with-in class scatter after projection. LS_LDA and LS_MDA succeffully avoid this problem by applying LDA on each block.
4. Experimental Results and Analysis
In this section, we use the Extended Yale B [15], PIE [19] and AR [16] databases to evaluate the proposed methods and compare them with some popular methods dealing with SSPP problem. These state-of-the-art methods include PCA[21], SpPCA [21], 2DPCA [23], BlockFLD [10], FLDA_single [8], AGL [24], DMMA [25], and patch based CRC (PCRC) [11].
For our proposed methods, the neighbor sets are fixed at P = 8,R = 1 and the patch size is fixed at 9 × 9 . Since the classification of each local block can be solved independently before combing decisions, we open 12 Matlab workers for parallel computation to improve the efficiency. For the kernel parameter t in Equation 7, it’s optimally chosen t = 4 . Specifically, when the parameter t = +∞ , the weight wkj = 1 for any k and j , which means that the within-class scatter matrix is actually constructed in a non-weighted way.
4.1 Extended Yale B Database
The Extended Yale B face database [15] contains 38 human subjects under 9 poses and 64 illumination conditions. The 64 images of a subject in a particular pose are acquired at camera frame rate of 30 frames per second, so there is only small change in head pose and facial expression for those 64 images. However, its extreme lighting conditions still make it a challenging task for most face recognition methods. All frontal-face images marked with P00 were used in our experiment. The cropped and normalized 192×168 face images were captured under various laboratory-controlled lighting conditions [17] and are resized to 80×80 in our experiments. Some sample images of one person are shown in Fig. 5.
Fig. 5.Samples of a person under different illuminations in Extended Yale B face database
To evaluate the performance of our proposed methods to SSPP problem, we use the image under the best illumination condition for training, whose azimuth and elevation are both 0 degree. The remaining 63 images are used for testing and the average results are reported. The average results of the five subsets of the Extended Yale B are also reported respectively. The details of the five subsets are given in Table 1. The experimental results are shown in Table 2. From the table, we can see that our proposed methods achieve the best average results. Moreover, LS_MDA further improves the performance of LS_LDA. The experimental results demonstrate that our proposed methods are robut to not only SSPP problem but also illumination varaitions.
Table 1.Five Subsets of Extended Yale B
Table 2.Recognition rates on Extended Yale B
4.2 PIE Database
The CMU PIE face database contains 68 subjects with 41368 face images as a whole [19]. Images of each person were taken across 13 different poses, under 43 different illumination conditions, and with 4 different expressions. All images have been cropped and resized to be 64 × 64 . Some sample images of one person are shown in Fig. 6.
Fig. 6.Samples of a person in PIE face database.
The images under five near frontal poses (C05, C07, C09, C27 and C29) are used in our experiment. We select 6 images with different illumination under each pose, thus we get 30 images for each individual. The single image with good illumination from frontal pose C27 is used for training. The remaining images of each subject are used for testing. The experimental result is shown in Fig. 7. We can see that LS_LDA and LS_MDA still achieve the best results and LS_MDA outperforms LS_LDA. Comparing with other methods, they leads to at least 9.3% improvements. To further evaluate the performance of our proposed LS_LDA and LS_MDA, we also conduct experiments with more training samples. We randomly select 1-10 samples from the 30 images of each individual for training and the remaining images are used for testing. The experiment is randomly conducted for 5 times and the average result is reported. The experimental results are shown in Fig. 8. It can be seen that LS_MDA shows obvious advantages especially when the training sample number is small. This is because the influence of outliers is suppressed by more clean samples with the training sample number increasing.
Fig. 7.Recognition rates on PIE face database.
Fig. 8.Recognition rates with different training sample number.
4.3 AR Database
The AR face database [16] contains over 4,000 color face images of 126 people (70 men and 56 women), including frontal views of faces with different facial expressions, lighting conditions and occlusions. The pictures of 120 individuals (65 men and 55 women) were taken in two sessions (separated by two weeks) and each session contains 13 color images. To demonstrate the performance of our proposed methods for SSPP problem, we use the single image under natural expression and illumination from session 1 for training. The other images of two sessions are used for testing. These 120 individuals are selected to use in our experiment. Some samples of one person are shown in Fig. 9. The images are resized to 32×32 and converted into gray scale.
Fig. 9.Samples of a person in AR database. Top row: 8 samples under different expression, illumination and occlusion from session 1; Bottom row: 8 samples from session 2 taken under the same conditions as those in top row.
The classification results on the two sessions are shown in Table 4 and Table 5 respectively. One can see from the tables that our methods show superior performance to the other methods. It's not only robust to expression and illumination variations, but also show great robustness to occlusion. In the experiments of session 1, they lead to at least 10% improvement to other methods. We also compare it with a special designed method for occlusion (GSRC [19]), which used 7 samples per subject and high resolution images but only achieved a lower recognition rate of 93% and 79% in sunglasses and scarves cases. Although there are some blocks with occlusion that will not be classified by LS_LDA correctly, those blocks without occlusion will lead to more accurate classification results that will finally suppress the wrong results and achieve the final accurate result. In the experiments of session 2, we can find that the proposed methods are also robust to time variation. Compared to other methods, they have at least 20% improvement. To further comparet the performance of the proposed LS_LDA and LS_MDA, we also conduct experiments with more training samples. We randomly select 1-5 samples from each subject for training and the remaining samples are used for testing. The experiment is randomly carried out for 5 times and the average results under different training sample number are shown in Table 6. From the table, we can see that LS_MDA is superior to LS_LDA.
Table 4.Recognition accuracy (%) on session 1 of AR database
Table 5.Recognition accuracy (%) on session 2 of AR database
Table 6.Recognition accuracy (%) under different training sample number
5. Conclusion
In this paper, we propose local similarity based linear discriminant analysis (LS_LDA) to solve SSPP problem. Motivated by the “divide-conquer” strategy, we first divide the face into local blocks, and classify each local block, and then integrate all the classification results to make final decision. To make LDA feasible for the classification of each local block, we further divide each block into overlapped patches and assume that these patches are from the same class. This assumption not only reflects the local structure relationship of the overlapped patches but also makes LDA feasible for SSPP problem. Considering the fact that some blocks with illumination, expression or occlusion may be invalid for classification, we further propose LS_MDA to improve the performance. Experimental results on two popular databases show that our methods not only generalize well SSPP problem but also have strong robustness to expression, illumination, occlusion and time variation. However, the proposed methods also rely on the basis that all the training and testing images are well aligned. Therefore, drastic pose variation will decrease their performance. This problem is planned to be solved in our future work.
References
- W. Zhao, R. Chellappa, A. Rosenfeld, and P. J. Phillips, “Face recognition: A literature survey,” ACM Computing Surveys, vol. 35, no. 4, pp. 399–458, 2003. Article (CrossRef Link) https://doi.org/10.1145/954339.954342
- M. Kirby, L. Sirovich, “Application of the Karhunen-Loeve procedure for the characterization of human faces,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 12, no. 1, pp. 103–108, 1990. Article (CrossRef Link) https://doi.org/10.1109/34.41390
- M. Turk, A. Pentland, “Eigenfaces for recognition,” Journal of Cognitive Neuroscience, vol. 3, no. 1, pp. 71–86, 1991. Article (CrossRef Link) https://doi.org/10.1162/jocn.1991.3.1.71
- P. Belhumeur, J. Hespanha, and D. Kriegman, “Eigenfaces vs. fisherfaces: Recognition using class specific linear projection,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 19, no. 7, pp. 711-720, 1997. Article (CrossRef Link) https://doi.org/10.1109/34.598228
- A. M. Martínez and A. C. Kak, “PCA versus LDA,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 23, no. 2, pp. 228-233, 2001. Article (CrossRef Link) https://doi.org/10.1109/34.908974
- X. Tan, S.C. Chen, Z.H. Zhou, and h. F., “Face recognition from a single image per person: A survey,” Pattern Recognition, vol. 39, no. 9, pp. 1725-1745, 2006. Article (CrossRef Link) https://doi.org/10.1016/j.patcog.2006.03.013
- S. Shan, W. Gao, and D. Zhao, “Face Identification Based on Face-Specific Subspace,” International Journal of Imaging Systems and Technology, vol. 13, no. 1, pp. 23-32, 2003. Article (CrossRef Link) https://doi.org/10.1002/ima.10047
- Quan-xue Gao, Lei Zhang, and David Zhang, “Face recognition using FLDA with single training image per person,” Applied Mathematics and Computation, vol. 205, no. 2, pp. 726-734, 2008. Article (CrossRef Link) https://doi.org/10.1016/j.amc.2008.05.019
- A.M. Martinez, “Recognizing imprecisely localized, partially occluded, and expression variant faces from a single sample per class,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 25, no. 6, pp. 748-763, 2002. Article (CrossRef Link) https://doi.org/10.1109/TPAMI.2002.1008382
- S. C. Chen, J. Liu, Z. H. Zhou, “Making FLDA Applicable to Face Recognition with One Sample per Person,” Pattern Recognition, vol. 37, no. 7, pp. 1553-1555, 2004. Article (CrossRef Link) https://doi.org/10.1016/j.patcog.2003.12.010
- P. F. Zhu, L. Zhang, Q.H. Hu, and S. C.K. Shiu, "Multi-scale Patch based Collaborative Representation for Face Recognition with Margin Distribution Optimization," European Conference on Computer Vision, pp. 822-835, October 7-13, 2012. Article (CrossRef Link)
- R. Kumar, A. Banerjee, B.C. Vemuri, H. Pfister, "Maximizing all margins: Pushing face recognition with kernel plurality," IEEE International Conference on Computer Vision, pp. 2375-2382, November 6-13, 2011. Article (CrossRef Link)
- N. C. de Condorcet, Essai sur l’Application de l’Analyze `a la Probabilit´e des D´ecisions Rendues `a la Pluralit´e des Voix. Paris, France:Imprim´erie Royale, 1785.
- L. Lam, C. Y. Suen, “Application of majority voting to pattern recognition: an analysis of its behavior and performance,” IEEE Transactions Systems Man and Cybernetics, Part A: Systems and Humans, vol. 27, no. 5, pp. 553-568, 1997. Article (CrossRef Link) https://doi.org/10.1109/3468.618255
- A. Georghiades, P. Belhumeur, D. Kriegman, “From few to many: Illumination cone models for face recognition under variable lighting and pose,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 6, no. 23, pp. 643-660, 2001. Article (CrossRef Link) https://doi.org/10.1109/34.927464
- A. M. Martinez, R. Benavente, "The AR Face Database," CVC Technical Report 24, 1998.
- K. Lee, J. Ho, and D. Kriegman, “Acquiring Linear Subspaces for Face Recogntinon under Variable Lighting,” IEEE Transactions Pattern Analysis and Machine Intelligence, vol. 27, no. 5, pp. 684-698, 2005. Article (CrossRef Link) https://doi.org/10.1109/TPAMI.2005.92
- Q. F. Stout, “Supporting Divide-and-Conquer Algorithms for Image Processing,” Journal of Parallel and Distributed Computing, vol. 4, no. 1, pp. 95-115, 1987. Article (CrossRef Link) https://doi.org/10.1016/0743-7315(87)90010-4
- T. Sim, S. Baker, and M. Bsat, “The CMU pose, illumination, and expression database,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 25, no. 12, pp. 1615-1618, 2003. Article (CrossRef Link) https://doi.org/10.1109/TPAMI.2003.1251154
- J. Yang, J. Yang, D. Zhang, “Median Fisher Discriminator: a robust feature extraction method with applications to biometrics,” Frontiers of Computer Science in China, vol. 2, no. 3, pp. 295–305, 2008. Article (CrossRef Link) https://doi.org/10.1007/s11704-008-0029-4
- Matthew. Turk, Alex Pentland, “Eigenfaces for recognition,” Journal of cognitive neuroscience, vol. 3, no. 1, pp. 71-86, 1991. Article (CrossRef Link) https://doi.org/10.1162/jocn.1991.3.1.71
- S.C. Chen, Y.L. Zhu., “Subpattern-based principle component analysis,” Pattern Recognition, vol. 37, no. 1, pp. 1081-1083, 2004. Article (CrossRef Link) https://doi.org/10.1016/j.patcog.2003.09.004
- J. Yang, D. Zhang, J.Y. Yang, “Two-Dimensional PCA: A New Approach to Appearance-Based Face Representation and Recognition,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 26, no. 1, pp. 131-137, 2004. Article (CrossRef Link) https://doi.org/10.1109/TPAMI.2004.1261097
- S. Yu, S. Shan, X. Chen, W. Gao, "Adaptive Generic Learning for Face Recognition from a Single Sample per Person," IEEE Computer Vision and Pattern Recognition, pp. 2699-2706, June 13-18, 2010. Article (CrossRef Link)
- J. Lu, Y.P. Tan, G. Wang, X. Zhou, "Discriminative Multi-Manifold Analysis for Face Recognition from a Single Training Sample per Person," in Proc. of IEEE International Conference on Computer Vision, PP. 1943-1950, November 6-13, 2011. Article (CrossRef Link)
- F. Liu, Y. Bi, Y. Cui, Z. Tang, "Local Similarity based Linear Discriminant Analysis for Face Recognition with Single Sample per Person," in Proc. of Asian Conference on Computer Vision, workshop FSLCV, pp. 85-95, November 1-5, 2014. Article (CrossRef Link)
- M. Yang, Luc Van, and L. Zhang. "Sparse variation dictionary learning for face recognition with a single training sample per person," in Proc. of IEEE International Conference on Computer Vision, pp. 689-696, December 3-6, 2013. Article (CrossRef Link)
- W. Deng, J. Hu, X. Zhou, and J. Guo, “Equidistant prototypes embedding for single sample based face recognition with generic learning and incremental learning,” Pattern Recognition, vol. 47, no. 12, pp. 3738–3749, 2014. Article (CrossRef Link) https://doi.org/10.1016/j.patcog.2014.06.020
- Weihong Deng, Jiani Hu, and Jun Guo, “Extended SRC: Undersampled Face Recognition via Intraclass Variant Dictionary,” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 34, no. 9, pp. 1864–1870, 2012. Article (CrossRef Link) https://doi.org/10.1109/TPAMI.2012.30
- P. Zhu, et al. "Local Generic Representation for Face Recognition with Single Sample per Person," in Proc. of Asian Conference on Computer Vision, pp. 34-50, November 1-5, 2014. Article (CrossRef Link)
- Yu, R. Hong, M. Wang, and J. You, “Image clustering based on sparse patch alignment framework,” Pattern Recognition, vol. 47, no. 11, pp. 3512–3519, 2014. Article (CrossRef Link) https://doi.org/10.1016/j.patcog.2014.05.002
- J. Yu, D. Tao, M. Wang, and Y. Rui, “Learning to Rank Using User Clicks and Visual Features for Image Retrieval,” IEEE Transactions on Cybernetics, vol. 45, no. 4, pp. 767–779, 2015. Article (CrossRef Link) https://doi.org/10.1109/TCYB.2014.2336697
- Jun Yu, Meng Wang, and Dacheng Tao, “Semisupervised Multiview Distance Metric Learning for Cartoon Synthesis,” IEEE Transaction on Image Processing, vol. 21, no. 11, pp. 4636–4648, 2012. Article (CrossRef Link) https://doi.org/10.1109/TIP.2012.2207395
- Jun Yu, Dacheng Tao, and Meng Wang, “Adaptive Hypergraph Learning and its Application in Image Classification,” IEEE Transaction on Image Processing, vol. 21, no. 7, pp. 3262–3272, 2012. Article (CrossRef Link) https://doi.org/10.1109/TIP.2012.2190083