The collected training data often include both normal and faulty samples for complex chemical processes. However, some monitoring methods, such as partial least squares (PLS), principal component analysis (PCA), independent component analysis (ICA) and Fisher discriminant analysis (FDA), require fault-free data to build the normal operation model. These techniques are applicable after the preliminary step of data clustering is applied. We here propose a novel hyperplane distance neighbor clustering (HDNC) based on the local discriminant analysis (LDA) for chemical process monitoring. First, faulty samples are separated from normal ones using the HDNC method. Then, the optimal subspace for fault detection and classification can be obtained using the LDA approach. The proposed method takes the multimodality within the faulty data into account, and thus improves the capability of process monitoring significantly. The HDNC-LDA monitoring approach is applied to two simulation processes and then compared with the conventional FDA based on the K-nearest neighbor (KNN-FDA) method. The results obtained in two different scenarios demonstrate the superiority of the HDNC-LDA approach in terms of fault detection and classification accuracy.
Vincent P, Bengio Y, K-local hyperplane and convex distance nearest neighbor algorithms, in Advances in Neural Information Proc. Systems, MIT Press, Cambridge, 1, 985, 2002
Pasluosta CF, Dua P, Lukiw WJ, Nearest hyperplane distance neighbor clustering algorithm applied to gene co-expression analysis in Alzheimer’s disease, in: Annual International Conference of the IEEE Engineering in Medicine and Biology Society, Boston, 33, 5559, 2011
Xu J, Yang J, Lai ZH, Information Sciences, 232, 11, 2013
Archambeau C, Vrins F, Verleysen M, Flexible and robust Bayesian classication by nite mixture models, in European Symp. on Articial Neural Networks, Bruges, Belgium, 75, 2004
Zhang T, Tao D, Li X, Yang J, IEEE Trans. Knowledge and Data Eng., 21, 1299, 2009
Lam B, Yan H, Cluster validity for DNA microarray data using a geometrical index, in 4th International Conference on Machine Learning and Cybernetics, 6, 3333, 2005