Overall
- Language
- English
- Conflict of Interest
- In relation to this article, we declare that there is no conflict of interest.
- Publication history
-
Received October 22, 2024
Accepted January 20, 2025
Available online May 25, 2025
-
This is an Open-Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/bync/3.0) which permits
unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Most Cited
Dimensionality Reduction for Clustering of Nonlinear Industrial Data: A Tutorial
https://doi.org/10.1007/s11814-025-00402-7
Abstract
Dimensionality reduction is essential for industrial process data with numerous nonlinear variables to retain only the important
features for visualization or subsequent tasks. This study serves as a tutorial demonstrating how various dimensionality
reduction techniques perform as the complexity of process variables in toy examples increases. Among the variables, there
are those containing fault signals, aiming to demonstrate the process of performing a fault detection task. The results evaluated
based on three criteria showed that Uniform Manifold Approximation and Projection (UMAP) demonstrated notable
results, particularly with sparse and noisy data, while also off ering adequate robustness to out-of-sample test data. This tutorial
provides guidance on selecting the appropriate dimensionality reduction technique based on data complexity, ultimately
enabling more eff ective execution of subsequent tasks.

