Rajabioun, RaminHatas, HasanAfshar, MojtabaAtan, OzkanAkin, Bilal2026-01-302026-01-3020262687-97352687-974310.1109/JESTIE.2025.36363802-s2.0-105023449983https://doi.org/10.1109/JESTIE.2025.3636380https://hdl.handle.net/20.500.14720/29682This study presents a highly accurate and cost-efficient machine learning framework for diagnosing distributed bearing faults by identifying an optimal and compact set of frequency-domain features. The analysis is based on a comprehensive dataset collected from an experimental testbed comprising a coupled induction motor operating under 50 distinct combinations of speed and load. The dataset encompasses both healthy bearings and those exhibiting various distributed faults-such as lubrication degradation, contamination, electrical erosion, and flaking-as well as a localized fault in the form of a single-point defect on the outer race. Data were acquired using a multisensor module integrating three-axis vibration, stray magnetic flux, and two phase current signals. The primary objective is to reduce an initial set of 102 frequency-based features to a minimal subset suitable for deployment on resource-constrained microcontrollers without sacrificing classification performance. Employing Mutual Information (MI) and Random Forest (RF) techniques for feature selection, the study successfully identifies a critical subset of five features. A Random Forest classifier trained on these features achieved an impressive test accuracy of 99.58%. Furthermore, the combination of top-ranked features from both MI and RF methods led to improved classification performance over using either method independently, thereby enhancing the robustness and reliability of industrial bearing fault detection.eninfo:eu-repo/semantics/closedAccessFault DiagnosisLubricationFeature ExtractionErosionVibrationsContaminationFault DetectionAccuracyRandom ForestsBenchmark TestingDistributed Bearing Faults DetectionFrequency-Based Feature SelectionMachine LearningMultisensor DataOptimal Feature Selection for Distributed Bearing Fault Detection Using Cost-Effective, High-Performance Machine Learning AlgorithmsArticle