A novel self-learning feature selection approach based on feature attributions

2021
Feature selection has shown its effectiveness in improving the accuracy and generalization of machine learning models, especially for those tasks with high-dimensional data. In this article, a novel self-learning feature selection (SLFS) approach based on feature attributions is proposed as a wrapper method, which has higher search efficiency for optimal feature subsets with three main improvements. First, we regard feature selection as a combinatorial optimization problem and propose a unified local search framework for wrapper methods by analyzing meta-heuristic algorithms in feature selection. Second, for the binary search space of feature selection, we propose two types of neighborhood structures, namely, ring-type and line-type structures, for the local search framework. Third, we focus on feature attribution methods, such as SHAP (SHapley Additive explanations) (Lundberg & Lee, 2017), which can interpret each feature's importance to predictions. In each iteration, we adopt SHAP values and other attributes from previous subsets to guide the next selection of new subsets. To validate the performance of our SLFS approach, we collected 16 classification datasets from the UCI repository for comparison with other meta-heuristic wrapper approaches in terms of fitness, accuracy, F1 scores and selection ratios. The experimental results show that the SLFS approach can be used to obtain an optimal subset with fewer iterations and a small population, and SHAP values play a role in improving search efficiency.
EXPERT SYSTEMS WITH APPLICATIONS
卷号:183
ISSN:0957-4174
收录类型
SSCI
发表日期
2021
学科领域
循证管理学
国家
中国
语种
英语
DOI
10.1016/j.eswa.2021.115219
其他关键词
WHALE OPTIMIZATION ALGORITHM; HYBRID GENETIC ALGORITHM; CLASSIFICATION; SEARCH; SYSTEM
EISSN
1873-6793
资助机构
National Key R&D Program of China [2019YFB1704402]
资助信息
This work was supported by National Key R&D Program of China (2019YFB1704402) .
被引频次(WOS)
0
被引更新日期
2022-01
来源机构
Tongji University Utah System of Higher Education Utah State University
关键词
Feature selection Local search Feature attribution Self-learning