TY - GEN
T1 - Reconstruction-based unsupervised feature selection
T2 - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
AU - Li, Jundong
AU - Tang, Jiliang
AU - Liu, Huan
N1 - Funding Information: This material is, in part, supported by National Science Foundation (NSF) under grant number 1614576.
PY - 2017
Y1 - 2017
N2 - Feature selection has been proven to be effective and efficient in preparing high-dimensional data for data mining and machine learning problems. Since real-world data is usually unlabeled, unsupervised feature selection has received increasing attention in recent years. Without label information, unsupervised feature selection needs alternative criteria to define feature relevance. Recently, data reconstruction error emerged as a new criterion for unsupervised feature selection, which defines feature relevance as the capability of features to approximate original data via a reconstruction function. Most existing algorithms in this family assume predefined, linear reconstruction functions. However, the reconstruction function should be data dependent and may not always be linear especially when the original data is high-dimensional. In this paper, we investigate how to learn the reconstruction function from the data automatically for unsupervised feature selection, and propose a novel reconstruction-based unsupervised feature selection framework REFS, which embeds the reconstruction function learning process into feature selection. Experiments on various types of realworld datasets demonstrate the effectiveness of the proposed framework REFS.
AB - Feature selection has been proven to be effective and efficient in preparing high-dimensional data for data mining and machine learning problems. Since real-world data is usually unlabeled, unsupervised feature selection has received increasing attention in recent years. Without label information, unsupervised feature selection needs alternative criteria to define feature relevance. Recently, data reconstruction error emerged as a new criterion for unsupervised feature selection, which defines feature relevance as the capability of features to approximate original data via a reconstruction function. Most existing algorithms in this family assume predefined, linear reconstruction functions. However, the reconstruction function should be data dependent and may not always be linear especially when the original data is high-dimensional. In this paper, we investigate how to learn the reconstruction function from the data automatically for unsupervised feature selection, and propose a novel reconstruction-based unsupervised feature selection framework REFS, which embeds the reconstruction function learning process into feature selection. Experiments on various types of realworld datasets demonstrate the effectiveness of the proposed framework REFS.
UR - http://www.scopus.com/inward/record.url?scp=85029090920&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85029090920&partnerID=8YFLogxK
U2 - 10.24963/ijcai.2017/300
DO - 10.24963/ijcai.2017/300
M3 - Conference contribution
T3 - IJCAI International Joint Conference on Artificial Intelligence
SP - 2159
EP - 2165
BT - 26th International Joint Conference on Artificial Intelligence, IJCAI 2017
A2 - Sierra, Carles
PB - International Joint Conferences on Artificial Intelligence
Y2 - 19 August 2017 through 25 August 2017
ER -