TY - GEN
T1 - A marginalized denoising method for link prediction in relational data
AU - Chen, Zheng
AU - Zhang, Weixiong
N1 - Publisher Copyright:
© SIAM.
PY - 2014
Y1 - 2014
N2 - Missing information is ubiquitous in relational datasets. Imputation of missing relations, a.k.a. link prediction, has become an increasingly crucial problem in relational data analysis as a huge amount of data has been accumulated in various fields. Recent advances in the latent variable models have greatly improved the state-of-the-art in the link prediction accuracy, however it comes at the price of increasing complexity. In this paper, we propose a novel link prediction algorithm, marginalized denoising model (MDM), where the problem of predicting unobserved or missing links in a given relational matrix is cast as a problem of matrix denoising. The method learns a mapping function that models the embedded topological structures of the relational network by capturing the so-called indirect affinities among entities. We train the mapping function by recovering the originally observed matrix from a conceptually "infinite" number of corrupted matrices where some links are randomly masked from the observed matrix. By re-applying the learned function to the observed relational matrix, we aim to "denoise" the observed matrix and thus to recover the unobserved links. Experimental results on several benchmarks demonstrate the superior performance of the new method over several stateof-the-art link prediction methods.
AB - Missing information is ubiquitous in relational datasets. Imputation of missing relations, a.k.a. link prediction, has become an increasingly crucial problem in relational data analysis as a huge amount of data has been accumulated in various fields. Recent advances in the latent variable models have greatly improved the state-of-the-art in the link prediction accuracy, however it comes at the price of increasing complexity. In this paper, we propose a novel link prediction algorithm, marginalized denoising model (MDM), where the problem of predicting unobserved or missing links in a given relational matrix is cast as a problem of matrix denoising. The method learns a mapping function that models the embedded topological structures of the relational network by capturing the so-called indirect affinities among entities. We train the mapping function by recovering the originally observed matrix from a conceptually "infinite" number of corrupted matrices where some links are randomly masked from the observed matrix. By re-applying the learned function to the observed relational matrix, we aim to "denoise" the observed matrix and thus to recover the unobserved links. Experimental results on several benchmarks demonstrate the superior performance of the new method over several stateof-the-art link prediction methods.
UR - http://www.scopus.com/inward/record.url?scp=84925504830&partnerID=8YFLogxK
U2 - 10.1137/1.9781611973440.34
DO - 10.1137/1.9781611973440.34
M3 - Conference contribution
AN - SCOPUS:84925504830
T3 - SIAM International Conference on Data Mining 2014, SDM 2014
SP - 298
EP - 306
BT - SIAM International Conference on Data Mining 2014, SDM 2014
A2 - Zaki, Mohammed J.
A2 - Banerjee, Arindam
A2 - Parthasarathy, Srinivasan
A2 - Ning-Tan, Pang
A2 - Obradovic, Zoran
A2 - Kamath, Chandrika
PB - Society for Industrial and Applied Mathematics Publications
T2 - 14th SIAM International Conference on Data Mining, SDM 2014
Y2 - 24 April 2014 through 26 April 2014
ER -