TY - GEN
T1 - Identity matching based on probabilistic relational models
AU - Li, Jiexun
AU - Wang, Gang
AU - Chen, Hsinchun
PY - 2006
Y1 - 2006
N2 - Identity management is critical to various organizational practices ranging from citizen services to crime investigation. The task of searching for a specific identity is difficult because multiple identity representations may exist due to issues related to unintentional errors and intentional deception. In this study we propose a probabilistic relational model (PRM) based approach to match identities in databases. By exploring a database relational structure, we derive three categories of features, namely personal identity features, social activity features, and social relationship features. Based on these derived features, a probabilistic prediction model can be constructed to make a matching decision on a pair of identities. An experimental study using a real criminal dataset demonstrates the effectiveness of the proposed PRM-based approach. By incorporating social activity features, the average precision of identity matching increased from 53.73 % to 54.64%; furthermore, the incorporation of social relation features increased the average precision to 68.27%.
AB - Identity management is critical to various organizational practices ranging from citizen services to crime investigation. The task of searching for a specific identity is difficult because multiple identity representations may exist due to issues related to unintentional errors and intentional deception. In this study we propose a probabilistic relational model (PRM) based approach to match identities in databases. By exploring a database relational structure, we derive three categories of features, namely personal identity features, social activity features, and social relationship features. Based on these derived features, a probabilistic prediction model can be constructed to make a matching decision on a pair of identities. An experimental study using a real criminal dataset demonstrates the effectiveness of the proposed PRM-based approach. By incorporating social activity features, the average precision of identity matching increased from 53.73 % to 54.64%; furthermore, the incorporation of social relation features increased the average precision to 68.27%.
KW - Feature construction
KW - Identity matching
KW - Probabilistic relational models
UR - http://www.scopus.com/inward/record.url?scp=84870208179&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84870208179&partnerID=8YFLogxK
M3 - Conference contribution
SN - 9781604236262
T3 - Association for Information Systems - 12th Americas Conference On Information Systems, AMCIS 2006
SP - 1457
EP - 1464
BT - Association for Information Systems - 12th Americas Conference On Information Systems, AMCIS 2006
T2 - 12th Americas Conference on Information Systems, AMCIS 2006
Y2 - 4 August 2006 through 6 August 2006
ER -