TY - GEN
T1 - Long-Term Effect Estimation with Surrogate Representation
AU - Cheng, Lu
AU - Guo, Ruocheng
AU - Liu, Huan
N1 - Funding Information: This material is based upon work supported by the National Science Foundation (NSF) Grant 1614576. Publisher Copyright: © 2021 ACM.
PY - 2021/8/3
Y1 - 2021/8/3
N2 - There are many scenarios where short- and long-term causal effects of an intervention are different. For example, low-quality ads may increase short-term ad clicks but decrease the long-term revenue via reduced clicks. This work, therefore, studies the the problem of long-term effect where the outcome of primary interest, orprimary outcome, takes months or even years to accumulate. The observational study of long-term effect presents unique challenges. First, the confounding bias causes large estimation error and variance, which can further accumulate towards the prediction of primary outcomes. Second, short-term outcomes are often directly used as the proxy of the primary outcome, i.e., thesurrogate. Nevertheless, this method entails the strong surrogacy assumption that is often impractical. To tackle these challenges, we propose to build connections between long-term causal inference and sequential models in machine learning. This enables us to learnsurrogate representations that account for thetemporal unconfoundedness and circumvent the stringent surrogacy assumption by conditioning on the inferred time-varying confounders. Experimental results show that the proposed framework outperforms the state-of-the-art.
AB - There are many scenarios where short- and long-term causal effects of an intervention are different. For example, low-quality ads may increase short-term ad clicks but decrease the long-term revenue via reduced clicks. This work, therefore, studies the the problem of long-term effect where the outcome of primary interest, orprimary outcome, takes months or even years to accumulate. The observational study of long-term effect presents unique challenges. First, the confounding bias causes large estimation error and variance, which can further accumulate towards the prediction of primary outcomes. Second, short-term outcomes are often directly used as the proxy of the primary outcome, i.e., thesurrogate. Nevertheless, this method entails the strong surrogacy assumption that is often impractical. To tackle these challenges, we propose to build connections between long-term causal inference and sequential models in machine learning. This enables us to learnsurrogate representations that account for thetemporal unconfoundedness and circumvent the stringent surrogacy assumption by conditioning on the inferred time-varying confounders. Experimental results show that the proposed framework outperforms the state-of-the-art.
KW - long-term effect
KW - representation learning
KW - sequential models
KW - surrogates
UR - http://www.scopus.com/inward/record.url?scp=85102987840&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85102987840&partnerID=8YFLogxK
U2 - 10.1145/3437963.3441719
DO - 10.1145/3437963.3441719
M3 - Conference contribution
T3 - WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
SP - 274
EP - 282
BT - WSDM 2021 - Proceedings of the 14th ACM International Conference on Web Search and Data Mining
PB - Association for Computing Machinery, Inc
T2 - 14th ACM International Conference on Web Search and Data Mining, WSDM 2021
Y2 - 8 March 2021 through 12 March 2021
ER -