TY - GEN
T1 - Causal learning in question quality improvement
AU - Li, Yichuan
AU - Guo, Ruocheng
AU - Wang, Weiying
AU - Liu, Huan
N1 - Funding Information: This material is based upon work supported by ARO/ARL and the National Science Foundation (NSF) Grant #1610282, NSF #1909555. Publisher Copyright: © Springer Nature Switzerland AG 2020.
PY - 2020
Y1 - 2020
N2 - To improve the quality of questions asked in Community-based questions answering forums, we create a new dataset from the website, Stack Overflow, which contains three components: (1) context: the text features of questions, (2) treatment: categories of revision suggestions and (3) outcome: the measure of question quality (e.g., the number of questions, upvotes or clicks). This dataset helps researchers develop causal inference models towards solving two problems: (i) estimating the causal effects of aforementioned treatments on the outcome and (ii) finding the optimal treatment for the questions. Empirically, we performed experiments with three state-of-the-art causal effect estimation methods on the contributed dataset. In particular, we evaluated the optimal treatments recommended by the these approaches by comparing them with the ground truth labels – treatments (suggestions) provided by experts.
AB - To improve the quality of questions asked in Community-based questions answering forums, we create a new dataset from the website, Stack Overflow, which contains three components: (1) context: the text features of questions, (2) treatment: categories of revision suggestions and (3) outcome: the measure of question quality (e.g., the number of questions, upvotes or clicks). This dataset helps researchers develop causal inference models towards solving two problems: (i) estimating the causal effects of aforementioned treatments on the outcome and (ii) finding the optimal treatment for the questions. Empirically, we performed experiments with three state-of-the-art causal effect estimation methods on the contributed dataset. In particular, we evaluated the optimal treatments recommended by the these approaches by comparing them with the ground truth labels – treatments (suggestions) provided by experts.
UR - http://www.scopus.com/inward/record.url?scp=85087036166&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85087036166&partnerID=8YFLogxK
U2 - 10.1007/978-3-030-49556-5_20
DO - 10.1007/978-3-030-49556-5_20
M3 - Conference contribution
SN - 9783030495558
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 204
EP - 214
BT - Benchmarking, Measuring, and Optimizing - 2nd BenchCouncil International Symposium, Bench 2019, Revised Selected Papers
A2 - Gao, Wanling
A2 - Zhan, Jianfeng
A2 - Fox, Geoffrey
A2 - Lu, Xiaoyi
A2 - Stanzione, Dan
PB - Springer
T2 - 2nd International Symposium on Benchmarking, Measuring, and Optimization, Bench 2019
Y2 - 14 November 2019 through 16 November 2019
ER -