Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

Johansson, Fredrik D.; Shalit, Uri; Kallus, Nathan; Sontag, David

Computer Science > Machine Learning

arXiv:2001.07426 (cs)

[Submitted on 21 Jan 2020 (v1), last revised 31 Jul 2023 (this version, v4)]

Title:Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

Authors:Fredrik D. Johansson, Uri Shalit, Nathan Kallus, David Sontag

View PDF

Abstract:Practitioners in diverse fields such as healthcare, economics and education are eager to apply machine learning to improve decision making. The cost and impracticality of performing experiments and a recent monumental increase in electronic record keeping has brought attention to the problem of evaluating decisions based on non-experimental observational data. This is the setting of this work. In particular, we study estimation of individual-level causal effects, such as a single patient's response to alternative medication, from recorded contexts, decisions and outcomes. We give generalization bounds on the error in estimated effects based on distance measures between groups receiving different treatments, allowing for sample re-weighting. We provide conditions under which our bound is tight and show how it relates to results for unsupervised domain adaptation. Led by our theoretical results, we devise representation learning algorithms that minimize our bound, by regularizing the representation's induced treatment group distance, and encourage sharing of information between treatment groups. We extend these algorithms to simultaneously learn a weighted representation to further reduce treatment group distances. Finally, an experimental evaluation on real and synthetic data shows the value of our proposed representation architecture and regularization scheme.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2001.07426 [cs.LG]
	(or arXiv:2001.07426v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2001.07426

Submission history

From: Fredrik D. Johansson [view email]
[v1] Tue, 21 Jan 2020 10:16:33 UTC (1,239 KB)
[v2] Wed, 17 Mar 2021 09:21:02 UTC (1,233 KB)
[v3] Mon, 14 Feb 2022 12:30:55 UTC (1,476 KB)
[v4] Mon, 31 Jul 2023 08:36:45 UTC (1,476 KB)

Computer Science > Machine Learning

Title:Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Generalization Bounds and Representation Learning for Estimation of Potential Outcomes and Causal Effects

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators