Counterfactual Explanations in Sequential Decision Making Under Uncertainty

Tsirtsis, Stratis; De, Abir; Gomez Rodriguez, Manuel

Local TagsRelease HistoryDetailsSummary

Counterfactual Explanations in Sequential Decision Making Under Uncertainty

Tsirtsis, S., De, A., & Gomez Rodriguez, M. (2021). Counterfactual Explanations in Sequential Decision Making Under Uncertainty. Retrieved from https://arxiv.org/abs/2107.02776.

Item is Released

show all hide all

Basic

show hide

Item Permalink: https://hdl.handle.net/21.11116/0000-0009-19F0-C Version Permalink: https://hdl.handle.net/21.11116/0000-0009-19F1-B

Genre: Paper

Files

show Files

hide Files

:

arXiv:2107.02776.pdf (Preprint), 745KB

View Save

File Permalink:
https://hdl.handle.net/21.11116/0000-0009-19F2-A

Name:
arXiv:2107.02776.pdf

Description:
File downloaded from arXiv at 2021-09-01 08:55 To appear at the ICML 2021 workshop on Interpretable Machine Learning in Healthcare

OA-Status:

Visibility:
Public

MIME-Type / Checksum:
application/pdf / [MD5]

Technical Metadata:

View

Copyright Date:
-

Copyright Info:
-

License:
http://creativecommons.org/licenses/by/4.0/

Locators

show

Creators

show

hide

Creators:
Tsirtsis, Stratis¹, Author
De, Abir², Author
Gomez Rodriguez, Manuel¹, Author

Affiliations:
1Group M. Gomez Rodriguez, Max Planck Institute for Software Systems, Max Planck Society, ou_2105290
2External Organizations, ou_persistent22

Content

show

hide

Free keywords: Computer Science, Learning, cs.LG,Computer Science, Computers and Society, cs.CY,Statistics, Machine Learning, stat.ML

Abstract: Methods to find counterfactual explanations have predominantly focused on one
step decision making processes. In this work, we initiate the development of
methods to find counterfactual explanations for decision making processes in
which multiple, dependent actions are taken sequentially over time. We start by
formally characterizing a sequence of actions and states using finite horizon
Markov decision processes and the Gumbel-Max structural causal model. Building
upon this characterization, we formally state the problem of finding
counterfactual explanations for sequential decision making processes. In our
problem formulation, the counterfactual explanation specifies an alternative
sequence of actions differing in at most k actions from the observed sequence
that could have led the observed process realization to a better outcome. Then,
we introduce a polynomial time algorithm based on dynamic programming to build
a counterfactual policy that is guaranteed to always provide the optimal
counterfactual explanation on every possible realization of the counterfactual
environment dynamics. We validate our algorithm using both synthetic and real
data from cognitive behavioral therapy and show that the counterfactual
explanations our algorithm finds can provide valuable insights to enhance
sequential decision making under uncertainty.

Details

show

hide

Language(s): eng - English

Dates: Created: 2021-07-06Published Online: 2021

Publication Status: Published online

Pages: 18 p.

Publishing info: -

Table of Contents: -

Rev. Type: -

Identifiers: arXiv: 2107.02776
BibTex Citekey: Tsirtsis_2107.02776
URI: https://arxiv.org/abs/2107.02776

Degree: -

Event

show

Legal Case

show

Project information

show hide

Project name : HumanML

Grant ID : 945719

Funding program : Horizon 2020 (H2020)

Funding organization : European Commission (EC)

Source

show