Counterfactual Explanations in Sequential Decision Making Under Uncertainty

Tsirtsis, Stratis; De, Abir; Gomez Rodriguez, Manuel

Counterfactual Explanations in Sequential Decision Making Under Uncertainty

Tsirtsis, S., De, A., & Gomez Rodriguez, M. (2021). Counterfactual Explanations in Sequential Decision Making Under Uncertainty. Retrieved from https://arxiv.org/abs/2107.02776.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0009-19F0-C 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0009-19F1-B

資料種別: 成果報告書

ファイル

表示: ファイル

非表示: ファイル

:

arXiv:2107.02776.pdf (プレプリント), 745KB

表示保存

ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-0009-19F2-A

ファイル名:
arXiv:2107.02776.pdf

説明:
File downloaded from arXiv at 2021-09-01 08:55 To appear at the ICML 2021 workshop on Interpretable Machine Learning in Healthcare

OA-Status:

閲覧制限:
公開

MIMEタイプ / チェックサム:
application/pdf / [MD5]

技術的なメタデータ:

表示

著作権日付:
-

著作権情報:
-

CCライセンス:
http://creativecommons.org/licenses/by/4.0/

作成者

表示:

非表示:

作成者:
Tsirtsis, Stratis¹, 著者
De, Abir², 著者
Gomez Rodriguez, Manuel¹, 著者

所属:
1Group M. Gomez Rodriguez, Max Planck Institute for Software Systems, Max Planck Society, ou_2105290
2External Organizations, ou_persistent22

内容説明

表示:

非表示:

キーワード: Computer Science, Learning, cs.LG,Computer Science, Computers and Society, cs.CY,Statistics, Machine Learning, stat.ML

要旨: Methods to find counterfactual explanations have predominantly focused on one
step decision making processes. In this work, we initiate the development of
methods to find counterfactual explanations for decision making processes in
which multiple, dependent actions are taken sequentially over time. We start by
formally characterizing a sequence of actions and states using finite horizon
Markov decision processes and the Gumbel-Max structural causal model. Building
upon this characterization, we formally state the problem of finding
counterfactual explanations for sequential decision making processes. In our
problem formulation, the counterfactual explanation specifies an alternative
sequence of actions differing in at most k actions from the observed sequence
that could have led the observed process realization to a better outcome. Then,
we introduce a polynomial time algorithm based on dynamic programming to build
a counterfactual policy that is guaranteed to always provide the optimal
counterfactual explanation on every possible realization of the counterfactual
environment dynamics. We validate our algorithm using both synthetic and real
data from cognitive behavioral therapy and show that the counterfactual
explanations our algorithm finds can provide valuable insights to enhance
sequential decision making under uncertainty.

資料詳細

表示:

非表示:

言語: eng - English

日付: 作成: 2021-07-06オンライン出版: 2021

出版の状態: オンラインで出版済み

ページ: 18 p.

出版情報: -

目次: -

査読: -

識別子（DOI, ISBNなど）: arXiv: 2107.02776
BibTex参照ID: Tsirtsis_2107.02776
URI: https://arxiv.org/abs/2107.02776

学位: -

訴訟

表示:

Project information

表示: 非表示:

Project name : HumanML

Grant ID : 945719

Funding program : Horizon 2020 (H2020)

Funding organization : European Commission (EC)

出版物

表示:

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物