日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

  Enhancing statistical power in temporal biomarker discovery through representative shapelet mining

Gumbsch, T., Bock, C., Moor, M., Rieck, B., & Borgwardt, K. (2020). Enhancing statistical power in temporal biomarker discovery through representative shapelet mining. Bioinformatics, 36(Supplement_2), i840-i848. doi:10.1093/bioinformatics/btaa815.

Item is

基本情報

表示: 非表示:
アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-000C-F059-1 版のパーマリンク: https://hdl.handle.net/21.11116/0000-000C-F05A-0
資料種別: 学術論文

ファイル

表示: ファイル

関連URL

表示:
非表示:
説明:
-
OA-Status:
Not specified

作成者

表示:
非表示:
 作成者:
Gumbsch, Thomas, 著者
Bock, Christian, 著者
Moor, Michael, 著者
Rieck, Bastian, 著者
Borgwardt, Karsten1, 著者                 
所属:
1ETH Zürich, ou_persistent22              

内容説明

表示:
非表示:
キーワード: -
 要旨: Motivation Temporal biomarker discovery in longitudinal data is based on detecting reoccurring trajectories, the so-called shapelets. The search for shapelets requires considering all subsequences in the data. While the accompanying issue of multiple testing has been mitigated in previous work, the redundancy and overlap of the detected shapelets results in an a priori unbounded number of highly similar and structurally meaningless shapelets. As a consequence, current temporal biomarker discovery methods are impractical and underpowered. Results We find that the pre- or post-processing of shapelets does not sufficiently increase the power and practical utility. Consequently, we present a novel method for temporal biomarker discovery: Statistically Significant Submodular Subset Shapelet Mining (S5M) that retrieves short subsequences that are (i) occurring in the data, (ii) are statistically significantly associated with the phenotype and (iii) are of manageable quantity while maximizing structural diversity. Structural diversity is achieved by pruning non-representative shapelets via submodular optimization. This increases the statistical power and utility of S5M compared to state-of-the-art approaches on simulated and real-world datasets. For patients admitted to the intensive care unit (ICU) showing signs of severe organ failure, we find temporal patterns in the sequential organ failure assessment score that are associated with in-ICU mortality. Availability and implementation S5M is an option in the python package of S3M: github.com/BorgwardtLab/S3M.

資料詳細

表示:
非表示:
言語:
 日付: 2020-12-302020-12
 出版の状態: 出版
 ページ: -
 出版情報: -
 目次: -
 査読: 査読あり
 識別子(DOI, ISBNなど): DOI: 10.1093/bioinformatics/btaa815
ISSN: 1367-4803, 1460-2059
 学位: -

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物 1

表示:
非表示:
出版物名: Bioinformatics
種別: 学術雑誌
 著者・編者:
所属:
出版社, 出版地: -
ページ: - 巻号: 36 (Supplement_2) 通巻号: - 開始・終了ページ: i840 - i848 識別子(ISBN, ISSN, DOIなど): -