日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

登録内容を編集ファイル形式で保存
 
 
ダウンロード電子メール
  Faster Approximate Pattern Matching: A Unified Approach

Charalampopoulos, P., Kociumaka, T., & Wellnitz, P. (2020). Faster Approximate Pattern Matching: A Unified Approach. Retrieved from https://arxiv.org/abs/2004.08350.

Item is

基本情報

表示: 非表示:
アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0007-8C63-C 版のパーマリンク: https://hdl.handle.net/21.11116/0000-000E-2250-1
資料種別: 成果報告書
LaTeX : Faster Approximate Pattern Matching: {A} Unified Approach

ファイル

表示: ファイル
非表示: ファイル
:
arXiv:2004.08350.pdf (プレプリント), 2MB
ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-0007-8C65-A
ファイル名:
arXiv:2004.08350.pdf
説明:
File downloaded from arXiv at 2020-12-10 08:25
OA-Status:
閲覧制限:
公開
MIMEタイプ / チェックサム:
application/pdf / [MD5]
技術的なメタデータ:
著作権日付:
-
著作権情報:
-

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Charalampopoulos, Panagiotis1, 著者
Kociumaka, Tomasz1, 著者
Wellnitz, Philip2, 著者                 
所属:
1External Organizations, ou_persistent22              
2Algorithms and Complexity, MPI for Informatics, Max Planck Society, ou_24019              

内容説明

表示:
非表示:
キーワード: Computer Science, Data Structures and Algorithms, cs.DS
 要旨: Approximate pattern matching is a natural and well-studied problem on
strings: Given a text $T$, a pattern $P$, and a threshold $k$, find (the
starting positions of) all substrings of $T$ that are at distance at most $k$
from $P$. We consider the two most fundamental string metrics: the Hamming
distance and the edit distance. Under the Hamming distance, we search for
substrings of $T$ that have at most $k$ mismatches with $P$, while under the
edit distance, we search for substrings of $T$ that can be transformed to $P$
with at most $k$ edits.
Exact occurrences of $P$ in $T$ have a very simple structure: If we assume
for simplicity that $|T| \le 3|P|/2$ and trim $T$ so that $P$ occurs both as a
prefix and as a suffix of $T$, then both $P$ and $T$ are periodic with a common
period. However, an analogous characterization for the structure of occurrences
with up to $k$ mismatches was proved only recently by Bringmann et al.
[SODA'19]: Either there are $O(k^2)$ $k$-mismatch occurrences of $P$ in $T$, or
both $P$ and $T$ are at Hamming distance $O(k)$ from strings with a common
period $O(m/k)$. We tighten this characterization by showing that there are
$O(k)$ $k$-mismatch occurrences in the case when the pattern is not
(approximately) periodic, and we lift it to the edit distance setting, where we
tightly bound the number of $k$-edit occurrences by $O(k^2)$ in the
non-periodic case. Our proofs are constructive and let us obtain a unified
framework for approximate pattern matching for both considered distances. We
showcase the generality of our framework with results for the fully-compressed
setting (where $T$ and $P$ are given as a straight-line program) and for the
dynamic setting (where we extend a data structure of Gawrychowski et al.
[SODA'18]).

資料詳細

表示:
非表示:
言語: eng - English
 日付: 2020-04-172020-11-162020
 出版の状態: オンラインで出版済み
 ページ: 74 p.
 出版情報: -
 目次: -
 査読: -
 識別子(DOI, ISBNなど): arXiv: 2004.08350
BibTex参照ID: Charalampopoulos_arXiv2004.08350
URI: https://arxiv.org/abs/2004.08350
 学位: -

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物

表示: