金融データ・マイニング/動的クラスタリングとクラスター変化検出のバックアップ(No.4)

バックアップ一覧
差分を表示
現在との差分を表示
ソースを表示
金融データ・マイニング/動的クラスタリングとクラスター変化検出へ行く。
- 1 (2023-12-22 (金) 15:31:51)
- 2 (2023-12-22 (金) 16:28:42)
- 3 (2023-12-22 (金) 20:41:58)
- 4 (2023-12-23 (土) 15:52:11)
- 5 (2023-12-23 (土) 19:53:43)
- 6 (2023-12-25 (月) 11:56:35)
- 7 (2023-12-26 (火) 11:03:47)

動的クラスタリングについてのメモ

Gu (2022) による動的クラスタリングの定義 †

原文 †

Given the dataset [math]D[/math] and its clustering results [math]\mathcal{L}_D[/math], let [math]D_\mathit{new}[/math] be a new set of objects with some updates on [math]D[/math] where [math]D \cap D_\mathit{new} \neq \emptyset[/math]. Dynamic clustering computes the clustering results of [math]D_\mathit{new}[/math] based on [math]\mathcal{L}_D[/math]. The dynamic clustering is denoted by [math]F[/math] and its clustering result is denoted as [math]F(D, D_\mathit{new}, \mathcal{L}_D)[/math].

↑

訳 †

データセット [math]D[/math] とそのクラスタリング結果 [math]\mathcal{L}_D[/math] が与えられ、[math]D[/math] を更新してできた新しいデータセットを [math]D_\mathit{new}[/math] とする。このとき、[math]D \cap D_\mathit{new} \neq \emptyset[/math]、つまり、[math]D[/math] と [math]D_\mathit{new}[/math] には共通するデータが存在する。 動的クラスタリングは、[math]\mathcal{L}_D[/math] に基づいて [math]D_\mathit{new}[/math] のクラスタリング結果を計算する。 動的クラスタリングを [math]F[/math]、その結果を[math]F(D, D_\mathit{new}, \mathcal{L}_D)[/math] と表す。

動的クラスタリングの目的は、データセット [math]D[/math] が与えられたときに [math]D[/math] のクラスタリング結果 [math]\mathcal{L}_D[/math] を得るバッチ・アルゴリズム [math]\mathcal{L}_D = B(D)[/math] に対して、[math]B(D_\mathit{new})[/math] より効率的に [math]F(D, D_\mathit{new}, \mathcal{L}_D) \approx \mathcal{L}_{D_\mathit{new}}[/math] を得ること。

↑

Gruenheid (2014) による動的クラスタリングの定義 †

↑

原文 †

Let [math]D[/math] be a set of records and [math]\Delta D[/math] be an increment to [math]D[/math]. Let [math]\mathcal{L}_D[/math] be the clustering of records in [math]D[/math]. Incremental linkage clusters records in [math]D + \Delta D[/math] based on [math]\mathcal{L}_D[/math]. We denote the incremental linkage method by [math]f[/math], and denote the results by [math]f(D, \Delta D, \mathcal{L}_D)[/math].

↑

訳 †

[math]D[/math] をレコードの集合、[math]\Delta D[/math] を [math]D[/math]　の増分、[math]D[/math]　のクラスタリング結果を [math]\mathcal{L}_D[/math] とする。 動的クラスタリング は、[math]\mathcal{L}_D[/math]　に基づいて [math]D + \Delta D[/math] に含まれるレコードをクラスタリングする。動的クラスタリングの手法を [math]f[/math]、その結果を [math]f(D, \Delta D, \mathcal{L}_D)[/math] と表す。

動的クラスタリングの目的は、次の2つ。

[math]D+\Delta D[/math] のクラスタリング [math]F(D + \Delta D)[/math] より動的クラスタリング [math]f(D, \Delta D, \mathcal{L}_D)[/math] を速く実行すること
動的クラスタリングの結果 [math]f(D, \Delta D, \mathcal{L}_D)[/math] と（バッチ）クラスタリングの結果 [math]F(D + \Delta D)[/math] が似ていること、つまり [math]f(D, \Delta D, \mathcal{L}_D \approx F(D + \Delta D)[/math]

↑

論文 †

小西智貴 (2022). 複数の時系列データに対する動的クラスタリング. 中央大学大学院研究年報理工学研究科編 52.
- クラスタ構造が時間によって変化するデータに対するディリクレ過程混合モデルによる動的クラスタリングについて説明している。
Gu B, Kargar S, and Nawab F (2022). Efficient dynamic clustering: Capturing patterns from historical cluster evolution. Open Proceedings 25:351–363. doi: 10.48786/edbt.2022.21
- 動的クラスタリングの定義
Gruenheid A, Dong XL, and Srivastava D (2014). Incremental record linkage. VLDB Endowment 7:697–708 (PDF)
- 動的クラスタリングの定義

↑

手法 †

CulStream
- Aggarwal CC, Philip SY, Han J, and Wang J (2003). A framework for clustering evolving data streams. VLDB 2003. (PDF)
DenStream
- Cao F, Estert M, Qian W, and Zhou A (2006). Density-based clustering over an evolving data stream with noise. 2006 SIAM International Conference on Data Mining:328–339. doi: 10.1137/1.97816119727(PDF)
DynStream
- Kim IS, Liao S, and Imai K (2020). Measuring trade profile with granular product-level data. American Journal of Political Science 64(1):102–117. doi: 10.1111/ajps.12473

金融データ・マイニング/動的クラスタリングとクラスター変化検出 のバックアップ(No.4)

Gu (2022) による動的クラスタリングの定義 †

原文 †

訳 †

Gruenheid (2014) による動的クラスタリングの定義 †

原文 †

訳 †

論文 †

手法 †

金融データ・マイニング/動的クラスタリングとクラスター変化検出のバックアップ(No.4)