Uipath Marketplace

サインイン

MarketplaceStudioスニペットJaccard - String Matching Algorithm

はじめてのオートメーションをわずか数分で作成できます。Studio Web を試す

Jaccard - String Matching Algorithm

Jaccard - String Matching Algorithm

作成者: Internal Labs

StarStarStarStarStarStarStarStarStarStar

0

スニペット

ダウンロード数

<100

バージョンリリース日ダウンロード リンク
1.0.0
2025年1月27日

The Jaccard String Matching Algorithm has been integrated into our RPA process to enhance string comparison accuracy in cases requiring similarity checks. This algorithm helps evaluate the similarity between two strings(Words) by comparing the intersection of unique token sets, particularly useful for text-based data processing, name matching, and record linkage.

Test set:

Since the Jaccard Index calculates the similarity between two sets, create sets with known intersections:

Test Set Examples:

High Overlap

Set A: {"apple", "banana", "cherry", "date"}

Set B: {"apple", "banana", "cherry", "fig"}

Expected Jaccard Index: 0.6 (3 shared out of 5 unique elements)

Partial Overlap

Set C: {"dog", "cat", "rabbit", "horse"}

Set D: {"cat", "rabbit", "hamster", "turtle"}

Expected Jaccard Index: 0.4 (2 shared out of 5 unique elements)

No Overlap

Set E: {"car", "bus", "train"}

Set F: {"plane", "boat", "bicycle"}

Expected Jaccard Index: 0.0 (no shared elements)

Identical Sets

Set G: {"sun", "moon", "stars"}

Set H: {"sun", "moon", "stars"}

Expected Jaccard Index: 1.0 (3 shared out of 3 unique elements)

Large Set with Small Overlap

Set I: { "apple", "banana", "cherry", "date", "fig", "grape", "honeydew" }

Set J: { "fig", "grape", "kiwi", "lemon" }

Expected Jaccard Index: ~0.18 (2 shared out of 11 unique elements)

Small Set in Large Set

Set K: { "red", "green", "blue", "yellow", "purple" }

Set L: { "red", "green" }

Expected Jaccard Index: ~0.4 (2 shared out of 5 unique elements)

公開者

Internal Labs

公開者ページを表示

ライセンス契約とプライバシー条項

ライセンス契約

プライバシー条項

技術情報

バージョン

1.0.0

更新日

2025年1月27日

対応製品

Studio: 22.10.12 - 24.10.5

認定

シルバー 認定済み

サポート

UiPath コミュニティ サポート