はじめてのオートメーションをわずか数分で作成できます。Studio Web を試す →
作成者: Internal Labs
0
スニペット
<100
バージョン | リリース日 | ダウンロード リンク |
---|---|---|
1.0.0 | 2025年1月27日 | |
The Jaccard String Matching Algorithm has been integrated into our RPA process to enhance string comparison accuracy in cases requiring similarity checks. This algorithm helps evaluate the similarity between two strings(Words) by comparing the intersection of unique token sets, particularly useful for text-based data processing, name matching, and record linkage. Test set: Since the Jaccard Index calculates the similarity between two sets, create sets with known intersections: Test Set Examples: High Overlap Set A: {"apple", "banana", "cherry", "date"} Set B: {"apple", "banana", "cherry", "fig"} Expected Jaccard Index: 0.6 (3 shared out of 5 unique elements) Partial Overlap Set C: {"dog", "cat", "rabbit", "horse"} Set D: {"cat", "rabbit", "hamster", "turtle"} Expected Jaccard Index: 0.4 (2 shared out of 5 unique elements) No Overlap Set E: {"car", "bus", "train"} Set F: {"plane", "boat", "bicycle"} Expected Jaccard Index: 0.0 (no shared elements) Identical Sets Set G: {"sun", "moon", "stars"} Set H: {"sun", "moon", "stars"} Expected Jaccard Index: 1.0 (3 shared out of 3 unique elements) Large Set with Small Overlap Set I: { "apple", "banana", "cherry", "date", "fig", "grape", "honeydew" } Set J: { "fig", "grape", "kiwi", "lemon" } Expected Jaccard Index: ~0.18 (2 shared out of 11 unique elements) Small Set in Large Set Set K: { "red", "green", "blue", "yellow", "purple" } Set L: { "red", "green" } Expected Jaccard Index: ~0.4 (2 shared out of 5 unique elements) | ||