Uipath Marketplace
MarketplaceStudioアクティビティString Similarity Calculation

はじめてのオートメーションをわずか数分で作成できます。Studio Web を試す

String Similarity Calculation

String Similarity Calculation

作成者: Patrick Flöß

StarStarStarStarStarStarStarStarStarStar

0

アクティビティ

ダウンロード数

<100

[戻る] ボタン
[戻る] ボタン
carouselImage0
[次へ] ボタン
[次へ] ボタン

要約

要約

Calculate the similarity of strings, e.g. Aa / Ab = 50%.

概要

概要

Calculates Levenshtein distance and puts it relative to the longest string, so you get a value between 0 and 1 that tells you how similar two strings are. You can use this whenever a 100% match is not feasible and a machine learning model is to expensive for the use case. 

I also added a normalization and sorting activity, that can be put upfront the string similarity calculation if you want to ignore any non-word characters and / or if you don't care in what order the strings are filled.

I use this to compare addresses from diffferent systems, as users from different systems often type in addresses kind of similar, but not exactly the same. A value above 0.7 is good enough for me, to validate whether addresses match. Every set of data needs to be analyzed separately. A lot of testing is helpful to find a good value. 

Activities Details

1. Compare two strings: The core feature is the comparison of two strings and the return of a similarity value. 

2. Remove Non-Word Characters: This feature involves eliminating any character from the text that doesn't contribute to its meaning. These may include punctuations, special symbols, numbers (unless they are relevant), and any other non-alphabetic characters. This process helps to simplify the text, making it easier to analyze and process.

3. Convert to Uppercase: This feature translates all the letters in the text to uppercase. It is a commonly used method for normalizing text data, ensuring consistency across different inputs. This function can be particularly useful when case sensitivity is not relevant to the application, ensuring the text can be evaluated uniformly.

4. Sort the String Alphanumeric: This feature involves organizing the content of a string in alphanumeric order. This is a type of sorting where both the numbers (0-9) and the letters (A-Z/a-z) in a string are arranged systematically.

First, all numbers are sorted in ascending order, followed by letters sorted in ascending order (typically, uppercase letters first, then lowercase). For example, the string 'b2a1C3' would become '123Cab'. Sorting the string alphanumeric can be beneficial in a variety of scenarios, such as enhancing data consistency, simplifying search and retrieval processes, and providing a standardized view of the data, especially when dealing with large amounts of text data.

機能

機能

  • Compare two strings
  • Normalize strings

その他の情報

その他の情報

依存関係

UiPath.System.Activities >= 23.4.2

コード言語

Visual Basic

ランタイム

Windows (.NET 5.0 以上)

公開者

Patrick Flöß

公開者ページを表示

ライセンス契約とプライバシー条項

MIT

プライバシー条項

技術情報

バージョン

1.0.3

更新日

2023年6月2日

対応製品

Studio: 23.4.1+

認定

シルバー 認定済み

タグ

サポート

UiPath コミュニティ サポート

類似のコンポーネント