Fuzzy string searching is a term that describes a technique of finding strings that approximately match a pattern. This is widely used when comparing names or other data that might not be otherwise matchable. These activities will be of great use for searching double entries in a CRM or matching suppliers' names in an OCR case.
With the added ability to compare images, this makes a complete package for all your needs for comparing and searching in almost all project situations.
All dependencies are bundled within the package!
This activity provides the following functionalities:
GetJaroWinklerDistance: String metric measuring an edit distance between two sequences which means how many characters would need to be edited to match two strings.
GetLevenshteinDistance: The Levenshtein distance between two words is the minimum number of single-character edits (insertions, deletions, or substitutions) required to change one word into the other.
GetNGramDistance: Represents the sum of absolute differences of occurrences of n-gram vectors between two strings. This means you can use different sizes (1 - n) to compare strings which can be used for also larger text comparisons for example.
LuceneSearchString: Uses an array of strings to search for Keywords guided by the Lucene query language which can use fuzzy, wildcard, regular expression, and RegEx matching.
Image Comparison: This activity will compare two images and will create an image that will show the differences which can be used for example to compare screenshots for activities vs. error screenshots