Text similarity computation.

Parameters:
Type of similarity
Selects the algorithm used to compute similarity:
Damereau LevenStein similarity: Measures how similar two strings are by counting insertions, deletions, substitutions, and transpositions.Jaro Winkler similarity: Emphasizes similarity at the beginning of the strings, useful for matching names.Dice Coefficient similarity: Based on bigram overlap between the two strings.Damereau LevenStein distance: Returns the distance (not similarity), i.e., the number of edits.First text
Column containing the first string to compare.
Second Text
Column containing the second string to compare.
See dedicated page for more information.
See dedicated page for more information
The Text Similarity action calculates a similarity score or distance between two strings using selected string matching algorithms. This is commonly used in data cleaning, deduplication, or approximate string matching tasks.
Damereau LevenStein similarity for general fuzzy matching tasks.Jaro Winkler when comparing short strings or names.Dice Coefficient works well for linguistic similarity based on shared parts.Damereau LevenStein distance.| String_A | String_B |
|---|---|
| kitten | sitting |
| saturday | sunday |
| context | contact |
| night | knight |
| intention | execution |
| apple | appel |
| robot | robot |




See CorrectSpelling action documentation for more information on the topic.
