editdistance

Description

Returns the Levenshtein distance between str1 and str2.

Optional argument max_distance specifies a maximum distance to compute. If the computed distance reaches max_distance, the computation is stopped and max_distance is returned. A negative max_distance causes the function to use 0 as the max_distance and return 0. Empty str<> parameters return 0. A null value in either str1 or str2 or both returns null.

Return type

int64

Domain

This is a scalar function (calculates a single output value for a single input row.)

Categories

Usage

editdistance(str1, str2, [ maxDistance ])

Argument

Type

Optional

Repeatable

Restrictions

str1

string

no

no

none

str2

string

no

no

none

maxDistance

int64

yes

no

none

Examples

filter editdistance(name, last_name, 3) > 2

Filter rows whose name and last_name columns have an edit distance greater than two. Cap the computation at a max_distance of 3, since we don’t care if the actual edit distance is larger.