Skip to content

Conversation

@jfseb
Copy link
Contributor

@jfseb jfseb commented Jan 30, 2017

Bug: distinct strings with common prefixes get a weight > 1.0
jaroWinkler('commonlongprefixword', 'commonlongprefixworm')

The former code easily gives weights > 1.0 for distinct strings.

Bug: distinct strings with common prefixes get a weight > 1.0
jaroWinkler('commonlongprefixword', 'commonlongprefixworm')
@Yomguithereal
Copy link
Owner

Hello @jfseb. Thanks for noticing this one. Reading my code again, I also found some other inconsistencies, notably normally for Jaro-Winkler, the size of the common prefix should not be exceeding 4. Let me some time to document myself again more thoroughly on the metric so I can fix it correctly.

@Yomguithereal
Copy link
Owner

@jfseb I'm gonna go ahead and merge your PR and fix the code even more.

@Yomguithereal Yomguithereal merged commit e9ea764 into Yomguithereal:master Jan 31, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

2 participants