Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I've used this recently to do some fuzzy matching of column names in datasets, I also added it to a small python one-liner library I've been making for practice. p.s. don't give me flack, I know this isn't an efficient way to do things.

jaccard = lambda A, B: len(set(A).intersection(set(B))) / len(set(A).union(set(B)))

https://github.com/b-mc2/MiniMath



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: