A common problem faced by data analysts, data scientists, and many developers who need to analyze and compare data, is that texts are often similar, but not quite identical to one another. This can result from the existence of multiple ways to say the same thing, typos and abbreviations, common yet unindicative words (such as “the”) and punctuation, that can all skew the results.
During this talk, I will walk you through several methods to compare inexact texts, using a few different libraries, cover the usages as well as advantages & disadvantages of each method, and tackle some commonly faced issues.
By the end of the talk, you should have a good basis to start comparing texts efficiently and elegantly in your code.
Priority access to all content
Community Discord
Exclusive promotions and giveaways