Please see our Training page for details about all of our courses.
In the last few months I've given two different talks about scalable fuzzy matching.
The first was a Strata in San Jose, titled Similarity at Scale. In that talk I focused mostly on techniques for doing fuzzy matching (or joins) between large data sets, primarily via Cascading workflows.
More recently I presented more...
Ken will be giving a talk on Thursday, September 11th at this year's Cassandra Summit in San Francisco. His presentation describes how Early Warning (one of Scale Unlimited's clients) uses Cassandra and Solr to handle fuzzy entity matching across hundreds of millions of people and companies.