Fill out the form below if you’d like to learn how Scale Unlimited can solve your big data processing/training and web crawling problems.
Note that fields marked with an ‘*‘ are required.
In the last few months I've given two different talks about scalable fuzzy matching.
The first was a Strata in San Jose, titled Similarity at Scale. In that talk I focused mostly on techniques for doing fuzzy matching (or joins) between large data sets, primarily via Cascading workflows.
More recently I presented more...
Ken will be giving a talk on Thursday, September 11th at this year's Cassandra Summit in San Francisco. His presentation describes how Early Warning (one of Scale Unlimited's clients) uses Cassandra and Solr to handle fuzzy entity matching across hundreds of millions of people and companies.