Flink-based Web Crawler Talk at Flink Forward 2018

February 19, 2018

On April 10th, at 11am, I’ll be presenting at talk at this year’s Flink Forward conference in San Francisco. What’s it about? My talk tries to answer the question “Is it possible to build an efficient, focused web crawler using Apache Flink?” It’s actually a bit deeper than that – the challenge I set was whether this could be done using ONLY Flink, without adding in additional infrastructure. Which took more…

ApacheCon Big Data 2016

May 28, 2016

Earlier this month I flew to Vancouver, a wonderful city I’d never had the chance to visit. My excuse was that I was giving a talk at this year’s ApacheCon Big Data conference, which took place in Vancouver from May 9th to 12th. Part of the fun of attending a conference like this is the chance to meet people I’d only interacted with via email. For example, Nick Burch is more…