Computing at Scale: Meet Hadoop

Thursday 10 April 2014, 6.00pm - 8.00pm

The Davidson Building, 5 Southampton Street, London WC2E 7HA. The nearest underground stations are Covent Garden and Charing Cross.

Edgar Meij


Hadoop is a massively scalable, parallel storage and computation platform. It is in use within the largest web companies in the world whilst at the same time seeing increasing adoption in other, more traditional enterprise and research environments. The focus of this talk lies on a broad introduction to Hadoop. I will provide some background into the history of Hadoop, the key ideas, intuitions, and programming paradigms behind it, and will detail how Hadoop works in practice using real-life use cases and code examples. I will then zoom in on the bigger Hadoop ecosystem and conclude with current developments and an outlook.


Dr. Edgar Meij is a research scientist at Yahoo Labs in Barcelona. Before this, he was a postdoc at the University of Amsterdam, where he obtained his PhD in Computer Science. There, he also collaborated with SARA to set up the first Dutch national Hadoop cluster for academic and non-profit use. His current research mainly focuses on entity linking and semantic search at web scale, employing large-scale computational platforms such as Hadoop on a daily basis. He regularly teaches at the post-graduate and graduate level, including university courses and conference tutorials. He is a co-organizer of various entity-related NLP and IR workshops, including Reputation 2012 and RepLab.


Free. To gain admission please email your name to our Membership Secretary, Algirdas Pakstas, at in advance of the meeting, including the title of the event and your name in the subject line of your email. Attendance lists will normally be finalised on the Monday preceding each meeting but late admission may be accepted by signing in to the Davison Building as a visitor.


View the slides for this event (PDF)

YouTube video of the event