Speaker: Dr Dario Freni, Google
Series: CeNACS seminars
Organising the world's information and making it universally accessible and useful requires technologies that are able to handle petabytes of data quickly and reliably. This talk focuses on three crucial aspects of Google's infrastructure: storage, processing and reliability. We will present popular technologies within Google, giving an overview of their principles and main use cases. We will cover distributed storage solutions including GFS (distributed file system), Bigtable (distributed multi-dimensional sorted map), Spanner and F1 (globally distributed databases). Processing solutions that will be covered include MapReduce, Flume (distributed processing of batch data), and MillWheel (distributed processing of streaming data). Finally, we discuss techniques which are used to ensure 24/7 availability for services which use the above technologies, and in general all Google services.
Share this event
When and where
1.00pm - 3.00pmWednesday 8th October 2014