程序代写代做代考 data mining algorithm html database C clock 7CCSMBDT – Big Data Technologies Week 4
7CCSMBDT – Big Data Technologies Week 4 Grigorios Loukides, PhD (grigorios.loukides@kcl.ac.uk) Spring 2017/2018 1 Objectives Today: MapReduce patterns Numerical summarization (count, max) Filtering Distinct Binning (partitioning records into bins) Sorting Read: Chapter 3.2 from Bagha https://github.com/mattwg/mrjob-examples MapReduce (join, cost measurement) NoSQL databases (intro) 2 MapReduce with python […]