General files or links of interest
File(s) Description Last modified C introduction Local UW C Tutorial Summer, 2013 MMDS site Mining of Massive Datasets web site Constantly Online textbook or with links Big Data online textbook March, 2014 Web archive Way back web searching Constantly survey paper Wooyoung Kim, Parallel Clustering Algorithms: Survey 2009 hash-table.zip A fully functional hash table system with an example of its usage 2011 J48.java Decision tree algorithm and code 1999?
The lecture links below will be filled in during the semester. The order of the talks is not what is in the table.
Topic Link Description DBDDAS dbddas+examples Lectures about Big Data, dynamic data apps, and real examples concepts sentences sentences Distance k computing Basics hashtables Hash tables and hash functions Big Data and mapreduce MapReduce and friends data mining pig-oink Apache Pig and Sandia Oink concepts sentence Big data sentence problem description find Finding similar items mining-data-streams Data mining of streams marketbasket Market baskets anomalies Case study on anomalies clustering Clustering algorithms machine-learning Machine learning algorithms dim-reduction Dimensionality reduction