CS 374: Database Systems
James Madison University, Spring 2022

Apr 26: Big Data: Hadoop, Hive, Spark

Lesson Outline

40 min Ch19 Slides
* Apache Hadoop: MapReduce Tutorial
* Free book: Mining of Massive Datasets
35 min Show and Tell
* Each group has 4 minutes
* Share 1 tip from your code

Before Tuesday

  1. Textbook: Skim PDBM Chapter 19 (35 pages).

  2. Exam: Submit both portions by Sunday 11:59pm.
    * I will announce when the exams files are ready.

  3. Project: Tie up loose ends, prepare to present.