Description
Exercise 1) Read and provide a half page summary and analysis of this article available on the blackboard in the ‘Articles’ section: Beyond Batch Processing: Towards Real-Time and Streaming Big Data
Exercise 2) Read and provide a half page summary and analysis of this article available on the blackboard in the ‘Articles’ section: Real-time stream processing for Big Data
Exercise 3) 5 points extra credit
- (1 point) Get the spark streaming demo code to work in your Hortonworks sandbox. Provide screen shots of output for various inputs
- (4 points) Modify consume.py to output a count of words beginning with only the letters a through h inclusive using just RDD transformations and actions. Provide screen shots of output for various inputs.

