CIND719 Spark Programming Assignment

$30.00

Download Details:

  • Name: CIND-719-Assignment-3-Spark-abdmbk.zip
  • Type: zip
  • Size: 88.94 KB

Category:

Description

5/5 - (1 vote)

1. ODD/EVEN NUMBER (30 pts)
(Hint: Note that you are reading the file as text and need to convert the numbers to int())

Input: number_list.txt (a list of 1000 integers)
Output: Count the number of odd numbers and even numbers in the file

2. Top 10 and bottom 10 words (30 pts)
(Hint: Search and try takeOrdered() method)

Input: shakespeare.txt
Output: 10 words with the highest count and 10 words with lowest count

3. Group and Count (40 pts)

Input: fulltext_txt
Output: Count the number of tweets for each user_id and save the results in a text file.

 

SUBMIT YOUR SCRIPT AND THE OUTPUT OF YOUR SCRIPT.