Use a crawler to get at least 20 webpages from a website.
Count theoccurrences of words in the webpages on Hadoop.
Hand in:
- Each one should crawl different websites, list the website URL, as well as the URLsof the crawled webpages.
-
Count the word occurrence on Hadoop, code in both JAVA and another language such asPig Latin. print out your code.
-
Print out your result.
Home work due: 4/12
You are allowed toform a group of no more than 4 fellow students.
https://github.com/ewanlee/machine-learning-ECNU-/blob/master/Hadoop%20wordcount%20demo_cutted.pdf
近期评论