Skip to content

Latest commit

 

History

History
23 lines (15 loc) · 1.19 KB

README.md

File metadata and controls

23 lines (15 loc) · 1.19 KB

Apache Storm Hashtags Analyzer

An Apache storm analyzer for top 5 hashtags in each country using mongoDB for geoloaction calculations and heavy hitters for counting streams.

Work flow

  • Tweets source is twitter api (twitter 4j) is used to get tweet body and the geolocation information.
  • Tweets then parsed to get the hashtages and the country name I used mongoDB driver to get the nearest country to the given geolocation information data of countries, longitude and latitude are provided I get them from google here.
  • Tweets then counted using heavy hitters algorithm to get the top 5 occurrances in the stream.
  • Finally visualize the result using d3 and flask python server.

You can find many maps code here

Sample run

First img

Second img

Third img

Fourth img