Project Summary

Parallelization

Parallelization: 3 points

The data obtained from twitter included about 410,000 reviews of customers that had to classified to predict the rankings of airline. The data was parallelized in the cluster resulting in efficient computation.

UI

UI: 3 points

The UI has been implemented with a website that's built using HTML5 and Bootstrap CSS inorder to visualize the results and showcase information regarding our project.

Visualization

Visualization: 2 points

The visualization has been implemented using Tableau by connecting it to Spark Cassandra dynamically and resulting charts were integrated with the HTML website.

Technologies

Technologies: 1 point

Some of the new technologies learnt as part of this project are Spark MLlib, Tableau, HTML5 and Bootstrap CSS.