Skip to main content
  1. Projects/

Big Data Tools on Amazon Product Reviews Dataset

Table of Contents

In my final project for the Computational Tools for Big Data course course (Technical University of Denmark, AY 2016/17), I utilized Apache Spark, Neo4j, Pandas DataFrames, and SQL to perform extensive data mining on the massive Amazon Product Reviews Dataset. This project explored the potential and challenges of storing and processing inconveniently large datasets using big data technologies.

Project report

Source code
#