DVD Rental ETL Project

  • Tech Stack: Python, Hadoop, Spark, Hive, Superset, Postgresql
  • Github URL: Project Link

This is example project about ETL with spark, hadoop, hive

User can setup a bigdata stack to run local. Example structure pyspark project, build dependencies and run spark-submit in local integrate with Postgres, Hdfs, Hive.