GHCN Data Analysis using Spark & Hadoop

About the Project
The Global Historical Climatology Network (GHCN) is an integrated database of daily climate summaries from more than 100,000 surface stations in 180 countries and territories.
Code
For the full code, visit the project's GitHub page below:
Outcome