GHCN Data Analysis using Spark & Hadoop

About the Project

The Global Historical Climatology Network (GHCN) is an integrated database of daily climate summaries from more than 100,000 surface stations in 180 countries and territories.

 

Code

For the full code, visit the project's GitHub page below:

GitHub

 

 

Outcome