Practical session, using Spark for emergency datasources: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
'''Practical session:''' | '''Practical session:''' | ||
* Running Apache Hadoop and MapReduce | |||
* Querying analyzing open data source with Apache park. | * Querying analyzing open data source with Apache park. | ||
Revision as of 12:53, 17 September 2018
Practical session:
- Running Apache Hadoop and MapReduce
- Querying analyzing open data source with Apache park.
Steps:
- 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: File:data.zip.
- 2) Setup an account in Data bricks: https://databricks.com/try-databricks.
- 3) Create a cluster in Databricks.
- 4) Import files from zip folder to workspace.
- 5) Open Fire incidents exploration - RunMe file in cloud.databricks browser.