Practical session, using Spark for emergency datasources: Difference between revisions
No edit summary |
No edit summary |
||
Line 1: | Line 1: | ||
'''Practical session:''' | '''Practical session:''' | ||
* Running Apache Hadoop and MapReduce | |||
* Querying analyzing open data source with Apache park. | ===Tasks:=== | ||
* '''Running Apache Hadoop and MapReduce:''' | |||
**[[Running Hadoop | Getting started with Hadoop]] | |||
* '''Querying analyzing open data source with Apache park.''' | |||
'''Steps:''' | '''Steps:''' |
Revision as of 12:54, 17 September 2018
Practical session:
Tasks:
- Running Apache Hadoop and MapReduce:
- Querying analyzing open data source with Apache park.
Steps:
- 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: File:data.zip.
- 2) Setup an account in Data bricks: https://databricks.com/try-databricks.
- 3) Create a cluster in Databricks.
- 4) Import files from zip folder to workspace.
- 5) Open Fire incidents exploration - RunMe file in cloud.databricks browser.