Jump to content

info319

Practical session, using Spark for emergency datasources: Difference between revisions

From info319

Revision as of 12:54, 17 September 2018

Practical session:

Tasks:

Running Apache Hadoop and MapReduce:
- Getting started with Hadoop

Querying analyzing open data source with Apache park.

Steps:

1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: File:data.zip.
2) Setup an account in Data bricks: https://databricks.com/try-databricks.
3) Create a cluster in Databricks.
4) Import files from zip folder to workspace.
5) Open Fire incidents exploration - RunMe file in cloud.databricks browser.

Retrieved from "http://info319.wiki.uib.no/index.php?title=Practical_session,_using_Spark_for_emergency_datasources&oldid=172"