Practical session, using Spark for emergency datasources: Difference between revisions

Revision as of 08:10, 18 September 2019

Practical session:

2) Load data and run queries on an Apache Spark cluster in Azure HDInsight.
- Steps:
  - 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: File:data.zip.
  - 2) Apache Spark cluster in Azure HDInsight [1]

Solve Variables error: File:Variables.pdf

@@ Line 6: / Line 6: @@
 **[[Running Hadoop | Getting started with Hadoop]]
-* '''2) Querying and analyzing open data source with Apache Spark.'''
+* '''2) Load data and run queries on an Apache Spark cluster in Azure HDInsight.'''
 **'''Steps:'''
 *** 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: [[:File:data.zip]].
-***
+*** 2) Apache Spark cluster in Azure HDInsight [https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-load-data-run-query]
 '''Solve Variables error:''' [[:File:Variables.pdf]]