Practical session, using Spark for emergency datasources: Difference between revisions
No edit summary |
No edit summary |
||
Line 6: | Line 6: | ||
**[[Running Hadoop | Getting started with Hadoop]] | **[[Running Hadoop | Getting started with Hadoop]] | ||
* '''2) | * '''2) Load data and run queries on an Apache Spark cluster in Azure HDInsight.''' | ||
**'''Steps:''' | **'''Steps:''' | ||
*** 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: [[:File:data.zip]]. | *** 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: [[:File:data.zip]]. | ||
*** | *** 2) Apache Spark cluster in Azure HDInsight [https://docs.microsoft.com/en-us/azure/hdinsight/spark/apache-spark-load-data-run-query] | ||
'''Solve Variables error:''' [[:File:Variables.pdf]] | '''Solve Variables error:''' [[:File:Variables.pdf]] |
Revision as of 08:10, 18 September 2019
Practical session:
Tasks:
- 1) Running Apache Hadoop and MapReduce:
- 2) Load data and run queries on an Apache Spark cluster in Azure HDInsight.
- Steps:
- 1) Download data from: https://data.sfgov.org/Public-Safety/Fire-Department-Calls-for-Service/nuek-vuh3. or Session file: File:data.zip.
- 2) Apache Spark cluster in Azure HDInsight [1]
- Steps:
Solve Variables error: File:Variables.pdf