Exercises: Difference between revisions
(Created page with "We do not need to plan exercises yet. Selecting central themes and tools come first. But any ideas for good datasets, pre-packaged cases etc. could be listed here if you have!") |
No edit summary |
||
(29 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit! | |||
* Exercise 1: [[Getting started with Apache Spark]] and [[Processing tweets with Spark]]. | |||
* Exercise 2: [[Streaming tweets with Twitter API]] | |||
* Exercise 3: [[Streaming tweets with Kafka and Spark]] | |||
* Exercise 4: | |||
** [[Create Spark cluster]] | |||
** [[Install HDFS and YARN on the cluster]] | |||
** [[Install Spark on the cluster]] | |||
** [[Install Kafka on the cluster]] | |||
* Exercise 5: | |||
** [[Create Spark cluster using Terraform]] | |||
** [[Configure Spark cluster using Ansible]] |
Latest revision as of 13:56, 31 October 2022
Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit!
- Exercise 1: Getting started with Apache Spark and Processing tweets with Spark.
- Exercise 2: Streaming tweets with Twitter API
- Exercise 3: Streaming tweets with Kafka and Spark
- Exercise 4:
- Exercise 5: