Exercises: Difference between revisions

From info319
(Created page with "We do not need to plan exercises yet. Selecting central themes and tools come first. But any ideas for good datasets, pre-packaged cases etc. could be listed here if you have!")
 
No edit summary
 
(29 intermediate revisions by 2 users not shown)
Line 1: Line 1:
We do not need to plan exercises yet. Selecting central themes and tools come first. But any ideas for good datasets, pre-packaged cases etc. could be listed here if you have!
Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit!
* Exercise 1: [[Getting started with Apache Spark]] and [[Processing tweets with Spark]].
* Exercise 2: [[Streaming tweets with Twitter API]]
* Exercise 3: [[Streaming tweets with Kafka and Spark]]
* Exercise 4:
** [[Create Spark cluster]]
** [[Install HDFS and YARN on the cluster]]
** [[Install Spark on the cluster]]
** [[Install Kafka on the cluster]]
* Exercise 5:
** [[Create Spark cluster using Terraform]]
** [[Configure Spark cluster using Ansible]]

Latest revision as of 13:56, 31 October 2022