Exercises: Difference between revisions

Revision as of 15:36, 22 August 2022

Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit!

Exercise 1: Getting started with Apache Spark. Installing Spark and running your first examples.
Exercise 2: Programming Spark. Basic Spark programming. Simple streaming.
Exercise 3: Streaming Spark. We will continue analysing tweets or other streaming types of information.
Exercise 4: Spark in the cloud. We will run Apache Spark on a cluster of virtual machines in the OpenStack cloud.
Exercise 5: Cloud management. We will automate upgrading and scaling of clusters using Terraform and Ansible.

I also hope to be able to do something with Docker, Docker Swarm and/or Kubernetes.

@@ Line 1: / Line 1: @@
 Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit!
-* Exercise 1: [Getting started with Apache Spark]. Installing Spark and running your first programs. <!-- [[Apache Spark]] -->
+* Exercise 1: [[Getting started with Apache Spark]]. Installing Spark and running your first examples.
 * Exercise 2: '''Programming Spark.''' Basic Spark programming. Simple streaming. <!-- [[Spark Streaming Twitter]] [[Sentiment analysis using Spark Streaming]] -->
 * Exercise 3: '''Streaming Spark.''' We will continue analysing tweets or other streaming types of information. <!-- Kafka -->