Exercises: Difference between revisions

From info319
No edit summary
No edit summary
Line 4: Line 4:
* Exercise 3: '''Streaming Spark.''' We will continue analysing tweets or other streaming types of information. <!-- Kafka -->
* Exercise 3: '''Streaming Spark.''' We will continue analysing tweets or other streaming types of information. <!-- Kafka -->
* Exercise 4: '''Spark in the cloud.''' We will run Apache Spark on a cluster of virtual machines in the OpenStack cloud. <!-- Docker -->
* Exercise 4: '''Spark in the cloud.''' We will run Apache Spark on a cluster of virtual machines in the OpenStack cloud. <!-- Docker -->
* Exercise 5: '''Cloud management.'''
* Exercise 5: '''Cloud management.''' We will automate upgrading and scaling of clusters using Terraform and Ansible.
 
I also hope to be able to do something with Docker, Docker Swarm and/or Kubernetes.

Revision as of 08:17, 22 August 2022

Outline of the exercises. Because the exercises are new this year, it is hard to plan exactly, so this is likely to change a bit!

  • Exercise 1: Getting started with Apache Spark. Installing Spark and running your first programs.
  • Exercise 2: Programming Spark. Basic Spark programming. Simple streaming.
  • Exercise 3: Streaming Spark. We will continue analysing tweets or other streaming types of information.
  • Exercise 4: Spark in the cloud. We will run Apache Spark on a cluster of virtual machines in the OpenStack cloud.
  • Exercise 5: Cloud management. We will automate upgrading and scaling of clusters using Terraform and Ansible.

I also hope to be able to do something with Docker, Docker Swarm and/or Kubernetes.