Exercises: Difference between revisions

From info319
No edit summary
No edit summary
 
(6 intermediate revisions by the same user not shown)
Line 3: Line 3:
* Exercise 2: [[Streaming tweets with Twitter API]]
* Exercise 2: [[Streaming tweets with Twitter API]]
* Exercise 3: [[Streaming tweets with Kafka and Spark]]
* Exercise 3: [[Streaming tweets with Kafka and Spark]]
* Exercise 4 - not completely finished:  
* Exercise 4:
** [[Create Spark cluster]]
** [[Create Spark cluster]]
** [[Install HDFS and YARN on the cluster]]
** [[Install HDFS and YARN on the cluster]]
** [[Install Spark on the cluster]]
** [[Install Spark on the cluster]]
** [[Install Kafka on the cluster]]
** [[Install Kafka on the cluster]]
* Exercise 5: '''Cloud management.''' We will automate upgrading and scaling of clusters using Terraform and Ansible.
* Exercise 5:  
 
** [[Create Spark cluster using Terraform]]
I also hope to be able to do something with Docker, Docker Swarm and/or Kubernetes.
** [[Configure Spark cluster using Ansible]]

Latest revision as of 13:56, 31 October 2022