Sessions: Difference between revisions

From info319
No edit summary
No edit summary
Line 39: Line 39:


Supplementary:
Supplementary:
* Berven at al. ...
* Berven, A., Christensen, O. A., Moldeklev, S., Opdahl, A. L., & Villanger, K. J. (2020). A knowledge-graph platform for newsrooms. Computers in Industry, 123, 103321. [https://scholar.google.com/scholar?output=instlink&q=info:0K5dB1_9nusJ:scholar.google.com/&hl=en&as_sdt=0,5&as_ylo=2018&scillfp=11776208952974186557&oi=lle Paper]
* Opdahl, A. L., & Tessem, B. (2021). Ontologies for finding journalistic angles. Software and Systems Modeling, 20(1), 71-87. [https://link.springer.com/article/10.1007/s10270-020-00801-w Paper]
* Opdahl, A. L., & Tessem, B. (2021). Ontologies for finding journalistic angles. Software and Systems Modeling, 20(1), 71-87. [https://link.springer.com/article/10.1007/s10270-020-00801-w Paper]


== Session 4 - Cloud computing. NREC and Openstack ==
== Session 4 - Cloud computing. NREC and Openstack ==
* NREC. OpenStack
* [https://docs.nrec.no/index.html NREC and OpenStack], the following sections/pages: Introduction, Project application, Logging in, The dashboard, Create a Linux virtual machine (skip: Windows), Using SSH, Working with Security Groups, Create and manage volumes, Create and manage snapshots (skip: images), Instance console


Guest presentation: Sohail Khan on computer vision and deep networks for image analysis
Guest presentation: Sohail Khan on computer vision and deep networks for image analysis


== Session 5 - Cloud management. Terraform and Ansible. Docker and Kubernetes ==
== Session 5 - Cloud management. Terraform and Ansible. Docker and Kubernetes ==
* Terraform, Ansible
* [https://docs.nrec.no/terraform-part1.html TerraForm and NREC part I], [https://docs.nrec.no/terraform-part2.html part II], and [https://docs.nrec.no/terraform-part3.html part III]
* Docker, Kubernetes
* [https://www.ansible.com/overview/how-ansible-works How Ansible Works] and [https://docs.ansible.com/ansible_community.html the Ansible Community portal]
* Material on Docker and Kubernetes (TBA)


== Session 6 - Societal issues. Privacy. GDPR ==
== Session 6 - Societal issues. Privacy. GDPR ==

Revision as of 15:58, 4 September 2022

Tentative themes for each session

  • Thursday August 18th: Introduction meeting File:IntroductionMeeting.pdf
  • Thursday September 1st: Session 1 - Introduction to big data. Big-data processing. Spark
  • Thursday September 15th: Session 2 - More about Spark. Data sources. Twitter's API and tweepy
  • Thursday September 29th: Session 3 - Streaming Spark. Big data architecture. Kafka
  • Thursday October 13th: Session 4 - Cloud computing. NREC and Openstack
  • Thursday October 27th: Session 5 - Cloud management. Terraform and Ansible. Docker and Kubernetes
  • Thursday November 10th: Session 6 - Societal issues. Privacy. GDPR
  • Thursday November 24th: Session 7 - Essay presentations
  • Thursday December 8th: Session 8 - Project demonstrations

Session 1 - Introduction to big data. Big-data processing. Spark

Supplementary:

Session 2 - More about Spark. Data sources. Twitter's API and tweepy

  • Chambers & Zaharia, chapters 4-9
  • Kitchin, chapter 3

Guest presentation: Daniel Rosnes on using Twitter data for the news

Supplementary:

Session 3 - Streaming Spark. Big data architecture. Kafka

  • Chambers & Zaharia, chapters 20-21
  • Paper on big-data architecture (TBA)
  • Gallofré, M., Opdahl, A. L., Stoppel, S., Tessem, B., & Veres, C. (2021). The News Angler Project: Exploring the Next Generation of Journalistic Knowledge Platforms. In Proceedings of Norsk IKT-konferanse for forskning og utdanning. Short Paper [Poster]
  • Kafka Introduction

Guest presentation: Marc Gallofré Ocaña on the News Hunter platform and its big-data ready architecture

Supplementary:

  • Berven, A., Christensen, O. A., Moldeklev, S., Opdahl, A. L., & Villanger, K. J. (2020). A knowledge-graph platform for newsrooms. Computers in Industry, 123, 103321. Paper
  • Opdahl, A. L., & Tessem, B. (2021). Ontologies for finding journalistic angles. Software and Systems Modeling, 20(1), 71-87. Paper

Session 4 - Cloud computing. NREC and Openstack

  • NREC and OpenStack, the following sections/pages: Introduction, Project application, Logging in, The dashboard, Create a Linux virtual machine (skip: Windows), Using SSH, Working with Security Groups, Create and manage volumes, Create and manage snapshots (skip: images), Instance console

Guest presentation: Sohail Khan on computer vision and deep networks for image analysis

Session 5 - Cloud management. Terraform and Ansible. Docker and Kubernetes

Session 6 - Societal issues. Privacy. GDPR

Guest presentation: Laurence Dierickx on aspects of big-data quality

Supplementary:

Session 7 - Essay presentations

Session 8 - Project demonstrations