Sessions
Tentative themes for each session
- Thursday August 18th: Introduction meeting
- Thursday September 1st: Session 1 - Introduction to big data. Big-data processing. Spark
- Thursday September 15th: Session 2 - Big data architecture. News Angler/News Hunter. Kafka
- Thursday September 29th: Session 3 - Cloud. OpenStack
- Thursday October 13th: Session 4 - Virtual instances. Docker
- Thursday October 27th:Session 5 - Big data storage. Cassandra
- Thursday November 10th: Session 6 - Privacy. GDPR
- Thursday November 24th: Session 7 - Essay presentations
- Thursday December 8th: Session 8 - Project demonstrations
Session 1 - Introduction to big data. Big-data processing. Spark
- Kitchin, chapter 4-5
- Section 1 in Opdahl, A. L., & Nunavath, V. (2020). Big Data. Big Data in Emergency Management: Exploitation Techniques for Social and Mobile Data, 15-29. Paper
- Spark Quick Start (with Python examples)
Session 2 - Big data architecture. News Angler/News Hunter. Kafka
- Kitchin, chapter 1-3
- Opdahl, A. L., & Tessem, B. (2021). Ontologies for finding journalistic angles. Software and Systems Modeling, 20(1), 71-87. Paper
- Gallofré, M., Opdahl, A. L., Stoppel, S., Tessem, B., & Veres, C. (2021). The News Angler Project: Exploring the Next Generation of Journalistic Knowledge Platforms. In Proceedings of Norsk IKT-konferanse for forskning og utdanning. Short Paper [Poster]
- Kafka Introduction
Session 3 - Cloud. OpenStack
- NREC. OpenStack. TerraForm. Ansible
Session 4 - Virtual instances. Docker
- TDB
Session 5 - Big data storage. Cassandra
- TDB
Session 6 - Privacy. GDPR
- TDB
Session 7 - Essay presentations
- TDB
Session 8 - Project demonstrations
- TDB