Readings: Difference between revisions

From info319
No edit summary
No edit summary
Line 18: Line 18:
* Sigma: Cassavia, N., & Masciari, E. (2021, March). Sigma: a scalable high performance big data architecture. In 2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (pp. 236-239). IEEE. [https://bibsys-almaprimo.hosted.exlibrisgroup.com/primo-explore/openurl?sid=google&auinit=N&aulast=Cassavia&atitle=Sigma:%20a%20scalable%20high%20performance%20big%20data%20architecture&id=doi:10.1109%2FPDP52278.2021.00044&vid=UBB&institution=UBB&url_ctx_val=&url_ctx_fmt=null&isSerivcesPage=true Paper]
* Sigma: Cassavia, N., & Masciari, E. (2021, March). Sigma: a scalable high performance big data architecture. In 2021 29th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP) (pp. 236-239). IEEE. [https://bibsys-almaprimo.hosted.exlibrisgroup.com/primo-explore/openurl?sid=google&auinit=N&aulast=Cassavia&atitle=Sigma:%20a%20scalable%20high%20performance%20big%20data%20architecture&id=doi:10.1109%2FPDP52278.2021.00044&vid=UBB&institution=UBB&url_ctx_val=&url_ctx_fmt=null&isSerivcesPage=true Paper]
* Maamouri, A., Sfaxi, L., & Robbana, R. (2021, December). Phi: A Generic Microservices-Based Big Data Architecture. In European, Mediterranean, and Middle Eastern Conference on Information Systems (pp. 3-16). Springer, Cham. [https://link.springer.com/chapter/10.1007/978-3-030-95947-0_1 Paper]
* Maamouri, A., Sfaxi, L., & Robbana, R. (2021, December). Phi: A Generic Microservices-Based Big Data Architecture. In European, Mediterranean, and Middle Eastern Conference on Information Systems (pp. 3-16). Springer, Cham. [https://link.springer.com/chapter/10.1007/978-3-030-95947-0_1 Paper]
->
-->
 
<!--
* Michael Armbrust, Armando Fox, Rean Griffith, Anthony D Joseph, Randy Katz, Andy Konwinski, Gunho Lee, David Patterson, Ariel Rabkin, Ion Stoica, Matei Zaharia (2010). A view of cloud computing. Communications of the ACM 53 (4), 50-58. [https://dl.acm.org/doi/fullHtml/10.1145/1721654.1721672 Paper]
* M Zaharia, M Chowdhury, MJ Franklin, S Shenker, I Stoica (2010). Spark: Cluster computing with working sets. 2nd USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 10). [https://www.usenix.org/event/hotcloud10/tech/full_papers/Zaharia.pdf Paper]
* Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave, Justin Ma, Murphy McCauly, Michael J Franklin, Scott Shenker, Ion Stoica (2012). Resilient distributed datasets: A Fault-Tolerant abstraction for In-Memory cluster computing. In Prof. 9th USENIX Symposium on Networked Systems Design and Implementation (NSDI 12), pp. 15-28. [https://scholar.google.com/citations?view_op=view_citation&hl=en&user=I1EvjZsAAAAJ&citation_for_view=I1EvjZsAAAAJ:Tyk-4Ss8FVUC Paper]
* Karun, A. K., & Chitharanjan, K. (2013, April). A review on hadoop—HDFS infrastructure extensions. In 2013 IEEE conference on information & communication technologies (pp. 132-137). IEEE. [https://scholar.google.com/scholar?output=instlink&q=info:GIm8aG-ScOsJ:scholar.google.com/&hl=en&as_sdt=0,5&scillfp=6854624816870725192&oi=lle Paper]
* Kafka?
-->


Supplementary:
Supplementary:

Revision as of 14:11, 17 August 2022

Books

We will use two text books:

  • Rob Kitchin. The Data Revolution - Big Data, Open Data, Data Infrastructures & Their Consequences. Sage, 2014.
    • At least chapters 1, 3-5 and some later chapters are mandatory.
  • Bill Chambers and Matei Zaharia: Spark: The Definitive Guide - Big Data Processing Made Simple. O'Riley, 2018. File:Spark-TheDefinitiveGuide.pdf
    • At least chapters 1-9 and some later chapters are mandatory.


Papers

Selected papers will become available here, including:

  • Gallofré, M., Opdahl, A. L., Stoppel, S., Tessem, B., & Veres, C. (2021). The News Angler Project: Exploring the Next Generation of Journalistic Knowledge Platforms. In Proceedings of Norsk IKT-konferanse for forskning og utdanning. Short Paper File:A1-Poster-NIKT2021.pdf


Supplementary:

  • Section 1 in Opdahl, A. L., & Nunavath, V. (2020). Big Data. Big Data in Emergency Management: Exploitation Techniques for Social and Mobile Data, 15-29. Book chapter
  • Opdahl, A. L., & Tessem, B. (2021). Ontologies for finding journalistic angles. Software and Systems Modeling, 20(1), 71-87. Paper
  • Berven, A., Christensen, O. A., Moldeklev, S., Opdahl, A. L., & Villanger, K. J. (2020). A knowledge-graph platform for newsrooms. Computers in Industry, 123, 103321. Paper


Technical introductions

Selected web pages will become available here, including:

Additional non-mandatory materials will be made available to support the exercises further.


Lecture slides

See the Session page for lecture slides after each session.

Readings for each session and exercise

The Sessions and Exercises pages will suggest specific readings for each session and exercise.