Time: Tuesday, 4. DS (13:00 bis 14:30)
Location: APB E008
Quantity: 2V/2Ü/0P
Language: German
Modules: INF-B-510, INF-B-520, INF-B-530, INF-B-540, INF-BAS4, INF-BI-2, INF-PM-ANW, INF-VERT4, IST-05-FG-AVS, MINF-04-FG-IAS, WI-BA-08, INF-04-FG-AVS


Although we already live in a world of data, we just see the tip of an iceberg. Data is everywhere and decisions based on large data sets are driving not only business related ut more and more personal decisions. The challenges are enormous and range from technical questions on how to setup and run an efficient and cost-effective data management platform to security and privacy concerns to prevent the loss of personal self-determination.
In this course we give a broad overview of Big Data technologies. First, we revisit the traditional way tackling structured Big Data by introducing the relational database system DB2. Second, we present the most popular state-of-the-art platforms such as Redis, HBase, MongoDB, Neo4j to manage unstructured or semi-structured data as well as graphs. Third, we will give an overview of Apache Spark, a parallel in-memory data processing engine, that is used for Big Data analytics. The exercises for the individual Big Data platforms are supplemented by a specific real-world use-case.

Course Outline

Teil Inhalt Folien
00 Organisatorisches pdf
01 Overview pdf
02 SQL (Basics and Advanced) a) pdf b) pdf
03 JDBC und ODBC pdf
04 Redis – A Key-Value / Data Structure Store pdf
05 Neo4j – Graph Database System pdf
06 HBase (Big Table) pdf
07 MongoDB – Document Stores pdf
08 ArangoDB – A Multi-model Database System pdf
09 Apache Spark – A Data Flow System pdf


Send us an email to: sya-db-bdp@groups.tu-dresden.de