Time: Tuesday, 4. DS (13:00 bis 14:30)
Location: APB E008
Quantity: 2V/2Ü/0P
Language: German
Modules: INF-B-510, INF-B-520, INF-B-530, INF-B-540, INF-BAS4, INF-BI-2, INF-PM-ANW, INF-VERT4, IST-05-FG-AVS, MINF-04-FG-IAS, WI-BA-08, INF-04-FG-AVS

Goals

Although we already live in a world of data, we just see the tip of an iceberg. Data is everywhere and decisions based on large data sets are driving not only business related ut more and more personal decisions. The challenges are enormous and range from technical questions on how to setup and run an efficient and cost-effective data management platform to security and privacy concerns to prevent the loss of personal self-determination.
In this course we give a broad overview of Big Data technologies. First, we revisit the traditional way tackling structured Big Data by introducing the relational database system DB2. Second, we present the most popular state-of-the-art platforms such as Redis, HBase, MongoDB, Neo4j to manage unstructured or semi-structured data as well as graphs. Third, we will give an overview of Apache Spark, a parallel in-memory data processing engine, that is used for Big Data analytics. The exercises for the individual Big Data platforms are supplemented by a specific real-world use-case.

Course Outline

Teil Inhalt Folien
00 Organisatorisches pdf
01 Overview pdf
02 SQL (Basics and Advanced) a) pdf b) pdf
03 JDBC pdf
04 ODBC pdf
05 UDFs und SPs a) pdf b) pdf
06 Redis – A Key-Value / Data Structure Store pdf
07 HBase (Big Table) pdf
08 MongoDB – Document Stores pdf
09 Neo4j – Graph Database System pdf
10 ArangoDB – A Multi-model Database System pdf
11 Apache Spark – A Data Flow System pdf

Contact

Send us an email to: sya-db-bdp@groups.tu-dresden.de