Department of Computer Science | Institute of Theoretical Computer Science
with applications to Data Science
Organization: | Michael Böhlen (UZH), Sven Helmer (UZH), Paolo Penna (ETH) |
Teaching language: | English |
Level: | PhD, MSc and advanced BSc students |
Academic Year: | Spring 2019 (FS19) |
Dates: |
Tuesday 19.2.2019, 16.30 - 18.00h UZH, BIN 0.K.11/12/13 (kickoff meeting) Saturday 13.4.2019, 9.00 - 15.00h BIN 2.A.01 Saturday 11.5.2019, 9.00 - 15.00h ETH CAB H.52 |
Participation at all three meetings is compulsory. The assessment depends on the quality of the report, presentation, active participation during the seminar, and input as a buddy.
Topics
1. Architectures and Systems
2. Column Stores
3. Streams
4. Spark
5. Query Processing
6. Clustering
Presentation | Student | Buddy | Advisor |
---|---|---|---|
Spark SQL: Relational Data Processing in Spark, SIGMOD 2015. |
Luca Wolf | Decova Sara | Sven Helmer |
SHC: Distributed Query Processing for Non-Relational Data Store, ICDE 2018. |
Donn Edward Anin | Lorenzo Selvatici | Sven Helmer |
Clive Charles Javara | Syed Shahvaiz Ahmed | Sven Helmer | |
A Minimal Variance Estimator for the Cardinality of Big Data Set Intersection, KDD 2017. |
Emilien Pierre Carlo Pilloud | Mesut Ceylan | Paolo Penna |
Orca: A Modular Query Optimizer Architecture for Big Data, SIGMOD 2014. |
Maximilian Wolfertz | Mike Suter | Michael Böhlen |
Optimizing Big Data Queries Using Program Synthesis, SOSP 2017. |
Alex Wolf | Yichun Xie | Michael Böhlen |
Clustering with Same-Cluster Queries, NIPS 2016. |
Michael Studer | Peter Giger | Paolo Penna |
Han-Mi Nguyen | Pascal Engeli | Paolo Penna | |
Coconut: A Scalable Bottom-Up Approach for Building Data Series Indexes, VLDB 2018. |
Timon Stampfli | Catharina Dekker | Michael Böhlen |