Schedule*:

Date Activity
Paper Presenter
Feb 1 Course Organization
- Stan
Feb 8 **Snow Day**
- -
Feb 15 ETL Systems
A Survey of ETL
ETL Tools for Data Migration
TPC-DI
Group 4
Feb 22 No Class - President's Day
- -
Feb 29 Streaming
Data Stream Management Issues
S-Store
Group 5
Mar 7 Schema Mapping and Semantic Heterogeneity
#4. Tutorial: Uncertain Schema Matching
#1. Schema Mapping as Query Discovery
#2. Data Tamer
Group 3
Mar 14 Data Cleaning
DC: A Practical Perspective (Chapters 1-4)
DC: Problems and Current Approaches
Potter's Wheel
Group 1
Mar 21 De-Duplication (incl. Crowd Sourcing)
Introduction to Duplicate Detection
Demystifying Data Deduplication
Indexing Techniques
Group 2
Mar 28 No Class - Spring Break
- -
Apr 4 Data Loading (incl. Bulk Load)
Optimizing Data Warehouse Loading Procedures
Optimized Data Loading
Transaction Reordering and Grouping
Group 4
Apr 11 NoDB
Group 3
Apr 18 Federated Databases
Group 1
Apr 25 Lambda Architecture
Group 2
May 2 Publish/Subscribe
Group 5
May 9 Project Presentations
- All
* This schedule is subject to changes as the course evolves.