Papers:
The
highlighted papers in each section are the primary readings and should be emphasized in class. You must be behind the department firewall in order to access these papers.
-
Parallel Database Systems
- D. Dewitt, S. Ghandeharizadeh, and D. Schneider, A Performance Analysis of the Gamma Database Machine, in Proceedings of the 1988 SIGMOD Conference, Chicago, Ill., June, 1988.
- D. Dewitt, S. Ghandeharizadeh, D. Schneider, A. Bricker, H. Hsiao, and R. Rasmussen, The Gamma Database Machine Project, in IEEE Transactions on Knowledge and Data Engineering, Vol. 2, No. 1, March, 1990.
- M. Stonebraker, R.H. Katz, D.A. Patterson, and J.K. Ousterhout, The Design of XPRS, in Proceedings of the 14th International Conference on Very Large Data Bases, 1988.
- A. Silberschatz, Parallel Databases, in Database Concepts (6th Edition), 2001.
-
NoSQL Databases Background - Part One
- S. Gilbert and N. Lynch, Brewer’s conjecture and the feasibility of consistent, available, partition-tolerant web services, in ACM SIGACT News, 33(2):51-59, 2002.
- M. Stonebraker, Errors in Database Systems, Eventual Consistency, and the CAP Theorem, in the Communications of the ACM (BLOG@CACM), April 2010.
- E. Brewer, Towards Robust Distributed Systems, in PODC Keynote Talk, 2000.
- A. Thomson, D.J. Abadi, The Case for Determinism in Database Systems, in PVLDB 3(1): 70-80 2010.
-
NoSQL Databases Background - Part Two
- G. DeCandia et al., Dynamo: Amazon's highly available key-value store, in SOSP 2007.
- F. Chang et al., Bigtable: A distributed storage system for structured data, in OSDI 2006.
- B. F. Cooper et al., PNUTS: Yahoo!’s hosted data serving platform, in VLDB 2008.
- R. Sears, M. Callaghan, and E. Brewer, Rose: Compressed Log Structured Replication, in VLDB, 2008.
-
NoSQL Databases
- A.Lakshman, P. Malik., Cassandra: A Decentralized Structured Storage System, in ACM SIGOPS Operating Systems Review, Volume 44 Issue 2, April 2010.
- J. Baker, et al., MegaStore: Providing Scalable Highly-Available Storage for Interactive Services, in CIDR 2011.
- A.Thusoo, et al., Hive: A Warehousing Solution Over a MapReduce Framework, in VLDB 2009.
- M. Stonebraker, SQL databases v. NoSQL databases, in Communications. ACM 53(4): 10-11 (2010).
- Systems:
- Amazon SimpleDB
- Apache Cassandra
- Apache CouchDB
- Apache HBase
- Dynomite Framework
- Hypertable
- MongoDB
- Project Voldemort
- Riak
- Redis
- Membase
- Tokyo Cabinet
- Scalaris
-
Databases in the Cloud - Part One
- M. Isard et al., Dryad: Distributed Data-Parallel Programs From Sequential Building Blocks, in EuroSys 2007.
- B.F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan, and R. Sears. Benchmarking Cloud Serving Systems with YCSB. ACM Symposium on Cloud Computing (SoCC), 2010.
- J. Dean and S. Ghemawat, MapReduce: Simplified Data Processing on Large Clusters, in OSDI 2004.
- C. Olston et al., Pig Latin: a not so foreign language for data processing, in SIGMOD 2008.
-
Databases in the Cloud - Part Two
- A. Pavlo, E. Paulson, A. Rasin, D. J. Abadi, D. J. DeWitt, S. Madden, and M. Stonebraker, A Comparison of Approaches to Large Scale Data Analysis, in SIGMOD '09: Proceedings of the 35th SIGMOD international conference on Management of data, New York, NY, USA, 2009, pp. 165-178. [VIDEO]
- M. Stonebraker, D. Abadi, D. J. DeWitt, S. Madden, E. Paulson, A. Pavlo, and A. Rasin, MapReduce and parallel DBMSs: friends or foes?, in Communications of the ACM, January 2010.
- T. Kraska, M. Hentschel, G. Alonso, and D. Kossmann, Consistency rationing in the cloud: pay only when it matters, in Proceedings of the VLDB Endowment, Volume 2 Issue 1, August 2009.
- C. Ooi and S. Parthasarathy, Special issue on data management on cloud computing platforms, in IEEE Data Engineering Bulletin, vol. 32, 2009.
-
Partitioned Main-Memory Databases
- M. Stonebraker, S. Madden, D. Abadi, S. Harizopoulos, N. Hachem, P. Helland, The End of an Architectural Era (It's Time for a Complete Rewrite), in VLDB 2007: 1150-1160.
- I. Pandis, R. Johnson, N. Hardavellas, and A. Ailamaki, Data-Oriented Transaction Execution, in Proc. VLDB Endow. 2010: 928-939.
- R. Kallman, H. Kimura, J. Natkins, A. Pavlo, A. Rasin, S. Zdonik, E. P. C. Jones, S. Madden, M. Stonebraker, Y. Zhang, J. Hugg, and D. J. Abadi, H-Store: A High-Performance, Distributed Main Memory Transaction Processing System, in Proc. VLDB Endow., vol. 1, iss. 2, pp. 1496-1499, 2008.
- Evan P. C. Jones, Daniel J. Abadi, Samuel Madden, Low overhead concurrency control for partitioned main memory databases, in SIGMOD Conference 2010: 603-614.
- P. Apers, C. Van Den Berg, J. Flokstra, P. Grefen, M. Kersten, A. Wilschut, PRISMA/DB: A Parallel Main Memory Relational DBMS, in IEEE Transactions on Knowledge and Data Engineering, 1992: 541-554.
-
Column Stores
- M. Stonebraker et al., C-Store: A Column-oriented DBMS, in VLDB 2005.
- D. Abadi et al, Column-Oriented Database Systems, in VLDB 2009.
- P. Boncz, M. Kersten, and S. Manegold, Breaking the Memory Wall in Monet DB, in Communications of the ACM, Volume 51 Issue 12, December 2008.
- D. J. Abadi, A. Marcus, S. R. Madden, and K. Hollenbach, Scalable Semantic Web Data Management Using Vertical Partitioning, in Proceedings of the 33rd International Conference on Very Large Data Bases (VLDB), pages 411–422, September 2007.
- D. Abadi, N. Hachem, S. Madden, Column-Stores vs. Row-Stores: How different are they really?, in SIGMOD 2008.
-
Miscellaneous
- P. Unterbrunner, et al., Predictable Performance for Unpredictable Workload, in VLDB 2009.
- J. Dittrich and A. Jindal, Towards a One-Size Fits All DB Architecture, in CIDR 2011.
- A. Abouzeid et al., HadoopDB: An Architectural Hybrid of MapReduce and DBMS Technologies for Analytical Workloads, in VLDB 2009.
- Azza Abouzied, Kamil Bajda-Pawlikowski, Jiewen Huang, Daniel J. Abadi, Avi Silberschatz, HadoopDB in action: building real world applications, in SIGMOD Conference 2010: 1111-1114.
- Daniel J. Abadi, Tradeoffs between Parallel Database Systems, Hadoop, and HadoopDB as Platforms for Petabyte-Scale Analysis, in SSDBM 2010: 1-3.